English 中文(简体)
Searching files in NTFS
原标题:

We have a fairly large disk array with roughly 2-3 million XML files on it. The disk is formatted with NTFS and we would like to search the filesystem using wildcards. So something like * SomePartOfTheFilename * would be a typical search query.

We are using .Net and are finding that using DirectoryInfo appears to be slow.

DirectoryInfo directoryInfo = new DirectoryInfo(directory);

List<FileInfo> fileInfos = directoryInfo.GetFiles(searchString, SearchOption.AllDirectories).ToList();

Using Loops and recursion is also very slow.

Is there a lower level API call we can use to directly search the NTFS index?

Using dir * SomePartOfTheFilename * /s from the command line is almost instant. Is there something there that can be leveraged?

问题回答

I m not sure if you can use the Indexing service, but it may be handy for what you are trying to do:

http://msdn.microsoft.com/en-us/library/ee805985%28VS.85%29.aspx

http://www.codeproject.com/KB/database/Indexing_Service_HOW-TO.aspx

It allows you to create complex queries against the NTFS index of the files on a computer.

You may use MFT directly (See: NTFS Wiki). That is data table where all information about files is located. You can see the structure of MFT for example here or here. The Windows API ends up in the same table so you can alternatively try to speed the searches up to guarantee that it will be paged in memory before search (simple read of e.g. c:$Mft is enough).





相关问题
Anyone feel like passing it forward?

I m the only developer in my company, and am getting along well as an autodidact, but I know I m missing out on the education one gets from working with and having code reviewed by more senior devs. ...

NSArray s, Primitive types and Boxing Oh My!

I m pretty new to the Objective-C world and I have a long history with .net/C# so naturally I m inclined to use my C# wits. Now here s the question: I feel really inclined to create some type of ...

C# Marshal / Pinvoke CBitmap?

I cannot figure out how to marshal a C++ CBitmap to a C# Bitmap or Image class. My import looks like this: [DllImport(@"test.dll", CharSet = CharSet.Unicode)] public static extern IntPtr ...

How to Use Ghostscript DLL to convert PDF to PDF/A

How to user GhostScript DLL to convert PDF to PDF/A. I know I kind of have to call the exported function of gsdll32.dll whose name is gsapi_init_with_args, but how do i pass the right arguments? BTW, ...

Linqy no matchy

Maybe it s something I m doing wrong. I m just learning Linq because I m bored. And so far so good. I made a little program and it basically just outputs all matches (foreach) into a label control. ...

热门标签