English 中文(简体)
原标题:Easiest way to process text from MS word file
  • 时间:2009-12-11 10:20:43
  •  标签:
  • c#
  • ms-word

i need to extract text from an old MS word .doc file in C#. What is the easiest (or else the best) way to get that job done?


首先,需要在MS Word物体图书馆添加内容。 Go to Project => Add Reference, selected the COM tab, 然后找到和选择“Microsoft Word 10.0 Object Library”。 该版本在你的计算机上可能有所不同。 Click OK。

在你这样做之后,你可以使用以下法典。 它将打开MS Word doc,并在电文箱中显示每一段:

// Read an MS Word Doc
private void ReadWordDoc()
        Word.ApplicationClass wordApp = new Word.ApplicationClass();

        // Define file path
        string fn = @"c:	est.doc";

        // Create objects for passing
        object oFile = fn;
        object oNull = System.Reflection.Missing.Value;
        object oReadOnly = true;

        // Open Document
        Word.Document Doc = wordApp.Documents.Open(ref oFile, ref oNull, 
                ref oReadOnly, ref oNull, ref oNull, ref oNull, ref oNull, 
                ref oNull, ref oNull, ref oNull, ref oNull, ref oNull, 
                ref oNull, ref oNull, ref oNull);

        // Read each paragraph and show         
        foreach (Word.Paragraph oPara in Doc.Paragraphs)                

        // Quit Word
        wordApp.Quit(ref oNull, ref oNull, ref oNull);

    catch (Exception ex)


Anyone feel like passing it forward?

I m the only developer in my company, and am getting along well as an autodidact, but I know I m missing out on the education one gets from working with and having code reviewed by more senior devs. ...

NSArray s, Primitive types and Boxing Oh My!

I m pretty new to the Objective-C world and I have a long history with .net/C# so naturally I m inclined to use my C# wits. Now here s the question: I feel really inclined to create some type of ...

C# Marshal / Pinvoke CBitmap?

I cannot figure out how to marshal a C++ CBitmap to a C# Bitmap or Image class. My import looks like this: [DllImport(@"test.dll", CharSet = CharSet.Unicode)] public static extern IntPtr ...

How to Use Ghostscript DLL to convert PDF to PDF/A

How to user GhostScript DLL to convert PDF to PDF/A. I know I kind of have to call the exported function of gsdll32.dll whose name is gsapi_init_with_args, but how do i pass the right arguments? BTW, ...

Linqy no matchy

Maybe it s something I m doing wrong. I m just learning Linq because I m bored. And so far so good. I made a little program and it basically just outputs all matches (foreach) into a label control. ...
