tags:

views:

57

answers:

2

I'm trying to open .doc file and read its content. But i can't find any way how to do this without launching MSWord.

Now I have following code:

    Microsoft.Office.Interop.Word.Application app = new Microsoft.Office.Interop.Word.Application();
    object nullObject = System.Reflection.Missing.Value;
    object file = @"C:\doc.doc";
    Microsoft.Office.Interop.Word.Document doc = app.Documents.Open(ref file, ref nullObject, ref nullObject,
             ref nullObject, ref nullObject, ref nullObject, ref nullObject, ref nullObject, ref nullObject,
             ref nullObject, ref nullObject, ref nullObject, ref nullObject, ref nullObject, ref nullObject,
             ref nullObject);
    doc.ActiveWindow.Selection.WholeStory();
    doc.ActiveWindow.Selection.Copy();
    IDataObject data = Clipboard.GetDataObject();
    string text = data.GetData(DataFormats.Text).ToString();
    doc.Close(ref nullObject, ref nullObject, ref nullObject);
    app.Quit(ref nullObject, ref nullObject, ref nullObject);

But it launches MSWord, any solution to do it without launching? Thanks in advance.

+1  A: 

Two possibilities: either use Microsoft's spec to write your own parser for the .doc format, or use an existing library for the purpose (e.g., from Aspose). Unless you have a couple of spare years to spend on the task, the latter is clearly the correct choice.

Jerry Coffin
thanks, i'll see Aspose lib.writing my own parser will take a few years :)
Vitali Fokin
+1  A: 

Last time I did this (via COM from C++), I recall a 'Visible' property in the Application interface (true=visible).

However, it seems to me that the default was false, so you had to set it to true to make Word appear.

Regardless of whether or not the user can see Word, you will still see winword.exe (or whatever it's called today) in your task manager. I don't think there's a way to access Word through this interface without it launching Word (behind the scenes or not).

If you don't want Word to launch at all, you may have to find another solution.

Marc Bernier
visibillity is enabled as default, so i can see msword anyway. even i set visibility to false, window appears and quickly collapses.
Vitali Fokin
i need to proceed lots of doc files, i takes too much time to launch word everytime
Vitali Fokin
Strange about the visibility property. I am using an older version of office (2003), maybe they changed the default. COM is very slow, you may be able to re-use some of the objects; the application object I think can stay alive as you cycle through each document. It may help a little.
Marc Bernier