views:

155

answers:

2

Hi, I want to extract information of bullets present in word document. I want something like this : Suppose the text below, is in word document :

Steps to Start car :

  • Open door
  • Sit inside
  • Close the door
  • Insert key
  • etc.

Then I want my text file like below :

Steps to Start car :

<BULET> Open door </BULET>

<BULET> Sit inside </BULET>

<BULET> Close the door </BULET>

<BULET> Insert key </BULET>

<BULET> etc.</BULET>

I am using C# language to do this.

I can extract paragraphs from word document and directly write them in text file with some formatting information like whether text is bold or is in italics, etc. but dont know how to extract this bullet information.

Can anyone please tell me how to do this?

Thanks in advance

A: 

I got the answer.....

Shekhar
Mind to share it with us?
Martijn
Sure.First I was converting doc on paragraph basis. But instead of that if we process doc file sentence by sentence basis, it is possible to determine whether that sentence contains bullet or any kind of shape or if that sentence is part of table. So once we get this information, then we can convert that sentence appropriately. If someone needs source code, I can share it.
Shekhar
A: 

Hi shekar,

Could you please share the code snippet to extract bullet information from the document here or mail me to [email protected]

I'm also facing the same problem

Thanks in advance

TAM