I have a word document in docx format with data in repeating format pattern.
I would like to take each data from the repeating set and upload to a row in the SQL table.
Sample of data here:
Question No : 1
How is LINQ to SQL different from Entities?
A. Answer 1
B. Answer 1
C. Answer 1
D. Answer 1
Answer : D
Explanations :
Some explanation.
Question No : 2
How is NVARCHAR different from VARCHAR
A. Answer 1
B. Answer 1
C. Answer 1
D. Answer 1
Answer : D
Explanations :
Some explanation.
I could think of few approaches:
- Read the document as docx using the Office API
- Save the document as XML from Word and parse XML [the converted XML document doesn't seem have a structure/schema]
- Save the document as HTML from Word and parse HTML [DOM structure not well formed]
Which among above would you suggest and why? Are there any tools to help convert a document and upload to a SQL table or access DB?
Thanks!