ansaurus

tags:

  • open
  • parsing
  • source
  • text-parsing

views:

13

answers:

0
+1  Q: 

Looking for Open Source document to text parser recommendations (ex: .PDF or .doc or to text)

Formatting should be preserved.

related questions

BNF grammar test case generation
Print stack trace information from C#
What is a good way to format logs?
How do you parse a filename in bash?
How to parse a string into a nullable int in C# (.NET 3.5)
An easy way to diff log files, ignoring the time stamps?
Learning Resources on Parsers, Interpreters, and Compilers
Does C# have built-in support for parsing page-number strings?
Resources for lexing, tokenising and parsing in python
Parsing, where can I learn about it.
Parsing XML using unix terminal
Equation (expression) parser with precedence?
What HTML parsing libraries do you recommend in Java
Where do I get the Antlr Ant task?
How do I put unicode characters in my Antlr grammar?
Resolving reduce/reduce conflict in yacc/ocamlyacc
Best Approach to Parse for SQL in PHP Files?
.Net Parse verses Convert
How can I learn about parser combinators?
Parse usable Street Address, City, State, Zip from a string
C# Save Dialogs
Delimited string parsing framework for .NET
Looking for algorithm that reverses the sprintf() function output
Split a string ignoring quoted sections
What is the best way to parse strings in Java
The questions, topics and answers come from serverfault.com, stackoverflow.com and superuser.com, and are licensed under the cc-wiki license.
© 2010 Chad Scira / icodeforlove