Hi,
I have a directory full (~103, 104) of XML files from which I need to extract the contents of several fields. I've tested different xml parsers, and since I don't need to validate the contents (expensive) I was thinking of simply using xml.parsers.expat (the fastest one) to go through the files, one by one to extract the data.
- Is there a more efficient way? (simple text matching doesn't work)
- Do I need to issue a new ParserCreate() for each new file (or string) or can I reuse the same one for every file?
- Any caveats?
Thanks!