I am currently working on an application that parses huge XML files.
For each file, there will be different processes but all of them will be parsed into a single object model.
Currently, the objects parsed from each XML file will go into a single collection.
This collection is also used during parsing, e.g. if a similar object already exists, it will modify the object's property instead, such as adding count.
Looking at the CPU graph when this application is running, it is clear that it only uses part of the CPU (one core at a time on 100%), so I assume that running it on parallel will help shave running time.
I am new into parallel programming, so any help is appreciated.