views:

67

answers:

2

Hi the aim is to parse a sizeable corpus like wikipedia to generate the most probable parse tree,and named entity recognition. Which is the best library to achieve this in terms of performance and accuracy? Has anyone used more than one of the above libraries?

+2  A: 

I use in my experiments the standford tagger but it really depends on the quality of your articles from wikipedia. Here you will find a comparison of different part-of-speech taggin implmentations - PoS on aclweb.

Skarab
The link needs the final ")" - well-known Markdown gotcha: [POS tagging state of the art](http://aclweb.org/aclwiki/index.php?title=POS_Tagging_%28State_of_the_art%29)
Christopher Manning
Thank you! The link is fixed.
Skarab
A: 

I'm currently using Enju HPSG parser which seems to be better than the others.

Sharmila