Hi the aim is to parse a sizeable corpus like wikipedia to generate the most probable parse tree,and named entity recognition. Which is the best library to achieve this in terms of performance and accuracy? Has anyone used more than one of the above libraries?
+2
A:
I use in my experiments the standford tagger but it really depends on the quality of your articles from wikipedia. Here you will find a comparison of different part-of-speech taggin implmentations - PoS on aclweb.
Skarab
2010-10-04 18:24:06
The link needs the final ")" - well-known Markdown gotcha: [POS tagging state of the art](http://aclweb.org/aclwiki/index.php?title=POS_Tagging_%28State_of_the_art%29)
Christopher Manning
2010-10-05 18:27:31
Thank you! The link is fixed.
Skarab
2010-10-05 21:35:08
A:
I'm currently using Enju HPSG parser which seems to be better than the others.
Sharmila
2010-10-14 18:36:46