I can't seem to find that in the documentation anywhere
views:
39answers:
1
+1
A:
The Penn Treebank has 4.5 million English words that are used for P.O.S tagging, and about half of that is used for skeletal parsing.
Check out page 327 of this document http://acl.ldc.upenn.edu/J/J93/J93-2004.pdf. It is a little outdated (2004) but I can't think of any new words that English speakers have introduced since then.
gnucom
2010-07-26 22:51:38
Thank you, that was really helpful!!
Lezan
2010-07-26 23:26:13