views:

261

answers:

2

I'm playing around with sentiment analysis, and I'm looking for some seed data. Is there a free dictionary around?

It can be really simple: 3 sets of texts/sentences, for "positive", "negative", "neutral". It doesn't have to be huge.

Eventually I'll probably generate my own seed data for my specific use case, but it would be great to have something to play with now while I'm building the thing.

Thanks!

+1  A: 

Bing Liu and Minqing Hu from UIC have a number of datasets:

Bo Pang from Cornell has some more.

Oren Trutner
The Cornell data looks like it'll do the trick. Thanks!
Ken
+1  A: 

Hi Ken,

If you're interested in sentiment dictionaries, many authors have presented work based on manually built lists, and other semi automated methods for obtaining lists of opinionated terms. One good approach is to derive it from the WordNet database, by extending a core of positive/negative words using relationships like synonyms etc.

A good example of a manually built list is the General Inquirer.

For a semi automated method that derives lists, check out SentiWordNet from Esuli and Sebastiani.

These I believe are generally available for research, but you may need to get in touch with the authors regarding the use of these resources for non-research purposes.

B.

bohana