views:

103

answers:

3

[Caveat] This is not directly a programing question, but it is something that comes up so often in language processing that I'm sure it's of some use to the community.

Does anyone have a good list of uninteresting (English) words that have been tested by more then a casual look? This would include all prepositions, conjunctions, etc... words that may have semantic meaning, but are often frequent in every sentence, regardless of the subject. I've built my own lists from time to time for personal projects but they've been ad-hoc; I continuously add words that I forgotten as they come in.

+6  A: 

These words are usually called stop words. The Wikipedia article contains much more information about them, including where to find some lists.

Greg Hewgill
+1 because you beat me by about 30 seconds :(
Mark Byers
+2  A: 

I think you mean stop words.

There's a few links to lists of stop words on Wikipedia, including this one.

Mark Byers
+1  A: 

List of English Stop Words

Anthony Forloney