Hi Guys,
We were set an algorithm problem in class today, as a "if you figure out a solution you dont have to do this subject". SO of course, we all thought we will give it a go.
Basically, we were provided a DB of 100 words and 10 categories. There is no match between either the words or the categories. So its basically a list of 100 words, and 10 categories.
We have to "place" the words into the correct category - that is, we have to "figure out" how to put the words into the correct category. Thus, we must "understand" the word, and then put it in the most appropriate category algorthmically.
i.e. one of the words is "fishing" the category "sport" --> so this would go into this category. There is some overlap between words and categories such that some words could go into more than one category.
If we figure it out, we have to increase the sample size and the person with the "best" matching % wins.
Does anyone have ANY idea how to start something like this? Or any resources ? Preferrably in C#? :)
P.S - Even a keyword DB or something might be helpful ? Anyone know of any free ones?
Edit: Can use external data - but sorry guys, Google "forbidden" :) (we all said this in class hhaa)