views:

834

answers:

3

In a follow-up to this answer I want to ask if any of you know any good (and more importantly easy to understand) tutorials and / or examples of data mining with the Weka toolkit.

I've been very interested in Data Mining ever since I've first heard of it and the things it can do, I've also have some experiments I'd like to do with some of my data and I've already bought four books and I found specially interesting the following two:

Intelligent Data Analysis Data Mining

The last one is written by the same authors of Weka and contains a lot of examples but still, I found it a little hard to understand the logic and specially the math. My math skills are currently very rough, I plan to go to the University this year and hopefully I'll learn and be able to better understand the math involved, but until then I want to gain some practice in Data Mining.

Is there any step-by-step tutorial with example data I can read to get me started with the Weka toolkit?

+3  A: 

When it comes to "applied" data mining, for the starters, you may not need to think about math at all. Weka is product of a university machine-learning project and offers 100+ algorithms. Contrast that with Microsoft SQL server SSAS which offers nine algorithms -- and they do not even bother to explain the math.

They both offer association, clustering, attribute selection, some kind of neural network. So, the trick is to understand what you are trying to achieve, not necessarily the math below. Try reading about Microsoft algorithms (good documentation) and see if you can figure out principles that SSAS and Weka have in common -- this should help you focus on basic principles and get you started.

There is a list of a few Weka tutorials here.

Damir Sudarevic
+1  A: 

Here are links to a collection of videos and a collection of tutorials.

datakid
+1  A: 

Personally, I would use RapidMiner5, as it's a super easy GUI environment (much better than v4). It has all of the WEKA functions available.

Get rapidminer here: http://www.rapid-i.com

Super easy intro videos to RapidMiner here:

https://dspace.ndlr.ie/jspui/handle/10633/2353

here:

http://www.neuralmarkettrends.com/

and here:

http://www.youtube.com/user/VancouverData

If you want to learn more about the statistics behind data mining, see this Standford prof's lecture series at Google:

http://www.youtube.com/watch?v=zRsMEl6PHhM&feature=related

el chief