Hello!
I need to develop a tool for web log data mining.
Having many sequences of urls, requested in a particular user session (retrieved from web-application logs), I need to figure out the patterns of usage and groups (clusters) of users of the website.
I am new to Data Mining, and now examining Google a lot. Found some useful info, i.e. querying Frequent Pattern Mining in Web Log Data seems to point to almost exactly similar studies.
So my questions are:
- Are there any python-based tools that do what I need or at least smth similar?
- Can Orange toolkit be of any help?
- Can reading the book Programming Collective Intelligence be of any help?
- What to Google for, what to read, which relatively simple algorithms to use best?
I am very limited in time (to around a week), so any help would be extremely precious. What I need is to point me into the right direction and the advice of how to accomplish the task in the shortest time.
Thanks in advance!