views:

66

answers:

2

On a website, everything is tagged with keywords assigned by the staff (it's not a community driven site, due to its nature). I am able to determine which tags a user is most active in (or, what tags they view the most). However, I'm not entirely sure how to choose the list. A few options present themselves, but they don't seem right to me.

My options, that I have determined, are:

  • Take the top n (or m < n if they have fewer than n viewed tags) tags
  • Take the top n tags where n is a percentage of the total tags viewed
  • Take the top n tags with m views where n and m are percentages of total tags viewed and total page views
  • Take all of the tags, regardless of views

The goal is to identify what is most interesting to the user and show them other things that they might be interested in, with respect to the tags that are assigned to the content.

+3  A: 

You could look at machine learning algorithms to find algorithms with which to evaluate the effectiveness of your choice.

Like for instance: http://en.wikipedia.org/wiki/Supervised_learning#Approaches_and_algorithms Stuff like nearest neighbour and bayes could help you improve your suggestions.

This is however overkill for just suggesting "Would you like to look at this too?", but it's an interesting approach to providing better tie-ins. It would, however require some method to figure out whether or not your users value your suggestions (e.g. "I like this!"-links or log-analysis based on time spent on links, etc.)

andy
Upvoted, but yes, I think it's overkill for a brand new feature. However, your recommendations for learning algorithms have been documented for future use. In the mean time, I'll be implementing Ilya Kochetov's solution (multiple reports, with a slant toward option 3).
Thomas Owens
By the way, in a perfect world, I would probably accept yours, but it's a little powerful for a new feature that might not even work out long-term.
Thomas Owens
+1  A: 

A simple solution is to try several reports and check which report is more informative. The nature of your site and your data may mean that some reports are unexpectedly useful and some are not. If a report get a 'flat' area chart for example - look for something else.

Even better give the consumers of the reports a choice and an ability to provide feedback. Tune the reports based on what they will be really looking for.

P.S. I would go fro the "Take the top n tags with m views where n and m are percentages of total tags viewed and total page views" report first

Ilya Kochetov