views:

496

answers:

4

Out of curiosity, I would love to know what tag clouds formats best serve the purpose of discovery of more and more (relevant)content?

I am aware of 3 formats, but don't know which one is the best.

1) delicious one - color shading

2) The standard one with font size variations -

3) The one on this site - numbers showing importance/usage.

So which ones do you prefer? and why?

Edit: Thanks to the answers below, I now have much more understanding of tag cloud visualization techniques.

4) Parallel Tag Clouds - a simple use of parallel coordinates technique. I find it more organized and readable.

5) voroni diagram - more useful for identifying tag relationships and making decisions based on them. Doesn't serves our purpose of discovery of relevant content.

6) Mind maps - They are good and can be employed to step by step filter content.

I found some more interesting techniques here - http://www.cs.toronto.edu/~ccollins/research/index.html

A: 

size adjusted voroni diagram - it shows which tags are inter-related

Justin
Thanks, I've added more details to the topic.
Arpit Tambi
+2  A: 

I really do think that depends on the content of the information and the audience. What's relevant to one is not relevant to another. If an audience is more specialized, then they will be more likely to think along the same lines, but it would still need to be analyzed and catered to by the content provider.

There are also multiple paths that a person can take to "discover more". Take the tag "DNS" for example. You could drill down to more specific details like "UDP Port 53" and "MX Record", or you could go sideways with terms like "IP address" "Hostname" and "URL". A Voronoi diagram shows clusters, but wouldn't handle the case where general terms could be related to many concepts. Hostname mapping to "DNS", "HTTP", "SSH" etc.

I've noticed that in certain tag clouds there's usually one or two items that are vastly larger than the others. Those sorts of things could be served by a mind map, where one central concept has others radiating out from it.

For the cases of lots of "main topics" where a mind map is inappropriate, there are parallel coordinates but that would be baffling to many net users.

I think that if we found an extremely well organized way of sorting clusters of tags while preserving links between generalities and specificities, that would be somewhat helpful to AI research.

In terms of which I personally prefer, I think the numeric approach is nice because infrequently referenced tags are still presented at a readable font size. I also think SO does it this way because they have vastly more tags to cover than the average size based cloud a la the standard.

Omniwombat
Thanks, I've added more details to the topic.
Arpit Tambi
+1  A: 

I would go with #2 out of the options you listed above.

  • #1 - The human eye recognizes and comprehends size differences much more effectively than color, when the color scale is along the same spectrum (ie, various blues as opposed to discrete individual colors).
  • #3 - Requires the user to scan the full list and mathematically compare each individual number while scanning. No real meaningful relationship between tags without a lot of work on the users part.

So, going with #2, there are several considerations to take into account:

  • Keep the tags alphabetical. This affords the user another method of searching and establishes a known relationship between each (assuming they know the alphabet!). If they're unordered, it's just a crapshoot to find a single one.
  • If size comparison is absolutely critical (this usually isn't the case, as you can scale up each level by a certain percentage or pixel amount), use a monospaced font. Otherwise, certain letter combinations may end up looking larger than they actually are.
  • Don't include any commas, pipes, or other dividers. You're already going to have a lot of data in a small area - no need to clutter it up with debris. Space the tags out with a decent amount of padding, of course. Just don't double the number of visual elements by adding more than just the data.
  • Set a min/max font size and scale between those. There are situations where one tag may be so popular that visually it may appear exponentially larger than the others. Likewise, you don't want a tag to end up rendering at 1px! Set the min/max and adjust between as necessary.
Mark Hurd
Thanks, I've added more details to the topic.
Arpit Tambi
A: 

My favorite tag cloud format is the Wordle format. It looks great and it also does a pretty good job of fitting a lot of tags in a small space.

narphorium
thanks, loved their designs. Would want to use them for my resume. However, I doubt they solve our purpose of discovering content.
Arpit Tambi