ansaurus

Question

Answer 1

+4 A:

That's a very good question. There are a few similar questions that have been asked here.

This article is probably a good place to start. Apparently upvotes minus downvotes is a bad way to do it. The better way is to use complicated maths to assign a score to each and sort by that.

Here is a scoring function in Ruby from the article:

require 'statistics2'

def ci_lower_bound(pos, n, power)
    if n == 0
        return 0
    end
    z = Statistics2.pnormaldist(1-power/2)
    phat = 1.0*pos/n
    (phat + z*z/(2*n) - z * Math.sqrt((phat*(1-phat)+z*z/(4*n))/n))/(1+z*z/n)
end

pos is the number of positive rating, n is the total number of ratings, and power refers to the statistical power: pick 0.10 to have a 95% chance that your lower bound is correct, 0.05 to have a 97.5% chance, etc.

As a usability thing, I would sort the data by the score, but I would not show the score to the user. I would only show the number of upvotes and downvotes.

David Johnstone 2010-09-14 01:23:20

Yeah, the scores come out to be some crazy decimals, currently. I only show votes, and then if you hover over a table row the (+/-) count un-hides.

Matt Egan 2010-09-14 02:13:18

@Matt Egan: I wouldn't show the upvotes minus downvotes at all as it is doesn't help anything. Also, your rankings don't look quite right at the moment, as "Hey Leonardo" (+1/-8) is ranked above a few +1/-2 songs.

David Johnstone 2010-09-14 03:51:12

I haven't implemented this popularity rating yet. I'm about to lie down into bed, I'm going to look over some of my Statistics notes from Sophomore year so I can actually comprehend what I'm doing here. I like to understand what I'm doing. - Also, now that you point that out, I don't think this whole "time" rating is working too well either :/

Matt Egan 2010-09-14 04:21:35

What should I/he use for `power`? I'm not exactly sure what it means.

Justin L. 2010-09-14 05:11:22

Using time to boost the ranking of new songs is not a bad idea (although you'd want to think carefully about what sort of mathematical function you would use) — you could make a useful "Sort by what's hot" using this in conjunction with the actual score (calculated using the lower bound of a confidence interval, as described in the article). I would love to see how this turns out.

David Johnstone 2010-09-14 05:26:40

@Justin L.: The short answer is to choose a desired confidence and use `power = (100% - confidence) * 2`, i.e., `power = (100% - 95%) * 2 = 0.10`. Understand that the equation is calculating the lower bound of what the *true* score of the item/song is. The long answer is to read through the Wikipedia page and related pages, but I can't help too much because statistics is not my forté.

David Johnstone 2010-09-14 05:39:55

Actually, if you want a "hotness" ranking, this is worth a read: http://www.evanmiller.org/rank-hotness-with-newtons-law-of-cooling.html

David Johnstone 2010-09-14 06:35:22

Yeah, I'm seriously wishing I had remembered my statistics unit from my mathematics class last year. I'm about to go talk to my teacher from last year between class changes, she might find it interesting as well. - Don't worry I'm working on it, I won't let you guys down :D

Matt Egan 2010-09-14 14:54:03

Ah, I'm reading this article about Newton's Law Of Cooling, and I remember covering this next year. This seems like it would be nice to implement, and a little less intensive than Wilson's Score interval. - One question on the Newton's cooling, if the process of upvoting increases a song's "temperature" should downvoting decrease the temperature (I'm thinking by a smaller amount, e.g. weigh upvotes more)?

Matt Egan 2010-09-14 15:00:19

@Matt Egan: The exponential decay "hotness" algorithm is certainly simpler than the Wilson's score, although they do different things. In the end, you will still need to be able to determine how good each song is (to be fair, you don't *need* a computer to work this out). Implementing it shouldn't be too hard, especially if you have a stats package like Ruby has in the example. Re downvoting and temperature: I was thinking about that myself, and I think they should both increase "hotness" (maybe upvotes +10, downvotes +5?)

David Johnstone 2010-09-14 23:26:57

What's your reasoning behind having downvotes increase hotness?

Matt Egan 2010-09-15 02:32:22

@Matt Egan: I was thinking "hotness" should be a measure of how actively people are interacting with it, and not be a measure of how good it is. But it depends on how you want it to behave. For example, what do you want to happen if heaps of people all vote up? Or heaps of people all vote down? Or vote half-half? Or 75% up? Or 75% down? Your call :-) Either way, you can implement it now, and tweak the numbers as you see fit.

David Johnstone 2010-09-15 03:08:24

The only problem with the hotness, is that from the start, I wasn't recording temperature, I'm trying to come up with something else... Any ideas?

Matt Egan 2010-09-15 03:43:29

@Matt Egan: That's a good point. If you haven't been logging when each vote was made since the start then you would probably have to fudge starting temperatures — if you want to go down that path. Anyway, I think that using the Wilson's score would be best. At least this gives the best possible ranking by popularity. Protip: you only *need* to understand the inner workings of things well enough to implement them :-)

David Johnstone 2010-09-15 03:58:54

Thanks David! I'm still working on it, I'm just a person who likes to understand how things work. - This is also SUPER interesting/exciting, I've never had such a dataset to work with on such a "real" population.

Matt Egan 2010-09-15 04:03:58

@Matt Egan: By the way, here's a PHP implementation (including an implementation of Ruby's Statistics2.pnormaldist): http://www.derivante.com/2009/09/01/php-content-rating-confidence/

David Johnstone 2010-09-15 04:05:36

Thanks, I figured I didn't need am implementation of the pnormaldist, as I could choose a confidence interval and calculate the normal distribution using my ole trusty TI-84. - One more thing I just realized, the Wilson score doesn't factor time into this equation...

Matt Egan 2010-09-15 04:11:37

And, as a bonus, that article also contains code for how you can boost new items (based solely on age), which sounds exactly what you'd want.

David Johnstone 2010-09-15 04:12:16

Ah, looks like this link has a section about that precise thing :P

Matt Egan 2010-09-15 04:13:05

Well, I got it working, for your viewing at : songs.taphappysoftware.com/index.php/songs/diagnostics - I'll have it working on the main table in a moment.

Matt Egan 2010-09-15 04:29:28

@Matt Egan: "Rank is determined by a complicated process involving confidence intervals and gravity over time." I like it! May I suggest not displaying the net number of votes, and always display the number of upvotes and downvotes? The net number of votes isn't a meaningful metric.

David Johnstone 2010-09-15 04:51:38

Done, done and done-er. Thanks for all your help David, it's be tremendous! Although, I am thinking about going back to a more understandable positive votes over time, except weighing positive votes more than downvotes... - We'll see :P

Matt Egan 2010-09-15 05:07:52

Answer 2

A:

How about sorting songs by posting time or number of votes (negative + positive)? If your goal is to give every song equal attention, this sounds good enough.

Nikita Rybak 2010-09-14 01:24:54

ansaurus

tags:

views:

answers:

Sorting A List Of Songs By Popularity

related questions