ansaurus

Question

Best way to calculate accuracy and display meaningful results

Answer 1

+1 A:

Relative percentages are a bad idea, because people are very bad at judging what that means in practice - for more explanation, see the book Bad Science.

Just display the sums in order from most accurate to least and explain the rating system. I don't think turning them into any sort of percentage is helpful, but it would be a good idea to give some guide figures or banding (say by colouring the text or background) of what good, middling and poor accuracy would be.

Finally, your question is very specific to your programming program and is unlikely to be of use to many other people the way it is phrased. Here we prefer question to be specific in technical topic but generally applicable to other problems, so if you phrase your problems more generally next time it makes for a better resource.

Martin 2010-01-15 09:52:16

@Martin Thanks for the insight. There is actualy color coding system implemented for results based upon it's accuracy. Problem with displaying the results is that the sums can become large and wide in range and is probably equivilant to displaying percentages. I would like to think there is a better way.Thanks for the info on the book, I'll browse the site also.

Cody N 2010-01-15 10:01:22

Answer 2

+1 A:

Hi

I tend to agree with @Martin that using numerical values to quantify the difference between qualitative measurements is a bit dodgy. However, people do it all the time, so if you want to carry on doing it go right ahead !

Now, what I really wanted to write is that your pseudo-code is not terribly pseudo- at all. Here's the pseudo-code that I would write:

ManhattanDistance[{56, 53, 50, 64},{56, 54, 52, 64}]

which specifies the same calculation as your version. Now, you may or may not recognise this to be a valid Mathematica statement, but that's beside the point. The point is that you have hit upon one of a myriad functions for measuring the distance between two vectors. Other distance measures include the Euclidean distance, and the Chessboard distance.

You could also use any one of a number of vector norms for measuring the distance between your vectors. For example, Mathematica gives the result sqrt(5) for the calculation:

Norm[S - A]

So, if you do want to indulge in some dodgy pseudo-statistics Google around for some definitions of vector distances and norms. I guess you'll find code or at least imperative algorithms too.

Regards

Mark

PS Don't tell anyone I helped you with pseudo-science :-)

High Performance Mark 2010-01-15 11:47:05

Wow, this is awesome input. This will lead to great learning experiences. I've decided to take yalls advice and contemplate a more elegant solution. In this case Google and research papers. All elements will deal with weather related variables. The case above was temperatures.

Cody N 2010-01-15 20:58:15

Answer 3

+1 A:

Your "position accuracy" is just an error which if normally distributed (as one would expect) can be modeled with a gaussian distribution. If so, since sums of gaussian random variables are themselves gaussian, your "sum of all accuracy" number is also a gaussian distributed random variable. You can compute a mean and variance of these error sums and have a gaussian PDF (probability distribution function) modeling your system and use it to answer questions like "that last clunky vector should be bright red because it had an error sum larger than 95% of all such vectors". Or "wow that last vector was A+ because it had an error less than 1% of all other such vectors".

This wiki post may help too.

Paul

Paul 2010-01-16 06:51:42

Answer 4

A:

Mean Squared Error is often used in engineering circles to quantify error between a solution and an estimate of the solution.

To avoid problems with a large variance in the values consider using log(error) ...of course this has it's own issues with log(0) being -infinity and if (0 < error < 1) log gives negative numbers

petantik 2010-01-18 12:34:47

ansaurus

tags:

views:

answers:

Best way to calculate accuracy and display meaningful results

related questions