views:

466

answers:

3

In weka I load an arff file. I can view the relationship between attributes using the visualize tab.

However I can't understand the meaning of the jitter slider. What is its purpose?

A: 

I don't know the products you mention, but jittering generally means randomising the sample positions. Eg, in ray tracing you would normally render a ray though each pixel on the screen. Jittering adds a random offset to each ray to reduce issues caused be regular aliasing.

http://en.wikipedia.org/wiki/Jitter

rikh
A: 

I don't know weka, but generally jitter is a term for the variation of a periodic signal to some reference interval. I'm guessing the slider allows you to set some range or threshold below which data points are treated as being regular, or to modify the output to introduce some variation. The wikipedia entry can give you some background.

Update: from this pdf, the jitter slider is for this purpose:

“Jitter” option to deal with nominal attributes (and to detect “hidden”data points)

Based on the accompanying slide it looks like it introduces some variation in the visualisation, perhaps to show when two data points overlap.

Update 2: This google books extract (to Data mining By Ian H. Witten, Eibe Frank) seems to confirm my guess:

[jitter] is a random displacement applied to X and Y values to separate points that lie on top of one another. Without jitter, 1000 instances at the same data point would look just the same as 1 instance

Rich Seller
+1  A: 

You can find the answer in the mailing list archives:

The jitter function in the Visualize panel just adds artificial random noise to the coordinates of the plotted points in order to spread the data out a bit (so that you can see points that might have been obscured by others).

Zed