ansaurus

Question

Answer 1

+1 A:

So I'm guessing something like JFreeChart just wasn't cutting it for your app? If you haven't gone down that road yet, I'd suggest checking it out before attempting to roll your own.

Anyway, if you're looking for the nearest point to a mouse event, getting the point with the minimum Euclidean distance (if it's below some threshold) and presenting that will give the most predictable behavior for the user. The downside is that Euclidean distance is relatively slow for large data sets. You can use tricks like ignoring the square root or BSP trees to speed it up a bit. But if those optimizations are even necessary really depends on how many data points you're working with. Profile a somewhat naive solution in a typical case before going into optimization mode.

j flemm 2010-08-11 13:57:23

JFreeChart would've added a layer of complexity that I didn't want to deal with, but mostly it was too slow. I'm redrawing thousands of points multiple times a second. Euclidean distance makes sense; it's essentially what I'm toying with right now, but reifying it into a method–sqrt (sq xdiff) (sq ydiff)–is helpful. Thank you!

Isaac Hodes 2010-08-11 14:21:31

@Isaac: Wow. So you're actually redrawing all those points multiple times a second? It might be worth looking into using an accelerated surface if you aren't already.Also: if you're searching thousand of points and want an interactive response some sort of spacial optimization like BSP trees is going to be a necessity. It's also going to be major bummer to write. BSP does parallelize relatively well though.Please keep us updated as you drill down to a solution. I'm interested in what you end up with.

j flemm 2010-08-11 14:44:23

Thanks for the tip re: accelerate surfaces; when I work on speeding up the drawing, I might give that a look. Right now, it isn't too much of a problem. I'm generally not redrawing more than a few thousand points a second, and that isn't too much to deal with. As for BSPs, from the Wikipedia article, it doesn't sound like their particularly applicable: perhaps I'm missing something?

Isaac Hodes 2010-08-11 20:03:58

Finally, Euclidean distance isn't working as well as I thought, as the distance between x-pts is always a fixed interval, and it's a small interval, but the difference between two y points can vary wildly (it's a graph of voltages over time, so that makes sense.) The issue is then that the y-vals are getting far more credence than the x-vals, though in reality they should be getting the same relative credence. Normalizing the ys doesn't seem to help much either… I'll report back with more results! Thanks for the help :)

Isaac Hodes 2010-08-11 20:04:36

@Isaac: Re: BSP: All a BSP tree would do is allow you to partition your space so you aren't checking every point for every click. Since each child region is nested within a larger parent region, you just find the child leaf and go up the tree however many steps that are required to be sure you've found the closest point. Re: Euclidean distance: Since it's fixed x interval and the y fluctuates wildly, Euclidean probably isn't the way to go. A 1-D Manhattan along the x-axis is probably better. Maybe check the two left and right neighboring points and do some y-axis arbitration for obvious cases.

j flemm 2010-08-12 14:25:22

Ah, interesting. I might consider it, though it may not be necessary, as the resolution of a space having too many point to process is so low (more pts than pxs), rending the whole method inaccurate and unlikely to be used anyway. I might do it anyway, though, it can't hurt. Re: 1-D Manhattan et al, I'm doing that now, and it's working decently. I'm going to add some heuristics to catch the cases where y's are obvious, as well. Thanks!

Isaac Hodes 2010-08-13 14:33:57

Answer 2

A:

I think your approach is decent. This basically only requires one iteration through your data array, a little simple maths and no allocations at each step so should be very fast.

It's probably as good as you are going to get unless you start using some form of spatial partitioning scheme like a quadtree, which would only really make sense if your data array is very large.

Some Clojure code which may help:

(defn squared-distance [x y point]
  (let [dx (- x (.x point))
        dy (- y (.y point))]
     (+ (* dx dx) (* dy dy))))

(defn closest 
  ([x y points]
    (let [v (first points)] 
      (closest x y (rest points) (squared-distance x y v) v)))
  ([x y points bestdist best]
    (if (empty? points)
      best
      (let [v (first points)
            dist (squared-distance x y v)] 
        (if (< dist bestdist)
          (recur x y (rest points) dist v)
          (recur x y (rest points) bestdist best))))))

mikera 2010-08-11 15:03:34

Thanks! Distance isn't working too well for me, in this case, but the spacial partition may–I'll look into it.

Isaac Hodes 2010-08-13 14:34:47

ansaurus

tags:

views:

answers:

Mapping Pixels to Data

related questions