ansaurus

Question

Algorithm to find most efficient moves to arrive at a given point

Answer 1

+2 A:

Given that you have the starting points and a fixed set of vectors, can you calculate the list of all reachable destinations and then just look up a given destination?

Chris Hulan 2010-01-21 19:00:25

Answer 2

+4 A:

I believe you would be able to make a generalized application of the A* (aka A star) path finding algorithm. There is no reason why it can't be done in Nth space. It gaurantees the most optimal path, if you can describe the cost of each move.

http://en.wikipedia.org/wiki/A*_search_algorithm

Matt 2010-01-21 19:03:42

There is -- considering you have a set of moves, distance is **NOT** a good approximation.

Kornel Kisielewicz 2010-01-21 19:06:22

@Kornel I'm not sure I follow

Matt 2010-01-21 19:08:48

@Matt -- A* heuristic is distance based -- that works good if you take single steps -- here, the movement can be more like that of a Chessboard knight -- worse, it's got a few moves and then they end. In short -- "Hitting" the target is more a problem then "Getting close". It would be perfectly possible for the first move in the solution to take you "AWAY" from the target -- and that could be the only solution. A* would then traverse the whole possibility tree. Not to mention that to decide if a path is not possible, you'd have to traverse the tree anyway -- the heuristic is not much help here

Kornel Kisielewicz 2010-01-21 19:14:56

@Kornel A* heuristic is cost based, it just so happens the most common implementation of cost is distance. That doesn't have to be the case - it can be anything you want it to be. And you don't have to traverse the whole tree if there is a solution. A* is (sort of) a breadth first search, so you stop when you reach your destination, and it is guaranteed to be the most efficient. You can discard all unvisited paths because you already know they are more costly. You would end up traversing the whole tree if there was no solution, but I can't think of an algorithm that can avoid that.

Matt 2010-01-21 19:20:41

@Matt : The algorithm presented by Strilanc is already much better, because it reduces the space to 2^10 possibilities.

Kornel Kisielewicz 2010-01-21 19:28:58

@Kornel I wouldn't be convinced it's better without a mathematical proof comparing the two, or a real life demonstration. Just because that approach reduces the search space to 2^10 doesn't mean anything - A*, as a heuristic, can eliminate dead/unimportant branches of the search space. Since we don't know what his cost evaluation is, or what his *real* problem is (since this is a simplified/orthogonal example), it will ultimately come down which is better in the given use case. We don't have enough information to determine what that is though.

Matt 2010-01-21 19:47:40

cont) In other words, Strilanc's solution may work just fine. But you have to examine all 2^10 possibilities, and even then do some optimization after that (though not sure what he meant there). A* could possibly find the solution without looking at 2^10 possibilities. I may also take longer, but only JS Bangs knows the answer to that.

Matt 2010-01-21 20:00:00

A* is a very good heuristic, on the assumptions that you can reasonably calculate a distance, and getting close is good. I don't think this holds. The goal is not to get close to a destination point, but hit it, and that means that farther away is better if there's one big vector that will hit the destination. If you think A* would work, please suggest a heuristic function.

David Thornley 2010-01-21 20:19:33

It reduces to breadth-first search with the heuristic being number of vectors added.

klochner 2010-01-21 20:26:03

@David What is this business about "getting close", both you and Kornel mentioned it. A* will get you *exactly* at your destination, if a solution is available.

Matt 2010-01-21 20:38:54

They mean that for any given point, you have no good heuristic for estimating total path length to the goal. Your heuristic will be existing path length, which is breadth first search.

klochner 2010-01-21 20:43:58

Note that as usual in breadth-first search, you can search from both ends (from the target point *dest* and from the starting points) and try to meet in the middle. At first it is cheaper to search from *dest*, but once you get to 8 or 9 moves and still haven't reached some starting points, it is cheaper to do a round going in the opposite direction.

Jason Orendorff 2010-01-22 13:54:30

Answer 3

A:

Begin at the start.
Do for a While
Get the distance to the destination
Test all possible moves and pick the one that gets you closest to the destination in one move.
end do

This may oscillate around the destination, but it will get you close.

xpda 2010-01-21 19:05:04

Answer 4

+5 A:

Actually, considering that you have around 10 vectors, you can, for a given dest point, calculate only the 1024 "targets" from the subset of vectors -- e.g every reachable space, with the information about what set of moves gets there. That might be "slow", or "fast" depending on context (it's be absurdly fast if implemented on a parallel computing device like the GPU).

Having all the sets that get there you can calculate the paths a lot quicker, then you can pick the point from which to get to dest in the fewest moves, choosing from the subset of the ones that are either your query or further.

(thanks to Strilanc)

Kornel Kisielewicz 2010-01-21 19:05:30

You're going to want a dynamic programming approach with this solution.

klochner 2010-01-21 20:41:11

@klochner Please explain your reasoning. If the dimensions and ranges are large, that's going to chow memory (and therefore time). 1024 destinations is sweet nothing.

marcog 2010-01-21 21:04:14

Gave it some more thought and you're right - it's faster to just calculate all the sums (1024*20) and check for matches (1024*20*100 in the worst case).

klochner 2010-01-21 21:27:04

The only problem that I see with this is that search time increases exponentially with every vector that I add. The # of vectors is _about_ 10, but it could maybe get as high as 20 or 30, which would put the performance of this algo through the floor.

JSBangs 2010-01-21 23:08:50

One billion (2^30) is still easily calculatable, especially in parallel. CUDA would be a great tool for this task :>

Kornel Kisielewicz 2010-01-21 23:17:05

Also, your problem is NP-complete, hence you can either ho with a exponential result, or try to do approximations that won't necessary work.

Kornel Kisielewicz 2010-01-21 23:26:15

It's not the search time that increases exponentially - it's the generation time for the reachability table, which can be done upfront (since it depends only on the vectors and starting points, which are fixed). The search time through that table can even be made O(1), since you can build a perfect hash (because the table contents are fixed).

caf 2010-01-22 03:19:13

Answer 5

+4 A:

So you want to find a subset of your set of vectors such that the subset sums to a given value. In 1 dimension this is called the subset sum problem, and is NP-Complete.

Luckily you only have ~10 vectors, so your problem size is actually quite small and tractable. Start by just trying all 2^10 move combinations for each starting point and picking the best one. Then work from there looking for simple optimizations.

Some easy optimizations that might work:

Prioritize searching subsets including vectors pointed in the right direction.
Meet-in-the-middle. Use a hash table to store all points reachable using subsets of the first half of your move set, and see if you can hit any using the second half of your move set starting from the end.
Go backwards. You only have one endpoint, so hash all reachable start points from there then check against all possible start points.
Concurrency

Strilanc 2010-01-21 19:07:05

2^10? You've got your math mixed up ;> -- remember that the vectors can be in differen orders, that gives us 10! = 3628800

Kornel Kisielewicz 2010-01-21 19:10:41

Vector addition is commutative. The order doesn't affect where you end up.

Strilanc 2010-01-21 19:13:37

@Stirlanc: How will you know if the given path takes you there in 3 or 8 moves then?

Kornel Kisielewicz 2010-01-21 19:26:13

(however, if the solution set is small then it's a big improvement anyway)

Kornel Kisielewicz 2010-01-21 19:27:29

You keep track of that information during the computation, of course. Instead of storing just reachable points you store tuples of reachable points and their cost.

Strilanc 2010-01-21 19:48:43

You're solving a linear program essentially by brute force (or with some simple heuristics). This would get blown away by any LP solver.

klochner 2010-01-21 20:49:01

@klochner: Why? LP solvers are good for really big systems. When I first used one I was amazed at how big a system it could take and spit out an answer fast. This is a really small system.

David Thornley 2010-01-21 21:09:16

it's fast for small problems as well, so long as they are LPs.

klochner 2010-01-21 21:17:57

Answer 6

+1 A:

As Kornel states, you have at most 2^10 = 1024 reachable destinations. So you could just generate all reachable destinations in 2^N time (where N is the number of vectors) by a simple recursive generation. This is going to be fast enough, of course. However, let's assume you wanted to stretch it.

You could optimise it to O(2^(N/2+1)) time by using a meet-in-the-middle solution. You split the vector set into two subsets and generate all reachable destinations for each subset independently. You then iterate through one set of reachable destinations, and for each location find the difference between it and the target destination. If that difference vector is in the other set of reachable destinations, you have a solution: combine the two and you're done. The difficulty here is in efficiently querying if you have the required vector in the other set: this can be done in O(1) time using a hash table.

That's O(2^(N/2)) time per subset, times two subsets gives O(2^(N/2+1)). To join the two it's O(2^(N/2)) time. So that gives us O(2^(N/2+1)) time overall.

marcog 2010-01-21 20:37:18

ansaurus

tags:

views:

answers:

Algorithm to find most efficient moves to arrive at a given point

Update w/ Accepted Solution

related questions