views:

37

answers:

0

I'm trying to create a mapping for tile/pixel positions in one tiled image dataset to tile/pixel position in a different dataset (that roughly cover the same image area). Due to rotations of the imaged area, panning axis, translations, and scale changes, there are many unknowns to account for in this mapping. This situation is illustrated below, where the rectangles represent the imaged area for each tile, and the layout of the tiles is governed by the relationship between the panning axis (black lines) and the camera axis:

alt textalt text

The problem reduces down to the following linear system which takes into account the scanning axis being rotated with respect to the camera axis differently in the two datasets, plus rotation,scale,and translations between the imaged areas in the two datasets. Unfortunately I'm not sure how to go about solving for the unknowns:

|tx ty|X|a b|+|px py|X|e f|+|i j|=|tx* ty*|X|k l|+|px* py*|  
        |c d|         |g h|                 |m n|

The unknowns in this system are (a,b,c,d,e,f,g,h,i,j,k,l,m,n) and basically describe a mapping from tile and pixel positions in dataset1 (tx,ty,px,py) to tile and pixel positions in dataset2(tx*,ty*,px*,py*). Coming up with 14 (or more) such correspondences isn't too big a deal for me, as I have a log-polar image registration that works pretty well for affine mapping a tile from dataset1 to a tile in dataset2 if they contain sufficient image overlap. The problem is I'd rather get the above mapping determined rather than doing this registration between every single tile in dataset1 and 2 would take forever!!.

Any help greatly appreciated! -Craig

Edit:

I just figured out I can rearrange the problem like so (I think?):

|tx ty px py 1|X|a b|=|tx* ty* px* py* 1|X|k l|  
                |c d|                     |m n|
                |e f|                     |1 0|
                |g h|                     |0 1|
                |i j|                     |1 1|

This is looking closer to what I need, but I still need help trying to figure out how to convert the answer I'd get back into a form I could use (most importantly I think I'd need to explicitly know (k,l,m,n) to convert the mapped coordinate back into a tile and pixel position in the target.... no?

Edit2: Changed the notation to be correct for matrix multiplication Edit3: Added illustration images since I now can!