ansaurus

Question

Are there good Patterns/Idioms for Data Translation/Transformation?

Answer 1

+1 A:

When designing a data compression algorithm, you need to take care that the beginning of one code can't be mistaken for another shorter code. This is the basis for Hamming code. The other alternative is to have a delimiter character separating your tokens, like in Morse code (which uses a short pause between characters).

Bill the Lizard 2009-08-05 12:39:44

Answer 2

+1 A:

Years ago I would have immediately said look at Bison http://www.gnu.org/software/bison/ or Yacc but I haven't done anything like this for some time so don't know if there is anything better.

Using them might be a bit over the top for what you are doing but the idioms used might be useful.

Dipstick 2009-08-05 12:41:36

Yes, it feels a lot like parsing, I need something more generic though - something which can parse a set of input structs into a list of output objects.

Frerich Raabe 2009-08-05 12:51:17

Answer 3

+1 A:

You could define a graph, where each node contains an input token and an associated output. The links of each node describe the possible next tokens. Thus, a path in the graph describe a possible transformation rule.

To transform the data, start from the node corresponding to the first input token, and try to navigate the graph on the longest path possible, matching the next input token to the nodes linked to the current node. When no linked node matches the next input node, take the output associated with the current node as the result.

Cătălin Pitiș 2009-08-05 12:43:00

Yes, this 'the longest match wins' idea is an alternative. So far, I manually sorted the order in which the rules are tried - which worked well enough for me (but maybe it doesn't scale well enough, I don't know).

Frerich Raabe 2009-08-05 12:45:23

See the updated question for why I cannot define one big graph.

Frerich Raabe 2009-08-05 12:50:32

ansaurus

tags:

views:

answers:

Are there good Patterns/Idioms for Data Translation/Transformation?

related questions