ansaurus

Question

"Crypt Kicker Problem" (Programming Challeneges)

Answer 1

+2 A:

A minor optimization could be done by enumerating possibilities before the backtracking run. In Python:

dictionary = ['and', 'dick', 'jane', 'puff', 'spot', 'yertle']
line = ['bjvg', 'xsb', 'hxsn', 'xsb', 'qymm', 'xsb', 'rqat', 'xsb', 'pnetfn']

# ------------------------------------

import collections

words_of_length = collections.defaultdict(list)

for word in dictionary:
  words_of_length[len(word)].append(word)

possibilities = collections.defaultdict(set)
certainities = {}

for word in line:
    length = len(word)
    for i, letter in enumerate(word):
        if len(words_of_length[length]) == 1:
            match = words_of_length[length][0]
            certainities[letter] = match[i]
        else:
            for match in words_of_length[length]:
              possibilities[letter].add(match[i])

for letter in certainities.itervalues():
    for k in possibilities:
        possibilities[k].discard(letter)

for i, j in certainities.iteritems():
    possibilities[i] = set([j])

# ------------------------------------

import pprint
pprint.pprint(dict(possibilities))

Output:

{'a': set(['c', 'f', 'o']),
 'b': set(['d']),
 'e': set(['r']),
 'f': set(['l']),
 'g': set(['f', 'k']),
 'h': set(['j', 'p', 's']),
 'j': set(['i', 'p', 'u']),
 'm': set(['c', 'f', 'k', 'o']),
 'n': set(['e']),
 'p': set(['y']),
 'q': set(['i', 'j', 'p', 's', 'u']),
 'r': set(['j', 'p', 's']),
 's': set(['n']),
 't': set(['t']),
 'v': set(['c', 'f', 'o']),
 'x': set(['a']),
 'y': set(['i', 'p', 'u'])}

If you have some single-element possibilities, you can eliminate them from the input and rerun the algorithm.

EDIT: Switched to set instead of list and added printing code.

Max Shawabkeh 2010-02-01 08:41:59

Thank you, I'll take in consideration those optimizations!

Andrei Ciobanu 2010-02-01 10:31:32

once the dictionary size grows this is going to be less useful no?

jk 2010-02-01 11:24:46

jk, yes. For larger inputs you're probably better off doing a simple depth first search with nodes ordered by letter frequency as per Sylvestre's answer.

Max Shawabkeh 2010-02-01 13:12:22

Answer 2

+1 A:

KeyArray will hold the replacement table.

- Start with an empty KeyArray, this is version 0

- Match longest encrypted word to longest dictionary word and add to KeyArray 
  (if there are two longest, pick any), this is version 1.

- Decrypt some letters of the next longest crypted word. 
- Check if the decrypted letters match the letter in the same
  position in any dictionary word of the same length.
- If none matches, go back to version 0 and try another word.
- If some letters match, add the rest of the letters to KeyArray, this is version 2. 

- Decrypt some letters of the next longest crypted word. 
- Check if the decrypted letters match the letter in the same 
  position in any dictionary word. 
- If none matches, go back to version 1 and try another word
- If some letters match, add the rest of the letters to KeyArray, this is version 3. 

Repeat until all words are decrypted.

If at version 0 none of the longest words creates a partial decrypt in 
shorter words, very probably there is no solution.

Carlos Gutiérrez 2010-02-01 09:44:02

Thank you, i will see this approach!

Andrei Ciobanu 2010-02-01 10:33:00

Answer 3

+1 A:

Another possible optimization, if you have "enough" text to deal with and you know the text's language, you can use letter frequencies (see : http://en.wikipedia.org/wiki/Letter_frequency). This is of course a very approximative approach when dealing with 6 / 7 words but will be the fastest way if you have a few pages to decode.

EDIT : about Max's solution, you could try to extract some characteristics of the word, too, such as repeating letters. Obviously, remarking that puff in the dictionary and qymm in the encrypted text are the only four letter words ending with a double letter gives a straight answer for 3 of the letters. In more complex scenarios, you should be able to narrow the possibilities for each letter couple.

Sylvestre Equy 2010-02-01 09:50:29

And it's cool, because Sherlock Holmes used it in 'The Adventure of the Dancing Men' :) http://monpinillos.wordpress.com/2008/06/02/holmess-skills-cryptography/

Carlos Gutiérrez 2010-02-01 09:56:07

Unfortunately I'll have to generate "enough" text. But that will be a cool problem in itself: "generating encrypted text". Thanks for your advice

Andrei Ciobanu 2010-02-01 10:31:01

ansaurus

tags:

views:

answers:

"Crypt Kicker Problem" (Programming Challeneges)

related questions