views:

322

answers:

1

I've translated Double-Metaphone into ActionScript3 and I want to test it (obviously) before I release the source to ... um ... the open.

I'm looking for a long list of names with the primary and secondary codes. Google does not find anything except one list with pairs of names (presumably they should match).

Thanks

+3  A: 

You could find someone else's double metaphone implementation, run it on the same long list of words, and compare the results to your own.

For long lists of words, I like infochimps. They have lots of word lists, like this one of 350,000 english words or this one of place names, and many more.

Here are implementations you can compare your results against. Here is an online example, except that it tests only one word at a time - I guess you'll have to download and run one of the scripts to test a large list of words.

For each word, two codes will be returned; you'll probably want to test that both codes returned match the ones returned of another implementation. You probably know that the reference implementation is here with full source code here, but including the links anyway for others' benefit.

thomasrutter
thank you for the word list.I hoped that some other DM implementation had a unit test that I could re-use.Cheers
Richard Haven