phonetics

Machine learning challenge: learn english pronunciation

Say you want to take CMU's phonetic data set input that looks like this: ABERRATION AE2 B ER0 EY1 SH AH0 N ABERRATIONAL AE2 B ER0 EY1 SH AH0 N AH0 L ABERRATIONS AE2 B ER0 EY1 SH AH0 N Z ABERT AE1 B ER0 T ABET AH0 B EH1 T ABETTED AH0 B EH1 T IH0 D ABETTING AH0 B EH1 T IH0 NG ABEX EY1 B EH0 K S ABEYANCE AH0 B EY1 AH0 N S (The w...

What is a good phonetic alphabet for a password application?

I'm writing an application that will create "difficult" passwords for the user. The user will only see the password once, and they'll copy it into a client that will remember it for them forever. Works great on a full-fledged PC, where I can select the whole password while it's on display in the browser, then paste it into the client t...

how to check if a string looks randomized, or human generated and pronouncable?

For the purpose of identifying [possible] bot-generated usernames. Suppose you have a username like "bilbomoothof" .. it may be nonsense, but it still contains pronouncable sounds and so appears human-generated. I accept that it could have been randomly generated from a dictionary of syllables, or word parts, but let's assume for a mom...