For the purpose of identifying [possible] bot-generated usernames.
Suppose you have a username like "bilbomoothof" .. it may be nonsense, but it still contains pronouncable sounds and so appears human-generated.
I accept that it could have been randomly generated from a dictionary of syllables, or word parts, but let's assume for a moment that the bot in question is a bit rubbish.
- Suppose you have a username like "sdfgbhm342r3f", to a human this is clearly a random string. But can this be identified programatically?
- Are there any algorithms available (similar to Soundex, etc..) that can identify pronounceable sounds within a string like this?
Solutions applicable in PHP/MySQL most appreciated.