I receive data files from a source I have no control over (the government) and in the records they have a Company Name field that I actually need to associate with existing company records in my database. I'm concerned that some of the names will vary by minor differences such as 'Company X, Inc.' vs 'Company X Inc'.
So my initial thoughts would be to create a collation key field based on the name ToLower()
and apply a regex to strip out all white space, and special characters.
Is there any better methodology to apply to this?