In Oracle 10g, I'd like to create a regular expression to list the characters that are different between two strings.
Here is the reason : I have a table with a field that contains sometimes Unicode characters that are not in the french language.
I am able to list the rows containing these non standards characters to make a future cleanup with this query :
SELECT DataID, Name, CONVERT(NAME, 'WE8ISO8859P1', 'WE8DEC')
FROM table
WHERE NAME <> CONVERT(NAME, 'WE8ISO8859P1', 'WE8DEC' )
where WE8ISO8859P1 - West European (that I accept)
and WE8DEC - 8-bit character sets from Digital Equipment Corporation (that i know that the application support)
I imagine that with a Oracle regular expression I would be able to extract the list of all these non standards characters. But I'm not familiar with regexp in Oracle so any help would be appreciated.
Here is my (not working) idea :
select regexp_replace("éaé", '[a-z][A-Z]', '' ) from dual;
would give "é" as a character to cleanup.