tags:

views:

390

answers:

1

I'm working with a Sybase database that seems to have non-printable characters in some of the string fields and this is throwing off some of our processing code. At first glance, it seemed to only be newlines and carriage returns, but we also have an ASCII code 27 in there - an ESC character, some accented characters, and some other oddities in there.

I have no direct access to change the database, so changing the bad data isn't an option, yet. For now I have to make do with just filtering it out. We're trying to export the table data from one database and load it into a database used by another application in a nightly batch process.

Ideally, I'd like to have a function that I can pass a list of characters and just have Sybase return the data with those characters removed. I'd like to keep it something we could do in plain SQL if possible.

Something like this to remove characters that are ASCII 0 - 31.

select str_replace(FIELD1, (0-31), NULL) as FIELD1, str_replace(FIELD2, (0-31), NULL) as FIELD2 from TABLE

So far, str_replace is the nearest I can find, but it only allows replacing one string with another. No support for character ranges and won't let me do the above.

We're running on Sybase ASE 12.5 on Unix servers.

A: 

Something like this works in SQL Server, which uses T-SQL like Sybase:

while @@ROWCOUNT > 0
    UPDATE TheTable
    SET strColumn = REPLACE(strColumn, SUBSTRING(strColumn, 
        PATINDEX('%[^a-zA-Z0-9 ]%', 
        strColumn collate Latin1_General_BIN), 1), '')
    WHERE PATINDEX('%[^a-zA-Z0-9 ]%', 
        strColumn collate Latin1_General_BIN) <> 0

The patindex function at least appears to exist on Sybase.

The collation is required to match binary; otherwise [a] would match 'á'.

Andomar
Thanks for the info. I wasn't aware of the patindex function. That looks like it will get me almost there.Unfortunately, we don't have access to actually modify the data in the table, just filter out the bad data during the extract. I also don't think the version of Sybase we are using has the collate function. But we do have a function called "stuff."Stuff "Returns the string formed by deleting a specified number of characters from one string and replacing them with another string." So if we use patindex to find the chars we need to delete and use stuff to delete them, I think we are set.
Kenny Drobnack