views:

147

answers:

6

I am wondering what the literal for a Null character (e.g. '\0') is in TSQL.

Note: not a NULL field value, but the null character (see link).

I have a column with a mix of typical and a null character. I'm trying to replace the null character with a different value. I would have thought that the following would work but it is unsuccessfull:

select REPLACE(field_with_nullchar, char(0), ',') from FOO where BAR = 20
+2  A: 

There are two different behaviors in the Cade Roux's answer: replacement is successful (when SQL collation is used) and unsuccessful (Windows collation is used). The reason is in type of collation used.

This behaviour was submitted to Microsoft nearly 4 years ago:

Q: When trying a replace a NUL character with replace(), this works is the value has an SQL collation, but not a Windows collation.

A: This is due to the fact that 0x0000 is an undefined character in Windows collations. All undefined characters are ignored during comparison, sort, and pattern matching. So searing for 'a' + char(0) is really searching for ‘a’, and searching for char(0) is equivalent to empty string.

The way to handle undefined character is a bit confusing, but this is the way that Windows defined to sort them, and SQL Server conforms with the general Windows API.

In SQL collation, there is no notion of undefined character. Each code point is assigned a weight, that's why we don't see a problem there.

but unfortunately, it is still undocumented.

So, it seems the only one solution is to change collation to SQL collation (e.g. SQL_Latin1_General_CP1_CI_AS may be used as well).

* I removed my previous answer as unnecessary

Alex
Exactly. He has embedded string terminators (NULL in the C sense, not SQL) and wants to fix them.
GalacticCowboy
A: 

The null character literal is just an empty string '', e.g. the below code returns 1

SELECT CASE WHEN char(0) = '' THEN 1 ELSE 0 END

However removing empty strings from a field won't do anything. What are you actually trying to acheive?, perhaps it is another character you need to replace with commas.

Ben Robinson
`REPLACE('', char(0), ',')` doesn't replace anything, but `REPLACE(char(0), char(0), ',')` does.
Alex
-1, wrong. Did you test that code? It returns 0 for me. `char(0)` is the ascii character represented by the number 0.
Blorgbeard
It returns 1 when i run it but perhaps i am wrong.
Ben Robinson
Weird - what version of sql server? I'm on 2008.
Blorgbeard
Ah - it's a collation thing. Using `SQL_Latin1_General_CP1_CI_AS` it returns 0, using `SQL_Latin1_General_CP1_CS_AS` it return 1. Not sure why. I'd remove my downvote but it's been too long, sorry.
Blorgbeard
+3  A: 

Looks like the C-style terminator is a terminator in SQL as well:

SELECT  REPLACE(bad, CHAR(0), ' ')
FROM    (
         SELECT 'a' + CHAR(0) + 'b' AS bad
        ) AS X

Looks like it's also dependent on COLLATION:

SELECT  REPLACE(CAST(bad COLLATE SQL_Latin1_General_CP1_CI_AS AS varchar(10)), CHAR(0), ' ')
FROM    (
         SELECT 'a' + CHAR(0) + 'b' AS bad
        ) AS X

works as expected, compared to:

SELECT  REPLACE(CAST(bad COLLATE Latin1_General_CI_AS AS varchar(10)), CHAR(0), ' ')
FROM    (
         SELECT 'a' + CHAR(0) + 'b' AS bad
        ) AS X
Cade Roux
A: 

Are you certain they are null characters? How did you get them in there?

It looks like SQL Server treats them as string terminators. This query:

select 'aaa' + char(0) + 'bbb'

Returns aaa for me (on SQL Server 2008).

Edit: Above is wrong - it's just the results grid that treats them that way. They show up in text mode.

Blorgbeard
Have you looked at the results in text mode? When I run that in grid mode it displays as "aaa", but in text-mode for the result set it displays "aaa bbb"
Tom H.
@Tom - oh, you're right. Cheers.
Blorgbeard
A: 

I just ran the test below on my server (2008) and it was successful. It may have to do with an ANSI setting. I'll try flipping some settings here and see if I can reproduce your issue.

DECLARE @test_null_char VARCHAR(20)

SET @test_null_char = 'aaa' + CHAR(0) + 'bbb'

SELECT @test_null_char    -- Returns "aaa bbb"

SET @test_null_char = REPLACE(@test_null_char, CHAR(0), 'ccc')

SELECT @test_null_char    -- Returns "aaacccbbb"
Tom H.
A: 

A VARBINARY cast should work with any collation

SELECT 
   REPLACE(CAST(CAST(fld AS VARCHAR(5)) AS VARBINARY(5)), 0x0, ',')
FROM 
   (SELECT 'QQ' + CHAR(0) + 'WW' COLLATE Latin1_General_CI_AS AS fld) AS T

SELECT 
   REPLACE(CAST(CAST(fld AS VARCHAR(5)) AS VARBINARY(5)), 0x0, ',')
FROM 
   (SELECT 'QQ' + CHAR(0) + 'WW' COLLATE SQL_Latin1_General_CP1_CI_AS AS fld) AS T 

>>QQ,WW
>>QQ,WW
Alex K.