ansaurus

Question

How do reduce transaction log growth for batched nvarchar(max) updates

Answer 1

A:

What you may have to do is surround each "chunk" or group of chunks with it's own transaction and commit after each group. Surrounding the entire thing with your own ADO transaction is essentially doing the same thing as the implicit transaction does, so that won't help. You have to commit in smaller chunks to keep the log smaller.

BBlake 2010-01-06 17:05:39

Answer 2

+1 A:

If by 'chunks' you mean something like:

UPDATE table
SET blob = blob + @chunk
WHERE key = @key;

Then you are right that the operation is fully logged. You should follow the BLOB usage guidelines and use the .Write methods for chuncked updates:

UPDATE table
SET blob.Write(@chunk, NULL, NULL)
WHERE key = @key;

This will minimally log the update (if possible, see Operations That Can Be Minimally Logged):

The UPDATE statement is fully logged; however, partial updates to large value data types using the .WRITE clause are minimally logged.

Not only that this is minimally logged, but because the update is an explicit write at the end of the BLOB, the engine will know that you only updated a portion of the BLOB and will only log that. When you update with SET blob=blob+@chunk te engine will see that the entire BLOB has received a new value and won't detect the fact that you really only changed the BLOB by appending new data, so the it will log the entire BLOB (several times, as you already found out).

BTW you should use chunks of size multiple of 8040:

For best performance, we recommend that data be inserted or updated in chunk sizes that are multiples of 8040 bytes.

Remus Rusanu 2010-01-06 18:31:23

BTW, in full logged model you'll still see the log size reduction when using .WRITE just from the fact that the engine understands that you're doing a partial update and not an entire BLOB column update.

Remus Rusanu 2010-01-06 19:55:22

Thanks for your thorough answer. Unfortunately--which I neglected to mention--the column is actually XML (a big blob of text wrapped in XML tags), and .WRITE doesn't work for XML columns.

Christopher 2010-01-06 22:38:40

Each individual chunk is valid XML then? You can try column.modify, using XML X-Query insert to append the chunks, but I'm not sure how that works in regard to log space. See http://msdn.microsoft.com/en-us/library/ms175466.aspx

Remus Rusanu 2010-01-06 22:49:02

Not only you omitted to mention is XML, you said in the title is nvarchar(max) lol :) Anyway, it was actualy useful exercise for me to digg up all the info on chunked updates and I may make use in a project soon of what I learned.

Remus Rusanu 2010-01-06 22:53:23

The chunks are submitted as nvarchar(max), but the chunks are appended to the XML blob: <xml>[chunk1]</xml>…<xml>[chunk1][chunk2]</xml>…etc.

Christopher 2010-01-07 01:02:35

You can use XML modify then.

Remus Rusanu 2010-01-07 01:08:17

ansaurus

tags:

views:

answers:

How do reduce transaction log growth for batched nvarchar(max) updates

related questions