ansaurus

Question

SQL problem - high volume trans and PK violation

Answer 1

A:

It might be related to the transaction isolation level. You might need

SET TRANSACTION ISOLATION LEVEL READ COMMITTED

before you start the transaction.

Also, if you have more updates than inserts, you should try the update first and check rowcount and do the insert second.

Joe 2009-07-02 18:15:58

Answer 2

A:

This is very similar to post 939831. Ultimately you want to use the hints (ROWLOCK, READPAST, UPDLOCK). READPAST tells sql server to skip to the next record if the current one is locked. UPDLOCK tells sql server that the read lock is going to escalate to an update lock.

When I implemented something similar I locked the next record by the threadID

UPDATE TOP (1)
    foo
SET
    ProcessorID = @PROCID
FROM
    OrderTable foo WITH (ROWLOCK, READPAST, UPDLOCK)
WHERE
    ProcessorID = 0

Then selected the record

SELECT *
FROM foo WITH (NOLOCK)
WHERE ProcessorID = @PROCID

Then marked it as processed

UPDATE foo
SET ProcessorID = -1
WHERE ProcessorID = @PROCID

Later in off hours I perform the relatively expensive operation of performing the delete operation to clear the queue of processed records.

William Edmondson 2009-07-02 18:17:16

Answer 3

+1 A:

Common problem. Explained here:

Defensive database programming: eliminating IF statements

AlexKuznetsov 2009-07-02 18:18:50

Answer 4

A:

The atomicity of the following statement is what you are after:

INSERT TableA(MessageSequence, Data )
SELECT @MessageSequence, @Data
WHERE NOT EXISTS
(
  SELECT TOP 1 MessageSequence FROM TableA WHERE MessageSequence = @MessageSequence
)

According to this person, it depends on the current isolation level.

mbeckish 2009-07-02 18:25:48

He then wants to do an update if there was already a row with that id.

John Saunders 2009-07-02 18:29:13

Yes, but the pertinent issue is that the INSERT is causing a PK violation.

mbeckish 2009-07-03 23:29:49

Answer 5

+1 A:

Why is this happening?

SELECT TOP 1 MessageSequence FROM TableA WHERE MessageSequence = @MessageSequence

This SELECT will try to locate the row, if not found the EXISTS operator will return FALSE and the INSERT will proceed. Hoewever, the decision to INSERT is based on a state that was true at the time of the SELECT, but that is no longer guaranteed to be true at the time of the INSERT. In other words, you have race conditions where two threads can both look up the same @MessageSequence, both return NOT EXISTS and both try to INSERT, when only the first one will succeed, second one will cause a PK violation.

How do I solve it?

The quickest fix is to add a WITH (UPDLOCK) hint to the SELECT, this will enforce the lock placed on the @MessageSequence key to be retained and thus the INSERT/SELECT to behave atomically:

INSERT TableA(MessageSequence, Data )
   SELECT @MessageSequence, @Data
   WHERE NOT EXISTS (
      SELECT TOP 1 MessageSequence FROM TableA WITH(UPDLOCK) WHERE MessageSequence = @MessageSequence)

To prevent SQL from doing fancy stuff like page lock, you can also add the ROWLOCK hint.

However, that is not my recommendation. My recommendation may surpise you, but is this: do the operation that is most likely to succeed and handle the error if it failed. Ie. if your business case makes it more likely for the @MessageSequnce to be new, try an INSERT and handle the PK if it failed. This way you avoid the spurious look-ups, and hte cost of the catch/retry is amortized over the many cases when it succeeds from the first try.

Also, it is perhaps worth investigating using the built-in queues that come with SQL Server.

Remus Rusanu 2009-07-02 18:46:37

Answer 6

A:

On a tangent, if you're thinking of a high volume trading system you might want to consider a tick database designed for such data [I'm not exactly sure what "message" you are storing here], such as discussed in this thread for example: http://www.elitetrader.com/vb/showthread.php?threadid=81345.

These are typically in-memory solutions with proprietary query languages. We use kdb+ at our shop.

Joel Goodwin 2009-07-02 21:08:12

Answer 7

A:

Not sure what Messaging product you use - but it may be worth looking at the transactions not at the DB level, but at the MQ Level.

Of course, if you are using a TM (Transaction manager), the two operations : 1)Get from MQ and 2)Write to DB are both 'bracketed' under the same parent commit.

So I am not sure if you are using an implicit or explicit or any TM here (for example, Microsoft's DTC).

MessageSequence is the PK, so could the same Message from the MQ be getting processed twice.
When you perform a 'GET" from MQ, make sure the GET is committed (i.e. not a db-commit, but a MQ-commit) - that will ensure the same MessageID cannot be 'popped' by the next thread that writes messages to the DB.

blispr 2009-07-03 12:20:50

ansaurus

tags:

views:

answers:

SQL problem - high volume trans and PK violation

related questions