ansaurus

Question

How to prevent concurrency in web service API?

Answer 1

+3 A:

Firstly, without knowing your architecture, you are probably going to run into issues if you have to enforce concurrency restrictions on the WebService tiers. Whilst you could use traditional locks etc to serialise the requests accross both services, what happens when you add a second web tier to scale your solution? If the locks are local to the web layer they will be next to useless.

I'm guessing there is probably a layer of some sort that sits below the Web services and it's here you need to enforce these restrictions. If client B comes in after client A has made a conflicting request, then the backend should reject the request when it finds out the state has changed and you should then return a 409 to the second client. In the end race conditions are still possible but you have to have your lowest common layer protect your from conflicting requests.

jkp 2009-11-22 02:09:20

Answer 2

A:

You could use a semaphore of some kind to keep access across services serial.

Trent 2009-11-22 03:04:31

Answer 3

+5 A:

Assuming it's not ok to just force the web server to have only one listening thread serving requests... I suppose I'd just use a static lock (ReentrantLock probably, for clarity, though you could sync on any shared object, really):

public class Global {
  public static final Lock webLock = new ReentrantLock();
}

public class ClassA {
    public void go() {
        Global.webLock.lock()
        try {
            // do A stuff
        } finally {
            Global.webLock.unlock()
        }
    }
}

public class ClassB {
    public void go() {
        Global.webLock.lock()
        try {
            // do B stuff
        } finally {
            Global.webLock.unlock()
        }
    }
}

public class ClassC {
    public void go() {
        Global.webLock.lock()
        try {
            // do C stuff
        } finally {
            Global.webLock.unlock()
        }
    }
}

overthink 2009-11-22 03:32:32

@overthink: What happens when he wants more service tiers? This solution won't scale if the locks are implemented on those tiers.

jkp 2009-11-22 03:48:42

@overthink: So you're saying you could also use `synchronized(sharedObject) {...}` blocks instead of the `Global.webLock.lock()/unlock()` statements?

Marcus 2009-11-22 05:36:33

@Marcus: Yes, you could use synchronized instead of lock/unlock. e.g. in my example above you could make `webLock` a simple `Object` and use `synchronized(webLock) {...}` instead.

overthink 2009-11-23 01:43:05

@jkp: I'm not attempting to give design advice; just answering the question as it was asked. You're correct that this wouldn't work if there are multiple web servers involved.

overthink 2009-11-23 01:53:53

Answer 4

+3 A:

Your design is flawed. The services should be idempotent. If the classes you have don't support that, redesign them until they do. Sounds like each of the three methods should be the basis for the services, not the classes.

duffymo 2009-11-22 03:57:10

Can you elaborate?

Marcus 2009-11-22 04:04:02

The only reason I can see for preventing the go() method in class A, B, and C from running concurrently is that they're sharing something. If that's class data or database data, you should try to redesign so they share nothing. That would allow them to run concurrently and not worry about interfering. If that's not possible, then you should have one service that calls them in proper sequence to ensure that they can't be called out of order. In either case, I think your design is flawed and the cure will be worse than the disease.

duffymo 2009-11-22 04:09:06

+100 if I could. There is a big problem somewhere in the design and semaphores, locks, etc are not the way to solve this.

Pascal Thivent 2009-11-22 13:07:22

Point taken.. this entire process involves a financial calculation. `C` is the final calculation run at the end of the day. `A` uploads some settings which can be done any time and `B` is an intermediate set of calculations that are done mid day. So normal operations these 3 functions will never happen at the same time. But to be thorough we want to ensure they don't happen at the same time.

Marcus 2009-11-22 16:19:56

Sounds to me like A is the only service here. The other two sound more like batch jobs that are kicked off at a particular time. The only concern you'd have with that approach is making sure that they finished within their appointed window. If that's a fair summary, I'd recommend looking at Spring Batch: http://static.springsource.org/spring-batch/

duffymo 2009-11-22 17:02:27

"...REST API..." - don't confuse a SOAP-less web service with a true REST approach. If REST is modeled after the stateless HTTP protocol, your services don't merit the title - they aren't stateless.

duffymo 2009-11-22 17:04:30

Answer 5

A:

Why not use hypermedia to constrain access?

Use something like,

POST /A

to initate the first process. The when it is complete the results should provide a link to follow to initiate the second process,

<ResultsOfProcessA>
  <Status>Complete</Status>
  <ProcessB href="/B"/>
</ResultsOfProcessA>

Follow the link to initate the second process,

POST /B

and repeat for part C.

Arguably a badly behaving client could cache the link to step B and attempt to re-use it in some future request to circumvent the sequence. However, it would not be too difficult to assign some kind of token when doing step A and require that the token be passed to step B and C to prevent the client from constructing the URL manually.

Reading your comments further, it seems that you have a situation where A could be run either before or after B. In this case I would suggest creating a resource D that represents the status of the entire set of processes (A,B and C). When a client retrieves D it is presented with the URIs that it is allowed to follow. Once a client has initiated the A process then the D resource should remove the B link for the duration of the processing. The opposite should occur when B is initiated before A.

The other advantage of this technique is that it is obvious if A or B has been run for the day as the status can be displayed in D. Once A and B have been run then D can contain a link for C.

The hypermedia is not a 100% foolproof solution because you could have two clients with the same copy of D and both might think that process A has not been run and both could attempt to run it simultaneously. This could be addressed by having some kind of "Last Modified" timestamp on D and you could update that timestamp whenever the status of D changes. This could allow the later request to be denied. Based on the description of your scenario it would seem that this is more of an edge case and the hypermedia would catch most attempts to run processes in parallel.

Darrel Miller 2009-11-23 03:22:31

ansaurus

tags:

views:

answers:

How to prevent concurrency in web service API?

related questions