ansaurus

Question

Answer 1

A:

You are considering two options

generating a list of required updates at the time of modification, client asks "what's in my list of updates"?
keeping timestamps such that a client can ask "My latest time for DSx is ..., do I need an update?"

I see option 2 as prefereable. I think it is more resilient to problems, recovery from client crashes etc. This is because each participant keeps only what it knows: the date of its data. The server does not need to "understand" who took what.

I think you can optimise the determination of what to upload. You speak as though the client needs to iterate through all it's data sets, retrieveing each time stamp one at a time, and making a decision to retrieve. Instead you could have a web service call:

I have DS1=<time>, DS2=<time> ...; which do I need to download?

The actual decision is made by the server on the basis of the data sent by the client, rather than the client fetching data to allow it to make the decision itself.

In passing I should point out that there are commercial products than handle all this. Do you really need to code it?

djna 2010-01-13 12:11:51

Hi Djna!Thanks for your answer. You do have an interesting suggestion regarding solution 2. However, if I have a large number of data sets, I am afraid that I would have to send a large number of timestamps to the server so that the server can make the decision of what to send back. For instance, if I have 10 000 data sets, I would have to send 10 000 timestamps. I think I should be able to represent a timestamp using 16 bytes and say another 4 bytes for a data set ID. Therefore, I guess the data itself, for 10 000 datasets would be like 200k. This thing could grow to 150k datasets.

Jason 2010-01-14 11:16:29

But if you 10,000 out of date data sets then the volume of data is going to be absurd, let alone the volume of the timestamps. If the set out out of date date is likely to be sparse compared with the overall available data then use some more cunning scheme. Such as organise the data hierarchically and update it hierarchically. "I have updated items under data/abc on 14 Jan 1t 14:15" anything changed since?

djna 2010-01-14 14:37:41

Thanks for the suggestion about a hierarchical structure. It sounds like a really good idea!

Jason 2010-01-15 22:22:37

ansaurus

tags:

views:

answers:

In house simple data synchronization

related questions