ansaurus

Question

What's a good algorithm for editing a "schedule" most efficiently?

Answer 1

+1 A:

You post is almost in the "too long; didnt read" category - shortening it will probably give you more feedback.

Anyway, on topic: you can try lookin into a thing called "Interval Algebra"

ADEpt 2008-10-05 19:14:15

Thanks, ADEpt, I think you're right. I'll give it a shot.

Adam Bellaire 2008-10-05 19:17:18

Answer 2

+1 A:

As I understand you, your users can only directly affect table A. Assuming you are programming in C#, you could use a simple ADO.Net DataSet to manage modifications to table A. The TableAdapter knows to leave untouched rows alone and to handle new, modified and deleted rows appropriately.

In addition you should define a cascading delete in order to automatically remove corresponding objects in table B.

The only case that is not handled this way is if a timespan in table A is shortened s.t. it does not subsume the corresponding record in Table B anymore. You could simply check for that case in an update stored procedure or alternatively define an update-trigger on table A.

Manu 2008-10-05 19:42:51

Answer 3

+1 A:

It seems to me like any algorithm for this will be involve a pass through NewA, matching ResourceID, StartTime, and EndTime, and keeping track of which elements from OldA get hit. Then you have two sets of non-matching data, UnmatchedNewA and UnmatchedOldA.

The simplest way I can think of to proceed is to basically start over with these: Write all of UnmatchedNewA to the DB, transfer elements of B from UnmatchedOldA into New A keys (just generated) where possible, deleting when not. Then wipe out all of UnmatchedOldA.

If there are a lot of changes, this is certainly not an efficient way to proceed. In cases where the size of the data is not overwhelming, though, I prefer simplicity to clever optimization.

It's impossible to know whether this final suggestion makes any sense without more background, but on the off chance that you didn't think of it this way:

Instead of passing the entire A collection back and forth, could you use event listeners or something similar to update the data model only where changes ARE needed? This way, the objects being altered would be able to determine which DB operations are required on the fly.

grossvogel 2008-10-05 19:56:14

Thanks, grossvogel, that was actually the approach that I was considering (wiping out old set/writing new set). It didn't seem terribly efficient, hence the question here, but there is something to be said for an approach that's easily explained. I may end up doing it this way.

Adam Bellaire 2008-10-05 20:45:06

Answer 4

+2 A:

I suggest you decouple your questions into two separate ones: The first should be something like: "How do I reason about resource scheduling, when representing a schedule atom as a resource with start time and end time?" Here, ADept's suggestion to use interval algebra seems fitting. Please see The Wikipedia entry 'Interval Graph' and The SUNY algorithm repository entry on scheduling. The second question is a database question: "Given an algorithm which schedules intervals and indicate whether two intervals overlap or one is contained in another, how do I use this information to manage a database in the given schema?" I believe that once the scheduling algorithm is in place, the database question will be much easier to solve. HTH, Yuval

Yuval F 2008-10-06 12:07:51

Answer 5

+4 A:

I have worked extensively with periods, but I'm afraid I don't understand entirely how table A and B work together, perhaps it's the word subsume that I don't understand.

Can you give some concrete examples of what you want done?

Do you mean that timespans recorded in table A contains entirely timespans in table B, like this?

|---------------- A -------------------|
    |--- B ----|      |--- B ---|

or overlaps with?

    |---------------- A -------------------|
|--- B ----|                        |--- B ---|

or the opposite way, timespans in B contains/overlaps with A?

Let's say it's the first one, where timespans in B are inside/the same as the linked timespan in table A.

Does this mean that:

* A removed A-timespan removes all the linked timespans from B
* An added A-timespan, what about this?
* A shortened A-timespan removes all the linked timespans from B that now falls outside A
* A lenghtened A-timespan, will this include all matching B-timespans now inside?

Here's an example:

|-------------- A1 --------------|    |-------- A2 --------------|
  |---- B1 ----|  |----- B2 ---|       |---- B3 ----|  |-- B4 --|

and then you lengthen A1 and shorten and move A2, so that:

|-------------- A1 ---------------------------------|  |--- A2 --|
  |---- B1 ----|  |----- B2 ---|       |---- B3 ----|  |-- B4 --|

this means that you want to modify the data like this:

1. Lengthen (update) A1
2. Shorten and move (update) A2
3. Re-link (update) B3 from A2 to A1 instead

how about this modification, A1 is lengthened, but not enough to contain B3 entirely, and A2 is moved/shortened the same way:

|-------------- A1 -----------------------------|      |--- A2 --|
  |---- B1 ----|  |----- B2 ---|       |---- B3 ----|  |-- B4 --|

Since B3 is now not entirely within either A1 or A2, remove it?

I need some concrete examples of what you want done.

Edit More questions

Ok, what about:

|------------------ A -----------------------|
  |------- B1 -------|  |------- B2 ------|
                           |---|                   <-- I want to remove this from A

What about this?

Either:

|------------------ A1 ----|   |---- A2 -----|
  |------- B1 -------|  |B3|   |--- B2 ---|

or:

|------------------ A1 ----|   |---- A2 -----|
  |------- B1 -------|

To summarize how I see it, with questions, so far:

You want to be able to do the following operations on A's
- Shorten
- Lengthen
- Combine when they are adjacent, combining two or more into one
- Punch holes in them by removing a period, and thus splitting it
B's that are still contained within an A after the above update, relink if necessary
B's that were contained, but are now entirely outside, delete them
B's that were contained, but are now partially outside, Edit: Delete these, ref data integrity
For all the above operations, do the least minimum work necessary to bring the data in line with the operations (instead of just removing everything and inserting anew)

I'll work on an implementation in C# that might work when I get home from work, I'll come back with more later tonight.

Edit Here's a stab at an algorithm.

Optimize the new list first (ie. combine adjacent periods, etc.)
"merge" this list with the master periods in the database in the following way:
1. keep track of where in both lists (ie. new and existing) you are
2. if the current new period is entirely before the current existing period, add it, then move to the next new period
3. if the current new period is entirely after the current existing period, remove the existing period and all its child periods, then move to the next existing period
4. if the two overlap, adjust the current existing period to be equal to the new period, in the following way, then move on to the next new and existing period
  1. if new period starts before existing period, simply move the start
  2. if new period starts after existing period, check if any child periods are in the difference-period, and remember them, then move the start
  3. do the same with the other end
with any periods you "remembered", see if they needs to be relinked or deleted

You should create a massive set of unit tests and make sure you cover all combinations of modifications.

Lasse V. Karlsen 2008-10-06 12:12:24

Hi lassevk, thanks for the response. Yes, it is the first scenario (B is totally contained within A). I'll try to add a concrete example to my request.

Adam Bellaire 2008-10-06 13:27:28

lassevk, I regret that I have but one upvote to give for your well-thought out response. I'll probably try to implement this tomorrow morning!

Adam Bellaire 2008-10-06 18:52:58

No worries .. I have one too :)

Learning 2008-12-18 12:23:44

ansaurus

tags:

views:

answers:

What's a good algorithm for editing a "schedule" most efficiently?

related questions