ansaurus

Question

Efficient way to calculate sum of deltas between consecutive rows?

Answer 1

+2 A:

Start with row_number, then join back to yourself.

with numbered as
(
SELECT value, row_number() over (order by timestamp) as Rownum 
FROM table
)
select sum(n1.value - n2.value)
from numbered n1
  join
  numbered n2  on n1.Rownum = n2.Rownum +1

Actually... you only want to pick up increases... so put a WHERE clause in, saying "WHERE n1.value > n2.value".

And... make sure I've put them the right way around... I've just changed it from -1 to +1, because I think I had it flipped.

Easy!

Rob

Rob Farley 2009-08-13 02:49:05

Answer 2

+2 A:

Much the same...

create table #temp ([timestamp] date,value int);
insert into #temp (timestamp,value) values ('2009-01-01',100)
insert into #temp (timestamp,value) values ('2009-01-02',105)
insert into #temp (timestamp,value) values ('2009-01-03',120)
insert into #temp (timestamp,value) values ('2009-01-04',0)
insert into #temp (timestamp,value) values ('2009-01-05',9);

with numbered as
(
    select ROW_NUMBER() over (order by timestamp) id,value from #temp
)
select sum(n1.value-n2.value) from numbered n1 join numbered n2 on n1.id=n2.id+1 where n1.value!=0

drop table #temp;

Result is 29, as specified.

spender 2009-08-13 02:57:46

+1 for the usage of row_number()

Rashmi Pandit 2009-08-13 04:19:45

Answer 3

A:

There are too many unnecessary joins in your algorithm.

Calculating the difference between each meter reading and its subsequent meter reading is a waste of resources. As a real world example, imagine if my electric company read my meter each day to how much electricity I used, and summed daily values to determine my monthly total - it just doesn't make sense. They simply determine the total based on the start value and the end value!

Simply calculate the difference between the first and last readings and adjust to account for the 'resets'. Your formula simply becomes:

total value = (final value) - (initial value) 
                 + (miscellaneous reductions in value, i.e. resets)
total value = (9) - (100) + (120)
            = 29

It's trivial to find the final value and initial value. Just find the total amount by which 'meter' was reduced during 'resets', and add this to the total. Unless there are more reset records than measurement records, this will always be more efficient.

To borrow from spender's solution, the 'reset' value could be calculated by

create table...

select sum(n1.value-n2.value) from numbered n1 join numbered n2 
     on n1.id=n2.id+1 where n1.value=0  //note value=0 rather than value!=0

Kirk Broadhurst 2009-08-13 02:58:51

ansaurus

tags:

views:

answers:

Efficient way to calculate sum of deltas between consecutive rows?

related questions