ansaurus

Question

How to calculate the smallest period of time between consecutive events?

Answer 1

A:

You could try this:

SELECT
    T1.*,
    (SELECT MIN(T2.time)
     FROM temperatures T2
     WHERE T2.time > T1.time)-T1.time diff
FROM
    temperatures T1
ORDER BY
    T1.time

Lasse V. Karlsen 2009-05-23 08:48:25

Works fine in a db that supports subqueries well--- takes ages on MySQL with a test of about 10,000 rows :(Also produces odd values for `diff`, e.g. -20090500997993

araqnid 2009-05-23 13:11:53

Answer 2

+2 A:

Try a query like this:

select 
    cur.timestamp as CurrentTime,
    prev.timestamp as PreviousTime,
    timediff(cur.timestamp,prev.timestamp) as TimeDifference,
    cur.temperature - prev.temperature as TemperatureDifference
from temperatures cur
left join temperatures prev on prev.timestamp < cur.timestamp
left join temperatures inbetween
    on prev.timestamp < inbetween.timestamp
    and inbetween.timestamp < cur.timestamp
where inbetween.timestamp is null

The first join seeks all previous rows for the current ("cur") row. The second join seeks rows in between the first and the second row. The where statement says there cannot be any rows in between the first and the second row. That way, you get a list of rows with their preceeding row.

Andomar 2009-05-23 08:49:53

This will wor, but will be quite slow and won't handle the duplicate timestamps correctly.

Quassnoi 2009-05-23 17:03:14

Don't you need a MIN() function applied to the time difference, and appropriate GROUP BY clause? You don't need to do a check that there is no reading between the two because if there was, the time differences between the first and second would be smaller than the difference between the first and third and hence MIN() would get rid of it. In other contexts, you have to ensure there is no intermediate reading between current and previous, and that complicates things no end.

Jonathan Leffler 2009-05-23 17:05:10

@Jonathan: there is a join with INBETWEEN that does this check.

Quassnoi 2009-05-23 17:16:35

Answer 3

+4 A:

What you need is analytical functions LAG and MIN.

They are missing in MySQL, but can be easily emulated using session variables.

This query returns all differences between consecutive records:

SELECT  (temperature - @r) AS diff,
        @r := temperature
FROM    (
        SELECT  @r := 0
        ) vars,
        temperatures
ORDER BY
        time

This one returns minimal time difference:

SELECT  (
        SELECT  id,
                @m := LEAST(@m, TIMEDIFF(time, @r)) AS mindiff,
                @r := time
        FROM    (
                SELECT  @m := INTERVAL 100 YEAR,
                        @r := NULL
                ) vars,
                temperatures
        ORDER BY
                time, id
        ) qo
WHERE   qo.id = 
        (
        SELECT  id
        FROM    temperatures
        ORDER BY
                time DESC, id DESC
        LIMIT 1
        )

See this article in my blog on how to emulate analytic functions in MySQL:

Analytic functions: FIRST_VALUE, LAST_VALUE, LEAD, LAG

If you add a PRIMARY KEY to you table (which you should always, always do!), then you may use more SQL-ish solution:

SELECT  temperature -
        (
        SELECT temperature
        FROM   temperatures ti
        WHERE  (ti.timestamp, ti.id) < (to.timestamp, to.id)
        ORDER BY
               ti.timestamp DESC, ti.id DESC
        LIMIT 1
        )
FROM    temperatures to
ORDER BY
       to.timestamp, to.id

This solution, though, is quite inefficient in MySQL due to the bug 20111.

The subquery will not use the range access path, though it will use an index on (timestamp, id) for ordering.

This may be worked around by creating a UDF that returns previous temperature, given the current record's id.

See this article in my blog for details:

Analytic functions: optimizing LAG, LEAD, FIRST_VALUE, LAST_VALUE

IF you don't use any filtering conditions, then the solution which uses session variable will be the most efficient, though MySQL specific.

Similar solutions for SQL Server will look like this:

SELECT  temperature -
        (
        SELECT TOP 1 temperature
        FROM   temperatures ti
        WHERE  ti.timestamp < to.timestamp
               OR (ti.timestamp = to.timestamp AND ti.id < to.id)
        ORDER BY
               ti.timestamp DESC, ti.id DESC
        )
FROM    temperatures to
ORDER BY
       to.timestamp, to.id

and

SELECT  MIN(mindiff)
FROM    (
        SELECT  timestamp -
                (
                SELECT TOP 1 timestamp
                FROM   temperatures ti
                WHERE  ti.timestamp < to.timestamp
                       OR (ti.timestamp = to.timestamp AND ti.id < to.id)
                ORDER BY
                       ti.timestamp DESC, ti.id DESC
                ) AS mindiff
        FROM    temperatures to
        ORDER BY
               to.timestamp, to.id
        ) q

In SQL Server, this will work OK, provided you have an index on (timestamp, id) (or just on (timestamp), if your PRIMARY KEY is clustered)

Quassnoi 2009-05-23 12:24:34

Perverse. So wrong it's right.

araqnid 2009-05-23 13:12:18

@araqnid: MySQL (sigh).

Quassnoi 2009-05-23 14:16:56

+1 Very nice. Tried it on Sql Server but "A SELECT statement that assigns a value to a variable must not be combined with data-retrieval operations."

Andomar 2009-05-23 15:57:33

@Andomar: no, it's MySQL specific. Oracle has native support for LAG, and in SQL Server you'll need to emulate it with subqueries.

Quassnoi 2009-05-23 16:44:16

@Andomar: I updated the post with SQL Server solution

Quassnoi 2009-05-23 17:00:15

Also the Andomar and Lasse V. Karlsen are great answers, but to me this one is cleaner, thanks for the blog post btw!

coma 2009-05-23 18:25:02

Answer 4

+3 A:

Assuming that there is a unique constraint on the time stamp (to prevent there being two recordings at the same time):

SELECT MIN(timediff(t1.`time`, t2.`time`)) AS delta_t,
    FROM temperatures t1 JOIN temperatures t2 ON t1.`time` < t2.`time`

This answers the questions rather precisely - and doesn't convey other useful information (such as which two timestamps or temperatures).

Jonathan Leffler 2009-05-23 17:10:59

What makes you think @op want the time difference? Temperature difference makes more sence in this query.

Quassnoi 2009-05-23 17:17:48

Nevermind, I missed the question title as usual :)

Quassnoi 2009-05-23 17:27:37

ansaurus

tags:

views:

answers:

How to calculate the smallest period of time between consecutive events?

related questions