ansaurus

Question

Selecting the latest values given data with missing records

Answer 1

+1 A:

Gotta love MySQL for allowing an order by in a subquery. That's not allowed by the SQL standard :)

You could rewrite the query in a standards complaint way like:

select  *
from    YourTable a
where   not exists
        (
        select  *
        from    YourTable b
        where   a.id = b.id
        and     a.datetime < b.datetime
        )

In case there are duplicates that you can't split apart in the subquery, you can group by and then pick an arbitrary value:

select  a.id
,       max(a.value)
,       max(a.datetime)
from    YourTable a
where   not exists
        (
        select  *
        from    YourTable b
        where   a.id = b.id
        and     a.datetime < b.datetime
        )
group by
        a.id

This chooses the maximum a.value sharing the latest datetime. Now datetime is the same for all duplicate rows, but standard SQL doesn't know that, so you have to specify a way to pick from the equal days. Here, I'm using max, but min or even avg would work just as well.

Andomar 2010-10-01 15:19:11

Nice! Okay - but we're assuming that there is more than one value per id, so we can compare and get the "latest" - but this doesn't account for the corner case where there is only one value, because it won't satisfy `a.datetime < b.datetime`. ;) I worked around that with a separate query to make sure there are always 2, but just food for thought. :)

Julian H. Lam 2010-10-01 15:44:58

@Julian H. Lam: The condition is behind `not exists`, so if it's not satisfied, it will be included in the result set. I'd expect the query to work if there's only one value.

Andomar 2010-10-01 15:53:32

ansaurus

tags:

views:

answers:

Selecting the latest values given data with missing records

related questions