ansaurus

Question

T-SQL absence by month from start date end date

Answer 1

+2 A:

I have had a similar issue where there has been a table of start/end dates designed for data storage but not for reporting.

I sought out the "fastest executing" solution and found that it was to create a 2nd table with the monthly values in there. I populated it with the months from Jan 2000 to Jan 2070. I'm expecting it will suffice or that I get a large pay cheque in 2070 to come and update it...

DECLARE TABLE months (start DATETIME)
-- Populate with all month start dates that may ever be needed
-- And I would recommend indexing / primary keying by start

SELECT
    months.start,
    data.id,
    SUM(CASE WHEN data.start < months.start
            THEN DATEDIFF(DAY, months.start, data.end)
            ELSE DATEDIFF(DAY, data.start, DATEADD(month, 1, months.start))
        END) AS days
FROM
    data
INNER JOIN
    months
        ON data.start < DATEADD(month, 1, months.start)
        AND data.end > months.start
GROUP BY
   months.start,
   data.id

That join can be quite slow for various reasons, I'll search out another answer to another question to show why and how to optimise the join.

EDIT:

Here is another answer relating to overlapping date ranges and how to speed up the joins...

http://stackoverflow.com/questions/452499/query-max-number-of-simultaneous-events/453180#453180

Dems 2009-01-30 16:11:51

That's pretty much my current thought process joining to a table with a list of dates in, obviously I could maintain that with an overnight run or something or generate on the fly.

PeteT 2009-01-30 16:14:11

I did something similar to the date table, but I created a CLR UDF (this is with SQL Server) that would generate a table of dates. Again assuming SQL Server, you could use common table expressions to do the same.

Sean Bright 2009-01-30 16:14:29

The overhead of having a table of dates/months is really low. I would have it as a permanent table. You get the benefits of the indexes then too.

Dems 2009-01-30 16:16:26

Nah, it allows me to generate hour, day, week, month, or year ranges with whatever criteria I need and is lightning fast.

Sean Bright 2009-01-30 16:18:34

Still, having one or more permanent tables is always faster than generating them on the fly. And in my case the indexing of the table made a difference...

Dems 2009-01-30 16:20:38

Answer 2

A:

I tried this solution but it gives incorrect results when there are 1 or more complete months between the start and end date, e.g. start date 15-5-2010, end date 15-7-2010. it will correctly return 17 days for may 2010, but then it incorrectly returns the remaining 45 days (30 for june + 15 for july) for june 2010, and then correctly the remaining 15 days for july 2010. SO the 45 days for june is not correct, Does Anyone have a solution for this?

Hennie 2010-06-09 12:22:52

ansaurus

tags:

views:

answers:

T-SQL absence by month from start date end date

related questions