ansaurus

Question

Answer 1

A:

WITH q(n) AS
          (
          SELECT  0
          UNION   ALL
          SELECT  n + 1
          FROM    q
          WHERE   n < 99
          ),
    qq(n) AS 
          (
          SELECT  0
          UNION   ALL
          SELECT  n + 1
          FROM    q
          WHERE   n < 99
          ),
    dates AS
          (
          SELECT  q.n * 100 + qq.n AS ndate
          FROM    q, qq
          )
SELECT    COUNT(userid) as numlogins,
          COUNT(DISTINCT userid) as numusers,
          CAST('2000-01-01' + ndate AS DATETIME) as date
FROM      dates
LEFT JOIN
          usagelog
ON        entryts >= CAST('2000-01-01' AS DATETIME) + ndate
          AND entryts < CAST('2000-01-01' AS DATETIME) + ndate + 1
GROUP BY
          ndate

This will select up to 10,000 dates constructed on the fly, that should be enough for 30 years.

SQL Server has a limitation of 100 recursions per CTE, that's why the inner queries can return up to 100 rows each.

If you need more than 10,000, just add a third CTE qqq(n) and cross-join with it in dates.

Quassnoi 2009-04-06 14:53:20

SQL Server does not have a limitation of 100 rows per CTE. I think it has a limit of 100 recursions in a CTE, but that is very different.

Tom H. 2009-04-06 15:45:05

Just checked, and actually the DEFAULT limit is 100 recursions. You can set that with MAXRECURSION up to 32,767

Tom H. 2009-04-06 15:46:19

Sure, you're right

Quassnoi 2009-04-06 15:59:20

Answer 2

A:

Create a memory table (a table variable) where you insert your date ranges, then outer join the logins table against it. Group by your start date, then you can perform your aggregations and calculations.

Adam Robinson 2009-04-06 14:53:52

Answer 3

+1 A:

The strategy I normally use is to UNION with the opposite of the query, generally a query that retrieves data for rows that don't exist.

If I wanted to get the average mark for a course, but some courses weren't taken by any students, I'd need to UNION with those not taken by anyone to display a row for every class:

SELECT AVG(mark), course FROM `marks` 
    UNION
SELECT NULL, course FROM courses WHERE course NOT IN
    (SELECT course FROM marks)

Your query will be more complex but the same principle should apply. You may indeed need a table of dates for your second query

David Caunt 2009-04-06 14:54:34

Answer 4

+3 A:

Frankly, I'd do this programmatically when building the final output. You're essentially trying to read something from the database which is not there (data for days that have no data). SQL isn't really meant for that sort of thing.

If you really want to do that, though, a "date" table seems your best option. To make it a bit nicer, you could generate it on the fly, using i.e. your DB's date functions and a derived table.

sleske 2009-04-06 14:54:59

This is also a good idea :)

David Caunt 2009-04-06 15:03:11

I ended up using the existing date table in my app and just tolerating the dependency it introduced. It was the fastest solution.

rmeador 2009-04-07 14:23:24

Answer 5

A:

Option 1 You can create a temp table and insert dates with the range and do a left outer join with the usagelog Option 2 You can programmetically insert the missing dates while evaluating the result set to produce the final output

kishore 2009-04-06 15:35:51

Answer 6

+1 A:

I had to do exactly the same thing recently. This is how I did it in T-SQL ( YMMV on speed, but I've found it performant enough over a coupla million rows of event data):

DECLARE @DaysTable TABLE ( [Year] INT, [Day] INT )

DECLARE @StartDate DATETIME
SET @StartDate = whatever

WHILE (@StartDate <= GETDATE())
BEGIN

  INSERT INTO @DaysTable ( [Year], [Day] )
  SELECT DATEPART(YEAR, @StartDate), DATEPART(DAYOFYEAR, @StartDate)

  SELECT @StartDate = DATEADD(DAY, 1, @StartDate)
END

-- This gives me a table of all days since whenever
-- you could select @StartDate as the minimum date of your usage log)

SELECT days.Year, days.Day, events.NumEvents
FROM @DaysTable AS days
LEFT JOIN (
  SELECT
 COUNT(*) AS NumEvents
 DATEPART(YEAR, LogDate) AS [Year],
 DATEPART(DAYOFYEAR, LogDate) AS [Day]
  FROM LogData
  GROUP BY
 DATEPART(YEAR, LogDate),
 DATEPART(DAYOFYEAR, LogDate)
) AS events ON days.Year = events.Year AND days.Day = events.Day

Keith Williams 2009-04-06 16:31:51

ansaurus

tags:

views:

answers:

SQL for counting events by date

related questions