ansaurus

Question

How do I GROUP BY on every given increment of a field value?

Answer 1

+1 A:

Create a table listing all weeks since the epoch, and JOIN it to your table of events.

CREATE TABLE Weeks (
  week INTEGER PRIMARY KEY
);

INSERT INTO Weeks (week) VALUES (200919); -- e.g. this week

SELECT w.week, e.org, COUNT(*)
FROM Events e JOIN Weeks w ON (w.week = strftime('%Y%W', e.time))
GROUP BY w.week, e.org;

There are only 52-53 weeks per year. Even if you populate the Weeks table for 100 years, that's still a small table.

Bill Karwin 2009-05-13 18:30:03

Answer 2

+1 A:

To do this in a set-based manner (which is what SQL is good at) you will need a set-based representation of your time increments. That can be a temporary table, a permanent table, or a derived table (i.e. subquery). I'm not too familiar with SQLite and it's been awhile since I've worked with UNIX. Timestamps in UNIX are just # seconds since some set date/time? Using a standard Calendar table (which is useful to have in a database)...

SELECT
     C1.start_time,
     C2.end_time,
     T.org,
     COUNT(time)
FROM
     Calendar C1
INNER JOIN Calendar C2 ON
     C2.start_time = DATEADD(dy, 6, C1.start_time)
INNER JOIN My_Table T ON
     T.time BETWEEN C1.start_time AND C2.end_time  -- You'll need to convert to timestamp here
WHERE
     DATEPART(dw, C1.start_time) = 1 AND    -- Basically, only get dates that are a Sunday or whatever other day starts your intervals
     C1.start_time BETWEEN @start_range_date AND @end_range_date  -- Period for which you're running the report
GROUP BY
     C1.start_time,
     C2.end_time,
     T.org

The Calendar table can take whatever form you want, so you could use UNIX timestamps in it for the start_time and end_time. You just pre-populate it with all of the dates in any conceivable range that you might want to use. Even going from 1900-01-01 to 9999-12-31 won't be a terribly large table. It can come in handy for a lot of reporting type queries.

Finally, this code is T-SQL, so you'll probably need to convert the DATEPART and DATEADD to whatever the equivalent is in SQLite.

Tom H. 2009-05-13 18:36:16

Answer 3

+1 A:

Not being familiar with SQLite I think this approach should work for most databases, as it finds the weeknumber and subtracts the offset

SELECT org, ROUND(time/604800) - week_offset, COUNT(*)
FROM table
GROUP BY org, ROUND(time/604800) - week_offset

In Oracle I would use the following if time was a date column:

SELECT org, TO_CHAR(time, 'YYYY-IW'), COUNT(*)
FROM table
GROUP BY org, TO_CHAR(time, 'YYYY-IW')

SQLite probably has similar functionality that allows this kind of SELECT which is easier on the eye.

stili 2009-05-13 20:14:00

yes, you can use the strftime function to format a date-time string (and the unixepoch modifier if that's what you start with), as well as the simpler approach based on truncating the unix-epoch number.

Alex Martelli 2009-05-13 21:23:32

Thanks! this is the solution; TO_CHAR doesn't exist in sqlite, but strftime functions do and I was able to sort it this way.

2009-05-14 17:00:54

ansaurus

tags:

views:

answers:

How do I GROUP BY on every given increment of a field value?

related questions