ansaurus

Question

In SQL, how can you "group by" in ranges?

Answer 1

+4 A:

create table scores (
   user_id int,
   score int
)

select t.range as [score range], count(*) as [number of occurences]
from (
      select user_id,
         case when score >= 0 and score < 10 then '0-9'
         case when score >= 10 and score < 20 then '10-19'
         ...
         else '90-99' as range
     from scores) t
group by t.range

tvanfosson 2008-10-24 03:32:37

Thanks! I tried this and the basic idea works great, although the syntax that I had to use is slightly different. Only the first "case" keyword is needed and then after the last condition, before the "as range" you need the keyword "end". Other than that, worked great- thanks!

Hugh 2008-10-24 04:05:49

Answer 2

+2 A:

select cast(score/10 as varchar) + '-' + cast(score/10+9 as varchar), 
       count(*)
from scores
group by score/10

James Curran 2008-10-24 03:32:48

I like this, but you have to fix up the ranges outside the query if you're going to display it.

tvanfosson 2008-10-24 03:35:01

In case you decide to fix your answer you need to change your score/10 on the first line to be (score/10)*10 for both of them otherwise you get 3 - 12 instead of 30-39 etc. As per my post below you could add an order by to get the results in the right order.

Timothy Walters 2008-10-24 05:57:05

Answer 3

+3 A:

In postgres (where || is the string concatenation operator):

select (score/10)*10 || '-' || (score/10)*10+9 as scorerange, count(*)
from scores
group by score/10
order by 1

gives:

 scorerange | count 
------------+-------
 0-9        |    11
 10-19      |    14
 20-29      |     3
 30-39      |     2

mhawke 2008-10-24 03:41:32

Answer 4

A:

Perhaps you're asking about keeping such things going...

Of course you'll invoke a full table scan for the queries and if the table containing the scores that need to be tallied (aggregations) is large you might want a better performing solution, you can create a secondary table and use rules, such as on insert - you might look into it.

Not all RDBMS engines have rules, though!

Richard T 2008-10-24 03:49:49

Answer 5

A:

declare @RangeWidth int

set @RangeWidth = 10

select
   Floor(Score/@RangeWidth) as LowerBound,
   Floor(Score/@RangeWidth)+@RangeWidth as UpperBound,
   Count(*)
From
   ScoreTable
group by
   Floor(Score/@RangeWidth)

Aheho 2008-10-24 03:58:11

Answer 6

+15 A:

I see answers here that won't work in SQL Server's syntax. I would use:

select t.range as [score range], count(*) as [number of occurences]
from (
  select case 
    when score between  0 and  9 then ' 0-9 '
    when score between 10 and 19 then '10-19'
    when score between 20 and 29 then '20-29'
    ...
    else '90-99' end as range
  from scores) t
group by t.range

EDIT: see comments

Ken Paul 2008-10-24 04:05:56

It is possibly because of the version of SQLServer I am using but to get your example to work (I test things before I vote them up) I had to move 'score' from after the 'case' to after each 'when'.

Ron Tuffin 2008-10-24 11:50:17

You're right, and thanks for the correction. Apparently when you put the variable after the keyword 'case', you can only do exact matches, not expressions. I learn as much from answering questions as from asking them. :-)

Ken Paul 2008-10-28 23:19:04

Answer 7

+3 A:

James Curran's answer was the most concise in my opinion, but the output wasn't correct. For SQL Server the simplest statement is as follows:

SELECT 
    [score range] = CAST((Score/10)*10 AS VARCHAR) + ' - ' + CAST((Score/10)*10+9 AS VARCHAR), 
    [number of occurrences] = COUNT(*)
FROM #Scores
GROUP BY Score/10
ORDER BY Score/10

This assumes a #Scores temporary table I used to test it, I just populated 100 rows with random number between 0 and 99.

Timothy Walters 2008-10-24 05:54:45

Ah... There's the advantage of actually taking the time to create the table. (I used an existing table with too few rows over too small a range)

James Curran 2008-10-24 13:47:46

Answer 8

+16 A:

Neither of the highest voted answers are correct on SQLServer 2000. Perhaps they were using a different version.

Here are the correct versions of both of them on SQLServer 2000.

select t.range as [score range], count(*) as [number of occurences]
from (
  select case  
    when score between 0 and 9 then ' 0- 9'
    when score between 10 and 19 then '10-19'
    else '20-99' end as range
  from scores) t
group by t.range

or

select t.range as [score range], count(*) as [number of occurences]
from (
      select user_id,
         case when score >= 0 and score< 10 then '0-9'
         when score >= 10 and score< 20 then '10-19'
         else '20-99' end as range
     from scores) t
group by t.range

Ron Tuffin 2008-10-24 12:01:46

Answer 9

+1 A:

An alternative approach would involve storing the ranges in a table, instead of embedding them in the query. You would end up with a table, call it Ranges, that looks like this:

LowerLimit   UpperLimit   Range 
0              9          '0-9'
10            19          '10-19'
20            29          '20-29'
30            39          '30-39'

And a query that looks like this:

Select
   Range as [Score Range],
   Count(*) as [Number of Occurences]
from
   Ranges r inner join Scores s on s.Score between r.LowerLimit and r.UpperLimit
group by Range

This does mean setting up a table, but it would be easy to maintain when the desired ranges change. No code changes necessary!

Walter Mitty 2008-10-25 12:20:44

ansaurus

tags:

views:

answers:

In SQL, how can you "group by" in ranges?

related questions