ansaurus

Question

SQL Server: Difference between PARTITION BY and GROUP BY

Answer 1

+2 A:

partition by doesn't actually roll up the data. It allows you to reset something on a per group basis. For example, you can get an ordinal column within a group by partitioning on the grouping field and using rownum() over the rows within that group. This gives you something that behaves a bit like an identity column that resets at the beginning of each group.

ConcernedOfTunbridgeWells 2010-03-08 20:41:53

Answer 2

+3 A:

They're used in different places. group by modifies the entire query, like:

select customerId, count(*) as orderCount
from Orders
group by customerId

But partition by just works on a window function, like row_number:

select row_number() over (order by orderId, partition by customerId)
    as OrderNumberForThisCustomer
from Orders

A group by normally reduces the number of rows returned by rolling them up and calculating averages or sums for each row. partition by does not affect the number of rows returned, but it changes how a window function's result is calculated.

Andomar 2010-03-08 20:43:25

OK, thanks, that cleared it up!

Mike Mooney 2010-03-08 20:49:14

Answer 3

A:

PARTITION BY Divides the result set into partitions. The window function is applied to each partition separately and computation restarts for each partition.

Found at this link: OVER Clause

Will Marcouiller 2010-03-08 20:44:44

Answer 4

+1 A:

It provides rolled-up data without rolling up

i.e. Suppose I want to return the relative position of sales region

Using PARTITION BY, I can return the sales amount for a given region and the MAX amount across all sales regions in the same row.

This does mean you will have repeating data, but it may suit the end consumer in the sense that data has been aggregated but no data has been lost - as would be the case with GROUP BY.

adolf garlic 2010-03-09 16:02:06

ansaurus

tags:

views:

answers:

SQL Server: Difference between PARTITION BY and GROUP BY

related questions