ansaurus

Question

New help with SQL Server select statement

Answer 1

+1 A:

For SQL Server 2005+, use:

SELECT TOP (@n) c.*
  FROM (
SELECT a.id,
       a.questionid,
       a.genreid
  FROM (SELECT t.*,
               ROW_NUMBER() OVER (PARTITION BY t.genreid) AS rank
          FROM TABLE t
         WHERE t.genreid NOT IN (SELECT TOP 1 --ensure only one genre, see order by
                                        t.genreid
                                   FROM TABLE t
                               GROUP BY t.genreid
                                 HAVING COUNT(*) = @y 
                               ORDER BY t.genreid) 
      ) a
 WHERE a.rank < @x
UNION ALL
SELECT b.id,
       b.questionid,
       b.genreid
  FROM TABLE b
 WHERE b.genreid IN (SELECT TOP 1 --ensure only one genre, see order by
                            t.genreid
                       FROM TABLE t
                   GROUP BY t.genreid
                     HAVING COUNT(*) = @y
                   ORDER BY t.genreid ) ) c

OMG Ponies 2010-06-05 21:31:52

I think @y is a different limit for a specific genere, it's not supposed to be used to find a genre. If it were, it could find more then one, contradicting the question

Andomar 2010-06-05 21:43:51

@Andomar: The OP states that the genreid that is excluded from the set has to have a row count of *y*. It's very likely there could be more than one genreid with *y* number of rows - TOP would handle that once we knew more about which of the duplicates would be selected.

OMG Ponies 2010-06-05 21:49:07

Thank you a lot :)

gio_333m 2010-06-06 12:21:49

Answer 2

A:

In SQL Server, you can do that with nested subqueries and top clauses:

select  top (@n) * 
from    (
        -- Retrieve @y rows from the special genre
        -- The prio field is used to ensure all these rows make it inside @n
        select  top (@y) 1 as prio, genreid, questionid
        from    @t
        where   genreid = @the_one

        -- And up to @x rows per non-special genre
        union all
        select  2 as prio, genreid, questionid
        from    (
                select  *
                ,       row_number() over (partition by genreid 
                                           order by newid()) as rownr
                from    @t
                where   genreid <> @the_one
                ) sub
        where rownr < @x
        ) sub2
order by
        prio, newid()

Sample data:

declare @t table (id int identity, QuestionId int, GenreId int)

insert @t (GenreId, QuestionId) values 
    (1,1),
    (2,1),(2,1),
    (3,1),(3,1),(3,1),
    (4,1),(4,1),(4,1),(4,1),
    (5,1),(5,1),(5,1),(5,1),(5,1)

declare @n int
declare @x int
declare @y int
declare @the_one int

set @n = 7 -- Total rows
set @x = 3 -- With less then 3 per genre
set @y = 3 -- Except three rows from genre @the_one
set @the_one = 3

Results in (one example, output differs on each run:

prio  genreid  questionid
1     3        1
1     3        3
1     3        2
2     4        1
2     1        1
2     5        1
2     5        4

Andomar 2010-06-05 21:33:07

Thank you a lot :)

gio_333m 2010-06-06 12:22:23

ansaurus

tags:

views:

answers:

New help with SQL Server select statement

related questions