ansaurus

Question

Answer 1

+4 A:

I'd say just ignore any duplicates. That will give you less than ten results for each of your batches, but I'd prefer that to enforcing uniqueness via grouping on the database (could be expensive). The objective here is to maximize message throughput, right?

If you still want to do it in SQL:

 select senderUserId, max(receiverUserId) from messages group by senderUserId

Thilo 2010-09-27 09:06:37

thanks, does what it should do...

eugeneK 2010-09-27 09:09:51

your answer is better than mine :)

Alexander 2010-09-27 09:10:10

Note that it is "unfair" against receivers with low ids...

Thilo 2010-09-27 09:11:09

@Thilo, this query runs on request few times a day so i don't worry about IO or memory usage that much.

eugeneK 2010-09-27 09:12:41

@Thilo, i'm a racist against low IDs

eugeneK 2010-09-27 09:13:09

Instead of max you can use LIST function: select senderUserId, list(receiverUserId) from messages group by 1

Andrei K. 2010-09-27 10:50:05

Answer 2

+5 A:

;WITH cte AS
(
SELECT senderUserID, 
       recieverUserID,
       ROW_NUMBER() OVER (PARTITION BY senderUserID ORDER BY recieverUserID) AS RN
FROM YourTable
)
SELECT senderUserID,recieverUserID FROM cte 
WHERE RN=1

Martin Smith 2010-09-27 09:08:43

Seems to work. If eugeneK wants to select 10 records only, he should change the last rows to: SELECT TOP 10 senderUserID,recieverUserID FROM cte WHERE RN=1

devmake 2010-09-27 09:16:52

thanks @Martin Smith... Thilo's query works faster with my dataset even though your solution is works just fine...

eugeneK 2010-09-27 09:22:23

ansaurus

tags:

views:

answers:

How to select unique rows only in sql?

related questions