ansaurus

Question

Answer 1

+9 A:

The values in an IN() predicate are considered to be a set, and the result returned by an SQL query has no way to automatically infer order from that set.

In general, the order of any SQL query is arbitrary unless you specify an order with an ORDER BY clause.

You can use a MySQL function FIND_IN_SET() to do what you want:

SELECT * FROM foo f where f.id IN (2, 3, 1)
ORDER BY FIND_IN_SET(f.id, '2,3,1');

Note that the list argument to FIND_IN_SET() isn't a variable length list like the arguments of IN(). It has to be a string literal or a SET.

Re questions about performance: I'm curious too, so I tried both FIND_IN_SET() and FIELD() methods against my copy of the StackOverflow data:

With no index on VoteTypeId:

SELECT * FROM Votes ORDER BY FIND_IN_SET(VoteTypeId, '13,1,12,2,11,3,10,4,9,5,8,6,7');

3618992 rows in set (31.26 sec)
3618992 rows in set (29.67 sec)
3618992 rows in set (28.52 sec)

SELECT * FROM Votes ORDER BY FIELD(VoteTypeId, 13,1,12,2,11,3,10,4,9,5,8,6,7);

3618992 rows in set (37.30 sec)
3618992 rows in set (49.65 sec)
3618992 rows in set (41.69 sec)

With an index on VoteTypeId:

SELECT * FROM Votes ORDER BY FIND_IN_SET(VoteTypeId, '13,1,12,2,11,3,10,4,9,5,8,6,7');

3618992 rows in set (14.71 sec)
3618992 rows in set (14.81 sec)
3618992 rows in set (25.80 sec)

SELECT * FROM Votes ORDER BY FIELD(VoteTypeId, 13,1,12,2,11,3,10,4,9,5,8,6,7);

3618992 rows in set (19.03 sec)
3618992 rows in set (14.59 sec)
3618992 rows in set (14.43 sec)

Conclusion: with limited testing, there is no great advantage to either method.

Bill Karwin 2009-10-27 15:48:17

much nicer that CASE 'x' THEN 1, CASE 'y' THEN 2 for an arbitrary order column

dnagirl 2009-10-27 15:51:39

thanks for clarifying that one - i went for the other answer as FIELD with variable arguments seems more intuitive for me. but i'd also be interested in a performance comparison.

msparer 2009-10-27 16:21:34

yeah didn't think there'd be much if any difference, but thanks for the test results :)

Ty W 2009-10-27 17:32:55

interesting. so there isn't a great advantage if there's an index on VoteTypeId - without an index however FIND_IN_SET seems to be the better option ... thanks for the testing!

msparer 2009-10-27 17:34:33

I'll add the disclaimer that I ran these tests on a Macbook with only 2GB RAM, and perhaps without adequate memory cooldown in between tests. So my margin of error might be a bit high. :-)

Bill Karwin 2009-10-27 17:48:37

Answer 2

+8 A:

As the other answer mentions: the query you posted has nothing about what order you'd like your results, just which results you'd like to get.

To order your results, I would use ORDER BY FIELD():

SELECT * FROM foo f where f.id IN (2, 3, 1)
ORDER BY FIELD(f.id, 2, 3, 1);

The argument list to FIELD can be variable length.

Ty W 2009-10-27 15:53:45

Oh, yes, that works too. +1

Bill Karwin 2009-10-27 15:56:40

I'd be interested to see if either method has a performance advantage in the general case of ordering rows by ID. I've always used FIELD() but I've seen a few examples of code that uses the FIND_IN_SET() method you posted.

Ty W 2009-10-27 15:59:24

ansaurus

tags:

views:

answers:

Maintaining order in MySQL "IN" query

related questions