ansaurus

Question

SQL: Selecting parents and their children

Answer 1

+2 A:

For MySQL (not MSSQL) Something like this would work;

SELECT q.text, a.text
FROM questions q
LEFT JOIN answers a ON 
    q.quiestionid = q.id
GROUP BY q.id, a.id
ORDER BY q.id

Let me know if you have any q's.

Christian 2009-07-27 03:01:37

Why don't you think that'd work on MSSQL? You just wrote ANSI SQL. The syntax will work fine in SQL Server.

Eric 2009-07-27 03:05:37

afaik, an ANSI-conforming rdbms requires that columns on SELECT clause should also appear on GROUP BY clause. so q.text, and a.text needed be included on GROUP BY clause. christian's example won't work on PostgreSQL and MSSQL. above will only work on pg and mssql using this: GROUP BY q.id, a.id, q.text, a.text

Michael Buen 2009-07-27 03:32:58

but as the .id fields are primary keys and aren't going to be repeated, why not just drop the useless group by?!

Alex Martelli 2009-07-27 03:56:08

@Michael: You're right. I mentally skipped over them because they are absolutely useless in this case (as Alex points out).

Eric 2009-07-27 04:43:52

Answer 2

+1 A:

Why do you think a left join would be inefficient? You'll get some duplicate data on questions that were answered many times, but that's just a few extra bytes over the wire, nothing to worry about.

One answer that was given works fine (in any real sql engine including mssql, sqlite, etc, as well as mysql as it was proposed for) but is redundant (it has a group-by on primary keys that don't get duplicated anyway). So the following simpler and typo-fixed version is fine and fast:

SELECT q.id, q.text, a.id, a.text
  FROM questions q
  LEFT JOIN answers a ON a.questionid = q.id
 ORDER BY q.id

Your client code must simply notice when q.id changes in order to group and display things "hierarchically" as you desire -- how to do it depends on the client-side language you use, in Python for example you'd use itertools.groupby to do it very simply (you don't need the group by on the SQL server side, but you do need it client-side, whether with a language supplied facility like in Python or by implementing it yourself, to get the hierarchy displayed as you desire).

Alex Martelli 2009-07-27 03:02:14

Yeah I was thinking the inefficiency would be in the duplicate data - assuming there could be 20 to 100 answers in the worst case. Given the simplicity I'll try it anyway and defer further optimisation once I've tested the performance.

cbp 2009-07-27 03:15:05

ansaurus

tags:

views:

answers:

SQL: Selecting parents and their children

related questions