ansaurus

Question

SQL primer: Why and how to use join statements?

Answer 1

+1 A:

That's a typical case for an inner join, such as:

select users.name, group.name from groups 
inner join group_users on groups.id = group_users.group_id
inner join users on group_users.user_id = users.id
where user_id = 47

Otávio Décio 2009-11-23 19:17:54

Answer 2

+2 A:

SELECT gu.id, gu.group_id, g.name, gu.user_id
FROM group_users gu
INNER JOIN Group g 
 ON gu.group_id = g.group_id
     WHERE user_id = 47

HLGEM 2009-11-23 19:19:05

Answer 3

+3 A:

I always find these charts to be very useful when doing joins:

http://www.codinghorror.com/blog/archives/000976.html

Carra 2009-11-23 19:20:43

Thanks! I really liked that post. Great for visual learners like me.

Andrew 2009-11-23 20:01:39

And it has ninjas *and* pirates!

Carra 2009-11-23 21:24:12

Answer 4

+7 A:

In general, you want to reduce the total number of queries to the database, by making each query do more. There are many reasons why this is a good thing, but the main one is that relational database management systems are specifically designed to be able to join tables quickly and are better at it than the equivalent code in some other language. Another reason is that it's usually more expensive to open many little queries than it is to run one large query that has everything you'll end up needing.

You want to take advantage of your RDBMS's strengths, so you should try to push data access into it in a few big queries rather than lots of little queries.

Now, that's just a general rule of thumb. There are cases when it's better to do some things outside of the database. It's important that you determine which is the right case for your situation by looking into bottlenecks if and only if they occur. Don't spend time worrying about performance until you find a performance problem.

But, in general, it's better to handle joins, lookups and all other query-related tasks in the database itself than it is to try to handle it in a general-purpose language.

That said, the kind of join you want is an inner join. You'd structure your join query like this:

SELECT groups.name, group_users.user_id
FROM group_users
INNER JOIN groups
  ON group_users.group_id = groups.group_id
WHERE groups.user_id = 47;

Welbog 2009-11-23 19:20:50

What happens to the result set after it has been joined? Are columns added? Or are the id values replaced with the name values?

Andrew 2009-11-23 20:04:14

I'm not entirely sure what you're asking. Columns are only added if you tell the server to add them. IDs aren't replaced with names. The values in the database remain unchanged during `SELECT` queries. Things can only be replaced if you're using an `UPDATE` statement. What will happen in this specific example query is that your result set will contain the names of all the groups associated with the user whose id is 47. There's no addition, replacing or anything sneaky going on. Just a regular query returning a result set.

Welbog 2009-11-23 20:33:32

Nevermind, I found out the answer to my question. The select statement determines which columns are returned, so instead of saying `select * from...` I could narrow it down and only return the group name and id by saying `select groups.name, groups.id from...`

Andrew 2009-11-23 23:03:54

ansaurus

tags:

views:

answers:

SQL primer: Why and how to use join statements?

related questions