ansaurus

Question

Answer 1

+3 A:

Assuming your server has GROUP_CONCAT because you didn't mention which one you're using:

SELECT GROUP_CONCAT(first_name SEPARATOR ' ')
FROM table
GROUP BY last_name
HAVING COUNT(first_name) > 1

Matti Virkkunen 2010-09-22 21:24:47

I like this a lot! For now, though, `GROUP_CONCAT` is only implemented in MySQL. There's an Oracle version at the following link, and the same blog has versions for PostGreSQL and SQL Server, too. http://explainextended.com/2009/04/05/group_concat-in-oracle-10g/

eksortso 2010-09-22 21:48:52

@eksortso: SQLite also supports the GROUP_CONCAT function.

OMG Ponies 2010-09-22 21:57:10

Answer 2

+1 A:

Select a.* from persons a inner join persons b on (a.personID<>b.PersonID and a.last_name=b.last_name)

PersonID is your table's primary key.

2010-09-22 21:24:56

Answer 3

+2 A:

I do not know if this is what you are asking for, but I think what you are looking for is

SELECT * FROM users u1, users u2 
WHERE (u1.last_name = u2.last_name AND COUNT(u1.last_name) > 1))

Java Dude 2010-09-22 21:28:53

That would get you a lot of duplicates if you have a very popular last name.

eksortso 2010-09-22 21:39:07

I've just answered the question... If you would like to check the other fields as well, then you'll just have to add them in the WHERE clause with the DISTINCT option.

Java Dude 2010-09-22 21:47:13

Can't use an aggregate (IE: COUNT) in the WHERE clause, outside of a subquery. After you fixed that error, `SELECT *` would return rows from both copies of the `USERS` table

OMG Ponies 2010-09-22 21:56:11

Answer 4

+6 A:

This should work on pretty much every SQL dialect:

SELECT last_name, first_name FROM names
WHERE last_name IN (
    SELECT last_name FROM names GROUP BY last_name HAVING COUNT(*) > 1
)

It will give you a result set like this, though:

Smith     Jack
Smith     Joe
Smith     Anna
Sixpack   Joe
Sixpack   Eve

The most elegant solution for displaying this in the desired format IMO is to just re-arrange the result set programmatically in the client application instead of pulling all sorts of obscure SQL stunts; something like (pseudocode):

for each row in resultset
   if row[last_name] <> previous_last_name
      print newline, print last_name
   print ' '
   print first_name

tdammers 2010-09-22 21:29:27

Answer 5

+1 A:

I tried to devise a solution that would work in most ANSI-compliant SQL database servers. Here's what I came up with.

The idea here is that you identify the duplicated last_names, then pull all the records that have one of those duplicates.

SELECT
   t.last_name, t.frst_name, t.frst_name1, t.frst_name2, ...
FROM our_table AS t
WHERE t.last_name IN (
   SELECT t0.last_name
   FROM our_table AS t0
   GROUP BY t0.last_name
   HAVING COUNT(*) > 1
)
ORDER BY
    t.last_name, t.frst_name, t.frst_name1, t.frst_name2, ...
;

eksortso 2010-09-22 21:37:05

Answer 6

+1 A:

Ha, lots of queries. Here is more

SELECT last_name, first_name FROM names n1
WHERE 
(
    SELECT count(*) FROM names n2 where n2.last_name = n1.last_name
) 
> 1

or if table has unique identifier

SELECT last_name, first_name FROM names n1
WHERE exists
(
    SELECT id FROM names n2 where n2.last_name = n1.last_name and n1.id <> n2.id
)

Danil 2010-09-22 22:27:52

I like this solution. Simple and works great.

rboarman 2010-10-22 21:24:59

ansaurus

tags:

views:

answers:

Find duplicate rows in database.

related questions