ansaurus

Question

Grouping fields that partially match in MySQL

Answer 1

A:

Something like this might work for you:

SELECT *
FROM members m1
inner join members m2 on m1.id <> m2.id
    and (
        m1.email = m2.email
        or m1.email like '%,' + m2.email
        or m1.email like m2.email + ',%'
        or m1.email like '%,' + m2.email + ',%'
    )

It depends on how consistently your email addresses are formatted when there are more than one. You might need to modify the query slightly if there is always a space after the comma, e.g., or if the quotes are actually part of your data.

RedFilter 2010-01-21 21:16:22

Thanks for the answer. Unfortunately the INNER JOIN of our members table is 94 million records and the query takes too long, which is why I was shying away from joins of this nature. I think that if I separated the email addresses out into their own table like they SHOULD be, I can accomplish what I want more easily.

Mitch Weaver 2010-01-22 15:53:22

Answer 2

A:

This works for me; may not do what you want:

SELECT MAX(ID) FROM members WHERE Email like "%someuser%" GROUP BY Email HAVING COUNT(Email) > 1

Nat 2010-01-21 21:28:08

This works great as long as you can guarantee your email field contains only one email. Our may contain multiples separated by commas, and I'm trying to group partial matches, which doesn't appear to be feasible as our schema exists now.

Mitch Weaver 2010-01-22 15:56:41

ansaurus

tags:

views:

answers:

Grouping fields that partially match in MySQL

related questions