ansaurus

Question

finding most common value across multiple tables

Answer 1

+1 A:

Try this...

select ip, count(*) 
from
(
select id, ip from bad_guys_1
union all
select id, ip from bad_guys_2
union all
select id, ip from bad_guys_3
union all
select id, ip from bad_guys_4
) a
group by ip
order by count(*) desc

Fosco 2010-08-19 14:07:17

you need UNION ALL not UNION, or repeated values in 2,3,and 4 tables will not be counted.

mdma 2010-08-19 14:12:23

updated.. though the likelihood of the ID and IP matching is small, you are correct.

Fosco 2010-08-19 14:20:02

Answer 2

+1 A:

Andy, You can use a "union" to create one big logical table (in memory) with just the IPs. Then you can do the normal

select count(ip), ip from 
(select ip from table1 union all select ip from table2 etc) unionedTable 
group by ip

[edited to add union all - thanks!]

Jeanne Boyarsky 2010-08-19 14:07:19

you need UNION ALL not UNION, or repeated values in different tables will not be counted.

mdma 2010-08-19 14:11:24

Fixed. Thanks mdma.

Jeanne Boyarsky 2010-08-19 15:30:56

Answer 3

+1 A:

       select ip, count(*) from
        (
        select id, ip from bad_guys_1
        union all
        select id, ip from bad_guys_2
        union all
        select id, ip from bad_guys_3
        union all
        select id, ip from bad_guys_4
        ) as ranking
        group by ip

order by count(*) desc

Yves M. 2010-08-19 14:07:43

you need UNION ALL not UNION, or repeated values in 2,3,and 4 tables will not be counted. (Assuming they also had the same id, which is possible.)

mdma 2010-08-19 14:11:52

Answer 4

+2 A:

 SELECT ip, count(*) c
 FROM 
 (
   SELECT ip
   from bad_guys_1 
   UNION ALL
   SELECT ip
   from bad_guys_2
   UNION ALL
   SELECT ip
   from bad_guys_3
   UNION ALL
   SELECT ip
   from bad_guys_4)
 group by ip
 order by 2 desc

Michael Pakhantsov 2010-08-19 14:08:28

Answer 5

+6 A:

Sorry to say, but the other answers using just union and not union all are wrong. If there is a selected row that appears in more than one table, it will only be counted in the first table if the other tables are included via union and not union all.

For those queries selecting both the ID and the address, the possibility of a row having the same ID and address in different tables still exists. Using UNION ALL ensures all values are unioned, whether they are duplicates or not - and we want the duplicates so they can be counted. Using UNION ALL is often less work for the database, since it does not need to find duplicates and remove them.

select ip, count(*) from
(
select ip from bad_guys_1
union ALL
select ip from bad_guys_2
union ALL
select ip from bad_guys_3
union ALL
select ip from bad_guys_4
) as ranking
group by ip
order by count(*) DESC

mdma 2010-08-19 14:10:29

yes, You're right. Running it with just union gives me a count of 1 for every result, but union all shows me correct total number of times each given ip shows up across all tables.

Andy 2010-08-19 14:25:44

ansaurus

tags:

views:

answers:

finding most common value across multiple tables

related questions