views:

27

answers:

3

I have the following table structure

+  id  +  word  +
+------+--------+

The table gets filled with the words in lower cas of a given text, so the text

Hello bye hello

would result in

+  id  +  word  +
+------+--------+
+   1  + hello  +
+------+--------+
+   2  + bye    +
+------+--------+
+   3  + hello  +
+------+--------+

I want to make a SELECT query that will return the number of words that get repeated at least two times in the table (like hello)

SELECT COUNT(id) FROM words WHERE (SELECT COUNT(words.word))>1

which of course is so wrong and super overloading when table is big. Any idea on how to achieve such purpose? In the given example inhere-above, I would expect 1

+5  A: 

To get a list of the words that appear more than once together with how often they occur, use a combination of Use GROUP BY and HAVING:

SELECT words, COUNT(*) AS cnt
FROM words
GROUP BY word
HAVING COUNT(*) > 1

To find the number of words in the above result set, use that as a subquery and count the rows in an outer query:

SELECT COUNT(*)
FROM
(
    SELECT NULL
    FROM words
    GROUP BY word
    HAVING COUNT(*) > 1
) T1
Mark Byers
+1. You are so fast.
a1ex07
you the boss! Thanks dude!
fabjoa
+1  A: 

SELECT count(word) as count FROM words GROUP BY word HAVING count >= 2;

bot403
+1  A: 

SELECT word, COUNT(*) FROM words GROUP by word HAVING COUNT(*) > 1

a1ex07