ansaurus

Question

Answer 1

A:

I'm guessing it's because all rows returned share the same value for O_ID. You can do a COUNT(*) or COUNT() on a key that is unique to each row to get the row count.

Håvard S 2010-02-03 13:55:46

All returned rows are unique because of the DISTINCT clause, proven by doing the query with COUNT().

GateKiller 2010-02-03 13:58:20

Can you provide a short excerpt of the results without the count, just showing a few O_ID?

Turnkey 2010-02-03 14:03:41

Well, COUNT(DISTINCT ...) will of course count unique non-NULL values, and that's just it. Don't do distinct, count(*) or count something that is unique across all your rows, WITHOUT a DISTINCT clause.

Håvard S 2010-02-03 14:06:18

Your right about how COUNT(DISTINCT ...) works and the result should be the same as the row count without the COUNT() function...

GateKiller 2010-02-03 14:35:08

Thanks for the confirm, just added as an answer.

Turnkey 2010-02-05 14:10:27

Answer 2

A:

Remove the DISTINCT and you'll get a count on all rows.

Turnkey 2010-02-03 13:59:34

True. But as you can see from the full query, there is a join involved so this would return duplicate ID's. And it doesn't answer the question of why COUNT() is returning 1 when it shouldn't.

GateKiller 2010-02-03 14:01:43

Yes, that is puzzling, thanks for posting the additional info. Did you run the exact same query to get the excerpt, just removing the count?

Turnkey 2010-02-03 14:08:37

Yeah, the excerpt is the exact same query without using the COUNT() function. Very puzzling indeed!

GateKiller 2010-02-03 14:33:31

I wonder if this could be data type related. I wonder if you could add a CAST in the COUNT clause to cast it to an INT type to see if that changes anything?

Turnkey 2010-02-03 14:56:27

@Turnkey: That totally worked "Count(Distinct Cast(O_ID as Int))" :D. Please can you submit that as an answer, I have some rep points for you.

GateKiller 2010-02-05 10:26:50

Answer 3

A:

Could you please run these queries:

SELECT  COUNT(DISTINCT O_ID)
FROM    vEmployers
INNER JOIN
        vEnrolment
ON      O_ID = E_EnrolmentEmployer
WHERE   E_START >= '01-AUG-2008' AND
        E_START < '01-AUG-2009'
        AND O_ID IN
        (
        SELECT  O_ID
        FROM    vEmployers
        INNER JOIN
                vEnrolment
        ON      O_ID = E_EnrolmentEmployer
        WHERE   E_Start < '01-AUG-2008'
                AND E_Start >= '01-AUG-2007'
        )

and

SELECT  DISTINCT TOP 5 O_ID
FROM    vEmployers
INNER JOIN
        vEnrolment
ON      O_ID = E_EnrolmentEmployer
WHERE   E_START >= '01-AUG-2008' AND
        E_START < '01-AUG-2009'
        AND O_ID IN
        (
        SELECT  O_ID
        FROM    vEmployers
        INNER JOIN
                vEnrolment
        ON      O_ID = E_EnrolmentEmployer
        WHERE   E_Start < '01-AUG-2008'
                AND E_Start >= '01-AUG-2007'
        )
ORDER BY
        O_ID

verbatim, without changing anything?

Quassnoi 2010-02-03 14:24:00

The first query returns one row with the value "1".The second query returns five rows of unique values.

GateKiller 2010-02-03 14:32:14

@GateKiller: could you please post the structure of the tables?

Quassnoi 2010-02-03 14:39:22

What information are you interested in? I'm not sure I would be allowed to post the full table schema + each table has ALOT of columns.

GateKiller 2010-02-03 14:47:06

@GateKiller: Just post the relevant columns: `O_ID`, `E_START`, `E_EnrolmentEmployer`: their datatypes, indexes if any and which tables they belong to. Also, it would be nice to see the execution plans for each query. Just run `SET SHOWPLAN_TEXT ON \n GO \n SELECT …` (`\n` is a newline)

Quassnoi 2010-02-03 14:50:11

Answer 4

A:

SELECT   
  COUNT(*)  
FROM    vEmployers  
INNER JOIN vEnrolment ON O_ID = E_EnrolmentEmployer
WHERE 
        E_START >= '01-AUG-2008' 
        AND E_START < '01-AUG-2009'
        AND O_ID IS NOT NULL AND O_ID IN (
          SELECT O_ID FROM vEmployers
          INNER JOIN vEnrolment ON O_ID = E_EnrolmentEmployer
          WHERE E_Start < '01-AUG-2008' and E_Start >= '01-AUG-2007'
        )
GROUP BY
  O_Id

Benoit Vidis 2010-02-03 16:27:53

Answer 5

+2 A:

The use of the numeric(16, 0) made me suspect that it was data type related. Add a CAST in the COUNT clause to cast it to an INT type:

Count(Distinct Cast(O_ID as Int))

Turnkey 2010-02-05 14:09:37

Thankyou Turnkey :)

GateKiller 2010-02-08 16:32:12

ansaurus

tags:

views:

answers:

Select Count(Distinct Value) returns 1

related questions