ansaurus

Question

Answer 1

+2 A:

Aggregate functions ignore NULL (except for COUNT(*)) so if you can convert outliers to NULL in your expression, that'd help.

AVG( CASE WHEN Start.EventTime = Finish.EventTime THEN NULL
     ELSE DATEDIFF(SECOND, Start.EventTime, Finish.EventTime) 
     END CASE )

Bill Karwin 2009-06-09 23:16:21

Note for any casual observers: Count(field_name) will ignore NULL.

Eric 2009-06-09 23:18:43

Answer 2

A:

Without having parsed your query in detail, my first idea is:

do your query into an table variable (or temptable)
remove outliers from the table using whatever metric you use use to define outliers
this metric might just be removing all values below or above a fixed threshold
and/or first calculating mean and stdev and then removing all entries more than x stdev away from mean
then do further analysis on the cleaned temptable

Ben Schwehn 2009-06-09 23:19:11

Taking an average in SQL after throwing away outliers