ansaurus

Question

Negating an arbitrary where clause condition (including the null tests)

Answer 1

A:

I don't think there's anything you can do if the conditions are completely arbitrary. Is it possible to "re-write" the conditions at some point based on some rules?

I believe that if you do this:

... where not (age = 21) ....

which translates internally into:

... where (age != 21) ...

you get too few records because it doesn't match null values, right?

But if you do this:

... where not (age = 21 and age is not null) ....

which translates internally into:

... where (age != 21 or age is null) ....

then you will get the expected results. (right?)

So can you force all the comparisons in your conditions to include a null test, either in the form (... or x is null) or (... and x is not null)?

Tim Sylvester 2009-07-14 01:54:41

What you said is right. That is an interesting approach, but a bit more complicated than I hoped. The conditions can be complex, like this:((woclass = 'ACTIVITY' or woclass = 'WORKORDER') and reportdate <= TO_TIMESTAMP ('2001-07-01 23:59:59.000' , 'YYYY-MM-DD HH24:MI:SS.FF') and upper(reportedby) like '%STEVE%' and historyflag = 0 and istask = 0 and (exists (select siteid from locancestor where ((ancestor like '%SEG100%')) and (location =workorder.location and systemid = ( select systemid from locsystem where primarysystem = '1' and siteid =workorder.siteid) and siteid=workorder.siteid)))

Joe Daley 2009-07-14 03:08:06

Yeah, post-processing that is a daunting task. Any chance of modifying the code that builds the expressions?

Tim Sylvester 2009-07-14 03:54:28

Answer 2

+1 A:

Assuming your table has a primary key id, one possible approach is:

select count(*)
from people p1
left join people p2
  on (p1.id = p2.id
  and (p2.<condition A>)
  and (p2.<contition B>))
where p1.<condition A>
  and p2.id IS NULL

You do need some simple preprocessing on the conditions (prefacing each column name with p1. or p2. as appropriate), but that's much easier than correctly negating conditions with the NULL issues you mention.

LEFT JOIN sometable ON whatever WHERE ... AND sometable.id IS NULL is a popular way to express "and there's no corresponding record in sometable that satisfied the whatever constraint, so I would expect a good engine to be well tuned to optimize that idiom as much as allowed by the available indices.

Alex Martelli 2009-07-14 04:37:54

Answer 3

A:

If you do have the id field, try:

select count(*) from dual where exists ( select * from people where (cond a) and zzz.id not in (select id from people where (cond b)) );

2009-07-14 04:52:53

Thanks. I think this is the fastest solution that avoids preprocessing the conditions. In my testing, the speed is comparable to Alex's solution.

Joe Daley 2009-07-14 17:33:52

Answer 4

A:

One solution is to get rid of any nulls within the parameters of the comparison first, that is, append strings to values or replace nulls with impossible values for that column. For an example of the first:

select x, y
  from foo
  join bar on bar.a||'x' = foo.a||'x' /* Replace "=" with "<>" for opposite result */
;

Replacing nulls:

select x, y
  from foo
  join bar on nvl(bar.a, 'x') = nvl(foo.a, 'x') -- Ditto
;

Now, the second option is harder (at least in Oracle 9.2) because you have to make sure that the replacement value is the same datatype as the column it replaces (NVL is a bit silly like that), and that it's a value outside the precision of the column datatype (e.g., 9999 for number(3)), but it might be possible to make it work with indexes. Of course, this is not possible if the column already uses maximum precision / length.

l0b0 2009-07-14 12:35:12

Answer 5

A:

If for every nullable column you can come up with a dummy value that should never be valid, then you could do something like this:

select count(*) from dual
where exists (
  select * from (
    select NVL(name, 'No Name') name, NVL(age, -1) age from people
    )
  where (<condition A>)
  and not (<condition B>)
);

You would probably want to create function-based indexes on those expressions.

This is certainly easier than parsing through the conditions at runtime and trying to replace the column names with NVL expressions, and it should have the same end result.

Dave Costa 2009-07-14 12:42:55

ansaurus

tags:

views:

answers:

Negating an arbitrary where clause condition (including the null tests)

related questions