ansaurus

Question

Checking a data conforms to a list of rules

Answer 1

A:

Your best bet might be to create a maintenance plan in SQL Server, with one step in the plan for each rule. Each rule would check the data and insert into an exception table if it found any nonconforming rules. This would allow for you to leverage the tools provided by SQL Server and maintain fairly easy maintenance of the rules themselves (adding, removing, and reordering).

Adam Robinson 2009-04-06 17:04:32

Answer 2

+1 A:

Create a script of SQL statements, with one statement being equal to a single rule. In your example, your statement might be:

INSERT INTO EXCEPTION 
   (RULE_NAME, DETAIL) 
VALUES 
("CASH_LEVEL_LOW", SELECT CUSTOMER_ID FROM CUSTOMER WHERE CUSTOMER.CASH < 50);

I'm not up-to-date on the syntax, but you should be able to get the gist of the idea from here. It would insert into another table one record per violation, with sufficient data so that you could locate the record easily.

Elie 2009-04-06 17:15:53

I have had similar thoughts. There are two things that concern me about this approach, A: it doesn't seem very easy to maintain. Say for example I want to quickly change the percentage, and we've got 200 rules. B: It worries me having too much business logic in the database.

MrEdmundo 2009-04-06 17:18:36

First, the logic is not in the database, but in a script file that runs on the database, which makes is easier to change. You can name your statements such that you would be able to find them later easily by just searching for the name.

Elie 2009-04-06 17:23:26

I appreciate what you're saying. When I imagine the scenario I'm describing I imagine laying a blueprint over a load of data and looking where the data is outside the lines. I'm not sure how neat that could be using scripts like we're talking about

MrEdmundo 2009-04-07 07:42:15

The scripts would be your blueprints... but you wouldn't need to keep them inside the database per se. I see your point, though.

Elie 2009-04-07 12:21:02

Answer 3

A:

I would quite like the system to be dynamic, i.e. it's quite easy to define new rules.

You already have such a system; it is your database. In particular, check constraints serve to prevent invalid data from being entered at all.

For a case like your example -- where you want to allow the value but flag it -- write a view, and have a client application issue an error if the view has any rows.

Here's an example: create view low_on_cash as select * from table where "Cash%" < 50 ;

In the client, you'd raise an error if "select count(*) from low_on_cash" didn't return 0;

If you established a convention that all such views were named with a prefix, e.g., "error_report", your client could select all such view names from the systables for the database in one query, then iterate that list by calling "select count(*) from " + viewname; logging an error for any that returned more than zero rows.

Since this would be data-driven, adding a new error report would consist of nothing more than creating a view with the proper name prefix; you'd not have to recompile the client.

The additional advantage is that adding any rule engine would require learning its Domain Specific Language for writing rules, training new staff on it, and even then inevitably there would be corner cases the rules didn't easily cover. Your coders already know SQL, and it's based on a 17 year old ANSI Standard based on 20 earler years of use, so most of the corner cases have been ironed out of the language.

tpdi 2009-04-06 17:23:30

That could potentially result in hundreds of views, and no central grouping of all entries across all views. It would work, but try generating a report of all rule violations.

Elie 2009-04-06 17:25:13

That is my point Elie, tpdi I don't disagree that you're idea would work, I just think it would be clunky and difficult to manage. Imagine 200 views. Ed

MrEdmundo 2009-04-06 17:27:01

Answer 4

A:

Personally I don't allow data to be entered in my database that doesn't follow the business rules. That is what check constraints, unique indexes, and triggers are for.

HLGEM 2009-04-06 17:32:27

Good call! Anytime I hear about people using scheduled jobs in the DB to manipulate or validate data it usually points to someone barking up the wrong tree.

JohnFx 2009-04-06 17:40:56

Who says it's invalid data? Maybe it's just a flag for data that may soon become invalid (approaching the invalid condition).

Elie 2009-04-06 17:50:12

And who says it isn't invalid data? The first step is to make sure not to put bad data into your database ever. If you want to check if something is approaching a limit that has nothing to do with data validation but is just like any other query or proc you want to run.

HLGEM 2009-04-06 18:38:24

I believe that what the question is about: looking for a way to report on certain bits of data which is not technically invalid, and hence can go into the database, but needs to be reviewed (e.g. if less than 50% cash then send a reminder). I agree that invalid data doesn't belong in the DB.

Elie 2009-04-06 18:47:36

In my mind HLGEM, validation of a data against a rule doesn't mean that the data been checked is completely invalid, it is only invalid when checked with the particular rule. It may be that the rule is the rule for the database, hence it's existence is invalid, this is not the case for my scenario.

MrEdmundo 2009-04-07 07:49:58

Answer 5

+2 A:

As it isn't mentioned yet, I'd suggest you'd take a look at the data mining capabilities of SQLServer. One of them is the abbility to highlight exceptions in your data.

Take a look at this Highlight Exceptions Video Tutorial to get you started.

Lieven 2009-04-06 17:38:18

I'm going to have a look at this when I get chance, a quick scan read of the manuscript suggests it might be a possible solution.

MrEdmundo 2009-04-07 07:38:02

ansaurus

tags:

views:

answers:

Checking a data conforms to a list of rules

related questions