Here's an interesting problem.
I have an oracle database with name & address information which needs to be kept current.
We get data feeds from a number of different gov't sources, and need to figure out matches, and whether or not to update the db with the data, or if a new record needs to be created.
There isn't any sort of unique i...
If I have a large set of data that describes physical 'things', how could I go about measuring how well that data fits the 'things' that it is supposed to represent?
An example would be if I have a crate holding 12 widgets, and I know each widget weighs 1 lb, there should be some data quality 'check' making sure the case weighs 13 lbs m...
Hi all,
I need to build a system that generates reports on data exceptions (e.g. this value is stale because it hasn't been updated in x days). Once they have a daily report on data quality issues my users would like to have a bunch of filtering capability (with customizable public and private filters) and then be able to categorize and...
This question is related to Tools for matching name/address data. There is a number commercial tools provided by SAS, Oracle, Microsoft, etc., that allow to de-duplicate or merging names of individuals or companies coming from multiple sources.
However, after reading the answers to the question mentioned before, I wondered why a seeming...
A user recently inquired (OK, complained) as to why a 19-digit account number on our web site was broken up into 4 individual text boxes of length [5,5,5,4]. Not being the original designer, I couldn't answer the question, but I'd always it assumed that it was done in order to preserve data quality and possibly to provide a better user...