data-quality

Tools for matching name/address data

Here's an interesting problem. I have an oracle database with name & address information which needs to be kept current. We get data feeds from a number of different gov't sources, and need to figure out matches, and whether or not to update the db with the data, or if a new record needs to be created. There isn't any sort of unique i...

What are techniques and practices on measuring data quality?

If I have a large set of data that describes physical 'things', how could I go about measuring how well that data fits the 'things' that it is supposed to represent? An example would be if I have a crate holding 12 widgets, and I know each widget weighs 1 lb, there should be some data quality 'check' making sure the case weighs 13 lbs m...

Are there .NET development tools out there that can help serve as a data issue reporter and tracker?

Hi all, I need to build a system that generates reports on data exceptions (e.g. this value is stale because it hasn't been updated in x days). Once they have a daily report on data quality issues my users would like to have a bunch of filtering capability (with customizable public and private filters) and then be able to categorize and...

Are there free, low cost, or open source tools for matching name/address data?

This question is related to Tools for matching name/address data. There is a number commercial tools provided by SAS, Oracle, Microsoft, etc., that allow to de-duplicate or merging names of individuals or companies coming from multiple sources. However, after reading the answers to the question mentioned before, I wondered why a seeming...

Best usability practice for accepting long-ish account numbers.

A user recently inquired (OK, complained) as to why a 19-digit account number on our web site was broken up into 4 individual text boxes of length [5,5,5,4]. Not being the original designer, I couldn't answer the question, but I'd always it assumed that it was done in order to preserve data quality and possibly to provide a better user...