Ratio of Data to Errors

Ratio of Data to Errors

One of the elements of a good data governance plan is establishing data quality metrics. Put another way, what are your measurements for how good your data really is?

One of the simplest but perhaps most powerful metrics is the ratio of data to errors (or what percentage of your data is correct). Simply put, you take the total number of a set of data and compare that to the number of errors on the list. For example, a committee list of 24 names and emails that has two errors on it would have a ratio of 24:2 (or 92% accuracy, if you prefer percentages).

The reason I like this simple formula is that it allows you to have an objective measure of data accuracy. Too often I hear from my clients "Our data is garbage" but they can't really quantify what "garbage" means or what data that is "not garbage" looks like.

There is a tendency to believe the data should be perfect. This is impossible, of course, as I've written many times over the years. But using a ratio of data to errors can help you quantify how good or bad your data is, and also help you set a measurable target for how good your data should be.

Wes's Wednesday Wisdom Archives

Don’t forget, your staff have day jobs…

February 18, 2026

Don’t forget, your staff have day jobs… The vast majority of my work is finite […]

Beware the automated “How did we do?” trap!

February 11, 2026

Beware the automated “How did we do?” trap! One of the downsides of technology is […]

Hindsight is 20/20

February 4, 2026

Hindsight is 20/20 I’m currently working with a client that is moving from their legacy […]

Cheaper now; costlier later.

January 28, 2026

Cheaper now; costlier later. Most of my clients are very cost-conscious, understandably. After all, who […]

The unified shopping cart: Dreams vs. reality

January 21, 2026

The unified shopping cart: Dreams vs. reality One of the more common requests I hear […]

Don’t forget to celebrate!

January 14, 2026

Don’t forget to celebrate! Truth be told, I’m not a big celebrator. (Maybe it’s because […]

Three thoughts on duplicate records

January 7, 2026

Three thoughts on duplicate records Duplicate records are a reality in any database of any size, […]

Please don’t do this…

December 17, 2025

Please don’t do this… I’ve noticed a trend among online retailers that I want to […]

AI is perpetually patient

December 10, 2025

AI is perpetually patient My friend and colleague Noel Shatananda of fusionSpan was providing me some […]

User adoption is (almost) all that matters…

December 3, 2025

User adoption is (almost) all that matters… I was speaking with an association recently about their […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top