Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Don’t forget about what got better

March 10, 2021

Don’t forget about what got better Negativity bias is the tendency to focus on only […]

Give a little at a time rather than taking away

March 3, 2021

Give a little at a time rather than taking away I’m sure there’s research somewhere […]

First, you gotta have the data

February 17, 2021

First, you gotta have the data When I work with clients on a new AMS […]

Work on your relationship with your AMS vendor

February 10, 2021

Work on your relationship with your AMS vendor It is no coincidence that my most […]

Dashboards for Data Integrity

February 3, 2021

Dashboards for Data Integrity I’ve written a bunch on data integrity reports. (Click here for […]

Next-to-Nothing Goals

January 27, 2021

Next-to-Nothing Goals I saw a Ted Talk by Christine Carter recently discussing the concept of […]

Just because you can…

January 20, 2021

Just because you can… In response to a recent Wednesday Wisdom on averages hiding the […]

Always ask “Why?”

January 13, 2021

Always ask “Why?” Little kids ask “Why?” all the time, because they are sponges for […]

Averages hide the extremes

January 6, 2021

Averages hide the extremes I can’t remember where I first heard it, but “averages hide […]

Look up PARTIAL names!

December 24, 2020

Look up PARTIAL names! Train your staff to do look ups on partial names, not full […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top