Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Action produces information

April 8, 2026

Action produces information “Action produces information. If you’re unsure of what to do, just do […]

Why a single source of truth is so beneficial…

March 25, 2026

Why a single source of truth is so beneficial… The holy grail of data management is […]

Just because you can, doesn’t mean you should

March 18, 2026

Just because you can, doesn’t mean you should I’ve worked in and with associations for well […]

Data doesn’t need to be perfect to be useful

March 4, 2026

Data doesn’t need to be perfect to be useful “Your data doesn’t have to be […]

Don’t just clean data, clean reports and queries

February 25, 2026

Don’t just clean data, clean reports and queries One of the reasons I exhort my […]

Don’t forget, your staff have day jobs…

February 18, 2026

Don’t forget, your staff have day jobs… The vast majority of my work is finite […]

Beware the automated “How did we do?” trap!

February 11, 2026

Beware the automated “How did we do?” trap! One of the downsides of technology is […]

Hindsight is 20/20

February 4, 2026

Hindsight is 20/20 I’m currently working with a client that is moving from their legacy […]

Cheaper now; costlier later.

January 28, 2026

Cheaper now; costlier later. Most of my clients are very cost-conscious, understandably. After all, who […]

The unified shopping cart: Dreams vs. reality

January 21, 2026

The unified shopping cart: Dreams vs. reality One of the more common requests I hear […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top