Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

When “overcommunicating” becomes overwhelming

November 19, 2025

When “overcommunicating” becomes overwhelming There is a common belief in project management and general business management that overcommunicating […]

Communicating when there is a major technology change

November 12, 2025

Communicating when there is a major technology change For those of you who are ASAE […]

Everything works, until it doesn’t.

November 5, 2025

Everything works, until it doesn’t. “If something cannot go on forever, it will stop.” – […]

Some things are unknowable

October 29, 2025

Some things are unknowable One of the most challenging aspects of moving to a new […]

Do we have access to that data?

October 22, 2025

Do we have access to that data? When I work with my clients on a […]

The three “buckets” of contact data

October 8, 2025

The three “buckets” of contact data Broadly speaking, when collecting data on contacts (individuals or organizations), […]

Balancing what is possible with what is realistic

October 1, 2025

Balancing what is possible with what is realistic When I work with clients on technology […]

“Humans want to be surrounded by beauty.”

September 24, 2025

“Humans want to be surrounded by beauty.” “Humans want to be surrounded by beauty.” – […]

Data managers vs data consumers

September 17, 2025

Data managers vs data consumers When I work with clients on any data management project, […]

Another universal truth

September 10, 2025

Another universal truth There are many universal truths in data management. Some examples: You will […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top