Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Artificial Intelligence is the next revolution in data management

May 7, 2025

Artificial Intelligence is the next revolution in data management I’ve been in the data management […]

There’s always gonna be something

April 30, 2025

There’s always gonna be something I’ve been consulting for 26 years now and I’m still […]

Eyes wide open and affirmative decision-making

April 16, 2025

Eyes wide open and affirmative decision-making When I work with my clients on any type […]

Honoring an industry legend

April 9, 2025

Honoring an industry legend I’m taking a break from my usual data management tips to […]

Never burn a bridge

April 2, 2025

Never burn a bridge My children have reached the age where they have, or are, […]

You CAN compete with the big guys…

April 2, 2025

You CAN compete with the big guys… In my experience, associations often undersell their actual […]

Join (or start) your users group!

March 26, 2025

Join (or start) your users group! Today’s message is simple: If the AMS you’re using […]

They don’t care, until they care.

March 19, 2025

They don’t care, until they care. One of the more common questions I get from […]

Longfellow and data management

March 12, 2025

Longfellow and data management “We judge ourselves by what we feel capable of doing while […]

Do you really need all that historical data?

March 5, 2025

Do you really need all that historical data? A question I’ll often get from my […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top