Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Asking for more is a good sign…

July 26, 2023

Asking for more is a good sign… A client who had recently implemented a new […]

Take action…

July 19, 2023

Take action… “The greatest wisdom not applied to action and behavior is meaningless data.” – […]

The Rule of 100 and 1,000 revisited

July 12, 2023

The Rule of 100 and 1,000 revisited I’m finding that the “Rule of 100 and […]

You gotta wanna

July 5, 2023

You gotta wanna Long ago I heard a training consultant say you can’t train people […]

Be careful not to overbuy

June 14, 2023

Be careful not to overbuy I recently spoke with an association of ten staff that was […]

When is the best time to clean your data?

June 7, 2023

When is the best time to clean your data? One of the most common questions […]

Do the benefits outweigh the risks?

May 31, 2023

Do the benefits outweigh the risks? As the economist Thomas Sowell points out, there are […]

Painting the Bridge

May 24, 2023

Painting the Bridge According to this article, the Golden Gate Bridge is painted continuously year-round. […]

Maintenance isn’t sexy

May 17, 2023

Maintenance isn’t sexy I remember reading once long ago that one of the reasons our […]

“Will I still have a job when this is done?”

May 10, 2023

“Will I still have a job when this is done? While working with a client […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top