Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

With data analytics (as with most things), keep it simple!

March 27, 2024

With data analytics (as with most things), keep it simple! Over the past several years […]

Don’t forget about periodic maintenance

March 20, 2024

Don’t forget about periodic maintenance Just as most automobiles need periodic maintenance, keeping your data as […]

Users Groups are ALWAYS Valuable

March 13, 2024

Users Groups are ALWAYS Valuable I am a huge fan of users groups. (Here’s a […]

But will you DO anything with that data?

February 28, 2024

But will you DO anything with that data? I frequently exhort my clients to always answer […]

It’s not what happens, but how you react…

February 21, 2024

It’s not what happens, but how you react… “It’s not what happens to you, but […]

Saying it and doing it are two different things

February 14, 2024

Saying it and doing it are two different things When I work with my clients […]

Benefits downstream are difficult to implement

February 7, 2024

Benefits downstream are difficult to implement In my 25 years of consulting, one of the […]

Your first answer may NOT be the right answer!

January 31, 2024

Your first answer may NOT be the right answer! I’m a big believer in “go […]

You don’t have to automate EVERYTHING to be successful

January 25, 2024

You don’t have to automate EVERYTHING to be successful I was discussing a data management […]

Technology won’t solve your process problems

January 17, 2024

Technology won’t solve your process problems Over the past few years I’ve watched as many […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top