Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Some data analytics tips from the DAN “Science Fair”

October 11, 2023

Some data analytics tips from the DAN “Science Fair” Last week I had the opportunity […]

Are all your processes frictionless?

October 4, 2023

Are all your processes frictionless? I’m not a huge fan of buzzwords, but I love […]

Trends don’t need perfect data

September 27, 2023

Trends don’t need perfect data When it comes to analyzing data trends (changes in data […]

Start with “Why” before you move to “How”

September 20, 2023

Start with “Why” before you move to “How” Something I’ve noticed over my years in […]

The power of users groups!

September 13, 2023

The power of users groups! Last week I had the honor and pleasure of speaking […]

Associations are complex businesses!

September 6, 2023

Associations are complex businesses! One of the reasons managing data at an association can be so […]

Snapshots are required

August 30, 2023

Snapshots are required Recently a couple of different clients have asked me why it’s necessary […]

“It’s in the database…”

August 16, 2023

“It’s in the database…” I often joke with my clients that AMS nirvana looks like […]

Using the Pomodoro Technique for data management

August 9, 2023

Using the Pomodoro Technique for data management I’m a huge fan of the Pomodoro Technique, […]

You need a PLAN to deal with duplicates

August 2, 2023

You need a PLAN to deal with duplicates Here’s the thing about duplicate records: No […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top