Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

“Different” isn’t necessarily better or worse.

December 4, 2024

“Different” isn’t necessarily better or worse. One of the biggest challenges I face when working […]

The Rule of 100 and 1,000 and automation

November 20, 2024

The Rule of 100 and 1,000 and automation I originally coined the rule of 100 […]

Once you know, what will you do?

November 13, 2024

Once you know, what will you do? I’ve yet to meet a client who didn’t […]

If it’s not in your AMS, why not?

November 6, 2024

If it’s not in your AMS, why not? I like to tell my clients they’ll […]

Why checkboxes and tags are awesome and dangerous

October 30, 2024

Why checkboxes and tags are awesome and dangerous One of the most common functions in […]

Don’t miss obvious engagement data

October 23, 2024

Don’t miss obvious engagement data What I’ve experienced with my clients over the years is […]

All data requires active management

October 16, 2024

All data requires active management It’s a simple fact of data management that is often […]

Documentation is critical for consistency

October 9, 2024

Documentation is critical for consistency There are so many reasons why documenting your data management […]

Consumer demands change and technology changes

October 2, 2024

Consumer demands change and technology changes When I work with clients on the selection of […]

Why I write

September 25, 2024

Why I write Thirty years ago, I started a new job as director of membership […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top