Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Not Just Technology

December 16, 2020

Not Just Technology There was recently a discussion on the ASAE online community about project management […]

Consider quitting

December 4, 2020

Consider quitting Patient: “Doctor, it hurts when I do this.” Doctor: “Then stop doing that.” […]

There is nothing so permanent as business rules

November 25, 2020

There is nothing so permanent as business rules Nobel Prize-winning economist Milton Friedman said, “There’s nothing […]

Budget for feature discovery and adoption

November 18, 2020

Budget for feature discovery and adoption When developing a budget for implementation of a new […]

Customize staff pages for better user adoption

November 11, 2020

Customize staff pages for better user adoption The single most important element to data management […]

Don’t ask questions for which you already know the answer

November 4, 2020

Don’t ask questions for which you already know the answer I was recently completing an […]

All decisions involve risk

October 28, 2020

All decisions involve risk Whether it’s choosing a new AMS or introducing a new product […]

Try flowcharting your processes

October 21, 2020

Try flowcharting your processes Working with a client recently on their membership join process reminded me […]

“I just want a system I don’t have to fight with.”

October 14, 2020

“I just want a system I don’t have to fight with.” I asked my client: […]

Inertia Contributes to Bad Data

October 7, 2020

Inertia Contributes to Bad Data Without knowing anything about your organization or its data, I’d […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top