Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Back to basics

April 5, 2023

Back to basics Over the past couple of years I’ve noticed that some AMS vendors […]

Your people matter

March 29, 2023

Your people matter I’ve written many times about how people, process, and technology have to […]

We remember moments…

March 22, 2023

We remember moments… “We do not remember days, we remember moments.” – Cesare Pavese Another […]

Acknowledging problems is part of managing expectations

March 15, 2023

Acknowledging problems is part of managing expectations Research was done some time ago that suggested […]

Need data? Consider third-party sources

March 8, 2023

Need data? Consider third-party sources I always tell my clients, only collect data that you’re […]

Action must follow the decision

March 1, 2023

Action must follow the decision When I work with my clients on their projects (whether […]

Everything should be focused on improving user adoption

February 22, 2023

Everything should be focused on improving user adoption Your AMS is a tool, and a […]

Needs change over time, and that’s OK

February 15, 2023

Needs change over time, and that’s OK I was speaking with a couple of association […]

The vaguer the question, the vaguer the answer

February 9, 2023

The vaguer the question, the vaguer the answer As the old saying goes, the devil […]

The best choice given the information you have

February 2, 2023

The best choice given the information you have “Hindsight is 20/20” is a cliché because, […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top