Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Patience and grace

August 17, 2022

Patience and grace A past client of mine recently told me: “You inspire confidence and […]

Sometimes you just have to try it and see what happens

August 10, 2022

Sometimes you just have to try it and see what happens The single greatest key […]

Training and testing

August 3, 2022

Training and testing I’ve written before that the best form of training follows this process: […]

How to save a “failing” project

July 27, 2022

How to save a “failing” project It is not unusual for me to receive a […]

How to avoid the “IT black hole”

July 20, 2022

How to avoid the “IT black hole” Going all the way back to my days […]

The “People” are important!

July 13, 2022

The “People” are important! I’ve written a lot about people, process, and technology over the […]

Always look for the MVP

July 6, 2022

Always look for the MVP I first wrote about minimum viable product (MVP) just three […]

The longer you take, the longer it will take

June 29, 2022

The longer you take, the longer it will take It may sound like a tautology, […]

Data is for action

June 22, 2022

Data is for action When deciding whether or not to collect a piece of data, […]

Have you shopped yourself?

June 15, 2022

Have you shopped yourself? I’ve always been fascinated by the “secret shopper” concept. (Maybe it’s […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top