Three thoughts on duplicate records
Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:
- Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
- Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
- Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.
Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.
![]()
Wes's Wednesday Wisdom Archives
AI actually requires thinking
AI actually requires thinking “I don’t think AI introduces a new kind of thinking. It […]
It’s not the mistakes, but how you respond
It’s not the mistakes, but how you respond Recently a client was complaining about a bug that […]
The hidden costs of bad data
The hidden costs of bad data Nobody likes bad data, and presumably we’re all working […]
Don’t let your customers edit their names online!
Don’t let your customers edit their names online! This issue came up recently and I […]
Once is an accident, twice is coincidence, three times is a pattern.
Once is an accident, twice is coincidence, three times is a pattern. We’ve probably all […]
“Every association does this.”
“Every association does this.” One of the most significant values I bring to my clients […]
Trust your gut
Trust your gut When I help associations with selection of a new technology system (e.g., […]
“People more frequently require to be reminded than informed.”
“People more frequently require to be reminded than informed.” “People more frequently require to be […]
Problems without solutions are not problems, they are facts of life
Problems without solutions are not problems, they are facts of life “Problems without solutions are […]
Perfect is not possible
Perfect is not possible We’ve all heard the phrase “Perfect is the enemy of good” and […]
