Three thoughts on duplicate records
Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:
- Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
- Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
- Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.
Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.
![]()
Wes's Wednesday Wisdom Archives
Are you closing the loop?
Are you closing the loop? If your association does a call for presentations for any […]
Declare victory and move on
Declare victory and move on The law of diminishing returns is the point at which the […]
Clean as you go
Clean As You Go A good cook or baker knows that, when working in the […]
Baby Steps
One of the keys to developing good data management habits is to be aware of […]
Success Requires Discipline
When it comes to data management, most of us know what to do; we just don’t […]
Take a moment to be grateful
Because we’re so focused on always improving what we have now, it’s easy to overlook […]
KPIs and Dashboards
I saw DJ Muller from MemberClicks speak on KPIs (key performance indicators). In his session […]
Documenting Process is Critical
When it comes to managing data successfully, process is critical. For example, a client of […]
Motion vs. Action
In James Clear’s book Atomic Habits (I recommend it!), he discusses the concept of motion vs. action. […]
Are You Answering Your Calls?
I’ve written about this before, but apparently I have to keep repeating it. If you’ve […]
