Three thoughts on duplicate records
Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:
- Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
- Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
- Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.
Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.
![]()
Wes's Wednesday Wisdom Archives
What are you doing with new contacts?
What Are You Doing with New Contacts/ I was very interested to read in a […]
Be Aware of Selection Bias
Be Aware of Selection Bias I wrote recently about the mistaken perception of older members […]
Some Things Just Take Time
Some Things Just Take Time I learned recently that an elephant’s gestation period is 18 […]
Sometimes It’s the Least Bad Choice
Sometimes It’s the Least Bad Choice Just like in life, sometimes when we’re making technology […]
Our Members Aren’t Tech Savvy
Our Members Aren’t Tech Savvy Having worked now in the association space for more than […]
Motion vs. Action
Motion vs. Action One key to successful data management is understanding the difference between motion […]
There is ALWAYS a Trade-off
There is ALWAYS a Trade-off I’ve written many times about trade-offs (you can read a […]
Little by little, a little becomes a lot
Little by little, a little becomes a lot “Little by little, a little becomes a […]
Why do we treat data management differently?
Why do we treat data management differently? A recent post on ASAE’s community read: “Looking […]
Don’t Forget Your Speakers!
Don’t Forget Your Speakers! A phenomenon I’ve noticed over the years is that my clients will […]
