Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

Motion vs. Action

August 26, 2020

Motion vs. Action One key to successful data management is understanding the difference between motion […]

There is ALWAYS a Trade-off

August 19, 2020

There is ALWAYS a Trade-off I’ve written many times about trade-offs (you can read a […]

Little by little, a little becomes a lot

August 12, 2020

Little by little, a little becomes a lot “Little by little, a little becomes a […]

Why do we treat data management differently?

August 5, 2020

Why do we treat data management differently? A recent post on ASAE’s community read: “Looking […]

Don’t Forget Your Speakers!

July 29, 2020

Don’t Forget Your Speakers! A phenomenon I’ve noticed over the years is that my clients will […]

90% of your data is never touched a second time

July 22, 2020

90% of your data is never touched a second time I heard recently on a […]

Newton’s First Law

July 15, 2020

Newton’s First Law Part of Newton’s First Law states that “…an object in motion stays […]

No System is Perfect

July 7, 2020

No System is Perfect While it may sound trite, it bears repeating that no data management […]

Data Accretes

July 1, 2020

Data Accretes One of my very first jobs in the association world was managing the production […]

Don’t be so lazy…

June 24, 2020

Don’t be so lazy… One of the things that attracted me to the software world […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top