Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

DAN – The Data Analytics Network

September 18, 2024

DAN – The Data Analytics Network I’m a huge fan of users groups (both internal […]

Process before technology

September 11, 2024

Process before technology In a conversation with a client recently, I was reminded (yet again) […]

Opting out and communication preferences

September 4, 2024

Opting out and communication preferences Last week’s newsletter discussed the need for associations to collect mobile […]

Are you collecting mobile phone numbers? You should be.

August 28, 2024

Are you collecting mobile phone numbers? You should be. Are you collecting (and using) the […]

Spend less time on data management and more on higher value activities

August 21, 2024

Spend less time on data management and more on higher value activities Data management is very […]

Change anything you want, except your name!

August 7, 2024

Change anything you want, except your name! This is an oldy but a goody, but […]

If you don’t trust your vendor…

July 24, 2024

If you don’t trust your vendor… When I start an AMS selection project with a […]

Your RFP should go to no more than five vendors!

July 17, 2024

Your RFP should go to no more than five vendors! As a rule, when I […]

Be concise!

July 10, 2024

Be concise! I started a monthly newsletter almost 25 years ago (which I recently discontinued). […]

A great example of a data integrity report!

June 26, 2024

A great example of a data integrity report! A couple months back I discussed the […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top