Three thoughts on duplicate records

Three thoughts on duplicate records

Duplicate records are a reality in any database of any size, so as database managers, we always have to deal with them (or should deal with them!). So here are three thoughts on managing duplicate records:

  1. Duplicate records will always exist. Always. Unless you're managing only a tiny number of records, you will always have duplicates. So the goal isn't to eliminate duplicates, but to minimize their number.
  2. Focus on and fix any processes that tend to cause creation of duplicate records. For example, I often hear from my clients "Our website password recovery process doesn't work, or is too cumbersome, so customers just create new records in order to register for an event quickly." Whatever process you have that leads to duplicate records, fix that process.
  3. Consistently run a process for identifying potential duplicates, and clean them up. On at least a quarterly basis you should run a query that helps you identify potential duplicate records and then take the time to clean up records that are actual duplicates. And of course, clean up duplicate records as you find them in your day-to-day work. But seeking them out and fixing them consistently, over time, is the best way to minimize duplicate records.

Duplicate records are a reality of life. But suffering with an overwhelming number of duplicates is a choice, and something you can fix, if you take the time to do so.

Wes's Wednesday Wisdom Archives

“If you keep doin’ what you’re doin’, you’ll keep gettin’ what you’re gettin’.”

June 5, 2024

“If you keep doin’ what you’re doin’, you’ll keep gettin’ what you’re gettin’.” I saw […]

Rather than adding something new, try subtracting

May 29, 2024

Rather than adding something new, try subtracting I read recently that sociological research suggests, when presented […]

It’s always people, process, and technology

May 22, 2024

It’s always people, process, and technology I speak and write a lot about people, process, […]

Once it’s lost, trust can be difficult to regain

May 15, 2024

Once it’s lost, trust can be difficult to regain I recall hearing once long ago […]

Share your successes!

May 8, 2024

Share your successes! I was speaking at an association meeting recently and one of the points […]

What are YOUR data integrity reports?

May 1, 2024

What are YOUR data integrity reports? Sitting in an AMS demo with a client recently, […]

You might have to do SOME of the work yourself!

April 24, 2024

You might have to do SOME of the work yourself! Many, many years ago I […]

Don’t manage to the exception!

April 17, 2024

Don’t manage to the exception! One of the universal truths about data management is, wherever possible, […]

It’s always about improvement

April 10, 2024

It’s always about improvement Talking with a client recently, she expressed frustration about one particular project […]

Don’t be a hoarder!

April 3, 2024

Don’t be a hoarder! The simple truth is that it’s almost “free” to collect data. […]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top