visibility Static layout preview of Find Duplicates, shown with a file imported and a completed run (results + match-group review). All pages →

Find Duplicates

Runs 100% locally

Find rows that repeat, then keep one and remove the extras.

description Using customers_export.csv from the upload screen.
Comma (,)
Auto-detected on upload. Change if the preview looks wrong.
Preview: customers_export.csv

18,442 rows, 6 columns

nameemailcityphonesignup_date
0Jane Doejane@acme.ioAustin512-555-01902024-01-04
1jane doeJANE@ACME.IOaustin(512) 555-019001/04/2024
2Bob Smithbob@globex.comDenver720-555-77812024-02-11
3R. Smithbob@globex.comDenver720-555-77812024-02-11
85
Higher means rows must look more alike to count as a duplicate.
the most-complete row
Which row survives in each group of duplicates.
Advanced options

Leave these empty to auto-detect which columns to compare. Otherwise, list the columns that must match exactly and the ones that only need to match approximately — together these are the columns used to find duplicates.

email
name
jaro_winkler
check Merge mode — fill missing fields in the surviving row


Results

Original rows
18,442
Duplicate rows
312
−312 removed
Match groups
147
Rows kept
18,130

Preview of an auto-resolved run: each group keeps its auto-picked survivor. Review the groups below to override any pending picks before the final download.


Match Groups

Differing columns are highlighted. The survivor row is kept; uncheck a row to split it out of the group.

Group 1 · 2 rows 98% match
keepnameemailcityphonesignup_date
keepJane Doejane@acme.ioAustin512-555-01902024-01-04
removejane doeJANE@ACME.IOaustin(512) 555-019001/04/2024
Group 2 · 2 rows 87% match
keepnameemailcityphonesignup_date
keepBob Smithbob@globex.comDenver720-555-77812024-02-11
removeR. Smithbob@globex.comDenver720-555-77812024-02-11

Decisions: 1 merged, 1 pending · Pending groups keep their auto-picked survivor unless you review them.

Processing Log
[00:00.01] Loaded 18,442 rows from customers_export.csv [00:00.04] Strategy: exact(email) + fuzzy(name, jaro_winkler ≥ 85) [00:00.91] Compared 18,442 rows → 147 match groups [00:01.02] Survivor rule: most-complete · merge=on [00:01.05] 312 rows flagged for removal
arrow_forwardDuplicates handled — your file is cleaned. Review the result or Back to Start here →