visibility Static layout preview of Find Duplicates, shown with a file imported and a completed run (results + match-group review). All pages →

Find Duplicates

Find rows that repeat, then keep one and remove the extras.

upload_file Drag and drop file here Up to 1.5 GB · CSV, TSV, XLSX, XLS · encoding & delimiter auto-detected
customers_export.csv 2.1 MB
Comma (,)
Auto-detected on upload. Change if the preview looks wrong.
Preview: customers_export.csv

18,442 rows, 6 columns

nameemailcityphonesignup_date
0Jane Doejane@acme.ioAustin512-555-01902024-01-04
1jane doeJANE@ACME.IOaustin(512) 555-019001/04/2024
2Bob Smithbob@globex.comDenver720-555-77812024-02-11
3R. Smithbob@globex.comDenver720-555-77812024-02-11
Options
Advanced Options
Leave empty to auto-detect
email
name
jaro_winkler
85
most-complete
check Merge mode — fill missing fields in the surviving row


Results

Original rows
18,442
Duplicate rows
312
−312 removed
Match groups
147
Rows kept
18,130

Match Groups

Group 1 · 2 rows 98% match
keepnameemailcityphonesignup_date
keepJane Doejane@acme.ioAustin512-555-01902024-01-04
removejane doeJANE@ACME.IOaustin(512) 555-019001/04/2024

Differing columns highlighted. The survivor row is kept; uncheck rows to split the group.

Group 2 · 2 rows 87% match
keepnameemailcityphonesignup_date
keepBob Smithbob@globex.comDenver720-555-77812024-02-11
removeR. Smithbob@globex.comDenver720-555-77812024-02-11

Decisions: 1 merged, 1 pending

Processing Log
[00:00.01] Loaded 18,442 rows from customers_export.csv [00:00.04] Strategy: exact(email) + fuzzy(name, jaro_winkler ≥ 85) [00:00.91] Compared 18,442 rows → 147 match groups [00:01.02] Survivor rule: most-complete · merge=on [00:01.05] 312 rows flagged for removal