Find Fuzzy Duplicates in CSV Files Online
Find near-duplicate rows in CSV files using Levenshtein edit distance or Jaro-Winkler similarity — all in your browser. Set your own threshold and download the matched pairs as CSV — no upload required.
Drop your .csv file here
or click to browse — max 50 MB
Frequently Asked Questions
Is my data private?
Yes — completely. Your file is never uploaded to any server. Everything runs locally in your browser using WebAssembly — processing happens entirely inside your tab. Once you close the tab, nothing is retained.
What is the maximum file size?
The free limit is 50 MB. For larger files, performance depends on your device's available memory — most modern machines handle 500 MB to 1 GB comfortably.
Related Tools
Convert CSV to Parquet Online
Convert CSV files to Parquet format directly in your browser. No upload required — your data never leaves your device.
Convert CSV to JSON Online
Convert CSV files to JSON format directly in your browser. No upload required — your data never leaves your device.
Convert CSV to NDJSON Online
Convert CSV files to NDJSON format directly in your browser. No upload required — your data never leaves your device.
Convert CSV to TSV Online
Convert CSV files to TSV format directly in your browser. No upload required — your data never leaves your device.
Convert CSV to Arrow Online
Convert CSV files to Arrow format directly in your browser. No upload required — your data never leaves your device.