Question 1

How does this compare to writing a Python script with fuzzywuzzy?

Accepted Answer

fuzzywuzzy (and its faster successor rapidfuzz) gives you string-distance functions, not a full match workflow. You still own: the data loading, the column normalization, the threshold tuning per field, the multi-field weighting, the false-positive handling, the output format, and the maintenance. ListMatchGenie wraps all of that in a configured workflow with a UI for review and a saved profile you can re-run. If you enjoy maintaining match scripts, Python is great. If you want the match done in 15 minutes, this is faster.

Question 2

Can I match a CSV against a database table?

Accepted Answer

Currently the workflow is CSV-to-CSV (or XLSX). Export the relevant table from your database to CSV — most databases have a built-in export — then upload both. We're working on direct database connectors; for now CSV is the universal interchange and avoids us needing read access to your database.

Question 3

What if my two CSVs have different encodings (UTF-8 vs Windows-1252)?

Accepted Answer

The Genie auto-detects encoding on upload and normalizes to UTF-8 internally. If a file has mixed encoding (rare, but happens with old exports), the cleansing pass will flag rows with character anomalies before matching starts. Output is UTF-8 with BOM by default — opens cleanly in Excel without garbled accents.

Question 4

How big a CSV can I upload?

Accepted Answer

Per file: 100MB hard limit on every tier (Free through Business). Per-row limits are tier-based — Free 1K source rows, Starter 25K, Pro 100K, Business 500K. The master file (reference data) can be larger than the source — the row limit applies to the rows being matched.

Question 5

Can I run this from the command line / API for automation?

Accepted Answer

API access is on the roadmap (developer-tier feature). Today the workflow is browser-based: upload, configure, download. For one-off matches that's faster than wiring up an API call; for recurring automation we recommend pulling the cleaned CSV from the export step into your downstream pipeline.

Question 6

Are the matched files stored after I download?

Accepted Answer

Files live in your account's regional S3 bucket (US, EU, or UK based on billing address). Default retention is 90 days, configurable per tier. You can delete a job and its associated files at any time; deletion is hard-delete (not soft-delete) and removes both the source upload and the match output.

Match two CSV files — fuzzy join without writing code.

The two CSVs aren't perfectly aligned. Standard joins miss most matches.

Two CSVs in. One merged CSV out. Five-minute workflow.

Drag-and-drop CSV-to-CSV match

Fuzzy matching on multiple fields together

Review queue with confidence + 'why' per cluster

Output: one merged CSV with both sides' columns

Save the match profile for re-runs

No code, no Python, no DB

Matching a CRM export against an event-attendance export

What changes when you use ListMatchGenie

Without ListMatchGenie

With ListMatchGenie

Questions about match two csv files

Let the Genie handle the grunt work.