Question 1

Which regions are supported today?

Accepted Answer

Twenty: US, UK, Ireland, Canada, Australia, New Zealand, Germany, Austria, Switzerland, Netherlands, France, Spain, Italy, Portugal, Sweden, Norway, Denmark, Poland, Mexico, and Brazil. Each region has validated handling of particles, compound surnames, diacritics, and local postal-code formats.

Question 2

What about Chinese, Japanese, Korean, or Arabic?

Accepted Answer

CJK (Chinese, Japanese, Korean) and right-to-left scripts (Arabic, Hebrew, Persian) are on our roadmap but not yet in the product. They present unique engineering challenges — name-order conventions, transliteration ambiguity, script composition — that deserve dedicated product work. Indic languages (Hindi, Bengali, Tamil), Thai, Vietnamese, and Finnish are also on the roadmap.

Question 3

Does it handle Spanish two-surname conventions?

Accepted Answer

Yes. Spanish and LatAm customer lists typically use paternal + maternal surnames (García López). The engine treats the paternal surname as the primary match key while using the maternal surname as a secondary signal — so María García in one file matches María García López in another without false positives.

Question 4

How are diacritics handled?

Accepted Answer

Region-by-region. German ä/ö/ü/ß fold per convention (Müller ↔ Mueller). French and Spanish accents fold to base characters. Scandinavian Å/Ø/Æ use region-appropriate conventions (Åke ↔ Ake in Swedish; Søren ↔ Soren in Danish). Polish ł/ń/ś/ż fold cleanly. Each region module ships with a validated diacritic-fold table.

Question 5

Can one match job span multiple regions?

Accepted Answer

Yes. A single file can contain records from any mix of the 20 supported regions. The engine applies region detection per row (based on postal code, country column, or name signals) and uses the appropriate regional module for each.

Question 6

What if my file has a region you don't list?

Accepted Answer

The engine still runs with a sensible Latin-script fallback — Unicode NFKD normalization, accent stripping, generic particle handling. You'll get reasonable results on Latin-script data from unlisted regions; you just won't get the per-country validation and specific folding tables the 20 listed regions have.

Question 7

How do you compare to other matching tools on international data?

Accepted Answer

Most matchers were built with US data in mind — they treat international name patterns as edge cases, if at all. WinPure and Match2Lists offer some European-region support but with limited language coverage. Excel and OpenRefine require manual normalization for every regional quirk. We're built international-first: 20 regions ship validated end-to-end, and the engine routes each row through the right regional module automatically.

International list matching across 20 regions — built for Madrid, Munich, and Manchester, not just Manhattan.

20 regions, validated end-to-end.

English-speaking

Western Europe (DACH + Benelux)

Southern Europe

Nordic

Eastern Europe

Latin America

Where off-the-shelf matchers fail.

Spanish full-name field with 4-5 tokens

Dutch particle 'van der' preserved and matched

German umlaut folding per convention

Québécois hyphenated saint-names

Brazilian Portuguese da/dos/de particles

Scandinavian Å/Ø/Æ regional folding

A real Spanish match, three rows.

How does this stack up against other matchers?

One engine, no config

Region modules, validated

GDPR-ready data residency

What's not in today's release.

International matching questions

Let the Genie handle the grunt work.