Matching people is largely about names. Matching companies, suppliers, accounts, or products is about identifiers and context — domains, SKUs, categories, street addresses that mean very different things than they do for people.
This guide covers matching non-person entities end-to-end, with the specific knobs to turn for company and product data.
When to use this guide
Use this workflow when you're matching:
- Companies — B2B accounts, supplier lists, firmographic data, CRM account records
- Products — SKUs, inventory items, catalog entries
- Any non-person entity with identifiers different from person names (email, phone)
For person matching, see Match leads to CRM. For finding duplicates inside a single list, see Deduplicate a customer list.
Why company matching is different
Company and product matching face issues person matching doesn't:
Naming is creative
Acme Inc., ACME INCORPORATED, acme, LLC, Acme Corp., The Acme Company, and Acme (formerly Beta Industries) can all refer to the same company. Person names don't have this — John Smith has much less variability than Johnson & Johnson Inc..
Match profiles for companies handle this with specific rules:
- Suffix stripping (
Inc,Incorporated,LLC,Ltd,Corp,Company,Co) - Article removal (
The,La,Le,Der) - Ampersand normalization (
&↔and) - Tokenization so word order matters less (
Smith Jones & Co↔Jones Smith & Co)
Domain beats name
If two records have the same primary domain (e.g. acme.com), they are overwhelmingly likely to be the same company — more reliable than any name comparison. The Company profile weights domain as the strongest signal when it's present.
Address is weak signal
Companies move, operate multiple locations, and list different addresses for billing vs shipping vs headquarters. Address carries far less weight in the Company profile than in Person.
Product matching is about SKUs
For products, the SKU (when present) is near-definitive. Product name comparison uses token-based matching so 12oz Blue Mug Ceramic matches Blue Ceramic Mug, 12oz despite different word order. Numeric tolerance handles 12.5in ~= 12.50" ~= 12-1/2 inch.
The workflow for companies
Prepare both files
For the source and master, include as many of these columns as you have:
- Company name (required)
- Domain (huge help — include if you have it)
- Street address, city, state, ZIP
- Phone
- Industry / SIC / NAICS code
- Any internal ID (your CRM's account ID, DUNS, etc.)
Clean each file individually first if you suspect internal duplicates.
Upload with correct profiling
On upload, verify column detection:
- Company name column profiled as
company_name - Domain profiled as
domain(not generic URL) - Address columns correctly identified as their parts
If the Genie misidentifies a column (e.g. a "parent_company" column as generic text), override it.
- Company name column profiled as
Pick the Company profile
Select Company as your match profile. The defaults are good for most B2B matching. Leave threshold at 70.
Custom tuning you might want:
- If domain is present in both files, consider raising domain weight further. Company domains rarely lie.
- If you're matching branches of companies where the same domain is shared, lower domain weight instead and raise address.
- If industry code is present in both files, add it as a low-weight tie-breaker.
Run and review
The match runs the same way as person matching. On Review, pay extra attention to:
- Same name, different location — often legitimate branches. Use the location context to decide whether to treat them as one or two entities.
- Same domain, different name — usually a rebrand. These are generally the same entity.
- Similar name, totally different industry — usually different entities. A "Sunrise Bakery" and a "Sunrise Consulting Group" are unrelated.
Export with enrichment
The exported file has your source rows plus:
- Every master column from the matched record (useful for enrichment)
- The
_lmg_metadata including match method and score - Any
_lmg_notesyou added during review
For CRM account reconciliation, filter to
_lmg_match_status = matchto get a clean lookup table of "source account → master account ID".
The workflow for products
Products follow the same overall workflow. The key differences:
Use the Product / SKU profile
This profile weights SKU as the strongest signal when present, with product name next, then brand and category as supporting signals.
Clean SKUs carefully
SKUs often have trailing suffixes (ABC-123-V2, ABC-123-V3 are often the same product). Decide ahead of time whether versioning matters to you and either preserve or strip suffixes in the column settings.
Use token-based comparison for names
Product names in the Product profile are tokenized and compared as sets of terms, not ordered strings. "Blue Ceramic Mug 12oz" and "12oz Blue Mug Ceramic" match because the token set is identical.
Numeric tolerance
For fields with numeric values (dimensions, weights, volumes), the profile applies a 2% tolerance by default. 12.5in, 12.50", and 12.5 inches all match. Override tolerance per-field in custom profiles.
Common gotchas
Parent/subsidiary confusion
Your master might list "Alphabet Inc" while your source has "Google LLC". Same real-world entity, different legal names. Handle this by:
- Enriching your master with "also known as" columns (e.g.
aka,parent,subsidiary_of) - Using the Company profile with relaxed thresholds (65), then manually reviewing the ambiguous cases
Suppliers with the same name
"Smith Manufacturing" in California and "Smith Manufacturing" in Texas could be the same company or two unrelated ones. The Company profile uses address and phone as tie-breakers, but if both are unknown, these go to review. Decide case-by-case; consider adding an industry column if your data has it.
Product variants
"Blue Ceramic Mug 12oz" and "Blue Ceramic Mug 16oz" are two products, not one. Make sure size/capacity/variant columns are weighted highly enough that they can discriminate.
Domain enrichment pays off
If your company data doesn't have domains, enriching it once (via Clearbit, ZoomInfo, or public WHOIS) makes every future match run dramatically more accurate. Domain is the most reliable company identifier short of a legal entity ID.
Related reading
- Match profiles — detail on Company and Product profiles
- Field mapping — cross-column alignment between source and master
- Choosing a profile — decision tree for all match types
