Kumo Co-Founder Hema Raghavan Named to Inc.’s 2026 Female Founders 500

Learn more
1Link Prediction · Identity Matching

Identity Matching

For each customer record, which other records in the database represent the same person?

Book a demo and get a free trial of the full platform: data science agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

Loved by data scientists, ML engineers & CXOs at

Catalina Logo

A real-world example

For each customer record, which other records in the database represent the same person?

Customer databases contain 15-25% duplicates on average. Fuzzy string matching catches only obvious cases (typos). Kumo uses behavioral and relational signals — shared devices, overlapping transactions, same IP addresses — to find matches that string-based methods miss entirely. A single unresolved identity costs $10-50 in wasted marketing spend per year, and at enterprise scale that adds up to millions in misattributed revenue and duplicated outreach.

How KumoRFM solves this

Relational intelligence for identity resolution

Kumo builds a relational graph connecting customers to their interactions, devices, transactions, and channels. Instead of comparing name strings, Kumo learns that Customer C001 and Customer C847 share the same device fingerprint, transact at the same merchants, and browse from overlapping IP ranges. These behavioral signals create a rich identity graph where matches emerge from structural similarity — not surface-level text overlap. The graph captures identity signals that rules-based systems cannot encode.

From data to predictions

See the full pipeline in action

Connect your tables, write a PQL query, and get predictions with built-in explainability — all in minutes, not months.

1

Your data

The relational tables Kumo learns from

CUSTOMERS

customer_idnameemailphoneaddress
C001John Smithjohn@acme.com555-0142123 Oak St
C847J. Smithjsmith@gmail.com555-0142123 Oak Street
C302Maria Lopezmlopez@corp.io555-0891456 Elm Ave

INTERACTIONS

interaction_idcustomer_idchanneldevice_idtimestamp
INT-5001C001webDEV-A12025-09-14 10:23
INT-5002C847mobileDEV-A12025-09-14 14:05
INT-5003C302webDEV-B72025-09-15 09:11

DEVICES

device_iddevice_typebrowseros
DEV-A1laptopChromemacOS
DEV-B7desktopFirefoxWindows
DEV-C3mobileSafariiOS
2

Write your PQL query

Describe what to predict in 2–3 lines — Kumo handles the rest

PQL
PREDICT LIST_DISTINCT(INTERACTIONS.CUSTOMER_ID, 0, 30, days)
FOR EACH CUSTOMERS.CUSTOMER_ID
3

Prediction output

Every entity gets a score, updated continuously

CUSTOMER_IDMATCHED_CUSTOMER_IDSCORETIMESTAMP
C001C8470.942025-10-01
C302C9180.872025-10-01
C455C7120.722025-10-01
4

Understand why

Every prediction includes feature attributions — no black boxes

Customer C001 (John Smith)

Predicted: 94% match with C847 (J. Smith)

Top contributing features

Shared device fingerprint (DEV-A1)

Same device

35% attribution

Phone number overlap

Exact match

25% attribution

Transaction merchant overlap

87%

20% attribution

IP address proximity

Same subnet

12% attribution

Address string similarity

0.91

8% attribution

Feature attributions are computed automatically for every prediction. No separate tooling required. Learn more about Kumo explainability

Bottom line: Resolve 15-25% hidden duplicates in your customer database — recovering millions in misattributed revenue and eliminating embarrassing double-outreach.

Topics covered

identity matching AIcustomer identity resolutiongraph neural network identityentity resolution machine learningrecord matching AIKumoRFMrelational deep learningpredictive query languagecustomer deduplicationidentity graphcross-system identity matchingbehavioral identity resolution

One Platform. One Model. Predict Instantly.

KumoRFM

Relational Foundation Model

Turn structured relational data into predictions in seconds. KumoRFM delivers zero-shot predictions that rival months of traditional data science. No training, feature engineering, or infrastructure required. Just connect your data and start predicting.

For critical use cases, fine-tune KumoRFM on your data using the Kumo platform and Data Science Agent for 30%+ higher accuracy than traditional models.

Book a demo and get a free trial of the full platform: data science agent, fine-tune capabilities, and forward-deployed engineer support.