Solution Background and Business Value
Credit card fraud is a widespread issue affecting financial institutions, businesses, and consumers. Fraud occurs when a malicious actor gains unauthorized access to a customer’s credit card and makes fraudulent transactions. Common fraud methods include:- Physical theft of the card.
- Skimming card details from payment terminals.
- Online breaches where card data is stolen and misused.
- Identify suspicious transactions early and intervene before money is lost.
- Reduce false positives, ensuring legitimate transactions are not blocked.
- Enhance fraud detection accuracy by leveraging graph-based patterns in transaction networks.
Data Requirements and Schema
Kumo AI processes relational data as interconnected tables using Graph Neural Networks (GNNs). This approach allows the model to learn from transaction patterns, account behavior, and merchant interactions without extensive feature engineering.
- Transactions Table
- Stores all recorded transactions.
- Key attributes:
transaction_id
: Unique identifier for each transaction.timestamp
: When the transaction occurred.credit_card_id
: Links transaction to a credit card.merchant_id
: Links transaction to a merchant.- Optional: Location, currency, amount, transaction type.
- Credit Cards Table
- Represents unique credit cards in the system.
- Key attributes:
credit_card_id
: Unique identifier.cc_open_date
: Date when the card was issued.cc_close_date
: Date when the card was closed (if applicable).- Optional: Credit limit, APR, fraud risk score.
- Fraud Reports Table
- Stores fraud labels for transactions.
- Key attributes:
transaction_id
: Links to the transaction flagged as fraudulent.timestamp
: When the fraud report was filed.label
: 1 if fraudulent, 0 if legitimate.
- Users Table: Links credit cards to customers.
- User Stats Table: Stores aggregated stats like transaction count and total spend.
- Merchants Table: Stores merchant details (e.g., category, location, risk rating).
Predictive Query for Credit Card Fraud Detection
1. Transaction-Level Fraud Detection Predict whether a transaction is fraudulent based on past fraud reports:- At inference time, leave
LABEL
empty for new transactions to generate fraud risk scores.
Deployment Strategy
1. Batch Predictions for Fraud Analysts- Fraud teams review high-risk transactions flagged by the ML model.
- Predictions are generated hourly or daily in batch mode.
- Fraud analysts label new fraudulent transactions, improving the model over time.
- The system generates instant fraud risk scores when a transaction occurs.
- High-risk transactions trigger manual review or two-factor authentication.
- ML embeddings enhance rule-based fraud detection.