Credit Card Fraud Detection Solution
Solution Background and Business Value
Credit card fraud is a widespread issue affecting financial institutions, businesses, and consumers. Fraud occurs when a malicious actor gains unauthorized access to a customer’s credit card and makes fraudulent transactions. Common fraud methods include:
-
Physical theft of the card.
-
Skimming card details from payment terminals.
-
Online breaches where card data is stolen and misused.
To mitigate financial losses, businesses can use machine learning models to detect fraud in real-time, allowing them to:
-
Identify suspicious transactions early and intervene before money is lost.
-
Reduce false positives, ensuring legitimate transactions are not blocked.
-
Enhance fraud detection accuracy by leveraging graph-based patterns in transaction networks.
Data Requirements and Schema
Kumo AI processes relational data as interconnected tables using Graph Neural Networks (GNNs). This approach allows the model to learn from transaction patterns, account behavior, and merchant interactions without extensive feature engineering.
Core Tables
-
Transactions Table
-
Stores all recorded transactions.
-
Key attributes:
-
transaction_id
: Unique identifier for each transaction. -
timestamp
: When the transaction occurred. -
credit_card_id
: Links transaction to a credit card. -
merchant_id
: Links transaction to a merchant. -
Optional: Location, currency, amount, transaction type.
-
-
-
Credit Cards Table
-
Represents unique credit cards in the system.
-
Key attributes:
-
credit_card_id
: Unique identifier. -
cc_open_date
: Date when the card was issued. -
cc_close_date
: Date when the card was closed (if applicable). -
Optional: Credit limit, APR, fraud risk score.
-
-
-
Fraud Reports Table
-
Stores fraud labels for transactions.
-
Key attributes:
-
transaction_id
: Links to the transaction flagged as fraudulent. -
timestamp
: When the fraud report was filed. -
label
: 1 if fraudulent, 0 if legitimate.
-
-
Additional Tables (Optional)
-
Users Table: Links credit cards to customers.
-
User Stats Table: Stores aggregated stats like transaction count and total spend.
-
Merchants Table: Stores merchant details (e.g., category, location, risk rating).
Entity Relationship Diagram (ERD)
Predictive Query for Credit Card Fraud Detection
1. Transaction-Level Fraud Detection
Predict whether a transaction is fraudulent based on past fraud reports:
- At inference time, leave
LABEL
empty for new transactions to generate fraud risk scores.
2. Time-Based Fraud Prediction
Predict whether a fraud report will be linked to a transaction in the next 30 days:
3. Credit Card Risk Prediction
Predict whether a credit card will be associated with fraudulent transactions in the next 7 days:
Deployment Strategy
1. Batch Predictions for Fraud Analysts
-
Fraud teams review high-risk transactions flagged by the ML model.
-
Predictions are generated hourly or daily in batch mode.
-
Fraud analysts label new fraudulent transactions, improving the model over time.
2. Real-Time Fraud Detection
-
The system generates instant fraud risk scores when a transaction occurs.
-
High-risk transactions trigger manual review or two-factor authentication.
-
ML embeddings enhance rule-based fraud detection.
Building models in Kumo SDK
1. Initialize the Kumo SDK
2. Connect data
3. Select tables
4. Create graph schema
5. Train the model