Berlin Tech Meetup: The Future of Relational Foundation Models, Systems, and Real-World Applications

Register now:
Learn13 min read

AI in Financial Services: Fraud, Risk, and the Relational Data Advantage

Banks sit on the richest relational data on earth. Customers, accounts, transactions, counterparties, merchants. Every dollar flows through a graph. The institutions that learn from that graph structure will win.

TL;DR

  • 1Global fraud losses reached $485 billion in 2023. Current flat models have 95-98% false positive rates, costing large banks $50M-200M annually in unnecessary investigations.
  • 2Graph-based fraud detection reduces false positives by 40-60% while catching organized fraud rings that flat models structurally cannot see. Every 10% reduction saves $50M-100M annually.
  • 3Credit risk models that incorporate network signals (counterparty risk, employer stability, transaction graph similarity) improve pricing for thin-file borrowers and reveal correlated default risks.
  • 4AML compliance costs $274 billion globally. Graph-based AML detects layering, structuring, and hub-and-spoke laundering patterns through network topology, outperforming decades-old rule-based systems.
  • 5A foundation model serves fraud, credit risk, AML, churn, and cross-sell from one platform. One data connection, one governance framework, dozens of prediction tasks in days, not months.

Global financial fraud losses reached $485 billion in 2023. That number comes from Nasdaq's Global Financial Crime Report, and it includes payment fraud, identity fraud, money laundering, and account takeover. In the US alone, banks reported $10.3 billion in direct fraud losses in 2024, according to FinCEN.

Every major bank runs fraud detection models. Every one of them still bleeds billions. The models work on individual transactions. They check if this purchase, from this card, at this merchant, looks anomalous compared to this cardholder's history. And they catch a lot. Visa's AI systems block an estimated $25 billion in fraud annually.

But the fraud they miss is the fraud that matters most. Organized fraud rings. Synthetic identity schemes. Money mule networks. These attacks exploit relationships between entities that single-transaction models cannot see.

Why banking data is different

A typical bank's core data spans 30-50 interconnected tables. Customers. Accounts (checking, savings, credit, investment). Cards. Transactions. Merchants. Wire transfers. Counterparties. Loans. Branches. Call center interactions. Online sessions. Device fingerprints. IP addresses.

Every table is connected to multiple others through foreign keys. A customer has accounts. Accounts generate transactions. Transactions flow to merchants. Merchants are categorized. Other customers transact at the same merchants. Those customers have their own accounts, their own transaction patterns, their own risk profiles.

This is a graph. A dense, temporal, heterogeneous graph where nodes are entities of different types and edges are relationships with timestamps and dollar amounts attached. Traditional ML flattens this structure into a single row per entity, losing the network information that carries the strongest predictive signals.

transactions — sample banking data

txn_idaccount_idmerchantamounttimestampdevice_id
T-4001A-110Gas Station #412$42.802025-03-01 08:14D-7701
T-4002A-112Wire Transfer Out$9,8002025-03-01 09:30D-7701
T-4003A-115Wire Transfer Out$9,7502025-03-01 09:45D-7703
T-4004A-118Wire Transfer Out$9,9002025-03-01 10:02D-7701
T-4005A-110Online Purchase$189.002025-03-01 14:22D-7701

Highlighted: three wire transfers just under $10,000 within 32 minutes. Individually unremarkable. In the graph, accounts A-110, A-112, and A-118 share device D-7701, and all three target the same offshore entity.

fraud_detection_comparison — flat vs graph model

MetricFlat Model (LightGBM)Graph Model (KumoRFM)Impact
True Positive Rate72%89%+24% more fraud caught
False Positive Rate97%42%55% fewer false alerts
Investigation Cost/Month$48.5M$21.4M$27.1M saved
Ring Detection Rate< 5%68%Rings visible in graph
Time to Ring Detection12-18 months (manual)2-5 days90%+ faster

Graph models reduce false positives by 55% while catching 24% more fraud. The biggest impact: detecting organized fraud rings that flat models structurally cannot see.

Fraud detection: from transactions to networks

Consider a fraud ring operating across a bank's customer base. Five accounts, opened at different branches over six months, with different names and identification documents. Individually, each account looks normal. Standard demographics, typical transaction patterns, unremarkable balances.

But in the graph, these accounts share a shipping address. Two share a device fingerprint. Three have received wire transfers from the same offshore entity. Their transaction patterns, when viewed as a network, form a distinctive hub-and-spoke topology that matches known laundering patterns.

A flat model scoring each transaction independently sees five normal accounts. A graph-based model sees a fraud ring.

The false positive problem

Current rule-based and flat ML fraud systems generate enormous numbers of false positives. The average bank's fraud detection system has a false positive rate of 95-98%, meaning 95-98 out of every 100 flagged transactions are legitimate. Each false positive requires human review, costing $15-50 per investigation.

For a bank processing 100 million transactions per month with a 2% flag rate, that is 2 million flagged transactions. At 97% false positive rate, 1.94 million of those are legitimate. At $25 per investigation, the bank spends $48.5 million per month reviewing transactions that should never have been flagged.

Graph-based models reduce false positives by 40-60% because they can distinguish between genuinely suspicious network patterns and normal coincidences. A customer transacting at an unusual merchant might look suspicious in isolation. But if the graph shows that the customer recently moved to a new city (address change, branch visit, utility payment) and the merchant is near their new address, the context explains the behavior.

Credit risk: beyond the credit score

Credit scoring is the original ML use case in banking. FICO scores, logistic regression, gradient-boosted trees. The models are mature and well-validated. But they operate on flat features: payment history, credit utilization, length of credit history, credit mix, new credit inquiries.

borrowers — two applicants with identical flat features

borrower_idFICOcredit_utilpayment_historyaccount_age_yrs
B-60168042%98% on-time6
B-60268041%97% on-time5

Nearly identical credit profiles. A flat model scores them within 2 points of each other.

transaction_network — what the graph reveals

borrower_idemployer_payrollemployer_trendpeer_default_ratecounterparty_risk
B-601TechCorp ($12K/mo)Stable (3 years)4% (below average)Low (utility + grocery)
B-602StartupXYZ ($11K/mo)Declining (-22% in 6 months)18% (3x average)High (cash advances + pawn)

B-602's employer payroll is declining 22% over 6 months. Borrowers with similar transaction graphs (same merchant categories, same employer trajectory) default at 18%, 3x the average. The flat credit score sees none of this.

Relational models add this dimension: the borrower's financial network. Who do they transact with? What is the risk profile of their counterparties? Are their income sources connected to stable or volatile entities? Has the default rate among borrowers with similar transaction patterns increased recently?

These network signals are particularly powerful for thin-file borrowers (those with limited credit history). A borrower with a FICO score of 650 might have a very different risk profile depending on whether their transaction graph resembles that of borrowers who defaulted or borrowers who stayed current. For the 800 million people globally who are "credit invisible," network signals are often the only signals available.

Portfolio-level risk

Beyond individual scoring, graph-based models reveal concentration risks that traditional portfolio analytics miss. If 500 borrowers in your portfolio are all connected through a single employer, and that employer's transaction volume has declined 40% over three months, you have a correlated default risk that no individual credit score reflects.

This is precisely the kind of systemic risk that caused the 2008 financial crisis: correlated defaults driven by shared network exposures that were invisible to models scoring borrowers independently.

Anti-money laundering: the compliance burden

Banks spend an estimated $274 billion annually on financial crime compliance globally, according to LexisNexis. AML compliance alone accounts for $35-40 billion in the US. Most of this spending goes to armies of compliance analysts manually reviewing suspicious activity reports (SARs).

Current AML systems rely on rules: flag transactions over $10,000, flag structured transactions just below $10,000, flag transfers to high-risk jurisdictions. These rules are decades old and widely known to criminals. They generate massive false positive volumes (estimated at 95-99%) while missing sophisticated laundering schemes that operate within the rules.

wire_transfers — layering pattern

transfer_idfrom_accountto_accountamounttimestampjurisdiction
W-201Corp-A (US)Shell-1 (Cayman)$9,900Mar 1, 09:00Cayman Islands
W-202Shell-1 (Cayman)Shell-2 (BVI)$9,700Mar 1, 14:30British Virgin Islands
W-203Shell-2 (BVI)Shell-3 (Panama)$9,500Mar 2, 10:15Panama
W-204Shell-3 (Panama)Corp-B (US)$9,200Mar 3, 11:00US
W-205Corp-B (US)Real Estate LLC$9,100Mar 3, 15:30US

Classic layering: $9,900 moves through 4 shell entities in 3 days, losing $700 in fees, before reaching a real estate purchase. Every individual transfer is under $10,000 and passes standard AML rules.

aml_rule_based_vs_graph — detection comparison

TransferRule-Based FlagGraph-Based FlagReason
W-201No (under $10K)Yes (0.91)Start of 4-hop layering chain
W-202No (under $10K)Yes (0.88)Shell-to-shell, rapid succession
W-203No (under $10K)Yes (0.92)3rd jurisdiction in 2 days
W-204No (under $10K)Yes (0.89)Circular: same beneficial owner
W-205No (under $10K)Yes (0.94)Terminal node: real estate purchase

Every transfer passes the $10,000 rule. The graph model detects the chain topology: 5 accounts, 4 jurisdictions, decreasing amounts, 3-day velocity. Classic layering pattern.

Graph-based AML detects laundering by analyzing the flow of money through the network. Layering (moving money through multiple accounts to obscure its origin) creates distinctive graph patterns: rapid sequential transfers, fan-out-fan-in topologies, and circular flows. These patterns are invisible when you examine transactions individually but obvious when you view the network.

Traditional financial AI

  • Scores individual transactions or customers
  • Flat feature tables with manual aggregations
  • 95-98% false positive rate in fraud detection
  • Rule-based AML with known circumventions
  • Misses organized fraud rings and network patterns

Graph-based financial AI

  • Analyzes the full entity-transaction-counterparty network
  • Learns directly from relational graph structure
  • 40-60% reduction in false positives
  • Detects layering, structuring, and ring patterns
  • Captures multi-hop signals invisible to flat models

Customer retention and cross-sell

Fraud and risk get the headlines, but customer retention may be the highest-ROI application of relational AI in banking. Acquiring a new banking customer costs 5-7x more than retaining an existing one. For large banks, a 1% improvement in retention translates to $100M-500M in preserved annual revenue.

Traditional churn models use account-level features: balance trends, transaction frequency, product holdings, support interactions. Graph-based models add the relational dimension: has the customer's transaction pattern shifted to a competitor bank? Have customers with similar financial profiles recently churned? Is the customer's employer showing signs of downsizing (reduced payroll deposits across connected accounts)?

Cross-sell models benefit similarly. A customer who just received a large wire transfer, opened a brokerage account, and has transaction patterns similar to customers who purchased investment advisory services is a strong cross-sell candidate. That signal spans four tables and three relationship types. No flat model captures it.

PQL Query

PREDICT fraud_probability
FOR EACH transactions.txn_id
WHERE transactions.timestamp > '2025-03-01'

One query scores every transaction against the full account-merchant-device-counterparty graph. The model detects ring patterns, structuring, and shared-device clusters in a single pass.

Output

txn_idfraud_probrisk_typenetwork_signal
T-40010.04LowNormal merchant pattern
T-40020.92RingShared device + structured amount + offshore target
T-40030.88RingSame target entity + timing cluster
T-40040.94Ring3-account hub via device D-7701
T-40050.11LowConsistent with account history

The foundation model shift

Building custom graph models for each banking use case (fraud, credit risk, AML, churn, cross-sell) requires separate data pipelines, separate feature engineering, and separate model training. At a large bank, this means 5-10 separate ML teams working on 5-10 separate pipelines, each costing $1M-3M per year to maintain.

A relational foundation model changes this equation. KumoRFM connects directly to the bank's data warehouse, understands the full relational schema, and serves any prediction task without task-specific feature engineering or model training.

On the RelBench benchmark, which includes financial transaction data, KumoRFM zero-shot outperforms supervised GNNs that were specifically trained on each task. The same model that detects fraud also predicts churn, scores credit risk, and identifies cross-sell opportunities.

For a bank, this means: one platform, one data connection, one governance framework, and dozens of prediction tasks served from the same model. The time from business question to production prediction drops from 6-12 months to days. The cost per use case drops from millions to a fraction of a single data scientist's salary.

The banks that adopt this approach first will not just save money on ML infrastructure. They will be able to ask questions about their data that were previously too expensive to answer. And in an industry where every basis point of risk pricing and every percentage point of fraud detection translates directly to the bottom line, that speed advantage compounds.

Frequently asked questions

How is AI used in financial services?

AI is used across fraud detection ($485B annual problem), credit risk scoring, anti-money laundering (AML), customer churn prediction, personalized product recommendations, claims processing, algorithmic trading, and regulatory compliance. The highest-ROI applications involve predictive tasks on relational data: scoring transactions, customers, accounts, and counterparties based on patterns across interconnected tables.

Why is graph-based AI better for fraud detection than traditional ML?

Traditional fraud models score individual transactions using features from a single table. Graph-based AI analyzes the full network: accounts, transactions, merchants, counterparties, devices, and IP addresses. This reveals fraud rings (multiple accounts connected through shared identifiers), money mule chains (layered transfers that appear normal individually but form suspicious patterns in aggregate), and identity fraud (synthetic identities linked through shared attributes). These patterns are invisible to flat models.

What is the relational data advantage in banking?

Banks store data across dozens of interconnected tables: customers, accounts, transactions, loans, credit cards, branches, merchants, counterparties, and more. Every dollar flows through a graph. Models that learn from this graph structure capture signals that flat feature tables miss: multi-hop relationships between entities, temporal patterns across transaction sequences, and network effects where one customer's behavior predicts another's.

How much does financial fraud cost annually?

Global fraud losses reached $485 billion in 2023 according to Nasdaq's Global Financial Crime Report. In the US alone, banks reported $10.3 billion in direct fraud losses in 2024. For every dollar of fraud, financial institutions incur $4.41 in total costs when accounting for investigation, remediation, regulatory fines, and customer attrition. False positives add another layer: the average bank spends $50M-200M annually reviewing legitimate transactions flagged incorrectly.

Can AI replace human fraud analysts?

AI does not replace fraud analysts. It makes them dramatically more effective. A graph-based AI system can reduce false positive rates by 40-60%, meaning analysts spend their time investigating actual fraud rather than clearing legitimate transactions. At a bank processing 100 million transactions monthly, reducing false positives by 50% saves 25,000-50,000 analyst hours per year. The AI handles pattern detection at scale; humans handle complex investigations and judgment calls.

See it in action

KumoRFM delivers predictions on relational data in seconds. No feature engineering, no ML pipelines. Try it free.