Executive AI Dinner hosted by Kumo - Austin, April 8

Register here
6Regression · Customer Value

Customer Lifetime Value

What is each customer's 3-year lifetime value?

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

Loved by data scientists, ML engineers & CXOs at

Catalina Logo

A real-world example

What is each customer's 3-year lifetime value?

Banks allocate relationship-manager time, fee waivers, and retention budgets uniformly across segments, wasting resources on low-value accounts while under-serving high-potential customers. A regional bank with 2M retail customers found that its top 8% of households generated 42% of total revenue, yet these customers received the same service level as accounts generating $200/year. Without accurate forward-looking LTV, banks cannot differentiate treatment, leading to $20-40M in misallocated service costs and lost high-value relationships annually.

Quick answer

The most accurate CLV models for banking connect customer profiles, product holdings, transaction histories, branch interactions, and digital engagement into a relational graph. This approach produces LTV estimates 30-40% more accurate than segment-based averages because it captures forward-looking signals like investment account growth, product consolidation patterns, and engagement trajectory rather than static snapshots.

Approaches compared

4 ways to solve this problem

1. Segment-based averages

Assign each customer a segment (Mass Market, Mass Affluent, Premier) and use the segment's historical average revenue as the LTV estimate.

Best for

Simple and fast. No model required. Works as a starting point for resource allocation decisions.

Watch out for

Treats all Premier customers the same. A Premier customer growing their investment account by $15K/quarter has very different forward value than one whose balances are flat. The average hides the variance.

2. RFM scoring (Recency, Frequency, Monetary)

Score customers on recency of last transaction, frequency of interactions, and monetary value of their accounts. Combine into a composite LTV score.

Best for

Better than segment averages. Captures basic behavioral differences within segments. Well-understood by marketing teams.

Watch out for

Still a static snapshot. Does not capture trajectories: a customer whose frequency is declining rapidly looks the same as one whose frequency just dipped once and recovered.

3. XGBoost on engineered features

Build features from product count, balance trends, transaction frequency, tenure, and channel usage. Train a regression model to predict 3-year revenue.

Best for

Meaningful accuracy improvement over RFM. Handles nonlinear interactions between features.

Watch out for

Manual feature engineering collapses rich temporal data into static aggregates. 'Average quarterly balance growth' loses the information about whether growth is accelerating, decelerating, or spiking from a single event.

4. KumoRFM (relational graph ML)

Connect customer profiles, product holdings, revenue history, engagement events, and transaction data. Write a PQL query to predict 3-year revenue sum. The GNN learns forward-looking value signals from the full relational graph.

Best for

Captures the full trajectory: investment account growing at $15K/quarter + increasing branch advisor visits + mortgage consolidation signals = high and rising LTV. These cross-table trajectories are invisible to flat models.

Watch out for

Requires revenue history data at the customer-quarter level. If revenue attribution is messy (common in shared household accounts), data cleaning is the bottleneck.

Key metric: Relational ML produces CLV estimates 30-40% more accurate than segment-based averages, enabling precise resource allocation across customer tiers.

Why relational data changes the answer

Customer value is not a number in a single table. It is the sum of behavior across products (checking, savings, investment, mortgage), channels (branch, app, web), and time (growing, stable, declining). A flat feature table collapses all of this into static aggregates like 'total balance' and 'product count,' destroying the trajectories that differentiate a $47K customer from a $2K customer.

Relational models read the full customer-product-engagement graph and learn sequences like 'investment account growing at $15K/quarter, branch advisor meetings twice per month, mortgage rate inquiry last week, and 12-year tenure with no product closures.' On RelBench, relational approaches score 76.71 vs 62.44 for single-table baselines on prediction tasks. For LTV, that accuracy gap means the difference between allocating a relationship manager to a $47K customer vs wasting that manager's time on a $2K account.

Estimating customer value from segment averages is like valuing a house by its neighborhood average. Two houses on the same street can differ by $500K based on renovation history, lot size, and recent upgrades. The appraisal (relational model) inspects the house itself. The Zillow estimate (segment average) just looks at the zip code.

How KumoRFM solves this

Relational intelligence built for banking and financial data

Kumo connects customer profiles, product holdings, transaction histories, branch interactions, digital engagement, and life-event signals into a relational graph. The model predicts that Customer C-10078 will generate $47,200 over the next 3 years because her investment account is growing, she is adding direct-deposit payroll, and her branch-visit frequency suggests she will consolidate a competitor mortgage. These cross-table signals produce LTV estimates 30-40% more accurate than segment-based averages.

From data to predictions

See the full pipeline in action

Connect your tables, write a PQL query, and get predictions with built-in explainability — all in minutes, not months.

1

Your data

The relational tables Kumo learns from

CUSTOMERS

customer_idsegmenttenure_yearsproducts_heldrelationship_mgr
C-10078Premier12.16RM-042
C-10042Mass Affluent4.23RM-018
C-10115Mass Market1.81None

REVENUE_HISTORY

customer_idquarterfee_revenueinterest_revenuetotal
C-100782025-Q2$1,240$2,800$4,040
C-100422025-Q2$320$890$1,210
C-101152025-Q2$45$120$165

PRODUCT_HOLDINGS

customer_idproductbalancemonthly_activity
C-10078Investment Account$340,00012 trades
C-10078Mortgage$280,0001 payment
C-10042Checking$12,30045 txns

ENGAGEMENT_EVENTS

customer_idchannelevent_typefrequency_30d
C-10078Branchadvisor_meeting2
C-10078Mobileinvestment_review8
C-10042Mobilebalance_check22
2

Write your PQL query

Describe what to predict in 2–3 lines — Kumo handles the rest

PQL
PREDICT SUM(REVENUE_HISTORY.TOTAL, 0, 36, months)
FOR EACH CUSTOMERS.CUSTOMER_ID
3

Prediction output

Every entity gets a score, updated continuously

CUSTOMER_IDSEGMENTPREDICTED_3YR_LTVCURRENT_ANNUALLTV_TIER
C-10078Premier$47,200$16,160Platinum
C-10042Mass Affluent$18,400$4,840Gold
C-10115Mass Market$2,100$660Standard
4

Understand why

Every prediction includes feature attributions — no black boxes

Customer C-10078 (Maria Gonzalez)

Predicted: $47,200 predicted 3-year lifetime value

Top contributing features

Investment account growth trajectory

+$15K/qtr

28% attribution

Product depth (6 products held)

Top 5%

23% attribution

Branch advisor meeting frequency

2x/month

19% attribution

Mortgage consolidation signals

Rate inquiry

17% attribution

Tenure and relationship stability

12.1 years

13% attribution

Feature attributions are computed automatically for every prediction. No separate tooling required. Learn more about Kumo explainability

Frequently asked questions

Common questions about customer lifetime value

How do you calculate customer lifetime value in banking?

The most accurate approach uses relational ML to predict forward-looking revenue by connecting customer profiles, product holdings, transaction history, and engagement data. This captures growth trajectories (investment account growing, product consolidation signals) rather than just current balances. The result is a 3-year revenue prediction per customer, 30-40% more accurate than segment-based averages.

What is the difference between CLV and current revenue?

Current annual revenue is a backward-looking measure. CLV predicts forward value, which can be very different. A Mass Affluent customer currently generating $4,800/year but growing savings rapidly, visiting branches for advisor meetings, and showing mortgage-readiness signals may be worth $18,400 over 3 years. A Premier customer generating $16K/year but showing disengagement signals may be worth less than projected.

How do banks use customer lifetime value predictions?

Three primary uses: (1) allocate relationship-manager time to the highest-value and highest-growth customers, (2) size retention offers proportionally to the value at risk when a high-LTV customer shows churn signals, and (3) prioritize acquisition channels that attract high-LTV customers rather than volume.

What data do you need for a banking CLV model?

Customer profiles, product holdings with balances, revenue history (fee + interest income per quarter), transaction data, and engagement events (branch visits, app logins, advisor meetings). The more tables you connect with clear foreign keys, the more forward-looking signals the model can learn.

How accurate are ML-based CLV predictions?

Relational ML models produce CLV estimates 30-40% more accurate than segment-based averages, measured by mean absolute error on held-out data. On RelBench benchmarks, relational models score 76.71 vs 62.44 for single-table approaches. The accuracy matters because it drives resource allocation: every dollar of RM time spent on a low-value customer is a dollar not spent on a high-value one.

Bottom line: Allocate relationship-manager time and retention budgets to the top 8% of customers who drive 42% of revenue, recovering $20-40M in annual misallocated service costs.

Topics covered

customer lifetime value bankingCLV prediction financial servicescustomer value AIbanking customer analyticsgraph neural network CLVKumoRFMrelationship value predictionrelational deep learning bankingcustomer profitability modelLTV scoring bank

One Platform. One Model. Infinite Predictions.

KumoRFM

Relational Foundation Model

Turn structured relational data into predictions in seconds. KumoRFM delivers zero-shot predictions that rival months of traditional data science. No training, feature engineering, or infrastructure required. Just connect your data and start predicting.

For critical use cases, fine-tune KumoRFM on your data using the Kumo platform and Research Agent for 30%+ higher accuracy than traditional models.

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.