What data is needed for usage forecasting?

Kumo connects directly to your existing relational tables: ACCOUNTS, USAGE_DAILY, FEATURES, BILLING, ALERTS. No ETL or feature engineering required. Write a PQL query and get explainable predictions in minutes.

6Regression · Usage Forecasting

Usage Forecasting

“What will this account's compute usage be next month?”

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

Loved by data scientists, ML engineers & CXOs at

A real-world example

What will this account's compute usage be next month?

Usage-based SaaS companies must forecast compute consumption for both infrastructure planning and revenue recognition. A cloud platform serving 2,000 accounts where usage estimates are off by 25% either over-provisions $8M in infrastructure or faces capacity crunches that degrade service for top accounts. Billing surprises from unpredictable usage also drive 18% of customer complaints. The usage trajectory depends on feature adoption, team growth, seasonal patterns, and the account's own business cycles.

Quick answer

Accurate SaaS usage forecasting requires connecting daily usage data, feature adoption events, billing history, and cross-account seasonal patterns in a relational model. The key signal per-account time-series models miss is the usage spike curve that follows feature adoption: accounts that enable batch processing show a 3x usage spike in weeks 2-4 before stabilizing. Graph ML improves forecasting accuracy from 75% to 92%.

Approaches compared

4 ways to solve this problem

1. Per-account time-series (ARIMA/Prophet)

Train a time-series model on each account's historical daily usage to forecast the next 30 days.

Best for

Captures account-specific seasonality and trends. Works well for stable, mature accounts with 12+ months of history.

Watch out for

Each account is modeled independently. When an account adopts a new feature that will spike usage 3x, the time-series model has no way to anticipate it. Also misses cross-account seasonality (Q4 e-commerce spikes).

2. Cohort-based usage projections

Group accounts by industry and plan tier, then project usage based on cohort growth curves.

Best for

Good for capacity planning at the portfolio level. Works when accounts within cohorts behave similarly.

Watch out for

Cohort averages hide individual variance. One account adopting batch processing will spike usage while others in the same cohort stay flat. Over-provisioning for the cohort average wastes infrastructure budget.

3. Committed-tier forecasting

For committed-usage customers, forecast based on their committed tier plus historical overage patterns.

Best for

Simple and works for committed-tier customers where usage is bounded by contract terms.

Watch out for

Does not work for pay-as-you-go accounts, which are the fastest growing and hardest to forecast. Also misses accounts that will exceed their committed tier due to feature adoption or team growth.

4. KumoRFM (relational graph ML)

Connect accounts, daily usage, feature events, billing history, and alert data into a temporal graph. The GNN learns feature-adoption usage curves, cross-account seasonality, and growth trajectory patterns.

Best for

Highest accuracy (92% vs 75% for time-series). Captures the 3x usage spike from new feature adoption and cross-account Q4 seasonality that per-account models miss.

Watch out for

Requires feature-level adoption tracking, not just aggregate usage. If you only log total compute hours without feature attribution, the adoption-curve signal is unavailable.

Key metric: Graph ML improves SaaS usage forecasting accuracy from 75% to 92%, saving $8M in infrastructure over-provisioning and eliminating billing surprises that drive 18% of churn complaints.

Why relational data changes the answer

Usage trajectories are shaped by events in other tables: feature adoption (an account enabling batch processing will spike compute 3x), team growth (5+ new users per month accelerates API calls), and cross-account patterns (all e-commerce accounts spike in Q4). A per-account time-series model with 'last_month_hours = 3,400' projects a smooth growth curve and misses that this account just adopted batch processing two days ago, which means a 3x spike is coming in weeks 2-4.

Relational models connect usage data to feature adoption events and cross-account seasonality patterns. They learn that accounts adopting batch processing follow a specific usage curve (3x spike, then stabilization at 1.5x baseline) and that similar-industry accounts share seasonal patterns. For a platform serving 2,000 accounts, improving accuracy from 75% to 92% saves $8M in infrastructure over-provisioning. It also eliminates billing surprises: when the model predicts a usage spike, the account team can proactively notify the customer rather than surprising them with a bill 25% higher than expected.

Forecasting account usage from historical trends alone is like forecasting household electricity from last year's bills. You miss that the family just bought an electric vehicle (feature adoption), that they are having a pool installed (team growth), and that it is going to be the hottest summer on record (cross-account seasonality). The connected context around the usage number matters more than the number itself.

How KumoRFM solves this

Graph-learned product intelligence across your entire account base

Kumo connects accounts, daily usage, feature events, billing history, and alert patterns into a temporal graph. It learns that accounts that recently adopted the batch-processing feature show a 3x usage spike in weeks 2-4 before stabilizing. The model captures seasonality (e-commerce accounts spike in Q4), growth trajectories (accounts adding 5+ users per month accelerate usage), and cross-account infrastructure patterns (when a shared cluster's accounts all grow simultaneously, capacity planning must account for contention).

From data to predictions

See the full pipeline in action

Connect your tables, write a PQL query, and get predictions with built-in explainability — all in minutes, not months.

Your data

The relational tables Kumo learns from

ACCOUNTS

account_id	plan	industry	active_users	contract_type
ACC401	Usage-based	E-commerce	85	Pay-as-you-go
ACC402	Usage-based	Finance	30	Committed
ACC403	Usage-based	Technology	120	Pay-as-you-go

USAGE_DAILY

usage_id	account_id	date	compute_hours	storage_gb	api_calls
UD01	ACC401	2025-03-01	450	2800	125,000
UD02	ACC401	2025-03-02	480	2850	132,000
UD03	ACC402	2025-03-01	120	500	28,000

FEATURES

feature_id	account_id	feature	enabled_date	usage_30d
FT01	ACC401	Batch processing	2025-02-15	High
FT02	ACC401	Real-time API	2024-12-01	High
FT03	ACC402	Batch processing	2025-03-01	New

BILLING

billing_id	account_id	month	amount	vs_estimate
BL401	ACC401	2025-02	$12,400	+18%
BL402	ACC401	2025-01	$10,200	+8%
BL403	ACC402	2025-02	$3,100	-5%

ALERTS

alert_id	account_id	type	timestamp	threshold_pct
AL01	ACC401	Usage spike	2025-02-20	150%
AL02	ACC403	Quota warning	2025-03-01	90%

Write your PQL query

Describe what to predict in 2–3 lines — Kumo handles the rest

PQL

PREDICT SUM(USAGE_DAILY.COMPUTE_HOURS, 0, 30, days)
FOR EACH ACCOUNTS.ACCOUNT_ID

Prediction output

Every entity gets a score, updated continuously

ACCOUNT_ID	INDUSTRY	LAST_MONTH_HOURS	PREDICTED_NEXT_MONTH
ACC401	E-commerce	12,800	16,200
ACC402	Finance	3,400	5,800
ACC403	Technology	8,900	9,100

Understand why

Every prediction includes feature attributions — no black boxes

Account ACC402 -- Finance, 30 active users

Predicted: 5,800 compute hours predicted next month (+71%)

Top contributing features

New batch processing adoption

Enabled 2 days ago

32% attribution

Similar-account batch adoption curve

3x spike, weeks 2-4

24% attribution

User growth rate (30d)

+5 new users

18% attribution

Historical monthly growth

+12% MoM avg

14% attribution

Committed tier buffer

80% of committed used

12% attribution

Feature attributions are computed automatically for every prediction. No separate tooling required. Learn more about Kumo explainability

PQL Documentation

Learn the Predictive Query Language — SQL-like syntax for defining any prediction task in 2–3 lines.

Read docs

Python SDK

Integrate Kumo predictions into your pipelines. Train, evaluate, and deploy models programmatically.

Read docs

Explainability Docs

Understand feature attributions, model evaluation metrics, and how to build trust with stakeholders.

Read docs

Frequently asked questions

Common questions about usage forecasting

How do you forecast SaaS compute usage per account?

Connect daily usage data, feature adoption events, team growth metrics, and billing history into a relational model. The model learns account-specific growth trajectories, feature-adoption usage curves, and cross-account seasonal patterns. Graph ML forecasts 30-day usage with 92% accuracy vs 75% for per-account time-series models.

Why are per-account time-series models inaccurate?

Per-account models treat each account as an island. They cannot anticipate usage changes driven by feature adoption (batch processing causes a 3x spike), team growth (new users adding API calls), or cross-account seasonality (all e-commerce accounts spike in Q4). These signals live in related tables, not in the account's own usage history.

How does feature adoption affect usage forecasting?

Feature adoption is the single biggest source of forecasting error. An account enabling batch processing shows a 3x compute spike in weeks 2-4 before stabilizing at 1.5x baseline. Per-account models miss this entirely. Relational models learn the specific usage curve associated with each feature adoption and apply it proactively when an account enables a new feature.

How do you prevent billing surprises in usage-based SaaS?

Billing surprises drive 18% of churn-related complaints. Accurate usage forecasting lets account teams proactively notify customers when a usage spike is predicted. This turns a negative billing surprise into a positive customer conversation and an opportunity to discuss committed-tier upgrades that save the customer money.

What is the ROI of better usage forecasting in SaaS?

A platform serving 2,000 accounts that improves forecasting accuracy from 75% to 92% saves $8M in infrastructure over-provisioning annually. It also reduces churn from billing complaints by eliminating usage surprises. The combined impact of infrastructure savings and retained revenue makes usage forecasting one of the highest-ROI ML applications in SaaS.

Bottom line: A usage-based SaaS platform serving 2,000 accounts that improves usage forecasting accuracy from 75% to 92% saves $8M in infrastructure over-provisioning and eliminates billing surprises that drive 18% of churn-related complaints. Kumo captures feature-adoption usage curves and cross-account seasonality that time-series models on individual accounts cannot learn.

Related use cases

Explore more B2B SaaS use cases

Use Case #1Account ChurnLearn more

Use Case #2Expansion RevenueLearn more

Use Case #5Support EscalationLearn more

Previous#5 Support Escalation

Topics covered

SaaS usage forecastingcompute usage predictionusage-based billing AIinfrastructure capacity MLaccount usage modelgraph neural network forecastingKumoRFM usage forecastingconsumption-based pricing AIcloud cost prediction

From a leadership team with proven experience

Vanja Josifovski

CEO and Co-Founder, ex-CTO Airbnb, ex-CTO Pinterest

Jure Leskovec

Co-Founder & Chief Scientist, Stanford Professor

Hema Raghavan

Co-Founder & Head of Engineering, ex-AI Lead, LinkedIn

One Platform. One Model. Infinite Predictions.

KumoRFM

Relational Foundation Model

Turn structured relational data into predictions in seconds. KumoRFM delivers zero-shot predictions that rival months of traditional data science. No training, feature engineering, or infrastructure required. Just connect your data and start predicting.

For critical use cases, fine-tune KumoRFM on your data using the Kumo platform and Research Agent for 30%+ higher accuracy than traditional models.

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.

Book a Demo Try Free