What data is needed for patient deterioration prediction?

Kumo connects directly to your existing relational tables: PATIENTS, VITALS, LABS, MEDICATIONS, NURSING_NOTES. No ETL or feature engineering required. Write a PQL query and get explainable predictions in minutes.

6Binary Classification · Early Warning

Patient Deterioration Prediction

“Which inpatients will deteriorate in the next 12 hours?”

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

Loved by data scientists, ML engineers & CXOs at

A real-world example

Which inpatients will deteriorate in the next 12 hours?

Failure to rescue costs U.S. hospitals $1.1B annually. Traditional early warning scores (NEWS, MEWS) generate false alarm rates above 90%, causing alert fatigue. A 600-bed hospital averages 15 unexpected ICU transfers per month; each costs $31,000 more than a planned transfer. Catching deterioration 6 hours earlier would save $5.6M per year and 40+ lives.

Quick answer

AI predicts inpatient deterioration by fusing vital sign trajectories, lab trends, medication administrations, and nursing assessment patterns into a real-time relational graph. Traditional early warning scores (NEWS, MEWS) use single-point-in-time vital thresholds and generate false alarm rates above 90%, causing alert fatigue. Graph-based models detect deterioration 6 hours earlier with 80% fewer false alarms by learning temporal patterns across multiple data streams and cross-patient signals from the same unit.

Approaches compared

4 ways to solve this problem

1. NEWS/MEWS (National Early Warning Score)

Aggregate scores based on vital sign thresholds: heart rate, blood pressure, respiratory rate, temperature, SpO2, and consciousness level. Calculated at each nursing assessment. Escalation triggers at score thresholds.

Best for

Universal screening where every patient needs a baseline risk assessment. Simple to calculate bedside. Supported by extensive clinical validation.

Watch out for

Single-point-in-time scoring misses trajectories. A heart rate of 102 scores the same whether it was 72 four hours ago (concerning trend) or 108 four hours ago (improving trend). False alarm rates above 90% cause alert fatigue, and nurses learn to ignore the warnings.

2. EHR-Embedded Sepsis Alerts

Vendor-provided sepsis screening tools (Epic Sepsis Model, Cerner St. John Sepsis) that monitor for SIRS criteria, qSOFA, and sepsis-related lab patterns. Integrated into clinical workflow with automated alerts.

Best for

Sepsis-specific detection where early antibiotics save lives. Embedded in the nursing workflow without additional tools.

Watch out for

Sepsis-specific: misses cardiac, respiratory, neurological, and post-surgical deterioration entirely. High false-positive rates (90%+ in published studies of the Epic Sepsis Model) reduce clinical trust.

3. Single-Patient Time-Series Models

Deep learning models (LSTMs, transformers) trained on individual patient vital-sign and lab-result time series to predict deterioration within 4-12 hours. Captures temporal trends within a single patient's data.

Best for

Patients with rich temporal data streams (ICU step-downs, telemetry patients) where the trajectory within the patient's own data is the primary signal.

Watch out for

Operates on one patient at a time. Cannot detect unit-level signals: when multiple patients on the same floor show subtle vital changes simultaneously, it may indicate a staffing problem, equipment issue, or medication error that affects the whole unit.

4. Graph Neural Networks (Kumo's Approach)

Fuses vitals trajectories, lab trends, medication administrations, nursing assessments, and unit-level context into a temporal relational graph. Detects both individual patient deterioration patterns and cross-patient signals on the same unit.

Best for

Catching deterioration 6-8 hours before traditional scores trigger, with 80% fewer false alarms. Detects unit-level risk factors (understaffing, concurrent deterioration) that affect individual patient outcomes.

Watch out for

Requires real-time data feeds from vitals monitors, labs, and medication administration records. Latency in data availability directly impacts prediction lead time.

Key metric: NEWS/MEWS generate 90%+ false alarms. Graph-based models achieve 80% fewer false alarms while detecting deterioration 6 hours earlier, saving $5.6M annually and 40+ lives at a 600-bed hospital.

Why relational data changes the answer

Flat deterioration models analyze each patient's vitals independently: is heart rate above threshold? Is SpO2 below threshold? They catch the patient whose vitals have already crossed into dangerous territory. But they miss the patient whose MAP has been declining slowly for 4 hours, who just received a new vasopressor (indicating the care team is already concerned), whose lactate came back elevated, and whose nursing note mentions 'increased confusion.' These signals span four different data streams (vitals, medications, labs, notes), and their temporal co-occurrence is what predicts deterioration 6-8 hours before any single vital sign crosses a threshold. A flat model would need each of these as a pre-engineered feature, and it would still miss the timing relationships between them.

Relational learning connects these data streams temporally. The model tracks the trajectory of vitals over sliding windows, correlates them with medication changes (a new vasopressor following a blood pressure drop is a strong signal), lab trends (rising lactate concurrent with vital changes confirms the pattern), and nursing sentiment (concern documented before vitals cross thresholds). It also detects cross-patient signals: when 3 patients on the same Med-Surg unit show subtle vital changes within the same 2-hour window, the model weighs unit-level risk factors (staffing ratio, recent shift change) that may explain the concurrent deterioration. This multi-stream, multi-patient approach produces a C-statistic above 0.90 with a false alarm rate below 15%, compared to NEWS/MEWS at 0.70-0.75 with false alarms above 90%.

Detecting patient deterioration from single-point vitals is like monitoring a building for structural failure by checking one sensor at a time. The temperature sensor says 'normal,' the vibration sensor says 'normal,' the moisture sensor says 'normal.' But if you connect them, you see that temperature is rising slowly, vibration shifted 2 hours ago, and moisture appeared at the foundation yesterday. The building is not failing yet, but the pattern across sensors says it will. Relational deterioration models are the connected building-monitoring system for hospitalized patients.

How KumoRFM solves this

Graph-learned clinical intelligence across your entire patient network

Kumo builds a temporal graph from vitals streams, lab results, medication administrations, and nursing assessments. It learns that a specific pattern of trending MAP decline combined with a new vasopressor order and a nursing note sentiment shift precedes deterioration 8 hours before traditional scores trigger. The model weighs cross-patient signals: when multiple patients on the same unit show subtle vital changes simultaneously, it detects environmental or staffing-related risk factors.

From data to predictions

See the full pipeline in action

Connect your tables, write a PQL query, and get predictions with built-in explainability — all in minutes, not months.

Your data

The relational tables Kumo learns from

PATIENTS

patient_id	age	unit	admit_date
P6001	71	Med-Surg 4A	2025-03-01
P6002	55	Med-Surg 4A	2025-03-02
P6003	83	Telemetry 3B	2025-02-28

VITALS

vitals_id	patient_id	timestamp	hr	bp_sys	spo2	temp
VT01	P6001	2025-03-03 06:00	88	132	96	37.1
VT02	P6001	2025-03-03 10:00	102	108	93	38.2
VT03	P6002	2025-03-03 08:00	72	128	98	36.8

LABS

lab_id	patient_id	test_name	value	timestamp
LB01	P6001	Lactate	3.8	2025-03-03 07:00
LB02	P6001	WBC	16.4	2025-03-03 07:00
LB03	P6002	Lactate	1.1	2025-03-03 06:00

MEDICATIONS

med_id	patient_id	drug	route	admin_time
MD01	P6001	Norepinephrine	IV	2025-03-03 09:30
MD02	P6002	Metoprolol	PO	2025-03-03 08:00
MD03	P6003	Heparin drip	IV	2025-03-03 06:00

NURSING_NOTES

note_id	patient_id	timestamp	assessment
NN01	P6001	2025-03-03 08:00	Pt appears more confused, family concerned
NN02	P6002	2025-03-03 08:30	Resting comfortably, no complaints
NN03	P6003	2025-03-03 07:00	Steady, ambulating in hallway

Write your PQL query

Describe what to predict in 2–3 lines — Kumo handles the rest

PQL

PREDICT BOOL(VITALS.RAPID_RESPONSE, 0, 12, hours)
FOR EACH PATIENTS.PATIENT_ID
WHERE PATIENTS.UNIT != 'ICU'

Prediction output

Every entity gets a score, updated continuously

PATIENT_ID	UNIT	TIMESTAMP	DETERIORATION_PROB
P6001	Med-Surg 4A	2025-03-03 10:15	0.91
P6002	Med-Surg 4A	2025-03-03 10:15	0.07
P6003	Telemetry 3B	2025-03-03 10:15	0.15

Understand why

Every prediction includes feature attributions — no black boxes

Patient P6001 -- 71yo, Med-Surg 4A

Predicted: 91% deterioration probability in next 12 hours

Top contributing features

Heart rate trend (last 4h)

+14 bpm

27% attribution

Lactate level

3.8 mmol/L

24% attribution

New vasopressor initiated

Norepinephrine

21% attribution

SpO2 decline rate

-3% in 4h

16% attribution

Nursing assessment sentiment

Concern noted

12% attribution

Feature attributions are computed automatically for every prediction. No separate tooling required. Learn more about Kumo explainability

PQL Documentation

Learn the Predictive Query Language — SQL-like syntax for defining any prediction task in 2–3 lines.

Read docs

Python SDK

Integrate Kumo predictions into your pipelines. Train, evaluate, and deploy models programmatically.

Read docs

Explainability Docs

Understand feature attributions, model evaluation metrics, and how to build trust with stakeholders.

Read docs

Frequently asked questions

Common questions about patient deterioration prediction

How does AI predict patient deterioration in hospitals?

AI predicts deterioration by fusing multiple real-time data streams: vital sign trajectories, lab result trends, medication administration patterns, and nursing assessment narratives. Graph-based models learn that specific temporal patterns across these streams (e.g., declining MAP + new vasopressor + rising lactate + nursing concern) predict deterioration 6-8 hours before traditional early warning scores trigger.

What is the false alarm rate for early warning scores like NEWS?

Traditional early warning scores (NEWS, MEWS) generate false alarm rates above 90% in most published studies. This means more than 9 out of 10 alerts are false positives, causing severe alert fatigue. Nurses learn to dismiss warnings, and real deterioration events get lost in the noise. Graph-based models reduce false alarms to below 15% while detecting deterioration earlier.

How many hours in advance can AI detect patient deterioration?

Graph-based models detect deterioration 6-8 hours before traditional threshold-based scores trigger alerts. The lead time comes from tracking vital-sign trajectories and correlating them with medication changes and lab trends. A declining MAP that has not yet crossed a threshold is invisible to NEWS/MEWS but is captured by trajectory-aware models.

What is the cost of failure to rescue in hospitals?

Failure to rescue (death following a complication that could have been treated if detected earlier) costs US hospitals $1.1B annually. Each unexpected ICU transfer costs $31,000 more than a planned transfer. A 600-bed hospital averages 15 unexpected ICU transfers per month. Catching deterioration 6 hours earlier prevents 40+ deaths and saves $5.6M annually.

Bottom line: A 600-bed hospital catching inpatient deterioration 6 hours earlier prevents 40+ deaths and saves $5.6M annually in unplanned ICU transfers. Kumo fuses vitals trajectories, lab trends, medication signals, and nursing narrative patterns into a single real-time score that outperforms NEWS/MEWS with 80% fewer false alarms.

Related use cases

Explore more healthcare use cases

Use Case #1Readmission PredictionLearn more

Use Case #2No-Show PredictionLearn more

Use Case #3Length of StayLearn more

Previous#5 Claims Denial

Topics covered

patient deterioration predictionearly warning score AIrapid response predictionclinical deterioration MLICU transfer predictiongraph neural network vitalsKumoRFM deteriorationinpatient safety AIsepsis early detection

From a leadership team with proven experience

Vanja Josifovski

CEO and Co-Founder, ex-CTO Airbnb, ex-CTO Pinterest

Jure Leskovec

Co-Founder & Chief Scientist, Stanford Professor

Hema Raghavan

Co-Founder & Head of Engineering, ex-AI Lead, LinkedIn

One Platform. One Model. Infinite Predictions.

KumoRFM

Relational Foundation Model

Turn structured relational data into predictions in seconds. KumoRFM delivers zero-shot predictions that rival months of traditional data science. No training, feature engineering, or infrastructure required. Just connect your data and start predicting.

For critical use cases, fine-tune KumoRFM on your data using the Kumo platform and Research Agent for 30%+ higher accuracy than traditional models.

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.

Book a Demo Try Free