Executive AI Dinner hosted by Kumo - Austin, April 8

Register here
6Binary Classification · Early Warning

Patient Deterioration Prediction

Which inpatients will deteriorate in the next 12 hours?

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.

By submitting, you accept the Terms and Privacy Policy.

Loved by data scientists, ML engineers & CXOs at

Catalina Logo

A real-world example

Which inpatients will deteriorate in the next 12 hours?

Failure to rescue costs U.S. hospitals $1.1B annually. Traditional early warning scores (NEWS, MEWS) generate false alarm rates above 90%, causing alert fatigue. A 600-bed hospital averages 15 unexpected ICU transfers per month; each costs $31,000 more than a planned transfer. Catching deterioration 6 hours earlier would save $5.6M per year and 40+ lives.

Quick answer

AI predicts inpatient deterioration by fusing vital sign trajectories, lab trends, medication administrations, and nursing assessment patterns into a real-time relational graph. Traditional early warning scores (NEWS, MEWS) use single-point-in-time vital thresholds and generate false alarm rates above 90%, causing alert fatigue. Graph-based models detect deterioration 6 hours earlier with 80% fewer false alarms by learning temporal patterns across multiple data streams and cross-patient signals from the same unit.

Approaches compared

4 ways to solve this problem

1. NEWS/MEWS (National Early Warning Score)

Aggregate scores based on vital sign thresholds: heart rate, blood pressure, respiratory rate, temperature, SpO2, and consciousness level. Calculated at each nursing assessment. Escalation triggers at score thresholds.

Best for

Universal screening where every patient needs a baseline risk assessment. Simple to calculate bedside. Supported by extensive clinical validation.

Watch out for

Single-point-in-time scoring misses trajectories. A heart rate of 102 scores the same whether it was 72 four hours ago (concerning trend) or 108 four hours ago (improving trend). False alarm rates above 90% cause alert fatigue, and nurses learn to ignore the warnings.

2. EHR-Embedded Sepsis Alerts

Vendor-provided sepsis screening tools (Epic Sepsis Model, Cerner St. John Sepsis) that monitor for SIRS criteria, qSOFA, and sepsis-related lab patterns. Integrated into clinical workflow with automated alerts.

Best for

Sepsis-specific detection where early antibiotics save lives. Embedded in the nursing workflow without additional tools.

Watch out for

Sepsis-specific: misses cardiac, respiratory, neurological, and post-surgical deterioration entirely. High false-positive rates (90%+ in published studies of the Epic Sepsis Model) reduce clinical trust.

3. Single-Patient Time-Series Models

Deep learning models (LSTMs, transformers) trained on individual patient vital-sign and lab-result time series to predict deterioration within 4-12 hours. Captures temporal trends within a single patient's data.

Best for

Patients with rich temporal data streams (ICU step-downs, telemetry patients) where the trajectory within the patient's own data is the primary signal.

Watch out for

Operates on one patient at a time. Cannot detect unit-level signals: when multiple patients on the same floor show subtle vital changes simultaneously, it may indicate a staffing problem, equipment issue, or medication error that affects the whole unit.

4. Graph Neural Networks (Kumo's Approach)

Fuses vitals trajectories, lab trends, medication administrations, nursing assessments, and unit-level context into a temporal relational graph. Detects both individual patient deterioration patterns and cross-patient signals on the same unit.

Best for

Catching deterioration 6-8 hours before traditional scores trigger, with 80% fewer false alarms. Detects unit-level risk factors (understaffing, concurrent deterioration) that affect individual patient outcomes.

Watch out for

Requires real-time data feeds from vitals monitors, labs, and medication administration records. Latency in data availability directly impacts prediction lead time.

Key metric: NEWS/MEWS generate 90%+ false alarms. Graph-based models achieve 80% fewer false alarms while detecting deterioration 6 hours earlier, saving $5.6M annually and 40+ lives at a 600-bed hospital.

Why relational data changes the answer

Flat deterioration models analyze each patient's vitals independently: is heart rate above threshold? Is SpO2 below threshold? They catch the patient whose vitals have already crossed into dangerous territory. But they miss the patient whose MAP has been declining slowly for 4 hours, who just received a new vasopressor (indicating the care team is already concerned), whose lactate came back elevated, and whose nursing note mentions 'increased confusion.' These signals span four different data streams (vitals, medications, labs, notes), and their temporal co-occurrence is what predicts deterioration 6-8 hours before any single vital sign crosses a threshold. A flat model would need each of these as a pre-engineered feature, and it would still miss the timing relationships between them.

Relational learning connects these data streams temporally. The model tracks the trajectory of vitals over sliding windows, correlates them with medication changes (a new vasopressor following a blood pressure drop is a strong signal), lab trends (rising lactate concurrent with vital changes confirms the pattern), and nursing sentiment (concern documented before vitals cross thresholds). It also detects cross-patient signals: when 3 patients on the same Med-Surg unit show subtle vital changes within the same 2-hour window, the model weighs unit-level risk factors (staffing ratio, recent shift change) that may explain the concurrent deterioration. This multi-stream, multi-patient approach produces a C-statistic above 0.90 with a false alarm rate below 15%, compared to NEWS/MEWS at 0.70-0.75 with false alarms above 90%.

Detecting patient deterioration from single-point vitals is like monitoring a building for structural failure by checking one sensor at a time. The temperature sensor says 'normal,' the vibration sensor says 'normal,' the moisture sensor says 'normal.' But if you connect them, you see that temperature is rising slowly, vibration shifted 2 hours ago, and moisture appeared at the foundation yesterday. The building is not failing yet, but the pattern across sensors says it will. Relational deterioration models are the connected building-monitoring system for hospitalized patients.

How KumoRFM solves this

Graph-learned clinical intelligence across your entire patient network

Kumo builds a temporal graph from vitals streams, lab results, medication administrations, and nursing assessments. It learns that a specific pattern of trending MAP decline combined with a new vasopressor order and a nursing note sentiment shift precedes deterioration 8 hours before traditional scores trigger. The model weighs cross-patient signals: when multiple patients on the same unit show subtle vital changes simultaneously, it detects environmental or staffing-related risk factors.

From data to predictions

See the full pipeline in action

Connect your tables, write a PQL query, and get predictions with built-in explainability — all in minutes, not months.

1

Your data

The relational tables Kumo learns from

PATIENTS

patient_idageunitadmit_date
P600171Med-Surg 4A2025-03-01
P600255Med-Surg 4A2025-03-02
P600383Telemetry 3B2025-02-28

VITALS

vitals_idpatient_idtimestamphrbp_sysspo2temp
VT01P60012025-03-03 06:00881329637.1
VT02P60012025-03-03 10:001021089338.2
VT03P60022025-03-03 08:00721289836.8

LABS

lab_idpatient_idtest_namevaluetimestamp
LB01P6001Lactate3.82025-03-03 07:00
LB02P6001WBC16.42025-03-03 07:00
LB03P6002Lactate1.12025-03-03 06:00

MEDICATIONS

med_idpatient_iddrugrouteadmin_time
MD01P6001NorepinephrineIV2025-03-03 09:30
MD02P6002MetoprololPO2025-03-03 08:00
MD03P6003Heparin dripIV2025-03-03 06:00

NURSING_NOTES

note_idpatient_idtimestampassessment
NN01P60012025-03-03 08:00Pt appears more confused, family concerned
NN02P60022025-03-03 08:30Resting comfortably, no complaints
NN03P60032025-03-03 07:00Steady, ambulating in hallway
2

Write your PQL query

Describe what to predict in 2–3 lines — Kumo handles the rest

PQL
PREDICT BOOL(VITALS.RAPID_RESPONSE, 0, 12, hours)
FOR EACH PATIENTS.PATIENT_ID
WHERE PATIENTS.UNIT != 'ICU'
3

Prediction output

Every entity gets a score, updated continuously

PATIENT_IDUNITTIMESTAMPDETERIORATION_PROB
P6001Med-Surg 4A2025-03-03 10:150.91
P6002Med-Surg 4A2025-03-03 10:150.07
P6003Telemetry 3B2025-03-03 10:150.15
4

Understand why

Every prediction includes feature attributions — no black boxes

Patient P6001 -- 71yo, Med-Surg 4A

Predicted: 91% deterioration probability in next 12 hours

Top contributing features

Heart rate trend (last 4h)

+14 bpm

27% attribution

Lactate level

3.8 mmol/L

24% attribution

New vasopressor initiated

Norepinephrine

21% attribution

SpO2 decline rate

-3% in 4h

16% attribution

Nursing assessment sentiment

Concern noted

12% attribution

Feature attributions are computed automatically for every prediction. No separate tooling required. Learn more about Kumo explainability

Frequently asked questions

Common questions about patient deterioration prediction

How does AI predict patient deterioration in hospitals?

AI predicts deterioration by fusing multiple real-time data streams: vital sign trajectories, lab result trends, medication administration patterns, and nursing assessment narratives. Graph-based models learn that specific temporal patterns across these streams (e.g., declining MAP + new vasopressor + rising lactate + nursing concern) predict deterioration 6-8 hours before traditional early warning scores trigger.

What is the false alarm rate for early warning scores like NEWS?

Traditional early warning scores (NEWS, MEWS) generate false alarm rates above 90% in most published studies. This means more than 9 out of 10 alerts are false positives, causing severe alert fatigue. Nurses learn to dismiss warnings, and real deterioration events get lost in the noise. Graph-based models reduce false alarms to below 15% while detecting deterioration earlier.

How many hours in advance can AI detect patient deterioration?

Graph-based models detect deterioration 6-8 hours before traditional threshold-based scores trigger alerts. The lead time comes from tracking vital-sign trajectories and correlating them with medication changes and lab trends. A declining MAP that has not yet crossed a threshold is invisible to NEWS/MEWS but is captured by trajectory-aware models.

What is the cost of failure to rescue in hospitals?

Failure to rescue (death following a complication that could have been treated if detected earlier) costs US hospitals $1.1B annually. Each unexpected ICU transfer costs $31,000 more than a planned transfer. A 600-bed hospital averages 15 unexpected ICU transfers per month. Catching deterioration 6 hours earlier prevents 40+ deaths and saves $5.6M annually.

Bottom line: A 600-bed hospital catching inpatient deterioration 6 hours earlier prevents 40+ deaths and saves $5.6M annually in unplanned ICU transfers. Kumo fuses vitals trajectories, lab trends, medication signals, and nursing narrative patterns into a single real-time score that outperforms NEWS/MEWS with 80% fewer false alarms.

Topics covered

patient deterioration predictionearly warning score AIrapid response predictionclinical deterioration MLICU transfer predictiongraph neural network vitalsKumoRFM deteriorationinpatient safety AIsepsis early detection

One Platform. One Model. Infinite Predictions.

KumoRFM

Relational Foundation Model

Turn structured relational data into predictions in seconds. KumoRFM delivers zero-shot predictions that rival months of traditional data science. No training, feature engineering, or infrastructure required. Just connect your data and start predicting.

For critical use cases, fine-tune KumoRFM on your data using the Kumo platform and Research Agent for 30%+ higher accuracy than traditional models.

Book a demo and get a free trial of the full platform: research agent, fine-tune capabilities, and forward-deployed engineer support.