KumoRFM training did not incorporate the SALT dataset. The examples on this page evaluate KumoRFM on SALT tasks without prior model training on SALT data.
What SALT Is
SALT, short for Sales Autocompletion Linked Business Tables, is a real enterprise resource planning (ERP) dataset built around sales-order workflows. It contains four linked business tables:- sales documents
- sales document items
- customers
- addresses
What KumoRFM Predicts on SALT
SALT is not primarily a regression or binary classification benchmark. Instead, the main SALT tasks are multi-class classification problems where KumoRFM fills in missing business attributes. The notebook focuses on eight categorical prediction targets:SALESOFFICESALESGROUPCUSTOMERPAYMENTTERMSSHIPPINGCONDITIONHEADERINCOTERMSCLASSIFICATIONPLANTSHIPPINGPOINTITEMINCOTERMSCLASSIFICATION
What the SALT Notebook Does
The SALT notebook is an evaluation walkthrough. It shows how to take a realistic enterprise dataset, convert it into a Kumo graph, and run classification tasks using Predictive Query Language. At a high level, the notebook does the following:- Installs the required packages, including
kumoaiand Hugging Face dataset tooling. - Authenticates with KumoRFM and initializes the SDK.
- Authenticates with Hugging Face to access the SALT dataset.
- Loads the SALT tables into pandas DataFrames.
- Reconstructs the raw relational dataset by concatenating train and test splits where needed.
- Cleans and normalizes the data:
- merges
CREATIONDATEandCREATIONTIMEinto a single datetime field - propagates timestamps to item rows
- removes auto-generated index columns
- adds a stable primary key for item-level tasks
- renames overlapping target columns for clarity
- merges
- Selects one target task at a time and removes the remaining target columns from the feature set.
- Masks the test labels to prevent temporal leakage.
- Builds a
LocalGraphwith explicit primary keys, time columns, and links across sales, items, customers, and addresses. - Initializes
KumoRFMon that graph. - Defines a Predictive Query Language statement for the chosen target.
- Runs batched predictions on the held-out entities.
- Evaluates the predictions using Mean Reciprocal Rank (MRR).
KumoRFM Prediction Pattern
For SALT-style tasks, KumoRFM predicts a categorical target for each entity in the relevant table. In the notebook, the PQL takes one of these forms:More Reading
- SALT notebook with KumoRFM
- SALT paper
- SALT dataset source
- SAP SALT press release
- RelBench SALT dataset page