Kumo is most commonly used for personalization and recommendation use cases, for good reason: as the first end-to-end personalization solution powered by GNNs and large language models (LLMs), Kumo makes it easy to develop custom GNN+LLM recommenders for your data, supporting over a dozen personalization solutions, ranging from personalized email notifications, to in-app carousels of related items, and even search reranking.Kumo’s flexible model architecture works particularly well for teams with unique business requirements and challenges, including: cold start, massive scale, data sparsity, text/image data, and complex business logic. Additionally, Kumo offers data warehouse-native deployments that allow you to perform data processing within your Snowflake or Databricks accounts, reducing the need for time-consuming security reviews.
Kumo’s predictive query language (PQL) provides the flexibility to build a wide variety of personalization models. This language lets data scientists declare the entity, target, filters, and optimization goal of a predictive task, in a purely declarative manner.In practice, this enables teams to quickly build GNN-powered recommendations for dozens of user-facing surfaces within their product, including in-app carousels, notifications, related items, cross/up-sells, and reranking of product listing pages (search/browse).Here are a few of the solutions that Kumo supports.
Data science and machine learning teams looking to get the best recommendations across many surfaces in their product
Product managers and directors of personalization that want to use the best-performing recommendation algorithms in academia and industry (GNNs)
Businesses with unique requirements that typical personalization vendors do not support, such as a used item marketplace, consumer apps, streaming platform, ticket reseller, or food delivery service
Engineering teams looking to improve their existing retrieval and re-ranking systems by using graph embeddings or GNN-powered candidate sources.
Organizations looking to invest in a general machine learning platform to up-level a wide variety of solutions beyond personalization, including LTV/conversion prediction for user segmentation, fraud detection, demand forecasting, and price optimization.
Hybrid GNN Model Architecture to achieve 2.7x better mean average precision on a public benchmarks compared to popular alternatives like two-tower models.
Solutions for both batch and real time recommendations, enabling a single platform to power asynchronous (eg. email) and in-product user journeys.
Prediction filters, enabling recommendations to be contextualized for specific events or campaigns. For example, Kumo can create a personalized list of winter events that might interest you.
Great handling of cold start. The model can make high-quality recommendations for new users and items for which there is little to no information using inductive representation learning.
GNN model planner enables data scientists to tune the model architecture for the dataset, including the training split, neighborhood sampling strategy, and model hyperparameters.
Support for visual signal (image-based), and LLM-powered text understanding.
Ability to export graph embeddings for all users/items, which can be used as a feature in your existing re-ranker models, or loaded into your search retrieval engine.
REST API and SDK, enabling data scientists and ML engineers to develop, test, and deploy recommendation models directly from notebooks or as part of an automated workflow.
Kumo does not require data to be transformed to fit a prescriptive schema, nor does it require the installation of a tracking pixel.Instead, Kumo generates recommendations directly from the raw data that already exists in the data warehouse. The Kumo graph builder makes it easy to stitch together data from many different sources. Just connect the tables to the graph and go.For the best personalization results, Kumo encourages using data such as:
Customer profile information
Auth/unauth browsing history
Purchases, likes, searches, reviews
Product inventory
Text descriptions and product images
Promotion and discount history
Online and in-store sales data
User notification history
Second and third party data
By using all of your data, Kumo can achieve better recommendation quality compared to solutions that can only use a subset.
Kumo reads and writes data directly to the client’s data lakehouse, supporting cloud-first data science workflows. Supported lakehouses include Snowflake, Databricks, AWS S3, GCP BigQuery, and Azure Synapse (coming soon). For example, users have found success using Kumo as part of a DBT-based development environment, using Airflow for orchestration, and Streamlit for consumption.
Additionally, Kumo provides data warehouse native deployment options, which keeps your data secure by performing data processing within your Snowflake or Databricks account. This makes Kumo suitable for use in highly regulated environments, including banking, healthcare, and government.
Kumo uses a distributed GNN training system, written in C++, which can handle multi-terabyte datasets with tens of billions of rows, and has customers that make daily recommendations for more than 100M active users or more than 10M inventory items.Because GNNs are great at discovering complex patterns in sparse data, Kumo is also a good fit for small datasets containing 1000’s of users, and only 10’s of items.
In order to cover both in-product and out-of-product personalization journeys, Kumo supports a variety of serving methods, .
Batch Export: Export bulk predictions via UI or API. This is an easy way to generate personalized emails or notifications. It is also the easiest way to personalize many in-product surfaces, such as product carousels, related item recommendations, candidate generation, and feed reranking.
Online Serving: Recommend or rerank items for your in-product surfaces, leveraging real-time in-session signal and Kumo’s GNN artifacts. This can be provided as a portable code snippet, or served by Kumo behind a REST API.
Embedding Export: Export graph embeddings for use in existing retrieval and re-ranking models. Many large tech companies have achieved engagement gains of 10-30%, simply by adding graph signal to existing systems.
Kumo recommendations are powered by a GNN architecture, inspired by several academic papers in recent history. Data scientists can benefit from these advances in model architecture, without needing to code them up manually.Here is some of the research that is used by Kumo AI:
GraphSAGE does inductive representation learning to deliver great recommendations for users with very little interaction data such as first or second time visitors
ID-GNN is a training process that enables the model to learn patterns such as repeat purchase or brand affinity
PNA introduces a variety of aggregation operators which are explored by Kumo AutoML
GCN describes mean-pooling aggregation, which captures similarity between users with similar item purchases
GIN captures frequency signal to learn more complex user behavior like power users vs resurrected users
NBF networks reduce the computational cost of models, by providing an efficient way to capture paths between nodes
GraphMixer uses temporal representation learning, to interpret sequences of user actions such as on-site browsing history
RDL describes temporal sampling, which learns from past sequences of user actions to predict the future.
The Kumo model planner empowers data scientists to quickly iterate and apply their domain knowledge to the model.Specifically, the Kumo model planner gives control over:
training table splits
neighborhood sampling
column encoding
training process
GNN architecture
optimization goals
For example, when creating recommendations for a used-item marketplace, one could disable ID-based item embeddings, in order to improve accuracy on new (cold start) items, while trading off accuracy on older items.Data Scientists have improved sales by $100M USD/year using the fine-grained controls offered in the model planner.
PQL is a declarative syntax for defining machine learning problems. It is highly flexible and easy to learn, supporting inline filters, boolean expressions, and aggregation functions. Data scientists can quickly experiment with many different and complex predictive formulations of a machine learning problem in very few lines of code.For example, the following query will recommend 5 winter clearance items that each of your users will purchase with the highest likelihood.
Copy
Ask AI
PREDICT LIST_DISTINCT( purchases.item_id WHERE purchases.markdown > 0.2 AND item.season = 'winter', 0, 14)RANK TOP 5FOR EACH customers.customer_id
As part of the training process, Kumo automatically computes data visualizations and metrics to help understand the model’s strengths and weaknesses.
Learning Curves and Distribution: Detects under and overfitting by monitoring the convergence rates. Tracks distribution of training data over time for balance.
Backtesting on Holdout: All models are back-tested on a configurable holdout dataset. The holdout data set may be downloaded for customer analysis.
Standard Evaluation Metrics and Charts: Including: ROC and PRC curve, cumulative gain chart, AUPRC, AUROC, predicted vs actual scatter plot and histogram, MAE, MSE, RMSE, SMAPE, average precision, per-category recall, F1, MAP
Baseline Comparison: Models are benchmarked against an automatically generated analytic baseline.
Column Explainability: A visualization highlighting which columns have the greeted predictive power helps prove that the model has no data leakage
Row Level Explainability: Users can understand the reason for individual predictions by seeing which rows contributed most to the result.
In order to support ongoing validation of model correctness, Kumo has the following features related to MLOps:
Data Source Snapshotting: During each job, data source statistics are snapshotted including size, time range, and import time, to enable faster root cause analysis.
Drift Detection: Distributions of features and predictions are monitored for drift. This enables early detection of issues, preventing bad predictions from being published to production.
Champion/Challenger: A champion/challenger approach can be adopted to validate key metrics of a newly trained model when orchestrating automatic job retraining through the REST API
Because Kumo writes directly to the data lakehouse, it is easy to connect with other cloud software for personalization use cases. Here are just a few examples:
Marketing Orchestration: Braze, Adobe/Marketo, Klaviyo, Omnisend, etc… Create ML-powered customer segments and insert personalized dynamic content such as product recommendations
Advertising: Google Performance Max, Meta Advantage Plus, etc… Assign users to custom audiences using ML, to show creatives that are more likely to convert
Search/Vector Databases: Elasticsearch, OpenSearch… Personalize your search engine by using Kumo graph embeddings
API Servers/Websites: DynamoDB, Redis, MySQL, Cosmos, MongoDB… Serve predictions to power personalized in-product carousels.
Chatbots/Language Models: OpenAI, Cortex, Bard, DBRX… Use Kumo predictions and explanations to generate personalized messages based on recent activity.