Cold Start Recommendation
Solution Background and Business Value
Cold start recommendation is a common challenge in e-commerce platforms like Amazon and eBay, where new products are constantly introduced. The challenge is even greater in platforms like Eventbrite or Ticketmaster, where items (e.g., events or tickets) are always new and never repeat.
This problem is crucial for businesses where recommendation quality directly impacts user engagement and revenue. Improving cold start recommendations leads to:
-
Better user experiences by ensuring relevant suggestions, even for new items.
-
Higher conversions and sales by surfacing new but relevant products.
-
Stronger user retention by keeping recommendations fresh and personalized.
At Kumo AI, we approach this problem using two key strategies:
-
Feature-Based Learning: Kumo learns patterns from rich item attributes (e.g., category, brand, price, etc.) to infer relevance for cold start items.
-
Graph Neural Network (GNN) Propagation: Kumo connects new items to existing ones using shared attributes (e.g., brand, category, location). This allows the system to leverage existing item signals for cold start recommendations.
Data Requirements and Schema
To build an effective cold start recommendation model, we need a structured dataset. Kumo AI allows us to enhance this model by incorporating additional data sources over time.
Core Tables
The three essential tables required for this solution are:
-
Orders Table
-
Stores interactions between users and items (e.g., purchases, event registrations).
-
Key attributes:
-
customer_id
: User identifier. -
item_id
: Item identifier. -
timestamp
: When the interaction occurred. -
Other optional features: purchase amount, event type.
-
-
-
Customers Table
-
Stores user-related information.
-
Key attributes:
-
customer_id
: Unique user identifier. -
Other optional features: age, location, join date.
-
-
-
Items Table
-
Stores product or event details.
-
Key attributes:
-
item_id
: Unique identifier for each item. -
start_timestamp
/end_timestamp
: Availability period of the item. -
Other optional features: category, brand, price, color.
-
-
Additional Tables (Optional)
For improved cold start recommendations, consider adding these:
-
Brands Table: Links items with brands, enabling item-to-item similarity.
-
Item Metadata Tables: Captures hierarchical relationships between items (e.g., event type, location).
-
Behavioral Tables: Includes clicks, wishlist adds, reviews, which can be used to enrich recommendations.
Entity Relationship Diagram (ERD)
Predictive Queries
We can train two types of recommendation models to handle cold start cases:
I. Temporal Recommendation (For Personalized Suggestions)
-
Handles cold start items dynamically by leveraging item features.
-
Best when there is a mix of new and existing items.
II. Static Link Prediction (For Item-to-Item Recommendations)
-
Uses relationships like brand, category, or event type to recommend similar items.
-
Best when orders table lacks timestamps and connections between items are strong.
Deployment Strategy
Cold start recommendation deployment is similar to traditional personalized recommendation systems, with some modifications:
-
Precompute Recommendations for New Users & Items:
-
Generate default item recommendations for users with no history.
-
Assign category-based fallback recommendations if needed.
-
-
Embedding-Based Candidate Generation:
-
Generate user and item embeddings daily.
-
Use these embeddings for real-time ranking and filtering.
-
-
Integrate with Real-Time Systems:
-
Use precomputed recommendations for new users.
-
Apply real-time reranking for logged-in users with purchase history.
-
Building models in Kumo SDK
1. Initialize the Kumo SDK
2. Connect data
3. Select tables
4. Create graph schema
5. Train the Model