What is inductive learning in GNNs?

Inductive learning means the GNN can generate embeddings and make predictions for nodes or graphs that were not present during training. The model learns a general function (parameterized by neural network weights) that computes embeddings from local neighborhood structure. At inference time, this function can be applied to any new node with features and neighbors, even in entirely new graphs.

How is inductive learning different from transductive learning in GNNs?

Transductive learning requires the entire graph (including test nodes) to be present during training. The model learns fixed embeddings for specific nodes. Inductive learning learns a parameterized function that can compute embeddings for any node based on its features and local structure. Inductive models generalize to new nodes; transductive models cannot.

Which GNN layers support inductive learning?

All parameterized GNN layers (GCNConv, GATConv, SAGEConv, GINConv) are inherently inductive because they learn weight matrices applied to node features, not fixed per-node embeddings. GraphSAGE was explicitly designed for inductive settings with neighbor sampling. Methods like node2vec and DeepWalk are transductive because they learn fixed per-node embeddings.

Why does inductive learning matter for enterprise production systems?

Enterprise databases change constantly. New customers sign up, new transactions occur, new products are added. A transductive model would need to be retrained every time the graph changes. An inductive model can immediately compute embeddings for new entities by aggregating their neighbors' features, enabling real-time predictions without retraining.

Can inductive GNNs work on entirely new graphs?

Yes, if the new graph has the same feature schema. A GNN trained on one company's customer-order-product graph can potentially be applied to another company's graph with the same node/edge types and features. This is the foundation of transfer learning in GNNs and is how foundation models like KumoRFM work across different relational databases.

Inductive Learning in GNNs: Generalizing to Unseen Nodes and Graphs | Kumo.ai

Inductive learning in graph neural networks means the model learns a parameterized function that can compute embeddings for nodes or graphs not present during training, enabling real-time predictions on continuously evolving data. Unlike transductive approaches that learn fixed embeddings for specific nodes, inductive GNNs learn weight matrices that transform any node's features based on its local neighborhood structure. When a new node appears, the model applies the same learned function to compute its embedding on the fly.

Why it matters for enterprise data

Enterprise databases are not static. Every day:

New customers register and place their first orders
Existing customers create new transactions
New products are added to the catalog
New support tickets, claims, and interactions appear

A transductive model trained on Monday's graph cannot predict on Tuesday's new customers without retraining. An inductive model can. As soon as a new customer places their first order, the model aggregates their order features and produces an embedding. The customer gets a churn prediction, fraud score, or product recommendation immediately.

This is not optional for production. Any enterprise ML system that touches relational data must handle new entities. Inductive learning makes this architecturally guaranteed rather than requiring engineering workarounds.

How inductive learning works

An inductive GNN learns two things during training:

Weight matrices (W) that transform node features
Aggregation strategy (how to combine neighbor information)

At inference on a new node:

Sample or collect the new node's neighbors
Apply the learned weight matrices to neighbor features
Aggregate using the learned strategy
Produce the node embedding

inductive_inference.py

import torch
from torch_geometric.nn import SAGEConv
from torch_geometric.loader import NeighborLoader

class InductiveGNN(torch.nn.Module):
    def __init__(self, in_dim, hidden_dim, out_dim):
        super().__init__()
        self.conv1 = SAGEConv(in_dim, hidden_dim)
        self.conv2 = SAGEConv(hidden_dim, out_dim)

    def forward(self, x, edge_index):
        x = self.conv1(x, edge_index).relu()
        return self.conv2(x, edge_index)

# Train on existing graph
model = InductiveGNN(16, 64, 7)
# ... training loop on data_train ...

# New customer appears with 3 orders
# Just add them to the graph and run inference
new_data = add_new_customer(existing_graph, new_customer_features,
                             new_order_edges)
loader = NeighborLoader(new_data, num_neighbors=[10, 10],
                        input_nodes=new_customer_idx)
for batch in loader:
    pred = model(batch.x, batch.edge_index)
    # pred[0] = new customer's embedding/prediction

GraphSAGE (SAGEConv) is the canonical inductive GNN. NeighborLoader samples local neighborhoods for scalable inference on new nodes.

Concrete example: real-time fraud scoring for new accounts

A bank processes 10,000 new account openings per day. Each new account:

Has features: [age, income, device_fingerprint, application_channel]
Connects to existing entities: shared device with other accounts, shared IP address, shared phone number

An inductive GNN trained on historical fraud patterns:

Takes the new account's features
Aggregates features from accounts sharing the same device/IP/phone
Produces a fraud probability within milliseconds
No retraining needed, even though this account never existed in training data

Limitations and what comes next

Cold start: New nodes with no connections have no neighbors to aggregate. Their embedding is based solely on their own features, losing the graph advantage. This improves as the node accumulates connections.
Feature schema must match: The new node must have the same feature dimensions as training nodes. If the feature schema changes, the model needs updating.
Distribution shift: If new nodes have fundamentally different feature distributions than training nodes (e.g., a new market segment), the learned function may not transfer well.

Foundation models like KumoRFM push inductive learning further by generalizing across different relational database schemas, achieving 76.71 zero-shot AUROC on unseen RelBench tasks.

Key Takeaways

1Inductive learning means the GNN generalizes to unseen nodes by learning a parameterized function, not fixed embeddings. Critical for production systems where new data arrives continuously.
2All parameterized GNN layers (GCNConv, GATConv, SAGEConv) are inherently inductive. They learn weight matrices applied to features, not per-node embedding lookups.
3For enterprise data: new customers get instant predictions by aggregating their neighbors' features through learned weights. No retraining needed for new entities.
4Cold start is the main limitation: nodes with zero connections rely solely on their own features. Performance improves as connections accumulate.
5KumoRFM extends inductive learning across database schemas, achieving 76.71 zero-shot AUROC on completely unseen relational databases.

Inductive Learning: GNNs That Generalize to Unseen Nodes and Graphs