Berlin Tech Meetup: The Future of Relational Foundation Models, Systems, and Real-World Applications

Register now:
PyG Guide

All Datasets

30 dataset guides covering the most important benchmarks in graph ML. From toy datasets (Karate Club) to billion-scale benchmarks (OGB-Papers100M).

120+ datasets in PyG30 guides

Cora

Citation

Node Classification

2.7K nodes10.6K edges

CiteSeer

Citation

Node Classification

3.3K nodes9.1K edges

PubMed

Citation

Node Classification

19.7K nodes88.6K edges

Amazon Computers

Co-purchase

Node Classification

13.8K nodes491.7K edges

Amazon Photo

Co-purchase

Node Classification

7.7K nodes238.2K edges

Reddit

Social

Node Classification

233.0K nodes114.6M edges

Flickr

Social

Node Classification

89.3K nodes899.8K edges

Yelp

Social

Multi-label Classification

716.8K nodes14.0M edges

Coauthor CS

Co-author

Node Classification

18.3K nodes163.8K edges

Coauthor Physics

Co-author

Node Classification

34.5K nodes495.9K edges

OGB-Products

OGB

Node Classification

2.4M nodes61.9M edges

OGB-Papers100M

OGB

Node Classification

111.1M nodes1615.7M edges

MUTAG

Molecular

Graph Classification

188 graphs~18 nodes40 edges

ENZYMES

Molecular

Graph Classification

600 graphs~33 nodes124 edges

PROTEINS

Molecular

Graph Classification

1.1K graphs~39 nodes146 edges

QM9

Molecular

Graph Regression (19 targets)

130.8K graphs~18 nodes37 edges

ZINC

Molecular

Graph Regression

249.5K graphs~23 nodes50 edges

MNIST Superpixels

Vision

Graph Classification

70.0K graphs~75 nodes1.4K edges

Karate Club

Social

Node Classification

34 nodes156 edges

PPI

Biological

Multi-label Node Classification

24 graphs~2.2K nodes61.3K edges

Elliptic Bitcoin

Financial

Node Classification (fraud)

203.8K nodes234.4K edges

DGraphFin

Financial

Node Classification (fraud)

3.7M nodes4.3M edges

MovieLens 1M

Recommendation

Link Prediction

9.9K nodes1.0M edges

ShapeNet

3D

Segmentation

16.9K graphs~2.6K nodes

ModelNet40

3D

Classification

12.3K graphs~17.7K nodes

NELL

Knowledge Graph

Node Classification

65.8K nodes251.6K edges

FB15k-237

Knowledge Graph

Link Prediction

14.5K nodes310.1K edges

PATTERN

GNN Benchmark

Node Classification

14.0K graphs~119 nodes6.1K edges

CLUSTER

GNN Benchmark

Node Classification

12.0K graphs~117 nodes4.3K edges

Peptides-func

Long Range

Multi-label Graph Classification

15.5K graphs~151 nodes307 edges

Your data is more complex than any benchmark.

KumoRFM works on real enterprise data: heterogeneous, temporal, billion-scale. No dataset formatting, no feature engineering.