How do I assign different weights to training samples?

On this page

Why assign different weights to training samples?
Guidelines for Setting Weights

Why assign different weights to training samples?

In some use cases, you may want to assign different levels of importance to samples in the training table. For instance, on an e-commerce platform, it can be valuable to emphasize high-value customers more during training. This can be done by assigning larger weights to those training instances. Kumo supports sample-level weighting via the training table. Using the kumo-sdk, you can add a weight column to scale each row’s relative contribution to the loss function. These weights directly influence the model’s optimization process and should be used thoughtfully.

Guidelines for Setting Weights

Zero weights: Kumo allows you to assign zero weight to some (not all) training instances. By setting a weight of zero, you effectively remove that sample from training, which can be helpful for excluding noisy or irrelevant instances. That said, assigning zero weights to too many samples can shrink the effective loss signal and slow convergence.
Highly skewed weights: While moderate variation in weights is usually fine, extreme disparities between sample weights can negatively impact training stability, slow convergence, and lead to poor generalization. If using skewed weight distributions, monitor their effects on the training loss curve closely.
Negative weights: Currently kumo does not support negative weights.

You can use instance weighting to better align training with business value, but carefully consider the effects of the assigned weights on loss stability and model performance before deploying your model.

How can I start with a smaller graph and/or a downsampled data set?How does Kumo handle the cold start problem in ML?

FAQ

​Why assign different weights to training samples?

​Guidelines for Setting Weights

Why assign different weights to training samples?

Guidelines for Setting Weights