CutMix

CutMix is a data augmentation technique used in training deep neural networks for computer vision. It creates new training samples by taking a patch from one image and pasting it into another, and blending the corresponding labels according to the patch area. The method aims to encourage the model to attend to multiple regions of an image and to improve generalization.

Procedure: Given two samples (x1, y1) and (x2, y2), a mixing ratio lambda is drawn from a

Variations and parameters: Different implementations vary the shape and size of the pasted patch and the distribution

Applications and effects: CutMix has been shown to improve accuracy and robustness on image classification benchmarks

Implementation notes: CutMix is relatively simple to implement and is supported in many deep-learning frameworks. When

A

1

−

a

=

+

−

Hyperparameters,