Tensorflow Keras implementation of : CutMix data augmentation for image classification

The full credit goes to: Sayan Nath

The Data augmentation strategy

CutMix is a data Augmentation strategy where some portion of the training example is removed and pasted with the content from other images in the training set. The labels are also mixed based on the proportion of the pixels that were combined. The full paper is at https://arxiv.org/pdf/1905.04899.pdf by Yun et. al., 2019.

CutMix augmented examples from CIFAR-10

Here are a few examples of augmented images. Few examples

Model and Dataset used

The Data augmentation is applied to the CIFAR-10 Data set. The model used here is the Resnet-20 with Categorical Cross-Entropy loss.

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'Adam', 'learning_rate': 0.001, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
  • training_precision: float32

Training Metrics

After 20 Epocs, the accuracy of the model trained on the CutMix augmented data is 79.61%, while the accuracy of the model trained on the original data is 75.62%. I also found that the training on the original data was slightly faster.

Model Plot

View Model Plot

Model Image

Downloads last month
18
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support image-classification models for tf-keras library.

Space using keras-io/CutMix_data_augmentation_for_image_classification 1