BUPT-CBFace: A Class-Balanced Dataset for Efficient Face Recognition



Description

In this page, we present the BUPT-CBFace dataset to help convenient and effective deep face recognition models training.

The medium-scale face recognition training set BUPT-CBFace is built by exploring the optimal data structure from massive data to show how class-balanced training can promote face recognition performance. It is characterized by the uniformly distributed sample size per class, as well as the balance between the number of classes and the number of samples in one class. In recognition tasks, BUPT-CBFace not only considers the balance between cross-age and cross-pose recognition but also reduces recognition bias to certain extent. Compared to the long-tailed CASIA-WebFace dataset, training deep models using BUPT-CBFace of the same size can significantly improve the face recognition accuracy.

Due to its small size and good recognition performance, BUPT-CBFace can be easily trained on a single NVIDIA GTX 1080Ti GPU to achieve the same level results as large-scale parallel training, which is very friendly to many institutes. We made available two versions of BUPT-CBFace as the alternative options to the existing long-tailed datasets:

  • BUPT-CBFace-50: Dataset with 10,000 classes and 50 images per class.

  • BUPT-CBFace-12: Dataset with 41,667 classes and 12 images per class.

Datasets # of photos # of subjects # photos per subject Distribution
VGGFace21 3.31M 9,131 362.6 Long-tailed
MS1M-IBUG2 3.8M 84.2K 44.2 Long-tailed
CASIA-WebFace3 494,414 10,575 46.8 Long-tailed
BUPT-CBFace-50 500,000 10,000 50 Uniform
BUPT-CBFace-12 500,004 41,667 12 Uniform

  1. Qiong Cao, Li Shen, Weidi Xie, Omkar M. Parkhi, Andrew Zisserman. VGGFace2: A dataset for recognising faces across pose and age. FG, 2018.
  2. Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, Jianfeng Gao. Ms-celeb-1m: A dataset and benchmark for large-scale face recognition. ECCV, 2016.
  3. Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z Li. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.

For more information, please refer to the paper Class-Balanced Training for Deep Face Recognition

Results

We compare face recognition performance of three same-scale datasets (CASIA-WebFace, BUPT-CBFace-12, and BUPT-CBFace-50) and two larger-scale datasets (MS1M-IBUG, VGGFace2) on the IJB-C benchmark. They are all trained with ArcFace loss and ResNet-34 architecture, and the batch size is set 256. BUPT-CBFace-12 can reach higher face recognition accuracy than BUPT-CBFace-50 but may occupy more GPU memory if they are trained under the same batch size. Researchers can choose any of the two according to their needs.

Training dataset 1e-6 1e-5 1e-4 1e-3 1e-2 1e-1 # of iterations (k)
VGGFace2 73.61% 82.78% 89.55% 94.28% 97.18% 98.76% 140
MS1M-IBUG 66.52% 82.20% 90.39% 94.28% 96.57% 98.31% 280
CASIA-WebFace 48.16% 68.04% 80.71% 89.34% 95.18% 98.32% 180
BUPT-CBFace-50 71.61% 81.03% 88.61% 93.95% 96.85% 98.60% 180
BUPT-CBFace-12 83.54% 88.66% 92.70% 95.52% 97.41% 98.83% 180


Reference

If you use this dataset in your research, please kindly cite our work as:


Yaobin Zhang and Weihong Deng. Class-balanced training for deep face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 824–825, 2020. [pdf]

BibTeX entry:
@inproceedings{zhang2020class,
    title={Class-Balanced Training for Deep Face Recognition},
    author={Zhang, Yaobin and Deng, Weihong},
    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
    pages={824--825},
    year={2020}
  }

Download the database

You can download the datasets from BaiduNetDisk (password: 64up), OneDrive or DropBox.

There are two zip files named "BUPT-CBFace-12.zip" and "BUPT-CBFace-50.zip". In each file, all the face images are stored in different folders in "images/" according to their identities, and the meta information is stored in "landmark.tsv". For example, one line in landmark.tsv

"m.051vglx/0 32 26 227 288 84 125 178 126 131 178 86 211 175 212"

shows the bounding box (upper left, lower right) and five facial landmarks (left eye, right eye, nose, left corner of mouth and right corner of mouth) information of face image "images/m.051vglx/0.jpg".


Contact

Please contact Yaobin Zhang (zhangyaobin@bupt.edu.cn) and Weihong Deng for questions about the database.