Attention!: agree MSR-LA before downloading
Cropped face images
Purpose: Faces are cropped with enough background region, and meant to let participants run their own face detector and alignment for more flexibility.
- Download link
- One big tsv File, 150GB download option A
- OneDrive: 14 files, 106GB download
- Some statistics:
- # of Entities: 99,892
- # of Lines: 8,456,240
- Image Resolution: up to 300*300
- Average Image# per Entity: 85
- Total file size (uncompressed): 152GB
- File format: text files, each line is an image record containing 7 columns, delimited by TAB.
- Column1: Freebase MID
- Column2: ImageSearchRank
- Column3: ImageURL
- Column4: PageURL
- Column5: FaceID
- Column6: FaceRectangle_Base64Encoded (four floats, relative coordinates of UpperLeft and BottomRight corner)
- Column7: FaceData_Base64Encoded
Disclaimers
- The data is released for non-commercial research purpose only. You have to read and agree the MSR Data License Agreement before you downloading the data;
- Please contact us If you are a celebrity but do not want to be included in this data set. We will remove related entries by request;
- In all the related publications, please cite the paper "MS-Celeb-1M: A Dataset and Benchmark for Large Scale Face Recognition" and provide the link to http://msceleb.org.
@INPROCEEDINGS { guo2016msceleb, author = {Guo, Yandong and Zhang, Lei and Hu, Yuxiao and He, Xiaodong and Gao, Jianfeng}, title = {M{S}-{C}eleb-1{M}: A Dataset and Benchmark for Large Scale Face Recognition}, booktitle = {European Conference on Computer Vision}, year = {2016}, organization={Springer}}