Attention!: agree MSR-LA before downloading
Full ImageThumbnails data
Purpose: Whole images are down sampled to up to 300*300 thumbnails, which are meant to provide the complete contextual information of the faces.
- Download link
- OneDrive: 20 files, 158GB download
- Some statistics:
- # of Entities: 99,952
- # of Lines: 10,490,534
- Image Resolution: up to 300*300
- Average Image # per Entity: 105
- Total file size (uncompressed): 214GB
- File format: text files, each line is an image record containing 6 columns, delimited by TAB.
- Column1: Freebase MID
- Column2: Query/Name
- Column3: ImageSearchRank
- Column4: ImageURL
- Column5: PageURL
- Column6: ImageData_Base64Encoded
Disclaimers
- The data is released for non-commercial research purpose only. You have to read and agree the MSR Data License Agreement before you downloading the data;
- Please contact us If you are a celebrity but do not want to be included in this data set. We will remove related entries by request;
- In all the related publications, please cite the paper "MS-Celeb-1M: A Dataset and Benchmark for Large Scale Face Recognition" and provide the link to http://msceleb.org.
@INPROCEEDINGS { guo2016msceleb, author = {Guo, Yandong and Zhang, Lei and Hu, Yuxiao and He, Xiaodong and Gao, Jianfeng}, title = {M{S}-{C}eleb-1{M}: A Dataset and Benchmark for Large Scale Face Recognition}, booktitle = {European Conference on Computer Vision}, year = {2016}, organization={Springer}}