Attention!: agree MSR-LA before downloading

Cropped face images

Purpose: Faces are cropped with enough background region, and meant to let participants run their own face detector and alignment for more flexibility. Alt

  • Download link
  • Some statistics:
    • # of Entities: 99,892
    • # of Lines: 8,456,240
    • Image Resolution: up to 300*300
    • Average Image# per Entity: 85
    • Total file size (uncompressed): 152GB
  • File format: text files, each line is an image record containing 7 columns, delimited by TAB.
    • Column1: Freebase MID
    • Column2: ImageSearchRank
    • Column3: ImageURL
    • Column4: PageURL
    • Column5: FaceID
    • Column6: FaceRectangle_Base64Encoded (four floats, relative coordinates of UpperLeft and BottomRight corner)
    • Column7: FaceData_Base64Encoded

Disclaimers

  1. The data is released for non-commercial research purpose only. You have to read and agree the MSR Data License Agreement before you downloading the data;
  2. Please contact us If you are a celebrity but do not want to be included in this data set. We will remove related entries by request;
  3. In all the related publications, please cite the paper "MS-Celeb-1M: A Dataset and Benchmark for Large Scale Face Recognition" and provide the link to http://msceleb.org.
    @INPROCEEDINGS { guo2016msceleb,
        author = {Guo, Yandong and Zhang, Lei and Hu, Yuxiao and He, Xiaodong and Gao, Jianfeng},
        title = {M{S}-{C}eleb-1{M}: A Dataset and Benchmark for Large Scale Face Recognition},
        booktitle = {European Conference on Computer Vision},
        year = {2016},
        organization={Springer}}