Articles producció científicaEnginyeria Informàtica i Matemàtiques

Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks

  • Dades identificatives

    Identificador:  imarina:9452161
    Autors:  Haffar, Rami; Sanchez, David; Domingo-Ferrer, Josep
    Resum:
    Human facial data offers valuable potential for tackling classification problems, including face recognition, age estimation, gender identification, emotion analysis, and race classification. However, recent privacy regulations, particularly the EU General Data Protection Regulation, have restricted the collection and usage of human images in research. As a result, several previously published face data sets have been removed from the internet due to inadequate data collection methods and privacy concerns. While synthetic data sets have been suggested as an alternative, they fall short of accurately representing the real data distribution. Additionally, most existing data sets are labeled for just a single task, which limits their versatility. To address these limitations, we introduce the Multi-Task Face (MTF) data set, designed for various tasks including face recognition and classification by race, gender, and age, as well as for aiding in training generative networks. The MTF data set comes in two versions: a non-curated set containing 132,816 images of 640 individuals, and a manually curated set with 5,246 images of 240 individuals, meticulously selected to maximize their classification quality. Both data sets were ethically sourced, using publicly available celebrity images in full compliance with copyright regulations. Along with providing detailed descriptions of data collection and processing, we evaluated the effectiveness of the MTF data set in training five deep learning models across the aforementioned classification tasks, achieving up to 98.88% accuracy for gender classification, 95.77% for race classification, 97.60% for age classification, and 79.87% for face recognition with the ConvNeXT model. Both MTF data sets can be accessed through the following link. https://github.com/RamiHaf/MTF_data_set
  • Altres:

    Enllaç font original: https://ieeexplore.ieee.org/document/10960301
    Referència de l'ítem segons les normes APA: Haffar, Rami; Sanchez, David; Domingo-Ferrer, Josep (2025). Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks. Ieee Access, 13(), 63827-63840. DOI: 10.1109/ACCESS.2025.3559310
    Referència a l'article segons font original: Ieee Access. 13 63827-63840
    DOI de l'article: 10.1109/ACCESS.2025.3559310
    Any de publicació de la revista: 2025
    Entitat: Universitat Rovira i Virgili
    Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
    Data d'alta del registre: 2025-04-30
    Autor/s de la URV: Domingo Ferrer, Josep / Sánchez Ruenes, David
    Departament: Enginyeria Informàtica i Matemàtiques
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    Tipus de publicació: Journal Publications
    Autor segons l'article: Haffar, Rami; Sanchez, David; Domingo-Ferrer, Josep
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Àrees temàtiques: Ciência da computação, Computer science (all), Computer science (miscellaneous), Computer science, information systems, Electrical and electronic engineering, Engenharias iii, Engenharias iv, Engineering (all), Engineering (miscellaneous), Engineering, electrical & electronic, General computer science, General engineering, General materials science, Materials science (all), Materials science (miscellaneous), Telecommunications
    Adreça de correu electrònic de l'autor: josep.domingo@urv.cat, david.sanchez@urv.cat
  • Paraules clau:

    Artificial intelligence
    Data models
    Deep learnin
    Deep learning
    Ethics
    Europe
    Face images
    Face recognition
    Faces
    General data protection regulation
    Image classification
    Image data set
    Law
    Regulation
    Training
    Computer Science (Miscellaneous)
    Computer Science
    Information Systems
    Engineering (Miscellaneous)
    Engineering
    Electrical & Electronic
    Materials Science (Miscellaneous)
    Telecommunications
    Ciência da computação
    Computer science (all)
    Electrical and electronic engineering
    Engenharias iii
    Engenharias iv
    Engineering (all)
    General computer science
    General engineering
    General materials science
    Materials science (all)
  • Documents:

  • Cerca a google

    Search to google scholar