Articles producció científica> Enginyeria Informàtica i Matemàtiques

Federated learning-based natural language processing: a systematic literature review

  • Identification data

    Identifier: imarina:9388759
    Authors:
    Khan, YounasSanchez, DavidDomingo-Ferrer, Josep
    Abstract:
    Federated learning (FL) is a decentralized machine learning (ML) framework that allows models to be trained without sharing the participants' local data. FL thus preserves privacy better than centralized machine learning. Since textual data (such as clinical records, posts in social networks, or search queries) often contain personal information, many natural language processing (NLP) tasks dealing with such data have shifted from the centralized to the FL setting. However, FL is not free from issues, including convergence and security vulnerabilities (due to unreliable or poisoned data introduced into the model), communication and computation bottlenecks, and even privacy attacks orchestrated by honest-but-curious servers. In this paper, we present a systematic literature review (SLR) of NLP applications in FL with a special focus on FL issues and the solutions proposed so far. Our review surveys 36 recent papers published in relevant venues, which are systematically analyzed and compared from multiple perspectives. As a result of the survey, we also identify the most outstanding challenges in the area.
  • Others:

    Author, as appears in the article.: Khan, Younas; Sanchez, David; Domingo-Ferrer, Josep
    Department: Enginyeria Informàtica i Matemàtiques
    URV's Author/s: Domingo Ferrer, Josep / Khan, Younas / Sánchez Ruenes, David / Sánchez Torres, David
    Keywords: Federated learning Natural language processing Privacy Security Systematic literature revie Systematic literature review
    Abstract: Federated learning (FL) is a decentralized machine learning (ML) framework that allows models to be trained without sharing the participants' local data. FL thus preserves privacy better than centralized machine learning. Since textual data (such as clinical records, posts in social networks, or search queries) often contain personal information, many natural language processing (NLP) tasks dealing with such data have shifted from the centralized to the FL setting. However, FL is not free from issues, including convergence and security vulnerabilities (due to unreliable or poisoned data introduced into the model), communication and computation bottlenecks, and even privacy attacks orchestrated by honest-but-curious servers. In this paper, we present a systematic literature review (SLR) of NLP applications in FL with a special focus on FL issues and the solutions proposed so far. Our review surveys 36 recent papers published in relevant venues, which are systematically analyzed and compared from multiple perspectives. As a result of the survey, we also identify the most outstanding challenges in the area.
    Thematic Areas: Artificial intelligence Biotecnología Ciência da computação Ciências biológicas i Ciencias humanas Ciencias sociales Computer science, artificial intelligence Engenharias iv Filologia, lingüística i sociolingüística Language and linguistics Linguistics Linguistics and language Medicina i Psicología Psychology
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Author's mail: josep.domingo@urv.cat david.sanchez@urv.cat younas.khan@urv.cat younas.khan@urv.cat
    Author identifier: 0000-0001-7213-4962 0000-0001-7275-7887
    Record's date: 2024-11-02
    Papper version: info:eu-repo/semantics/publishedVersion
    Papper original source: Artificial Intelligence Review. 57 (12): 320-
    APA: Khan, Younas; Sanchez, David; Domingo-Ferrer, Josep (2024). Federated learning-based natural language processing: a systematic literature review. Artificial Intelligence Review, 57(12), 320-. DOI: 10.1007/s10462-024-10970-5
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Entity: Universitat Rovira i Virgili
    Journal publication year: 2024
    Publication Type: Journal Publications
  • Keywords:

    Artificial Intelligence,Computer Science, Artificial Intelligence,Language and Linguistics,Linguistics and Language
    Federated learning
    Natural language processing
    Privacy
    Security
    Systematic literature revie
    Systematic literature review
    Artificial intelligence
    Biotecnología
    Ciência da computação
    Ciências biológicas i
    Ciencias humanas
    Ciencias sociales
    Computer science, artificial intelligence
    Engenharias iv
    Filologia, lingüística i sociolingüística
    Language and linguistics
    Linguistics
    Linguistics and language
    Medicina i
    Psicología
    Psychology
  • Documents:

  • Cerca a google

    Search to google scholar