Articles producció científica> Enginyeria Informàtica i Matemàtiques

An information theoretic approach to improve semantic similarity assessments across multiple ontologies

  • Dades identificatives

    Identificador:  imarina:9369758
    Autors:  Batet, Montserrat; Harispe, Sebastien; Ranwez, Sylvie; Sanchez, David; Ranwez, Vincent
    Resum:
    Semantic similarity has become, in recent years, the backbone of numerous knowledge-based applications dealing with textual data. From the different methods and paradigms proposed to assess semantic similarity, ontology-based measures and, more specifically, those based on quantifying the Information Content (IC) of concepts are the most widespread solutions due to their high accuracy. However, these measures were designed to exploit a single ontology. They thus cannot be leveraged in many contexts in which multiple knowledge bases are considered. In this paper, we propose a new approach to achieve accurate IC-based similarity assessments for concept pairs spread throughout several ontologies. Based on Information Theory, our method defines a strategy to accurately measure the degree of commonality between concepts belonging to different ontologies - this is the cornerstone for estimating their semantic similarity. Our approach therefore enables classic IC-based measures to be directly applied in a multiple ontology setting. An empirical evaluation, based on well-established benchmarks and ontologies related to the biomedical domain, illustrates the accuracy of our approach, and demonstrates that similarity estimations provided by our approach are significantly more correlated with human ratings of similarity than those obtained via related works. © 2014 Elsevier Inc. All rights reserved.
  • Altres:

    Autor segons l'article: Batet, Montserrat; Harispe, Sebastien; Ranwez, Sylvie; Sanchez, David; Ranwez, Vincent
    Departament: Enginyeria Informàtica i Matemàtiques
    Autor/s de la URV: Batet Sanromà, Montserrat / Sánchez Ruenes, David
    Paraules clau: Information theory; Mesh; Ontology; Semantic similarity; Snomed-ct
    Resum: Semantic similarity has become, in recent years, the backbone of numerous knowledge-based applications dealing with textual data. From the different methods and paradigms proposed to assess semantic similarity, ontology-based measures and, more specifically, those based on quantifying the Information Content (IC) of concepts are the most widespread solutions due to their high accuracy. However, these measures were designed to exploit a single ontology. They thus cannot be leveraged in many contexts in which multiple knowledge bases are considered. In this paper, we propose a new approach to achieve accurate IC-based similarity assessments for concept pairs spread throughout several ontologies. Based on Information Theory, our method defines a strategy to accurately measure the degree of commonality between concepts belonging to different ontologies - this is the cornerstone for estimating their semantic similarity. Our approach therefore enables classic IC-based measures to be directly applied in a multiple ontology setting. An empirical evaluation, based on well-established benchmarks and ontologies related to the biomedical domain, illustrates the accuracy of our approach, and demonstrates that similarity estimations provided by our approach are significantly more correlated with human ratings of similarity than those obtained via related works. © 2014 Elsevier Inc. All rights reserved.
    Àrees temàtiques: Administração pública e de empresas, ciências contábeis e turismo; Artificial intelligence; Astronomia / física; Biodiversidade; Ciência da computação; Ciências agrárias i; Ciências ambientais; Ciências biológicas i; Ciencias sociales; Computer science applications; Computer science, information systems; Comunicação e informação; Control and systems engineering; Engenharias i; Engenharias iii; Engenharias iv; Ensino; Information systems and management; Interdisciplinar; Matemática / probabilidade e estatística; Medicina ii; Software; Theoretical computer science
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Adreça de correu electrònic de l'autor: david.sanchez@urv.cat; montserrat.batet@urv.cat
    Data d'alta del registre: 2024-10-12
    Versió de l'article dipositat: info:eu-repo/semantics/acceptedVersion
    Enllaç font original: https://www.sciencedirect.com/science/article/abs/pii/S0020025514006677
    Referència a l'article segons font original: Information Sciences. 283 197-210
    Referència de l'ítem segons les normes APA: Batet, Montserrat; Harispe, Sebastien; Ranwez, Sylvie; Sanchez, David; Ranwez, Vincent (2014). An information theoretic approach to improve semantic similarity assessments across multiple ontologies. Information Sciences, 283(), 197-210. DOI: 10.1016/j.ins.2014.06.039
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    DOI de l'article: 10.1016/j.ins.2014.06.039
    Entitat: Universitat Rovira i Virgili
    Any de publicació de la revista: 2014
    Tipus de publicació: Journal Publications
  • Paraules clau:

    Artificial Intelligence,Computer Science Applications,Computer Science, Information Systems,Control and Systems Engineering,Information Systems and Management,Software,Theoretical Computer Science
    Information theory
    Mesh
    Ontology
    Semantic similarity
    Snomed-ct
    Administração pública e de empresas, ciências contábeis e turismo
    Artificial intelligence
    Astronomia / física
    Biodiversidade
    Ciência da computação
    Ciências agrárias i
    Ciências ambientais
    Ciências biológicas i
    Ciencias sociales
    Computer science applications
    Computer science, information systems
    Comunicação e informação
    Control and systems engineering
    Engenharias i
    Engenharias iii
    Engenharias iv
    Ensino
    Information systems and management
    Interdisciplinar
    Matemática / probabilidade e estatística
    Medicina ii
    Software
    Theoretical computer science
  • Documents:

  • Cerca a google

    Search to google scholar