Tesis doctoralsDepartament d'Enginyeria Informàtica i Matemàtiques

Ontology based semantic clustering

  • Dades identificatives

    Identificador:  TDX:896
    Autors:  Batet Sanromà, Montserrat
    Resum:
    Clustering algorithms have focused on the management of numerical and categorical data. However, in the last years, textual information has grown in importance. Proper processing of this kind of information within data mining methods requires an interpretation of their meaning at a semantic level. In this work, a clustering method aimed to interpret, in an integrated manner, numerical, categorical and textual data is presented. Textual data will be interpreted by means of semantic similarity measures. These measures calculate the alikeness between words by exploiting one or several knowledge sources. In this work we also propose two new ways of compute semantic similarity based on 1) the exploitation of the taxonomical knowledge available on one or several ontologies and 2) the estimation of the information distribution of terms in the Web. Results show that a proper interpretation of textual data at a semantic level improves clustering results and eases the interpretability of the classifications
  • Altres:

    Editor: Universitat Rovira i Virgili
    Data: 2011-02-15
    Identificador: urn:isbn:9788469432327, http://hdl.handle.net/10803/31913
    Departament/Institut: Departament d'Enginyeria Informàtica i Matemàtiques, Universitat Rovira i Virgili.
    Idioma: eng
    Autor: Batet Sanromà, Montserrat
    Director: Gibert Oliveras, Karina, Valls, Aïda
    Font: TDX (Tesis Doctorals en Xarxa)
    Format: application/pdf, 193 p.
  • Paraules clau:

    unsupervised classification
    classificació no supervisada
    Semantic Similarity
    Semblanza Semántica
    Semblança Semàntica
    Ontologies
    004 - Informàtica
  • Documents:

  • Cerca a google

    Search to google scholar