Articles producció científicaEnginyeria Química

Nonunique UPGMA clusterings of microsatellite markers

  • Dades identificatives

    Identificador:  imarina:9280466
    Autors:  Segura-Alabart, Natalia; Serratosa, Francesc; Gomez, Sergio; Fernandez, Alberto
    Resum:
    Agglomerative hierarchical clustering has become a common tool for the analysis and visualization of data, thus being present in a large amount of scientific research and predating all areas of bioinformatics and computational biology. In this work, we focus on a critical problem, the nonuniqueness of the clustering when there are tied distances, for which several solutions exist but are not implemented in most hierarchical clustering packages. We analyze the magnitude of this problem in one particular setting: the clustering of microsatellite markers using the Unweighted Pair-Group Method with Arithmetic Mean. To do so, we have calculated the fraction of publications at the Scopus database in which more than one hierarchical clustering is possible, showing that about 46% of the articles are affected. Additionally, to show the problem from a practical point of view, we selected two opposite examples of articles that have multiple solutions: one with two possible dendrograms, and the other with more than 2.5 million different possible hierarchical clusterings.© The Author(s) 2022. Published by Oxford University Press.
  • Altres:

    Enllaç font original: https://academic.oup.com/bib/article/23/5/bbac312/6652780
    Referència de l'ítem segons les normes APA: Segura-Alabart, Natalia; Serratosa, Francesc; Gomez, Sergio; Fernandez, Alberto (2022). Nonunique UPGMA clusterings of microsatellite markers. Briefings In Bioinformatics, 23(5), bbac312-bbac312. DOI: 10.1093/bib/bbac312
    Referència a l'article segons font original: Briefings In Bioinformatics. 23 (5): bbac312-bbac312
    DOI de l'article: 10.1093/bib/bbac312
    Any de publicació de la revista: 2022
    Entitat: Universitat Rovira i Virgili
    Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
    Data d'alta del registre: 2025-01-28
    Autor/s de la URV: Fernández Sabater, Alberto / Gómez Jiménez, Sergio / Segura Alabart, Natàlia / Serratosa Casanelles, Francesc d'Assís
    Departament: Enginyeria Informàtica i Matemàtiques, Enginyeria Química
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    Tipus de publicació: Journal Publications
    Autor segons l'article: Segura-Alabart, Natalia; Serratosa, Francesc; Gomez, Sergio; Fernandez, Alberto
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Àrees temàtiques: Molecular biology, Medicine (all), Mathematical & computational biology, Information systems, Ciências biológicas i, Ciência da computação, Biotechnology & applied microbiology, Biochemical research methods
    Adreça de correu electrònic de l'autor: natalia.segura@urv.cat, natalia.segura@urv.cat, natalia.segura@urv.cat, alberto.fernandez@urv.cat, sergio.gomez@urv.cat, francesc.serratosa@urv.cat
  • Paraules clau:

    Upgma
    Tie in proximity
    Str
    Ssr
    Microsatellite repeats
    Microsatellite marker
    Genetic diversity
    Dendrogram
    Computational biology
    Cluster analysis
    simple sequences
    l.
    Biochemical Research Methods
    Biotechnology & Applied Microbiology
    Information Systems
    Mathematical & Computational Biology
    Molecular Biology
    Medicine (all)
    Ciências biológicas i
    Ciência da computação
  • Documents:

  • Cerca a google

    Search to google scholar