Articles producció científicaCiències Mèdiques Bàsiques

Contamination of fungal genomes of Onygenaceae (Phylum Ascomycota) in public databases: incidence, detection, and impact

  • Dades identificatives

    Identificador:  imarina:9469348
    Autors:  Granados-Casas, AO; Fernández-Bravo, A; Stchigel, AM; Cano-Lira, JF
    Resum:
    Genomic datasets often contain unwanted, foreign, or erroneous nucleotide sequences that do not belong to the organism under study. Such contamination can significantly compromise genome analyses, reducing the accuracy and reliability of the results. Despite its potential impact, few studies have addressed the contamination of fungal genomes by exogenous sequences. Here, we analyzed eleven publicly available genomes of fungi from the family Onygenaceae, retrieved from the National Center for Biotechnology Information (NCBI) database. A comprehensive quality assessment was performed, evaluating genome completeness, contiguity, and contamination levels. Genomes with lower statistical quality and putatively contaminated were selected for further improvement. To enhance assembly quality, we built a custom Kraken 2 database including four high-quality genomes of closely related fungal taxa. After filtering, we reassessed the genomes to compare contiguity, completeness, and contamination levels before and after the process. Furthermore, structural and functional annotation was conducted to evaluate changes in predicted proteins, protein families and domains. Additionally, Average nucleotide identity and phylogenetic analyses were performed to further assess the impact of the filtering. Four genomes showed low-quality statistics and contamination levels between 5 and 12%, mainly of bacteria origin. After removing the contaminated regions, assembly quality metrics improved, and contamination level dropped below 3% in all cases. Functional annotation of the filtered assemblies revealed a reduction in bacteria-associated protein families. Our results demonstrate the presence of contamination in publicly available Onygenaceae fungal genomes and highlight its potential to bias downstream analyses. We emphasize the importance of contamination screening and removal to ensure reliable genomic data for fungal research.
  • Altres:

    Enllaç font original: https://link.springer.com/journal/12864
    Referència de l'ítem segons les normes APA: Granados-Casas, AO; Fernández-Bravo, A; Stchigel, AM; Cano-Lira, JF (2025). Contamination of fungal genomes of Onygenaceae (Phylum Ascomycota) in public databases: incidence, detection, and impact. BMC GENOMICS, 26(1), 1057-. DOI: 10.1186/s12864-025-12223-3
    Referència a l'article segons font original: BMC GENOMICS. 26 (1): 1057-
    DOI de l'article: 10.1186/s12864-025-12223-3
    Any de publicació de la revista: 2025-11-19
    Entitat: Universitat Rovira i Virgili
    Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
    Data d'alta del registre: 2026-05-16
    Autor/s de la URV: Cano Lira, José Francisco / Fernández Bravo, Ana / Granados Casas, Alan Omar / Stchigel Glikman, Alberto Miguel
    Departament: Ciències Mèdiques Bàsiques
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    Tipus de publicació: Journal Publications
    Autor segons l'article: Granados-Casas, AO; Fernández-Bravo, A; Stchigel, AM; Cano-Lira, JF
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    e-ISSN: 1471-2164
    Grup de recerca: Unitat de Micologia i Microbiologia Ambiental
    Àrees temàtiques: Genetics & heredity, Genetics, Ciências agrárias i, Biotechnology & applied microbiology, Biotechnology, Astronomia / física
    Adreça de correu electrònic de l'autor: alanomar.granados@urv.cat, ana.fernandez@urv.cat, ana.fernandez@urv.cat, alanomar.granados@urv.cat, alanomar.granados@urv.cat, albertomiguel.stchigel@urv.cat, albertomiguel.stchigel@urv.cat, jose.cano@urv.cat, jose.cano@urv.cat
  • Paraules clau:

    Whole genome sequencing
    Whole
    Taxonomy
    Software
    Quality assessment
    Phylogeny
    Onygenales
    Molecular sequence annotation
    Genomics
    Genome
    fungal
    Fungi
    Dna contamination
    Databases
    genetic
    Coverage
    Contamination
    Bacteria
    Ascomycota
    Algorithm
    <italic>ascomycota</italic>
    Biotechnology
    Biotechnology & Applied Microbiology
    Genetics
    Genetics & Heredity
    ascomycota
    Ciências agrárias i
    Astronomia / física
  • Documents:

  • Cerca a google

    Search to google scholar