Articles producció científica> Bioquímica i Biotecnologia

An Unsupervised Algorithm for Host Identification in Flaviviruses

  • Identification data

    Identifier: imarina:9216859
    Authors:
    Phuoc Truong NguyenGarcia-Valle, SantiagoPuigbo, Pere
    Abstract:
    Early characterization of emerging viruses is essential to control their spread, such as the Zika Virus outbreak in 2014. Among other non-viral factors, host information is essential for the surveillance and control of virus spread. Flaviviruses (genus Flavivirus), akin to other viruses, are modulated by high mutation rates and selective forces to adapt their codon usage to that of their hosts. However, a major challenge is the identification of potential hosts for novel viruses. Usually, potential hosts of emerging zoonotic viruses are identified after several confirmed cases. This is inefficient for deterring future outbreaks. In this paper, we introduce an algorithm to identify the host range of a virus from its raw genome sequences. The proposed strategy relies on comparing codon usage frequencies across viruses and hosts, by means of a normalized Codon Adaptation Index (CAI). We have tested our algorithm on 94 flaviviruses and 16 potential hosts. This novel method is able to distinguish between arthropod and vertebrate hosts for several flaviviruses with high values of accuracy (virus group 91.9% and host type 86.1%) and specificity (virus group 94.9% and host type 79.6%), in comparison to empirical observations. Overall, this algorithm may be useful as a complementary tool to current phylogenetic methods in monitoring current and future viral outbreaks by understanding host-virus relationships.
  • Others:

    Author, as appears in the article.: Phuoc Truong Nguyen; Garcia-Valle, Santiago; Puigbo, Pere;
    Department: Bioquímica i Biotecnologia
    URV's Author/s: Garcia Vallve, Santiago / PUIGBÒ AVALOS, PEDRO
    Keywords: Virus Host identification Genus Flavivirus Codon usage bias Codon adaptation index Algorithm Adaptation
    Abstract: Early characterization of emerging viruses is essential to control their spread, such as the Zika Virus outbreak in 2014. Among other non-viral factors, host information is essential for the surveillance and control of virus spread. Flaviviruses (genus Flavivirus), akin to other viruses, are modulated by high mutation rates and selective forces to adapt their codon usage to that of their hosts. However, a major challenge is the identification of potential hosts for novel viruses. Usually, potential hosts of emerging zoonotic viruses are identified after several confirmed cases. This is inefficient for deterring future outbreaks. In this paper, we introduce an algorithm to identify the host range of a virus from its raw genome sequences. The proposed strategy relies on comparing codon usage frequencies across viruses and hosts, by means of a normalized Codon Adaptation Index (CAI). We have tested our algorithm on 94 flaviviruses and 16 potential hosts. This novel method is able to distinguish between arthropod and vertebrate hosts for several flaviviruses with high values of accuracy (virus group 91.9% and host type 86.1%) and specificity (virus group 94.9% and host type 79.6%), in comparison to empirical observations. Overall, this algorithm may be useful as a complementary tool to current phylogenetic methods in monitoring current and future viral outbreaks by understanding host-virus relationships.
    Thematic Areas: Space and planetary science Paleontology General biochemistry,genetics and molecular biology Ecology, evolution, behavior and systematics Biology Biochemistry, genetics and molecular biology (miscellaneous) Biochemistry, genetics and molecular biology (all)
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Author's mail: santi.garcia-vallve@urv.cat
    Author identifier: 0000-0002-0348-7497
    Record's date: 2024-10-26
    Papper version: info:eu-repo/semantics/publishedVersion
    Link to the original source: https://www.mdpi.com/2075-1729/11/5/442
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Papper original source: Life. 11 (5):
    APA: Phuoc Truong Nguyen; Garcia-Valle, Santiago; Puigbo, Pere; (2021). An Unsupervised Algorithm for Host Identification in Flaviviruses. Life, 11(5), -. DOI: 10.3390/life11050442
    Article's DOI: 10.3390/life11050442
    Entity: Universitat Rovira i Virgili
    Journal publication year: 2021
    Publication Type: Journal Publications
  • Keywords:

    Biochemistry, Genetics and Molecular Biology (Miscellaneous),Biology,Ecology, Evolution, Behavior and Systematics,Paleontology,Space and Planetary Science
    Virus
    Host identification
    Genus
    Flavivirus
    Codon usage bias
    Codon adaptation index
    Algorithm
    Adaptation
    Space and planetary science
    Paleontology
    General biochemistry,genetics and molecular biology
    Ecology, evolution, behavior and systematics
    Biology
    Biochemistry, genetics and molecular biology (miscellaneous)
    Biochemistry, genetics and molecular biology (all)
  • Documents:

  • Cerca a google

    Search to google scholar