Tesis doctoralsDepartament d'Enginyeria Química

Structural Pattern Recognition for Chemical-Compound Virtual Screening

  • Dades identificatives

    Identificador:  TDX:3798
    Autors:  García Hernández, Carlos Jesús
    Resum:
    Molecules are naturally shaped as networks, making them ideal for studying by employing their graph representations, where nodes represent atoms and edges represent the chemical bonds. An alternative for this straightforward representation is the extended reduced graph, which summarizes the chemical structures using pharmacophore-type node descriptions to encode the relevant molecular properties. Once we have a suitable way to represent molecules as graphs, we need to choose the right tool to compare and analyze them. Graph edit distance is used to solve the error-tolerant graph matching; this methodology estimates a distance between two graphs by determining the minimum number of modifications required to transform one graph into the other. These modifications (known as edit operations) have an edit cost (also known as transformation cost) associated, which must be determined depending on the problem. This study investigates the effectiveness of a graph-only driven molecular comparison employing extended reduced graphs and graph edit distance as a tool for ligand-based virtual screening applications. Those applications estimate the bioactivity of a chemical employing the bioactivity of similar compounds. An essential part of this study focuses on using machine learning and natural language processing techniques to optimize the transformation costs used in the molecular comparisons with the graph edit distance. Overall, this work shows a framework that combines graph reduction and comparison with optimization tools and natural language processing to identify bioactivity similarities in a structurally diverse group of molecules. We confirm the efficiency of this framework with several chemoinformatic tests applied to regression and classification problems over different publicly available datasets.
  • Altres:

    Editor: Universitat Rovira i Virgili
    Data: 2021-11-12, 2022-02-08T10:42:07Z, 2022-02-08T10:42:07Z
    Identificador: http://hdl.handle.net/10803/673441
    Departament/Institut: Departament d'Enginyeria Química, Universitat Rovira i Virgili.
    Idioma: eng
    Autor: García Hernández, Carlos Jesús
    Director: Serratosa Casanelles, Francesc, Fernández Sabater, Alberto
    Font: TDX (Tesis Doctorals en Xarxa)
    Format: application/pdf, application/pdf, 170 p.
  • Paraules clau:

    virtual screening
    graph edit distance
    molecular similarity
    filtrado virtual
    distancia de edición de grafos
    filtrat virtual
    istància d'edició de grafs
    semblança molecular
    519.1
    Ciències
  • Documents:

  • Cerca a google

    Search to google scholar