Articles producció científicaBioquímica i Biotecnologia

PDB-CAT: A user-friendly tool to classify and analyze PDB protein-ligand complexes

  • Dades identificatives

    Identificador:  imarina:9469032
    Autors:  Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix
    Resum:
    The Protein Data Bank (PDB) contains more than 235,000 three-dimensional biostructures and is growing at a rate of nearly 10% per year. The PDB is essential to gain knowledge on how proteins and ligands interact and how these interactions are correlated with the quantitative activity of each ligand/target pair. Unfortunately, the lack of a tool that can classify structures as apo or holo, that is by no means straightforward, and differentiate between covalent and non-covalent ligand-protein complexes makes it difficult to obtain the structures that belong to each set. To address this issue, we present PDB-CAT, a user-friendly tool that facilitates the categorization and extraction of key information from PDBx/mmCIF files through an efficient parallelized implementation. PDB-CAT uses a blacklist-based approach to automatically identify the ligand in a complex. It then classifies the PDB files based on ligand presence: structures without a ligand are classified as apo, whereas those with a ligand are classified as covalently or non-covalently bound, depending on the type of binding. As well as making this classification, the program can verify if there are any mutations in the protein sequence by comparing it to a reference sequence. An example is included to illustrate two different uses: the classification of SARS-CoV-2 Main Protease complexes depending on their variant, and the complete screening of the PDBbindv2020, achieved in
  • Altres:

    Autor segons l'article: Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix
    Departament: Bioquímica i Biotecnologia
    Autor/s de la URV: Garcia Vallve, Santiago / Gimeno Vives, Aleix / Pujadas Anguiano, Gerard
    Paraules clau: Databases, protein; Humans; Ligands; Pdbx/mmcif; Protein binding; Protein conformation; Protein data bank; Protein-ligand complexes; Proteins; Protein–ligand complexes; Sars-cov-2; Software; Structural bioinformatics; Structure-based drug discovery; Structure‐based drug discovery
    Resum: The Protein Data Bank (PDB) contains more than 235,000 three-dimensional biostructures and is growing at a rate of nearly 10% per year. The PDB is essential to gain knowledge on how proteins and ligands interact and how these interactions are correlated with the quantitative activity of each ligand/target pair. Unfortunately, the lack of a tool that can classify structures as apo or holo, that is by no means straightforward, and differentiate between covalent and non-covalent ligand-protein complexes makes it difficult to obtain the structures that belong to each set. To address this issue, we present PDB-CAT, a user-friendly tool that facilitates the categorization and extraction of key information from PDBx/mmCIF files through an efficient parallelized implementation. PDB-CAT uses a blacklist-based approach to automatically identify the ligand in a complex. It then classifies the PDB files based on ligand presence: structures without a ligand are classified as apo, whereas those with a ligand are classified as covalently or non-covalently bound, depending on the type of binding. As well as making this classification, the program can verify if there are any mutations in the protein sequence by comparing it to a reference sequence. An example is included to illustrate two different uses: the classification of SARS-CoV-2 Main Protease complexes depending on their variant, and the complete screening of the PDBbindv2020, achieved in
    Àrees temàtiques: Biochemistry; Biochemistry & molecular biology; Biotecnología; Ciências biológicas i; Ciências biológicas ii; Ciências biológicas iii; Farmacia; Interdisciplinar; Medicine (miscellaneous); Molecular biology; Química
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Adreça de correu electrònic de l'autor: gerard.pujadas@urv.cat; santi.garcia-vallve@urv.cat; aleix.gimeno@urv.cat; aleix.gimeno@urv.cat; aleix.gimeno@urv.cat
    Data d'alta del registre: 2026-02-13
    Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
    Enllaç font original: https://onlinelibrary.wiley.com/doi/10.1002/pro.70379
    Referència a l'article segons font original: Protein Science. 34 (12): e70379-
    Referència de l'ítem segons les normes APA: Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix (2025). PDB-CAT: A user-friendly tool to classify and analyze PDB protein-ligand complexes. Protein Science, 34(12), e70379-. DOI: 10.1002/pro.70379
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    DOI de l'article: 10.1002/pro.70379
    Entitat: Universitat Rovira i Virgili
    Any de publicació de la revista: 2025-11-12
    Tipus de publicació: Journal Publications
  • Paraules clau:

    Biochemistry,Biochemistry & Molecular Biology,Medicine (Miscellaneous),Molecular Biology
    Databases, protein
    Humans
    Ligands
    Pdbx/mmcif
    Protein binding
    Protein conformation
    Protein data bank
    Protein-ligand complexes
    Proteins
    Protein–ligand complexes
    Sars-cov-2
    Software
    Structural bioinformatics
    Structure-based drug discovery
    Structure‐based drug discovery
    Biochemistry
    Biochemistry & molecular biology
    Biotecnología
    Ciências biológicas i
    Ciências biológicas ii
    Ciências biológicas iii
    Farmacia
    Interdisciplinar
    Medicine (miscellaneous)
    Molecular biology
    Química
  • Documents:

  • Cerca a google

    Search to google scholar