Articles producció científicaBioquímica i Biotecnologia

PDB-CAT: A user-friendly tool to classify and analyze PDB protein-ligand complexes

  • Identification data

    Identifier:  imarina:9469032
    Authors:  Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix
    Abstract:
    The Protein Data Bank (PDB) contains more than 235,000 three-dimensional biostructures and is growing at a rate of nearly 10% per year. The PDB is essential to gain knowledge on how proteins and ligands interact and how these interactions are correlated with the quantitative activity of each ligand/target pair. Unfortunately, the lack of a tool that can classify structures as apo or holo, that is by no means straightforward, and differentiate between covalent and non-covalent ligand-protein complexes makes it difficult to obtain the structures that belong to each set. To address this issue, we present PDB-CAT, a user-friendly tool that facilitates the categorization and extraction of key information from PDBx/mmCIF files through an efficient parallelized implementation. PDB-CAT uses a blacklist-based approach to automatically identify the ligand in a complex. It then classifies the PDB files based on ligand presence: structures without a ligand are classified as apo, whereas those with a ligand are classified as covalently or non-covalently bound, depending on the type of binding. As well as making this classification, the program can verify if there are any mutations in the protein sequence by comparing it to a reference sequence. An example is included to illustrate two different uses: the classification of SARS-CoV-2 Main Protease complexes depending on their variant, and the complete screening of the PDBbindv2020, achieved in
  • Others:

    Link to the original source: https://onlinelibrary.wiley.com/doi/10.1002/pro.70379
    APA: Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix (2025). PDB-CAT: A user-friendly tool to classify and analyze PDB protein-ligand complexes. Protein Science, 34(12), e70379-. DOI: 10.1002/pro.70379
    Paper original source: Protein Science. 34 (12): e70379-
    Article's DOI: 10.1002/pro.70379
    Journal publication year: 2025-11-12
    Entity: Universitat Rovira i Virgili
    Paper version: info:eu-repo/semantics/publishedVersion
    Record's date: 2026-02-13
    URV's Author/s: Garcia Vallve, Santiago / Gimeno Vives, Aleix / Pujadas Anguiano, Gerard
    Department: Bioquímica i Biotecnologia
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Publication Type: Journal Publications
    Author, as appears in the article.: Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Thematic Areas: Biochemistry, Biochemistry & molecular biology, Biotecnología, Ciências biológicas i, Ciências biológicas ii, Ciências biológicas iii, Farmacia, Interdisciplinar, Medicine (miscellaneous), Molecular biology, Química
    Author's mail: gerard.pujadas@urv.cat, santi.garcia-vallve@urv.cat, aleix.gimeno@urv.cat, aleix.gimeno@urv.cat, aleix.gimeno@urv.cat
  • Keywords:

    Databases
    protein
    Humans
    Ligands
    Pdbx/mmcif
    Protein binding
    Protein conformation
    Protein data bank
    Protein-ligand complexes
    Proteins
    Protein–ligand complexes
    Sars-cov-2
    Software
    Structural bioinformatics
    Structure-based drug discovery
    Structure‐based drug discovery
    Biochemistry
    Biochemistry & Molecular Biology
    Medicine (Miscellaneous)
    Molecular Biology
    Biotecnología
    Ciências biológicas i
    Ciências biológicas ii
    Ciências biológicas iii
    Farmacia
    Interdisciplinar
    Química
  • Documents:

  • Cerca a google

    Search to google scholar