Autor segons l'article: Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix
Departament: Bioquímica i Biotecnologia
Autor/s de la URV: Garcia Vallve, Santiago / Gimeno Vives, Aleix / Pujadas Anguiano, Gerard
Paraules clau: Databases, protein; Humans; Ligands; Pdbx/mmcif; Protein binding; Protein conformation; Protein data bank; Protein-ligand complexes; Proteins; Protein–ligand complexes; Sars-cov-2; Software; Structural bioinformatics; Structure-based drug discovery; Structure‐based drug discovery
Resum: The Protein Data Bank (PDB) contains more than 235,000 three-dimensional biostructures and is growing at a rate of nearly 10% per year. The PDB is essential to gain knowledge on how proteins and ligands interact and how these interactions are correlated with the quantitative activity of each ligand/target pair. Unfortunately, the lack of a tool that can classify structures as apo or holo, that is by no means straightforward, and differentiate between covalent and non-covalent ligand-protein complexes makes it difficult to obtain the structures that belong to each set. To address this issue, we present PDB-CAT, a user-friendly tool that facilitates the categorization and extraction of key information from PDBx/mmCIF files through an efficient parallelized implementation. PDB-CAT uses a blacklist-based approach to automatically identify the ligand in a complex. It then classifies the PDB files based on ligand presence: structures without a ligand are classified as apo, whereas those with a ligand are classified as covalently or non-covalently bound, depending on the type of binding. As well as making this classification, the program can verify if there are any mutations in the protein sequence by comparing it to a reference sequence. An example is included to illustrate two different uses: the classification of SARS-CoV-2 Main Protease complexes depending on their variant, and the complete screening of the PDBbindv2020, achieved in
Àrees temàtiques: Biochemistry; Biochemistry & molecular biology; Biotecnología; Ciências biológicas i; Ciências biológicas ii; Ciências biológicas iii; Farmacia; Interdisciplinar; Medicine (miscellaneous); Molecular biology; Química
Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
Adreça de correu electrònic de l'autor: gerard.pujadas@urv.cat; santi.garcia-vallve@urv.cat; aleix.gimeno@urv.cat; aleix.gimeno@urv.cat; aleix.gimeno@urv.cat
Data d'alta del registre: 2026-02-13
Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
Enllaç font original: https://onlinelibrary.wiley.com/doi/10.1002/pro.70379
Referència a l'article segons font original: Protein Science. 34 (12): e70379-
Referència de l'ítem segons les normes APA: Llop-Peiro, Ariadna; Trujillo-De Leon, Said; Pujadas, Gerard; Garcia-Vallve, Santiago; Gimeno, Aleix (2025). PDB-CAT: A user-friendly tool to classify and analyze PDB protein-ligand complexes. Protein Science, 34(12), e70379-. DOI: 10.1002/pro.70379
URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
DOI de l'article: 10.1002/pro.70379
Entitat: Universitat Rovira i Virgili
Any de publicació de la revista: 2025-11-12
Tipus de publicació: Journal Publications