Data from: Genome-scale annotation of protein binding sites via language model and geometric deep learning (Q6710241)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Data from: Genome-scale annotation of protein binding sites via language model and geometric deep learning |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Data from: Genome-scale annotation of protein binding sites via language model and geometric deep learning |
Dataset published at Zenodo repository. |
Statements
The dataset contains the training and test sets of protein binding sites with DNA, RNA, peptide, protein, ATP, HEM, Zn2+, Ca2+, Mg2+ and Mn2+. Each protein is associated with 3 lines indicating the protein name (PDB accession code and chain), sequence and residue labels (0 for non-binding and 1 for binding), respectively. The ESMFold-predicted structures are also provided.
0 references
20 March 2024
0 references