Protein Function Embeddings: First Beta Release of Datasets (Q6693412)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: [[]] |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Protein Function Embeddings: First Beta Release of Datasets |
Dataset published at Zenodo repository. |
Statements
This release corresponds to the datasets generated from athesis work that explores how information for protein functions can be exploited through embeddings so that the produced information can be used to improve protein function annotations. The underlying hypothesis here is that any pair of proteins with high sequence similarity will also share a similar biological function which would be reflected by the corresponding protein embeddings. The comparison and evaluation of this is done using two text-driven embedding approaches: Word2doc2Vec and Hybrid-Word2doc2Vec.
0 references
2 April 2023
0 references
v1.0.0
0 references