Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Notice: Unexpected clearActionName after getActionName already called in /var/www/html/w/includes/Context/RequestContext.php on line 321
Data Sets and Results for "Improved data sets and evaluation methods for the automatic prediction of DNA-binding proteins" - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of MediaWiki\Skin\BaseTemplate::getPersonalTools was deprecated in 1.46 Call $this->getSkin()->getPersonalToolsForMakeListItem instead (T422975). [Called from Skins\Chameleon\Components\NavbarHorizontal\PersonalTools::getHtml in /var/www/html/w/skins/chameleon/src/Components/NavbarHorizontal/PersonalTools.php at line 66] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Data Sets and Results for "Improved data sets and evaluation methods for the automatic prediction of DNA-binding proteins"

From MaRDI portal
(Redirected from Dataset:6702189)



DOI10.5281/zenodo.5153906Zenodo5153906MaRDI QIDQ6702189

Dataset published at Zenodo repository.

Author name not available (Why is that?)

Publication date: 2 August 2021

Copyright license: No records found.



Data sets and results for Improved data sets and evaluation methods for the automatic prediction of DNA-binding proteins The file dna_binding_protein_sequences.zip has the training and testing sets from the paper: RLL - random_train/test_full_1000.csv RSL - random_train/test_40.csv RSLL - random_train/test_40_1000.csv RLL where included positive examples have verified DNA binding activity -random_train/test_hq_1000.csv The 10 RSLL data sets - random_train/test_40_1000.csv +random_train/test_40_1000_cv_0-8.csv The results files arenamed similarly. See see_results.ipynb in the codebase that supplement thesedata sets The species data sets are derived from uniprot_data_bac.tab and uniprot_data_not_bac.tab. See code. The ESM embeddings used by the XGBoost model are in dna_binding_protein_esm.zip






This page was built for dataset: Data Sets and Results for "Improved data sets and evaluation methods for the automatic prediction of DNA-binding proteins"