Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Notice: Unexpected clearActionName after getActionName already called in /var/www/html/w/includes/Context/RequestContext.php on line 321
DAVI: A Dataset for Automatic Variant Interpretation - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

DAVI: A Dataset for Automatic Variant Interpretation

From MaRDI portal
(Redirected from Dataset:6722199)



DOI10.5281/zenodo.12697421Zenodo12697421MaRDI QIDQ6722199

Dataset published at Zenodo repository.

Author name not available (Why is that?)

Publication date: 9 July 2024

Copyright license: No records found.



The analysis of an individuals genetic material may uncover genetic variants, which can be classified as disease-causing (pathogenic) or benign. Identifying pathogenic variants among millions of variants relies on the research of evidence in support of or against variant pathogenicity, a process regulated by the American College of Molecular Genetics (ACMG) guidelines, which leverages data from the scientific literature. Despite recent improvements towards automation, searching shreds of evidence for pathogenicity in the literature still requires manual curation, a time-consuming process, due to the ever-growing number of published papers. In this work, we built DAVI (Dataset for Automatic Variant Interpretation), a reliable, manually curated dataset comprising 1239 sentences extracted from 311 (variant, article) associationsfor a pool of 41 variants. 597 sentencescontain (positive) evidence activating two opposing ACGM criteria, namely PS3 and BS3, while the remaining 642 do not contain (negative) evidence activating either of the two considered ACGM criteria. (variant, article) associations containing at least one positive sentence are classified as positive, while (variant, article) associations containing any positive sentence are negative. Therefore DAVI also contains 154 positive and 157 negative (variant, article) associations.






This page was built for dataset: DAVI: A Dataset for Automatic Variant Interpretation