Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Multi-method gene clusters at species-level resolution for 125 prokaryotic pangenomes - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Multi-method gene clusters at species-level resolution for 125 prokaryotic pangenomes

From MaRDI portal



DOI10.5281/zenodo.8406578Zenodo8406578MaRDI QIDQ6696458

Dataset published at Zenodo repository.

Author name not available (Why is that?)

Publication date: 4 October 2023

Copyright license: No records found.



This dataset contains 9 sets of species-level gene clusters and high-resolution species trees for 125 representative bacterial and archaeal species, encompassing a total of 6,851 nearly complete genomes. Each set represents a different approach to homology-, orthology-, and synteny-based gene clustering as implemented by 6 popular tools for comparative genomics and pangenome analysis (Roary, panX, OrthoFinder, MMseqs2/PanACoTa, CD-HIT, and eggNOG-mapper). For Escherichia coli, Cutibacterium acnes, Bacteroides uniformis, and Staphylococcus epidermidis, we provide additional sets that combine high-quality genomes with different proportions of medium- and low-quality metagenome-assembled genomes (MAGs). This dataset is a helpful resource for benchmarking gene clustering tools and pangenome analysis workflows, as well as for testing their robustness with respect to the presence of incomplete or contaminated genomic assemblies. Reference: Manzano-Morales S, Liu Y, Gonzlez-Bod S, Huerta-Cepas J, Iranzo J. 2022. Comparison of gene clustering criteria reveals intrinsic uncertainty in pangenome analyses. bioRxiv doi: 10.1101/2022.09.25.509376






This page was built for dataset: Multi-method gene clusters at species-level resolution for 125 prokaryotic pangenomes