Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
The pan-genome of Saccharomyces cerevisiae - MaRDI portal

Deprecated: Use of MediaWiki\Skin\SkinTemplate::injectLegacyMenusIntoPersonalTools was deprecated in Please make sure Skin option menus contains `user-menu` (and possibly `notifications`, `user-interface-preferences`, `user-page`) 1.46. [Called from MediaWiki\Skin\SkinTemplate::getPortletsTemplateData in /var/www/html/w/includes/Skin/SkinTemplate.php at line 691] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

Deprecated: Use of QuickTemplate::(get/html/text/haveData) with parameter `personal_urls` was deprecated in MediaWiki Use content_navigation instead. [Called from MediaWiki\Skin\QuickTemplate::get in /var/www/html/w/includes/Skin/QuickTemplate.php at line 131] in /var/www/html/w/includes/Debug/MWDebug.php on line 372

The pan-genome of Saccharomyces cerevisiae

From MaRDI portal



DOI10.5281/zenodo.3407352Zenodo3407352MaRDI QIDQ6719885

Dataset published at Zenodo repository.

Author name not available (Why is that?)

Publication date: 13 March 2019

Copyright license: No records found.



Thesedatasets are related to The pan-genome of Saccharomyces cerevisiae (Li G., Ji B., and Nielsen J.). This deposition contains following datasets: (1) Genomes.tar.gz: a compressed file containing 1392 Saccharomyces cerevisiaegenome assembles analyzed (2) genome_information_2.0.tsv: a tab-separated text file that contains the basic information of above genomes, including genomeSize, contigNums, N50, busco_C(%), busco_S(%), busco_D(%), busco_F(%), busco_M(%), busco_n, number_of_genes, number_of_partial_genes, download_from, Eco_Source, Ploidy, Aneuploidies. (3) ClusterFasta.tar.gz: a compressed file that contains a list of fasta files. Each fasta file contains protein sequences in a cluster. The name of the fasta file is the name of the representative sequence of that cluster. (4) sc_gene_cluster_info_0.7_v4.tsv: a tab-separated text file that contains the properties of gene clusters. (5)gene_presence_absence_v4.tsv: a tab-separated text file that contains the gene-presence/absence information. Each columns is a gene cluster. Each row is a genome. Y/Nis used to present presence/absence. (6)gene_num_in_clusters_of_each_strain_v4.tsv: a tab-sparated text file that contains the gene number of each genome in each cluster (copy number).Each columns is a gene cluster. Each row is a genome. (7)feature_importances_cv5_pa_cnv.tsv: a tab-separated file that contains the feature importance from a random forest classifier in a 5-fold cross-validation approach. The classifier was trained on gene presence/absence table (PA) or copy number table (CNV). The columns pa_x indicate the feature importance in each fold of cross-validation on PA dataset.The columns cnv_x indicate the feature importance in each fold of cross-validation on CNVdataset.






This page was built for dataset: The pan-genome of Saccharomyces cerevisiae