Sars-Cov-2 and Mers sequences from human host with no unknown characters (Q6694266)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Sars-Cov-2 and Mers sequences from human host with no unknown characters |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Sars-Cov-2 and Mers sequences from human host with no unknown characters |
Dataset published at Zenodo repository. |
Statements
The datasets are organized as follows: first column, number of bases in a given sequence; second, third, fourth and fifth columns, number of bases of type A, C, G and T, respectively, in the same sequence. 1) Sars-Cov-2 dataset. This dataset contains number of bases for the complete genome sequences from a human host, with none unknown characters. In the NCBI database, there are about 950.000 sequences with these characteristics. 2) Restricted Sars-Cov-2 dataset: This dataset contains number of bases for the complete sequences from a human host, with no unknown characters, with 29903 bases, that is of the same length as the reference sequence NC045512.2. We obtained, from the NCBI database, about 5600 sequences with such features. 3) Mers dataset: This dataset contains number of bases for the complete sequences of about 200 complete genome sequences from a human host, with no unknown characters.
0 references
2 January 2024
0 references