Simulated wastewater sequencing data for benchmarking SARS-CoV-2 variant abundance estimation (Q6698994)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Simulated wastewater sequencing data for benchmarking SARS-CoV-2 variant abundance estimation |
Dataset published at Zenodo repository.
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Simulated wastewater sequencing data for benchmarking SARS-CoV-2 variant abundance estimation |
Dataset published at Zenodo repository. |
Statements
To evaluate the accuracy of variant abundancepredictions from wastewater sequencing, we built a collection of benchmarking datasets that resemble real wastewater samples. For each variant (B.1.1.7, B.1.351, B.1.427, B.1.429, P.1) we created a series of 33 benchmarks by simulating sequencing reads from a variant genome, as well as a collection of background (non-variant of concern/interest) sequences, such that the variant abundance ranges from 0.05% to 100%. Analogously, we created a second series of benchmarks, simulating reads only from the Spike gene of each SARS-CoV-2 genome. We refer to the first set of benchmarks as whole genome (WG)and to the second set of benchmarks as S-only. We repeated these simulations at different sequencing depths: 100x and 1000x coverage for the whole genome benchmarks, and 100x, 1000x, and 10,000x coverage for the S-only benchmarks.
0 references
29 August 2021
0 references