NASCUP: Nucleic Acid Sequence Classification by Universal Probability

From MaRDI portal
Publication:6267459

arXiv1511.04944MaRDI QIDQ6267459

Author name not available (Why is that?)

Publication date: 16 November 2015

Abstract: Motivated by the need for fast and accurate classification of unlabeled nucleotide sequences on a large scale, we developed NASCUP, a new classification method that captures statistical structures of nucleotide sequences by compact context-tree models and universal probability from information theory. NASCUP achieved BLAST-like classification accuracy consistently for several large-scale databases in orders-of-magnitude reduced runtime, and was applied to other bioinformatics tasks such as outlier detection and synthetic sequence generation.




Has companion code repository: https://github.com/nascup/nascup








This page was built for publication: NASCUP: Nucleic Acid Sequence Classification by Universal Probability

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6267459)