On compound Poisson approximation for sequence matching (Q2711618)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: On compound Poisson approximation for sequence matching |
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | On compound Poisson approximation for sequence matching |
scientific article |
Statements
24 April 2001
0 references
degree of local similarity between two molecular sequences
0 references
Stein-Chen method
0 references
total variation distance
0 references
On compound Poisson approximation for sequence matching (English)
0 references
Let \(A_1,\dots, A_m\) and \(B_1,\dots, B_n\) each be independent and identically distributed sequences of random variables from distributions \(\mu\) and \(\nu\), respectively, over a finite alphabet. The paper considers the distribution of the number \(W\) of pairs \(i\), \(j\) such that \(A_{i+l}= B_{j+l}\) for all but at most \(r\) values of \(l\), \(0\leq l\leq k-1\), for \(k\) suitably large; the motivation is to approximate the null hypothesis distributions of statistics measuring the degree of local similarity between two molecular sequences, where some mismatches are allowed. The Stein-Chen method is used to establish the accuracy of a compound Poisson approximation to the distribution of \(W\) with respect to the total variation distance; the distance between the two is shown to be asymptotically negligible in a wide variety of settings.
0 references