Automatic classification of protein structure using the maximum contact map overlap metric (Q1736719)

From MaRDI portal





scientific article; zbMATH DE number 7042289
Language Label Description Also known as
English
Automatic classification of protein structure using the maximum contact map overlap metric
scientific article; zbMATH DE number 7042289

    Statements

    Automatic classification of protein structure using the maximum contact map overlap metric (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    26 March 2019
    0 references
    Summary: In this work, we propose a new distance measure for comparing two protein structures based on their contact map representations. We show that our novel measure, which we refer to as the maximum contact map overlap (max-CMO) metric, satisfies all properties of a metric on the space of protein representations. Having a metric in that space allows one to avoid pairwise comparisons on the entire database and, thus, to significantly accelerate exploring the protein space compared to no-metric spaces. We show on a gold standard superfamily classification benchmark set of 6759 proteins that our exact \(k\)-nearest neighbor (\(k\)-NN) scheme classifies up to 224 out of 236 queries correctly and on a larger, extended version of the benchmark with 60, 850 additional structures, up to 1361 out of 1369 queries. Our \(k\)-NN classification thus provides a promising approach for the automatic classification of protein structures based on flexible contact map overlap alignments.
    0 references
    maximum contact map overlap
    0 references
    protein space metric
    0 references
    \(k\)-nearest neighbor classification
    0 references
    superfamily classification
    0 references
    scop
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references