Markov reliability models of fault-tolerant distributed computing systems (Q1822487)

From MaRDI portal





scientific article; zbMATH DE number 4003494
Language Label Description Also known as
English
Markov reliability models of fault-tolerant distributed computing systems
scientific article; zbMATH DE number 4003494

    Statements

    Markov reliability models of fault-tolerant distributed computing systems (English)
    0 references
    0 references
    0 references
    0 references
    1986
    0 references
    A hierarchical view of fault-tolerant distributed computers is presented, viewing a distributed computing system as composed of interconnected, interacting, functional modules. Each module, modeled by a directed-state graph, is governed by internal random failure events and counteracting recovery processes, and also by coupling of external random events from other modules. It is shown that, under certain assumptions, the system is governed by a multidimensional Markov process, with non-Markov module processes as components. Mathematical properties of this model are formally analyzed. Performance measures are found from the steady-state distribution and visitation rate of each system and module state. A numerical example is presented exemplifying its practical application. The results are shown to fit very well the actual statistical data collected on an AT\&T bell Laboratories Electronic Switching System.
    0 references
    random failure events
    0 references
    recovery processes
    0 references
    multidimensional Markov process
    0 references
    Performance measures
    0 references

    Identifiers