Parallel complexity of matrix multiplication (Q1404305)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Parallel complexity of matrix multiplication |
scientific article; zbMATH DE number 1968850
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Parallel complexity of matrix multiplication |
scientific article; zbMATH DE number 1968850 |
Statements
Parallel complexity of matrix multiplication (English)
0 references
21 August 2003
0 references
Matrix multiplication is a kernel of many numerical algorithms, thus finding efficient and parallel solution to this problem is one of the most important topics in scientific computing. In this paper, the parallel complexity of multiplying two matrices (using the standard row/column method) on parallel distributed-memory computers is studied. The author considers three basic aspects of designing distributed algorithms, namely local computational tasks, the initial data layout, and the communication schedule with respect to LogP, the well known theoretical model of parallel distributed-memory computers, which allows to consider the basic characteristics of an interconnection network regardless of its physical structure. The main result describes the lower bounds on running time of distributed matrix multiplication. Then several algorithms with running times within a constant factor of the lower bounds are described and analyzed. It should be pointed out that this excellent paper provides some important facts about designing optimal distributed-memory parallel algorithms. For example it defines several classes of algorithms, data layouts and communication schedules which collectively produce the optimal running times.
0 references
parallel complexity
0 references
matrix multiplication
0 references
parallel models
0 references
LogP model
0 references
lower bounds
0 references
parallel computation
0 references
scientific computing
0 references
distributed-memory computers
0 references
algorithms
0 references