Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units - MaRDI portal

A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units

From MaRDI portal

Publication:2940025

Jump to:navigation, search

DOI10.1137/130930352zbMath1307.65055arXiv1307.6209OpenAlexW2101511474MaRDI QIDQ2940025

Moritz Kreutzer, A. R. Bishop, Georg Hager, Gerhard Wellein, Holger Fehske

Publication date: 23 January 2015

Published in: SIAM Journal on Scientific Computing (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1307.6209

zbMATH Keywords

algorithm numerical examples sparse matrix performance model sparse matrix-vector multiplication single instruction multiple data data format

Mathematics Subject Classification ID

Computational methods for sparse matrices (65F50) Complexity and performance of numerical algorithms (65Y20) Packaged methods for numerical algorithms (65Y15)

Related Items

GPU-accelerated preconditioned GMRES method for two-dimensional Maxwell's equations ⋮ A Task-Scheduling Approach for Efficient Sparse Symmetric Matrix-Vector Multiplication on a GPU ⋮ Increasing the Performance of the Jacobi--Davidson Method by Blocking ⋮ A Factored Sparse Approximate Inverse Preconditioned Conjugate Gradient Solver on Graphics Processing Units ⋮ Optimal strategy for modelling turbulent flows with ensemble averaging on high performance computing systems ⋮ A two-scale approach for efficient on-the-fly operator assembly in massively parallel high performance multigrid codes ⋮ A new sparse matrix vector multiplication graphics processing unit algorithm designed for finite element problems ⋮ Algebraic Multigrid Using a Stencil–CSR Hybrid Format on GPUs ⋮ Development of an unstructured mesh gyrokinetic particle-in-cell code for exascale fusion plasma simulations on GPUs ⋮ Estimating the effect of indices compression in the CSR-like data storage formats for matrix-vector multiplications and solving linear systems ⋮ The \textsc{Dune} framework: basic concepts and recent developments ⋮ Strategies for the Vectorized Block Conjugate Gradients Method ⋮ GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review ⋮ Efficient CSR-based sparse matrix-vector multiplication on GPU ⋮ A novel CSR-based sparse matrix-vector multiplication on GPUs ⋮ SELL_C_sigma ⋮ ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures ⋮ Benefits from using mixed precision computations in the ELPA-AEO and ESSEX-II eigensolver projects ⋮ Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

Uses Software

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2940025&oldid=15925178"