Accelerating BLAS and LAPACK via Efficient Floating Point Architecture Design
From MaRDI portal
Publication:5087834
DOI10.1142/S0129626417500062zbMath1492.65375arXiv1610.08705MaRDI QIDQ5087834
Anupam Chattopadhyay, Samir Kumar Nandy, Farhad Merchant, Soumyendu Raha, Ranjani Narayan
Publication date: 4 July 2022
Published in: Parallel Processing Letters (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1610.08705
parallel computinghigh-performance computingfloating point unitinstruction level parallelismpower-performance trade-offs
Parallel numerical computation (65Y05) Numerical linear algebra (65Fxx) Numerical algorithms for specific classes of architectures (65Y10)
Uses Software
Cites Work
This page was built for publication: Accelerating BLAS and LAPACK via Efficient Floating Point Architecture Design