Pages that link to "Item:Q991102"
From MaRDI portal
The following pages link to Towards dense linear algebra for hybrid GPU accelerated manycore systems (Q991102):
Displaying 32 items.
- Performance models and workload distribution algorithms for optimizing a hybrid CPU-GPU multifrontal solver (Q316653) (← links)
- Extending the length and time scales of Gram-Schmidt Lyapunov vector computations (Q347779) (← links)
- GPU accelerated computation of the isogeometric analysis stiffness matrix (Q461008) (← links)
- Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing (Q608851) (← links)
- A new approach to the lattice Boltzmann method for graphics processing units (Q646031) (← links)
- A new era in scientific computing: domain decomposition methods in hybrid CPU-GPU architectures (Q653690) (← links)
- Direct numerical simulations of reacting flows with detailed chemistry using many-core/GPU acceleration (Q1615464) (← links)
- A parallel computing method using blocked format with optimal partitioning for SpMV on GPU (Q1678174) (← links)
- Direct numerical simulations of turbulent reacting flows with shock waves and stiff chemistry using many-core/GPU acceleration (Q2028165) (← links)
- Productivity, performance, and portability for computational fluid dynamics applications (Q2294028) (← links)
- A heterogeneous parallel LU factorization algorithm based on a basic column block uniform allocation strategy (Q2298336) (← links)
- LU factorization on heterogeneous systems: an energy-efficient approach towards high performance (Q2403146) (← links)
- GPU-acceleration of stiffness matrix calculation and efficient initialization of EFG meshless methods (Q2449903) (← links)
- Quantum circuits synthesis using Householder transformations (Q2698830) (← links)
- Divide and conquer on hybrid GPU-accelerated multicore systems (Q2904837) (← links)
- Computing Least Squares Condition Numbers on Hybrid Multicore/GPU Systems (Q3459696) (← links)
- Adapting Regularized Low-Rank Models for Parallel Architectures (Q4646456) (← links)
- A novel, blocked algorithm for the reduction to Hessenberg-triangular form (Q5060666) (← links)
- GPU parameter tuning for tall and skinny dense linear least squares problems (Q5113719) (← links)
- Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems (Q5161159) (← links)
- Randomized GPU Algorithms for the Construction of Hierarchical Matrices from Matrix-Vector Operations (Q5230631) (← links)
- A linear algebra method to decompose forms whose length is lower than the number of variables into weighted sum of squares (Q5240712) (← links)
- Simulating Low Precision Floating-Point Arithmetic (Q5241264) (← links)
- Exploiting Lower Precision Arithmetic in Solving Symmetric Positive Definite Linear Systems and Least Squares Problems (Q5857837) (← links)
- ELSI -- an open infrastructure for electronic structure solvers (Q6040122) (← links)
- GPU acceleration of all-electron electronic structure theory using localized numeric atom-centered basis functions (Q6040775) (← links)
- GPU-acceleration of the ELPA2 distributed eigensolver for dense symmetric and Hermitian eigenproblems (Q6159210) (← links)
- Using Random Butterfly Transformations to Avoid Pivoting in Sparse Direct Methods (Q6487403) (← links)
- ARKODE: a flexible IVP solver infrastructure for one-step methods (Q6601375) (← links)
- A LAPACK implementation of the dynamic mode decomposition (Q6604151) (← links)
- DG-IMEX method for a two-moment model for radiation transport in the \(\mathcal{O}(v/c)\) limit (Q6648382) (← links)
- GPU accelerated Newton for Taylor series solutions of polynomial homotopies in multiple double precision (Q6660339) (← links)