Cache optimization and performance modeling of batched, small, and rectangular matrix multiplication on Intel, AMD, and Fujitsu processors (Q6601380)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Cache optimization and performance modeling of batched, small, and rectangular matrix multiplication on Intel, AMD, and Fujitsu processors |
scientific article; zbMATH DE number 7910038
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Cache optimization and performance modeling of batched, small, and rectangular matrix multiplication on Intel, AMD, and Fujitsu processors |
scientific article; zbMATH DE number 7910038 |
Statements
Cache optimization and performance modeling of batched, small, and rectangular matrix multiplication on Intel, AMD, and Fujitsu processors (English)
0 references
10 September 2024
0 references
low-rank matrix multiplication
0 references
batched matrix multiplication
0 references
Cache blocking
0 references
performance modeling
0 references
0 references
0 references