Underfox on Twitter: "For the first time, researchers have developed a new GPU-based framework to perform sparse general matrix matrix multiplication using Nvidia Tensor Cores. https://t.co/tdlQKUmJWV https://t.co/HkvoELpDV8" / Twitter
Running a parallel matrix multiplication program using CUDA on FutureGrid
Accelerating Matrix Multiplication with Block Sparse Format and NVIDIA Tensor Cores | NVIDIA Technical Blog
CUDA – Matrix Multiplication | The Elancer
Matrix Multiplication Optimization – Brian C. Becker
Nvidia's GeForce RTX 3080 Ti GPU Enters The Matrix | Tom's Hardware