×
May 1, 2023 · In this paper, we present a design for a high-performance GEMM with algorithm-based fault tolerance for use on GPUs.
Jun 21, 2023 · In this paper, we present a design of a high-performance GPU-based GEMM that integrates an algorithm-based fault tolerance scheme that detects ...
Jun 23, 2023 · We propose a template-based code generation strategy to automatically generate high-performance GEMM kernels with or without fault tolerance for ...
A design of a high-performance GPU-based GEMM that integrates an algorithm-based fault tolerance scheme that detects and corrects silent data corruptions at ...
Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs · Install and compile the code by running the following command. · Test the FT-SGEMM by ...
Anatomy of high-performance gemm with online fault tolerance on gpus. S Wu, Y ... Ft-gemm: A fault tolerant high performance gemm implementation on x86 cpus.
Anatomy of high-performance gemm with online fault tolerance on gpus. S Wu, Y ... Ft-gemm: A fault tolerant high performance gemm implementation on x86 cpus.
Our experimental evaluations on NVIDIA T4 GPU and A100 GPU demonstrate that FT K-Means without fault tolerance outperforms cuML's K-Means implementation, ...
May 9, 2023 · We incorporate the fault tolerant functionality at al- gorithmic level by fusing the memory-intensive operations into the GEMM assembly kernels.
Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs. 2023, Proceedings of the International Conference on Supercomputing. Resilient error ...