A C++ library for accelerating mixed-precision matrix multiply-accumulate (MMA) operations leveraging AMD GPU hardware. rocWMMA makes it easier to break down MMA problems into fragments and distribute block-wise MMA operations in parallel across GPU wavefronts.
... part of T2, get it here
URL: https://github.com/ROCm/rocWMMA
Author: Advanced Micro Devices, Inc.
Maintainer: The T2 Project <t2 [at] t2-project [dot] org>
License: MIT
Status: Stable
Version: 6.3.3
Remark: Does cross compile (as setup and patched in T2).
Download: https://github.com/ROCm/rocWMMA/ rocWMMA-rocm-6.3.3.tar.gz
T2 source: rocwmma.desc
Build time (on reference hardware): n.a.
Installed size (on reference hardware): n.a.
Dependencies (build time detected): n.a.
Installed files (on reference hardware): n.a.
1) This page was automatically generated from the T2 package source. Corrections, such as dead links, URL changes or typos need to be performed directly on that source.
2) Compatible with Linux From Scratch's "Standard Build Unit" (SBU).