rocblasdgemm
rocblas_dgemm is the double-precision general matrix-matrix multiplication routine in the ROCm library rocBLAS. It computes C = alpha * op(A) * op(B) + beta * C, where A, B, and C reside on the device. The operation op(X) is determined by transA and transB (none, transpose, or conjugate transpose; the latter is equivalent to transpose for real numbers). Following BLAS conventions, matrices are treated as column-major and the leading dimensions lda, ldb, and ldc specify the physical strides between columns.
The API is exposed as a C-style function in rocBLAS. A typical prototype is: rocblas_status rocblas_dgemm(rocblas_handle
Dimensional rules follow op(A) being m-by-k and op(B) being k-by-n, so the result C is m-by-n. Therefore
rocblas_dgemm is used as a building block in dense linear algebra on AMD GPUs. In addition to