ROCm Documentation is transitioning to this site. For the legacy documentation, please visit docs.amd.com. For more information or to provide feedback about this documentation transition, please see our announcement.

API

Contents

API#

hipBLAS interface examples#

The hipBLAS interface is compatible with rocBLAS and cuBLAS-v2 APIs. Porting a CUDA application which originally calls the cuBLAS API to an application calling hipBLAS API should be relatively straightforward. For example, the hipBLAS SGEMV interface is

GEMV API#

hipblasStatus_t
hipblasSgemv( hipblasHandle_t handle,
            hipblasOperation_t trans,
            int m, int n, const float *alpha,
            const float *A, int lda,
            const float *x, int incx, const float *beta,
            float *y, int incy );

Batched and strided GEMM API#

hipBLAS GEMM can process matrices in batches with regular strides. There are several permutations of these API’s, the following is an example that takes everything

hipblasStatus_t
hipblasSgemmStridedBatched( hipblasHandle_t handle,
            hipblasOperation_t transa, hipblasOperation_t transb,
            int m, int n, int k, const float *alpha,
            const float *A, int lda, long long bsa,
            const float *B, int ldb, long long bsb, const float *beta,
            float *C, int ldc, long long bsc,
            int batchCount);

hipBLAS assumes matrices A and vectors x, y are allocated in GPU memory space filled with data. Users are responsible for copying data from/to the host and device memory.