rocm-rs: Safe Rust wrappers for AMD ROCm Libraries

This project provides Rust bindings for AMD's ROCm (Radeon Open Compute) libraries, allowing Rust developers to leverage AMD GPUs for high-performance computing.

Disclaimer

This project is not affiliated with, endorsed by, or sponsored by Advanced Micro Devices, Inc. (AMD).
ROCm and AMD are trademarks of Advanced Micro Devices, Inc.

Current Status

Note: This project is in early development.

Currently implemented:

✅ rocFFT - Fast Fourier Transform library (raw bindings + safe wrappers)
✅ HIP - Heterogeneous-Compute Interface for Portability (raw bindings + safe wrappers)
✅ rocBLAS - Basic Linear Algebra Subprograms (raw bindings + safe wrappers)
✅ MIOpen - Deep learning primitives (raw bindings + safe wrappers)
✅ rocRAND - Random number generation (raw bindings + safe wrappers)
✅ rocSOLVER - Linear system solvers (raw bindings + safe wrappers)
✅ rocSPARSE - Sparse linear algebra (raw bindings only)
✅ ROCArray - GPU array struct with api similar to Vec (to be deprecated in favor of DeviceMemoryExt)
✅ rocmsmi - system managment interface (refer to rocm_smi_lib)
✅ rocm_kernel_macros - macros for writing gpu kernels in rust(refer to rocm_kernel_macros)

The project currently focuses on providing raw FFI bindings for most libraries, with safe Rust wrappers available for rocFFT. Additional safe wrappers for other libraries are planned for future development.

Prerequisites

AMD ROCm installed (version 6.3 or later recommended.It may work on older versions, but I did not test that)
Ubuntu 24.04 / Fedora 42
Rust toolchain (1.65.0 or later recommended)
A compatible AMD GPU

Installation

Add this to your Cargo.toml:

[dependencies]
rocm-rs = "5.1"

Usage

First, ensure that the ROCm libraries are in your library path or set the ROCM_PATH environment variable.

Writing your own kernels with rust

use std::path::PathBuf;

use rocm_rs::{
    hip::{kernel::AsKernelArg, *},
    kernel_args,
    rocm_kernel_macros::{amdgpu_global, amdgpu_kernel_finalize, amdgpu_kernel_init},
};

const LEN: usize = 1024;

// initializing rust gpu kernel
amdgpu_kernel_init!();

// marking code that will be coppied to gpu kernel
#[amdgpu_global]
fn kernel(input: *const u32, output: *mut u32) {
    // retriving data from buffere by workitem
    let num = read_by_workitem_id_x(input);

    // writing data back
    write_by_workitem_id_x(output, num * 3);
}

// compiling gpu kernel
const AMDGPU_KERNEL_BINARY_PATH: &str = amdgpu_kernel_finalize!();

fn main() -> Result<()> {
    // setting up device
    let device = Device::new(0)?;
    device.set_current()?;

    // loading gpu kerenel (runs in runtime!)
    let kernel_path = PathBuf::from(AMDGPU_KERNEL_BINARY_PATH);
    assert!(kernel_path.exists());

    let module = Module::load(kernel_path)?;

    // acquiring function handle from gpu kernel
    let function = module.get_function("kernel")?;

    // preparing host side buffers
    let mut in_host: Vec<u32> = vec![0; LEN];
    let mut out_host: Vec<u32> = vec![0; LEN];

    for i in 0..LEN {
        in_host[i] = i as u32;
    }

    // preparing gpu side buffers
    let mut input = DeviceMemory::<u32>::new(LEN)?;
    let output = DeviceMemory::<u32>::new(LEN)?;

    input.copy_from_host(&in_host)?;

    // providing arguments for kernel
    let kernel_args = kernel_args!(input, output);

    // setting up launch args
    let grid_dim = Dim3 { x: 2, y: 1, z: 1 };
    let block_dim = Dim3 {
        x: (LEN / 2) as u32,
        y: 1,
        z: 1,
    };

    function.launch(grid_dim, block_dim, 0, None, &mut kernel_args.clone())?;

    // retriving computed data
    output.copy_to_host(&mut out_host)?;

    println!("Output: {:?}", &out_host[..256]);

    Ok(())
}

For async operations with streams:

use std::path::PathBuf;

use rocm_rs::{hip::{kernel::AsKernelArg, *}, rocm_kernel_macros::*};

const LEN: usize = 1024;

amdgpu_kernel_init!();

#[amdgpu_global]
fn kernel(input: *const u32, output: *mut u32) {
    let num = read_by_workitem_id_x(input);
    write_by_workitem_id_x(output, num * 3);
}

const AMDGPU_KERNEL_BINARY_PATH: &str = amdgpu_kernel_finalize!();

fn main() -> Result<()> {
    let device = Device::current()?;
    
    // Create a stream for async operations
    let stream = device.get_stream()?;
    stream.add_callback(|| println!("callback"))?;

    let kernel_path = PathBuf::from(AMDGPU_KERNEL_BINARY_PATH);
    let module = Module::load(kernel_path)?;
    let function = module.get_function("kernel")?;

    let in_host: Vec<u32> = (0..LEN).map(|i| i as u32).collect();
    let out_host: Vec<u32> = vec![0; LEN];

    let input = DeviceMemory::<u32>::new(LEN)?;
    let output = DeviceMemory::<u32>::new(LEN)?;

    // Copy data from host to device asynchronously
    input.copy_from_host_async(in_host, &stream)?;

    let kernel_args = [input.as_kernel_arg(), output.as_kernel_arg()];

    let grid_dim = Dim3 { x: 2, y: 1, z: 1 };
    let block_dim = Dim3 { x: (LEN / 2) as u32, y: 1, z: 1 };

    function.launch(grid_dim, block_dim, 0, Some(&stream), &mut kernel_args.clone())?;

    // Retrieve computed data asynchronously
    let pending = output.copy_to_host_async(out_host, &stream)?;

    // Synchronize memory (awaiting for copy to finish)
    let out_host = stream.synchronize_memory(pending)?;
    println!("Output: {:?}", &out_host[..256]);

    Ok(())
}


### Using rocFFT with safe wrappers:

```rust
use rocm_rs::rocfft::{self, plan, execution, field};

fn main() {
    // Initialize the rocFFT library
    // Use the safe wrappers for rocFFT
    let plan = plan::Plan::new(/* parameters */);
    let field = field::Field::new(/* parameters */);
    let execution = execution::Execution::new(/* parameters */);
    
    // Perform FFT operations
    // ...
}

Using other libraries with raw bindings:

use rocm_rs::hip::*;

fn main() {
    unsafe {
        // Example of using HIP raw bindings
        let mut device_count = 0;
        hipGetDeviceCount(&mut device_count);
        println!("Found {} HIP devices", device_count);
        
        // Use other raw bindings as needed
        // ...
    }
}

Building from Source

Important: When building from source, you need to run cargo build first to generate the bindings files before you can use the library or run tests.

# Clone the repository
git clone https://github.com/RustNSparks/rocm-rs
cd rocm-rs

# Set the ROCm path if not in the default location
export ROCM_PATH=/opt/rocm

# Build the project (generates bindings)
cargo build

Feature flags

rocm_smi - enables bindings and wrappers for rocm_smi_lib

Examples

The project includes a workspace with examples for each sub-library. Run examples with:

cargo run --package <example_name> --example <example_name>

HIP

vector_add_example - Vector addition with kernel written in HIP
rust_kernel - Kernel written in Rust using macros
rust_kernel_async - Async kernel execution with streams
saxpy - SAXPY operation (Single-precision A*X + Y)
sort - GPU sorting example

MIOpen

miopen_basic - Basic MIOpen usage example
multi_tensor - Multi-tensor operations

rocBLAS

rocblas_basic - Basic rocBLAS usage example

rocRAND

normal - Random number generation with normal distribution

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

When contributing:

Run cargo build first to generate the bindings
Add tests for new functionality
Update documentation as needed

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

AMD for developing and maintaining ROCm
The Rust community for bindgen and other tools used in this project

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.github		.github
.idea		.idea
include		include
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs
rocm_attr.lock		rocm_attr.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rocm-rs: Safe Rust wrappers for AMD ROCm Libraries

Disclaimer

Current Status

Prerequisites

Installation

Usage

Writing your own kernels with rust

Using other libraries with raw bindings:

Building from Source

Feature flags

Examples

HIP

MIOpen

rocBLAS

rocRAND

Contributing

License

Acknowledgments

About

Uh oh!

Releases 3

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

rocm-rs: Safe Rust wrappers for AMD ROCm Libraries

Disclaimer

Current Status

Prerequisites

Installation

Usage

Writing your own kernels with rust

Using other libraries with raw bindings:

Building from Source

Feature flags

Examples

HIP

MIOpen

rocBLAS

rocRAND

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages