Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 5.2k 425

  2. hip hip Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 4k 551

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 244

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 690 98

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 569 85

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 459 69

Repositories

Showing 10 of 312 repositories
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    ROCm/llvm-project’s past year of commit activity
    LLVM 142 13,439 16 5 Updated Apr 12, 2025
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    ROCm/hipBLASLt’s past year of commit activity
    Assembly 86 MIT 114 13 88 Updated Apr 12, 2025
  • device-metrics-exporter Public

    Device Metrics Exporter exports metrics from AMD devices (GPUs) to collectors like Prometheus.

    ROCm/device-metrics-exporter’s past year of commit activity
    Go 11 Apache-2.0 13 5 7 Updated Apr 12, 2025
  • ROCm/TransformerEngine’s past year of commit activity
    Python 27 13 9 10 Updated Apr 12, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    ROCm/composable_kernel’s past year of commit activity
    C++ 376 169 33 (1 issue needs help) 71 Updated Apr 12, 2025
  • aomp Public

    AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

    ROCm/aomp’s past year of commit activity
    Fortran 217 Apache-2.0 51 2 47 Updated Apr 12, 2025
  • onnxruntime Public Forked from microsoft/onnxruntime

    ONNX Runtime: cross-platform, high performance scoring engine for ML models

    ROCm/onnxruntime’s past year of commit activity
    C++ 6 MIT 3,221 0 8 Updated Apr 12, 2025
  • AMDMIGraphX Public

    AMD's graph optimization engine.

    ROCm/AMDMIGraphX’s past year of commit activity
    C++ 214 MIT 97 363 (1 issue needs help) 55 Updated Apr 12, 2025
  • rocMLIR Public
    ROCm/rocMLIR’s past year of commit activity
    MLIR 141 40 1 24 Updated Apr 12, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    ROCm/vllm’s past year of commit activity
    Python 72 Apache-2.0 6,884 11 27 Updated Apr 12, 2025