Skip to content
Change the repository type filter

All

    Repositories list

    • torchada

      Public
      An adapter layer that ensures torch_musa🔦 delivers a CUDA-compatible PyTorch experience.
      Python
      MIT License
      113710Updated Jun 24, 2026Jun 24, 2026
    • vllm-musa

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Other
      18k10211Updated Jun 24, 2026Jun 24, 2026
    • C++
      20900Updated Jun 23, 2026Jun 23, 2026
    • mate

      Public
      MUSA AI Tensor Engine
      C++
      Apache License 2.0
      01010Updated Jun 23, 2026Jun 23, 2026
    • mutlass

      Public
      MUSA Templates for Linear Algebra Subroutines
      C++
      Other
      1.9k4610Updated Jun 22, 2026Jun 22, 2026
    • Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
      C++
      Other
      509000Updated Jun 12, 2026Jun 12, 2026
    • Fast Library for Approximate Nearest Neighbors
      C++
      Other
      664000Updated Jun 11, 2026Jun 11, 2026
    • Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
      C++
      BSD 3-Clause "New" or "Revised" License
      16000Updated Jun 11, 2026Jun 11, 2026
    • Forked from https://gitlab.com/libeigen/eigen
      C++
      Mozilla Public License 2.0
      0000Updated Jun 11, 2026Jun 11, 2026
    • muThrust

      Public
      The C++ parallel algorithms library.
      C++
      Other
      760400Updated Jun 8, 2026Jun 8, 2026
    • muAlg

      Public
      Cooperative primitives for MUSA C++.
      Cuda
      Other
      462700Updated Jun 8, 2026Jun 8, 2026
    • Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
      C++
      Other
      6135600Updated Jun 5, 2026Jun 5, 2026
    • tvm_musa

      Public
      Open Machine Learning Compiler Framework
      Python
      Apache License 2.0
      3.9k200Updated Jun 5, 2026Jun 5, 2026
    • TileOPs

      Public
      High-performance LLM operator library built on TileLang.
      Python
      Other
      44000Updated Jun 1, 2026Jun 1, 2026
    • MTClaw

      Public
      Local tool-routing proxy for openclaw/opencode/hermes, accelerating tool calls by up to 7x before forwarding general requests to upstream models.
      Python
      MIT License
      42612Updated May 21, 2026May 21, 2026
    • tvm-ffi

      Public
      Open ABI and FFI for Machine Learning Systems
      C++
      Apache License 2.0
      80000Updated May 20, 2026May 20, 2026
    • Python
      Other
      0000Updated May 13, 2026May 13, 2026
    • Python
      Other
      0000Updated May 13, 2026May 13, 2026
    • SimuMax

      Public
      a static analytical model for LLM distributed training
      Python
      Other
      2816100Updated May 11, 2026May 11, 2026
    • C++
      Apache License 2.0
      0100Updated May 7, 2026May 7, 2026
    • TypeScript
      Other
      1101Updated May 7, 2026May 7, 2026
    • LiteGS

      Public
      A refactored codebase for Gaussian Splatting. Training 3DGS in 50 seconds!
      Cuda
      Other
      3438060Updated Apr 10, 2026Apr 10, 2026
    • mujoco_warp_musa is a Python package extending MuJoCo Warp with MUSA compute backend, enabling GPU-accelerated physics simulation on MT MUSA architecture.Forked…
      Python
      Other
      01320Updated Mar 30, 2026Mar 30, 2026
    • mujoco_musa is a C++ sub-repository providing native MUSA kernel libraries for GPU-accelerated physics simulation in mujoco_warp_musa.
      C++
      Other
      0100Updated Mar 27, 2026Mar 27, 2026
    • axinfra is a lightweight array and compute infrastructure library for MUSA/CPU, providing device/stream management, array operations, and zero-copy interoperabi…
      Python
      Other
      0000Updated Mar 27, 2026Mar 27, 2026
    • torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
      Python
      Other
      36499300Updated Mar 17, 2026Mar 17, 2026
    • kineto

      Public
      HTML
      Other
      3100Updated Mar 16, 2026Mar 16, 2026
    • PyTorch Extension Library of Optimized Graph Cluster Algorithms
      C++
      MIT License
      164000Updated Mar 13, 2026Mar 13, 2026
    • Provides a Python interface to GPU management and monitoring functions. This is a wrapper around the MTML library.
      C
      MIT License
      5910Updated Mar 10, 2026Mar 10, 2026
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better per…
      Python
      Apache License 2.0
      755901Updated Feb 5, 2026Feb 5, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.