Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 412 70

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 358 52

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.2k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 217

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.6k 407

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.4k 831

Repositories

Showing 10 of 595 repositories
  • bionemo-framework Public

    BioNeMo Framework: For building and adapting AI models in drug discovery at scale

    NVIDIA/bionemo-framework’s past year of commit activity
    Jupyter Notebook 504 88 50 (1 issue needs help) 91 Updated Sep 10, 2025
  • NeMo-Agent-Toolkit Public

    The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

    NVIDIA/NeMo-Agent-Toolkit’s past year of commit activity
    Python 1,336 Apache-2.0 358 56 21 Updated Sep 10, 2025
  • TensorRT-LLM Public

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 11,545 Apache-2.0 1,732 741 383 Updated Sep 10, 2025
  • NeMo-Guardrails Public

    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

    NVIDIA/NeMo-Guardrails’s past year of commit activity
    Python 5,053 536 126 (5 issues need help) 49 Updated Sep 10, 2025
  • TensorRT-Model-Optimizer Public

    A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.

    NVIDIA/TensorRT-Model-Optimizer’s past year of commit activity
    Python 1,324 Apache-2.0 148 112 19 Updated Sep 10, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 354 66 173 (14 issues need help) 178 Updated Sep 10, 2025
  • cuda-python Public

    CUDA Python: Performance meets Productivity

    NVIDIA/cuda-python’s past year of commit activity
    Python 2,956 203 147 15 Updated Sep 10, 2025
  • TransformerEngine Public

    A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

    NVIDIA/TransformerEngine’s past year of commit activity
    Python 2,711 Apache-2.0 503 213 87 Updated Sep 10, 2025
  • numba-cuda Public

    The CUDA target for Numba

    NVIDIA/numba-cuda’s past year of commit activity
    Python 181 BSD-2-Clause 38 94 24 Updated Sep 10, 2025
  • gpu-operator Public

    NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

    NVIDIA/gpu-operator’s past year of commit activity
    Go 2,284 Apache-2.0 380 380 70 Updated Sep 10, 2025