KEMBAR78
NVIDIA Corporation · GitHub
Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 482 85

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 371 54

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.3k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.6k 222

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.8k 425

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.5k 883

Repositories

Showing 10 of 616 repositories
  • NVSentinel Public

    NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

    NVIDIA/NVSentinel’s past year of commit activity
    Go 52 Apache-2.0 10 14 3 Updated Oct 23, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 358 68 206 (15 issues need help) 184 Updated Oct 23, 2025
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 826 294 413 (17 issues need help) 92 Updated Oct 23, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 13,928 3,176 305 98 Updated Oct 23, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    C++ 11,941 Apache-2.0 1,817 746 426 Updated Oct 23, 2025
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 1,986 282 1,073 (6 issues need help) 170 Updated Oct 23, 2025
  • nv-ingest Public

    NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

    NVIDIA/nv-ingest’s past year of commit activity
    Python 2,751 Apache-2.0 271 96 (1 issue needs help) 37 Updated Oct 23, 2025
  • context-aware-rag Public

    Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.

    NVIDIA/context-aware-rag’s past year of commit activity
    Python 36 Apache-2.0 10 1 0 Updated Oct 23, 2025
  • spark-rapids-jni Public

    RAPIDS Accelerator JNI For Apache Spark

    NVIDIA/spark-rapids-jni’s past year of commit activity
    Cuda 51 Apache-2.0 74 76 9 Updated Oct 23, 2025
  • TensorRT-Incubator Public

    Experimental projects related to TensorRT

    NVIDIA/TensorRT-Incubator’s past year of commit activity
    MLIR 112 17 37 (1 issue needs help) 12 Updated Oct 23, 2025