Skip to content
Change the repository type filter

All

    Repositories list

    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      BSD 3-Clause "New" or "Revised" License
      1.5k8.6k61861Updated Jan 19, 2025Jan 19, 2025
    • C++
      BSD 3-Clause "New" or "Revised" License
      1036713Updated Jan 19, 2025Jan 19, 2025
    • Python
      Apache License 2.0
      0756Updated Jan 18, 2025Jan 18, 2025
    • core

      Public
      The core library and APIs implementing the Triton Inference Server.
      C++
      BSD 3-Clause "New" or "Revised" License
      104114017Updated Jan 17, 2025Jan 17, 2025
    • The Triton backend for the PyTorch TorchScript models.
      C++
      BSD 3-Clause "New" or "Revised" License
      4413803Updated Jan 17, 2025Jan 17, 2025
    • Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
      Python
      25220Updated Jan 17, 2025Jan 17, 2025
    • client

      Public
      Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
      Python
      BSD 3-Clause "New" or "Revised" License
      2345863626Updated Jan 16, 2025Jan 16, 2025
    • The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
      C++
      MIT License
      31132226Updated Jan 16, 2025Jan 16, 2025
    • Third-party source packages that are modified for use in Triton.
      C
      BSD 3-Clause "New" or "Revised" License
      58705Updated Jan 16, 2025Jan 16, 2025
    • The Triton backend for the ONNX Runtime.
      C++
      BSD 3-Clause "New" or "Revised" License
      58136733Updated Jan 15, 2025Jan 15, 2025
    • backend

      Public
      Common source, scripts and utilities for creating Triton backends.
      C++
      BSD 3-Clause "New" or "Revised" License
      9130503Updated Jan 15, 2025Jan 15, 2025
    • Python
      BSD 3-Clause "New" or "Revised" License
      1921605Updated Jan 13, 2025Jan 13, 2025
    • tutorials

      Public
      This repository contains tutorials and examples for Triton Inference Server
      Python
      BSD 3-Clause "New" or "Revised" License
      101624812Updated Jan 13, 2025Jan 13, 2025
    • The Triton backend for TensorRT.
      C++
      BSD 3-Clause "New" or "Revised" License
      306801Updated Jan 13, 2025Jan 13, 2025
    • The Triton backend for TensorFlow.
      C++
      BSD 3-Clause "New" or "Revised" License
      204502Updated Jan 13, 2025Jan 13, 2025
    • Simple Triton backend used for testing.
      C++
      BSD 3-Clause "New" or "Revised" License
      4200Updated Jan 13, 2025Jan 13, 2025
    • An example Triton backend that demonstrates sending zero, one, or multiple responses for each request.
      C++
      BSD 3-Clause "New" or "Revised" License
      7500Updated Jan 13, 2025Jan 13, 2025
    • TRITONCACHE implementation of a Redis cache
      C++
      BSD 3-Clause "New" or "Revised" License
      41320Updated Jan 13, 2025Jan 13, 2025
    • Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
      C++
      BSD 3-Clause "New" or "Revised" License
      153578011Updated Jan 13, 2025Jan 13, 2025
    • OpenVINO backend for Triton.
      C++
      BSD 3-Clause "New" or "Revised" License
      163064Updated Jan 13, 2025Jan 13, 2025
    • Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Server models.
      Python
      Apache License 2.0
      76446264Updated Jan 13, 2025Jan 13, 2025
    • Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API
      C++
      BSD 3-Clause "New" or "Revised" License
      1510Updated Jan 13, 2025Jan 13, 2025
    • Example Triton backend that demonstrates most of the Triton Backend API.
      C++
      BSD 3-Clause "New" or "Revised" License
      12600Updated Jan 13, 2025Jan 13, 2025
    • C++
      101804Updated Jan 13, 2025Jan 13, 2025
    • common

      Public
      Common source, scripts and utilities shared across all Triton repositories.
      C++
      BSD 3-Clause "New" or "Revised" License
      746603Updated Jan 13, 2025Jan 13, 2025
    • The Triton repository agent that verifies model checksums.
      C++
      BSD 3-Clause "New" or "Revised" License
      71000Updated Jan 13, 2025Jan 13, 2025
    • Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
      Python
      Apache License 2.0
      2619340Updated Jan 13, 2025Jan 13, 2025
    • FIL backend for the Triton Inference Server
      Jupyter Notebook
      Apache License 2.0
      3576512Updated Jan 8, 2025Jan 8, 2025
    • The Triton TensorRT-LLM Backend
      Python
      Apache License 2.0
      11274827921Updated Jan 7, 2025Jan 7, 2025
    • pytriton

      Public
      PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
      Python
      Apache License 2.0
      5376380Updated Nov 19, 2024Nov 19, 2024