-
Infinigence
- Beijing
- konnase.github.io
Stars
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
rhiswell / rdma-tutorial
Forked from jcxue/RDMA-TutorialA tutorial on RDMA based programming using code examples
Secure Reverse Proxy over SSH protocol. Help to expose your local server to the private network.
Collective communications library with various primitives for multi-machine training.
A simple, decentralized mesh VPN with WireGuard support.
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…
一款轻量级、跨平台的 Mini Kubernetes AI Dashboard,集成多集群管理、智能分析、实时异常检测和自然语言查询功能,支持多架构并可单文件部署,助力高效集群管理与运维优化。
Libtpa(Transport Protocol Acceleration), a DPDK based userspace TCP stack implementation.
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Peter00796 / my_megatron
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
PyTorch native quantization and sparsity for training and inference
real time face swap and one-click video deepfake with only a single image
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
Example models using DeepSpeed
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".
The easiest way to run WireGuard VPN + Web-based Admin UI.
A high-throughput and memory-efficient inference and serving engine for LLMs
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Easily and securely send things from one computer to another 🐊 📦