Skip to content
View konnase's full-sized avatar

Block or report konnase

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 3,389 248 Updated Feb 26, 2025

A tutorial on RDMA based programming using code examples

C 37 7 Updated Jun 1, 2019

Secure Reverse Proxy over SSH protocol. Help to expose your local server to the private network.

Go 6 Updated Jan 20, 2025

Collective communications library with various primitives for multi-machine training.

C++ 1,264 313 Updated Feb 26, 2025

Pygloo provides Python bindings for Gloo.

C++ 20 11 Updated Oct 29, 2024

A simple, decentralized mesh VPN with WireGuard support.

Rust 2,891 278 Updated Feb 21, 2025

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 92 7 Updated Feb 21, 2025

一款轻量级、跨平台的 Mini Kubernetes AI Dashboard,集成多集群管理、智能分析、实时异常检测和自然语言查询功能,支持多架构并可单文件部署,助力高效集群管理与运维优化。

Go 95 12 Updated Feb 26, 2025

Linux kernel source tree

C 188,530 55,272 Updated Feb 26, 2025

Libtpa(Transport Protocol Acceleration), a DPDK based userspace TCP stack implementation.

C 112 16 Updated Mar 19, 2024

RDMA core userspace libraries and daemons

C 1,693 720 Updated Feb 26, 2025

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Python 357 59 Updated Feb 26, 2025

Ongoing research training transformer models at scale

Python 1 Updated Oct 15, 2024

PyTorch native quantization and sparsity for training and inference

Python 1,861 223 Updated Feb 26, 2025

A PyTorch Native LLM Training Framework

Python 735 41 Updated Dec 27, 2024

real time face swap and one-click video deepfake with only a single image

Python 44,221 6,495 Updated Feb 19, 2025

An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)

Go 6,048 957 Updated Feb 20, 2025

Example models using DeepSpeed

Python 6,316 1,068 Updated Feb 14, 2025

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

Jupyter Notebook 488 117 Updated Aug 4, 2024

Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".

Python 46 1 Updated Jul 12, 2024
Python 295 39 Updated Aug 20, 2024

The easiest way to run WireGuard VPN + Web-based Admin UI.

JavaScript 17,512 1,689 Updated Feb 25, 2025

CUDA checkpoint and restore utility

C 297 15 Updated Jan 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,475 5,914 Updated Feb 26, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,537 1,116 Updated Feb 26, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 68,662 7,387 Updated Feb 26, 2025

Easily and securely send things from one computer to another 🐊 📦

Go 28,825 1,142 Updated Feb 25, 2025
Next
Showing results