Skip to content
View chuanyi-zjc's full-sized avatar

Block or report chuanyi-zjc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Jupyter Notebook Apache License 2.0 Updated Feb 12, 2025
  • sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python Apache License 2.0 Updated Feb 8, 2025
  • guidance Public

    Forked from guidance-ai/guidance

    A guidance language for controlling large language models.

    Jupyter Notebook MIT License Updated Feb 8, 2025
  • deepeval Public

    Forked from confident-ai/deepeval

    The LLM Evaluation Framework

    Python Apache License 2.0 Updated Jan 26, 2025
  • JupyterLab computational environment.

    TypeScript Other Updated Jan 24, 2025
  • gpustack Public

    Forked from gpustack/gpustack

    Manage GPU clusters for running AI models

    Python Apache License 2.0 Updated Jan 24, 2025
  • ModelScope: bring the notion of Model-as-a-Service to life.

    Python Apache License 2.0 Updated Jan 21, 2025
  • HAMi Public

    Forked from Project-HAMi/HAMi

    Heterogeneous AI Computing Virtualization Middleware

    Go Apache License 2.0 Updated Jan 21, 2025
  • ms-swift Public

    Forked from modelscope/ms-swift

    Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

    Python Apache License 2.0 Updated Jan 16, 2025
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python Apache License 2.0 Updated Jan 14, 2025
  • This repository contains tutorials and examples for Triton Inference Server

    Python BSD 3-Clause "New" or "Revised" License Updated Jan 8, 2025
  • 📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

    GNU General Public License v3.0 Updated Jan 8, 2025
  • Go Apache License 2.0 Updated Jan 7, 2025
  • 本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

    HTML Apache License 2.0 Updated Jan 4, 2025
  • AISystem Public

    Forked from chenzomi12/AISystem

    AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

    Jupyter Notebook Apache License 2.0 Updated Jan 2, 2025
  • HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container

    C Updated Dec 25, 2024
  • Ingress NGINX Controller for Kubernetes

    Go Apache License 2.0 Updated Oct 22, 2024
  • logger-demo Public

    Go Updated Oct 9, 2024
  • okg-sidecar Public

    Forked from magicsong/kidecar

    sidecar for open kruise

    Go Apache License 2.0 Updated Sep 27, 2024
  • Kubernetes performance and scale test orchestration framework written in golang

    Go Apache License 2.0 Updated Aug 30, 2024
  • cadvisor Public

    Forked from google/cadvisor

    Analyzes resource usage and performance characteristics of running containers.

    Go Other Updated Aug 20, 2024
  • cat Public

    Forked from dianping/cat

    CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能指标、健康状况、实时告警等。

    Java Apache License 2.0 Updated Aug 20, 2024
  • go-wrk Public

    Forked from tsliwowicz/go-wrk

    go-wrk - a HTTP benchmarking tool based in spirit on the excellent wrk tool (https://github.com/wg/wrk)

    Go Apache License 2.0 Updated Aug 18, 2024
  • llama3 Public

    Forked from meta-llama/llama3

    The official Meta Llama 3 GitHub site

    Python Other Updated Aug 12, 2024
  • Clean up Kubernetes yaml and json output to make it readable

    Go Apache License 2.0 Updated Jul 12, 2024
  • The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

    Go Apache License 2.0 Updated May 14, 2024
  • an unified scheduler for online and offline tasks

    Go Apache License 2.0 Updated Apr 15, 2024
  • Takin-web Public

    Forked from shulieTech/Takin-web
    Java Updated Apr 7, 2024
  • hubble Public

    Forked from cilium/hubble

    Hubble - Network, Service & Security Observability for Kubernetes using eBPF

    Go Apache License 2.0 Updated Apr 6, 2024
  • wrk2 Public

    Forked from giltene/wrk2

    A constant throughput, correct latency recording variant of wrk

    C Apache License 2.0 Updated Mar 3, 2024