-
pai-examples Public
Forked from aliyun/pai-examplesJupyter Notebook Apache License 2.0 UpdatedFeb 12, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedFeb 8, 2025 -
guidance Public
Forked from guidance-ai/guidanceA guidance language for controlling large language models.
Jupyter Notebook MIT License UpdatedFeb 8, 2025 -
deepeval Public
Forked from confident-ai/deepevalThe LLM Evaluation Framework
Python Apache License 2.0 UpdatedJan 26, 2025 -
jupyterlab Public
Forked from jupyterlab/jupyterlabJupyterLab computational environment.
TypeScript Other UpdatedJan 24, 2025 -
gpustack Public
Forked from gpustack/gpustackManage GPU clusters for running AI models
Python Apache License 2.0 UpdatedJan 24, 2025 -
modelscope Public
Forked from modelscope/modelscopeModelScope: bring the notion of Model-as-a-Service to life.
Python Apache License 2.0 UpdatedJan 21, 2025 -
HAMi Public
Forked from Project-HAMi/HAMiHeterogeneous AI Computing Virtualization Middleware
Go Apache License 2.0 UpdatedJan 21, 2025 -
ms-swift Public
Forked from modelscope/ms-swiftUse PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
Python Apache License 2.0 UpdatedJan 16, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJan 14, 2025 -
tutorials Public
Forked from triton-inference-server/tutorialsThis repository contains tutorials and examples for Triton Inference Server
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 8, 2025 -
Awesome-LLM-Inference Public
Forked from DefTruth/Awesome-LLM-Inference📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
GNU General Public License v3.0 UpdatedJan 8, 2025 -
-
llm-action Public
Forked from liguodongiot/llm-action本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
HTML Apache License 2.0 UpdatedJan 4, 2025 -
AISystem Public
Forked from chenzomi12/AISystemAISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Jupyter Notebook Apache License 2.0 UpdatedJan 2, 2025 -
HAMi-core Public
Forked from Project-HAMi/HAMi-coreHAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
C UpdatedDec 25, 2024 -
ingress-nginx Public
Forked from kubernetes/ingress-nginxIngress NGINX Controller for Kubernetes
Go Apache License 2.0 UpdatedOct 22, 2024 -
-
okg-sidecar Public
Forked from magicsong/kidecarsidecar for open kruise
Go Apache License 2.0 UpdatedSep 27, 2024 -
kube-burner Public
Forked from kube-burner/kube-burnerKubernetes performance and scale test orchestration framework written in golang
Go Apache License 2.0 UpdatedAug 30, 2024 -
cadvisor Public
Forked from google/cadvisorAnalyzes resource usage and performance characteristics of running containers.
Go Other UpdatedAug 20, 2024 -
cat Public
Forked from dianping/catCAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,消息队列,配置系统等)深度集成,为美团点评各业务线提供系统丰富的性能指标、健康状况、实时告警等。
Java Apache License 2.0 UpdatedAug 20, 2024 -
go-wrk Public
Forked from tsliwowicz/go-wrkgo-wrk - a HTTP benchmarking tool based in spirit on the excellent wrk tool (https://github.com/wg/wrk)
Go Apache License 2.0 UpdatedAug 18, 2024 -
llama3 Public
Forked from meta-llama/llama3The official Meta Llama 3 GitHub site
Python Other UpdatedAug 12, 2024 -
kubectl-neat Public
Forked from itaysk/kubectl-neatClean up Kubernetes yaml and json output to make it readable
Go Apache License 2.0 UpdatedJul 12, 2024 -
kubesphere Public
Forked from kubesphere/kubesphereThe container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️
Go Apache License 2.0 UpdatedMay 14, 2024 -
godel-scheduler Public
Forked from kubewharf/godel-scheduleran unified scheduler for online and offline tasks
Go Apache License 2.0 UpdatedApr 15, 2024 -
-
hubble Public
Forked from cilium/hubbleHubble - Network, Service & Security Observability for Kubernetes using eBPF
Go Apache License 2.0 UpdatedApr 6, 2024 -
wrk2 Public
Forked from giltene/wrk2A constant throughput, correct latency recording variant of wrk
C Apache License 2.0 UpdatedMar 3, 2024