Skip to content
View acelyc111's full-sized avatar
:octocat:
working
:octocat:
working

Organizations

@apache @XiaoMi @pegasus-kv

Block or report acelyc111

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 11,607 1,181 Updated Mar 9, 2025

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 4,447 333 Updated Mar 7, 2025

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 7,246 537 Updated Mar 5, 2025

AI-native (edge and LLM) proxy for agents. Move faster by letting Arch handle the pesky heavy lifting in building agentic apps -- ⚡️ query understanding and routing, seamless integration of prompts…

Rust 1,927 95 Updated Mar 8, 2025

Framework to bring LLM applications to production

Python 313 37 Updated Mar 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,512 825 Updated Mar 7, 2025

Paranoid text spacing in JavaScript

JavaScript 4,473 292 Updated Dec 30, 2023

Container runtimes on macOS (and Linux) with minimal setup

Go 21,576 431 Updated Mar 7, 2025

✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、色盘取色、多图上传、一键下载文档、自定义 CSS 样式、一键重置等特性

Vue 7,799 1,241 Updated Mar 8, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFG

Python 32,713 2,769 Updated Mar 8, 2025

A command-line tool to download photos from iCloud

Python 7,766 596 Updated Feb 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,810 6,142 Updated Mar 9, 2025

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,465 1,292 Updated Sep 5, 2024

A lightweight, fast, and secure code execution environment that supports multiple programming languages

Go 640 153 Updated Oct 30, 2024

Simple, unified interface to multiple Generative AI providers

Python 11,590 1,122 Updated Mar 6, 2025

10x Faster Long-Context LLM By Smart KV Cache Optimizations

Python 566 56 Updated Mar 9, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,543 2,295 Updated Mar 9, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,898 177 Updated Mar 8, 2025

An Alpine Linux container for the iCloud Photos Downloader command line utility

Shell 2,093 184 Updated Feb 27, 2025

The Cloud-Native API Gateway and AI Gateway

Lua 14,858 2,558 Updated Mar 7, 2025

Official inference library for Mistral models

Jupyter Notebook 10,062 898 Updated Nov 12, 2024

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 9,188 847 Updated Mar 7, 2025

Real-time monitor and web admin for Celery distributed task queue

Python 6,630 1,105 Updated Sep 1, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 43,424 3,879 Updated Mar 8, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 33,037 3,066 Updated Mar 9, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 19,063 2,031 Updated Oct 15, 2024

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 22,354 1,532 Updated Mar 7, 2025

A Python vector database you just need - no more, no less.

Python 597 46 Updated Mar 4, 2024
Next
Showing results