Lists (3)
Sort Name ascending (A-Z)
Stars
SGLang is a fast serving framework for large language models and vision language models.
Performance-portable, length-agnostic SIMD with runtime dispatch
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
AI-native (edge and LLM) proxy for agents. Move faster by letting Arch handle the pesky heavy lifting in building agentic apps -- ⚡️ query understanding and routing, seamless integration of prompts…
Framework to bring LLM applications to production
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Container runtimes on macOS (and Linux) with minimal setup
✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、色盘取色、多图上传、一键下载文档、自定义 CSS 样式、一键重置等特性
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFG
A command-line tool to download photos from iCloud
A high-throughput and memory-efficient inference and serving engine for LLMs
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
A lightweight, fast, and secure code execution environment that supports multiple programming languages
Simple, unified interface to multiple Generative AI providers
10x Faster Long-Context LLM By Smart KV Cache Optimizations
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
An Alpine Linux container for the iCloud Photos Downloader command line utility
Official inference library for Mistral models
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Real-time monitor and web admin for Celery distributed task queue
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
A Python vector database you just need - no more, no less.