Stars
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Simple, unified interface to multiple Generative AI providers
A high-throughput and memory-efficient inference and serving engine for LLMs
Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.
🐶 Kubernetes CLI To Manage Your Clusters In Style!
Set up your own OpenVPN server on Debian, Ubuntu, Fedora, CentOS, Arch Linux and more
HunyuanVideo: A Systematic Framework For Large Video Generation Model
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
The Robot Operating System, is a meta operating system for robots.
Main ROS.org landing website
Multi-threaded AWS inventory collection tool with a focus on security-relevant resources and metadata.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Pretrain a model on ciphered text so only you can use it
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Curated list of datasets and tools for post-training.
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
A template for PostgreSQL High Availability with Etcd, Consul, ZooKeeper, or Kubernetes
Postgres operator creates and manages PostgreSQL clusters running in Kubernetes
Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by Clickhouse and OpenTelemetry.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Torii ⛩️ is a simple, powerful and extensible open-source Internal Developer Portal
Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.
Open-source Infrastructure as Code (IaC) orchestration platform: GitOps workflows, orchestration, code generation, observability, drift detection, asset management, policies, Slack notifications, a…
Humanitec AWS Reference Architecture implementation