Skip to content
@EmbeddedLLM

EmbeddedLLM

EmbeddedLLM is the creator behind JamAI Base, a platform designed to orchestrate AI with spreadsheet-like simplicity.

Pinned Loading

  1. JamAIBase JamAIBase Public

    The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate…

    Python 793 25

  2. vllm vllm Public

    Forked from vllm-project/vllm

    vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 88 5

  3. embeddedllm embeddedllm Public

    EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU

    Python 31 1

Repositories

Showing 10 of 47 repositories
  • vllm Public Forked from vllm-project/vllm

    vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

    EmbeddedLLM/vllm’s past year of commit activity
    Python 88 Apache-2.0 5,608 2 0 Updated Feb 6, 2025
  • vllm-rocmfork Public Forked from ROCm/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    EmbeddedLLM/vllm-rocmfork’s past year of commit activity
    Python 0 Apache-2.0 5,608 0 0 Updated Feb 5, 2025
  • aiter Public Forked from ROCm/aiter

    AI Tensor Engine for ROCm

    EmbeddedLLM/aiter’s past year of commit activity
    Cuda 0 MIT 3 0 0 Updated Feb 4, 2025
  • JamAIBase Public

    The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate on AI applications.

    EmbeddedLLM/JamAIBase’s past year of commit activity
    Python 793 Apache-2.0 25 1 0 Updated Feb 4, 2025
  • lmcache-vllm Public Forked from LMCache/lmcache-vllm

    The driver for LMCache core to run in vLLM

    EmbeddedLLM/lmcache-vllm’s past year of commit activity
    Python 0 Apache-2.0 13 0 0 Updated Jan 24, 2025
  • LMCache Public Forked from LMCache/LMCache

    ROCm support of Ultra-Fast and Cheaper Long-Context LLM Inference

    EmbeddedLLM/LMCache’s past year of commit activity
    Python 0 Apache-2.0 44 0 0 Updated Jan 24, 2025
  • EmbeddedLLM/lmcache-tests’s past year of commit activity
    Python 0 7 0 0 Updated Jan 23, 2025
  • EmbeddedLLM/production-stack’s past year of commit activity
    Python 0 Apache-2.0 29 0 0 Updated Jan 22, 2025
  • kvpress Public Forked from NVIDIA/kvpress

    LLM KV cache compression made easy

    EmbeddedLLM/kvpress’s past year of commit activity
    Python 0 Apache-2.0 25 0 0 Updated Jan 21, 2025
  • litellm Public Forked from BerriAI/litellm

    Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

    EmbeddedLLM/litellm’s past year of commit activity
    Python 0 2,091 0 0 Updated Jan 13, 2025

Top languages

Loading…

Most used topics

Loading…