diff --git a/config/sidebar-learn.json b/config/sidebar-learn.json index f487ea6ca..a10658def 100644 --- a/config/sidebar-learn.json +++ b/config/sidebar-learn.json @@ -54,6 +54,11 @@ "label": "Deactivate AI-powered search", "slug": "deactivate_ai_powered_search" }, + { + "source": "learn/ai_powered_search/choose_an_embedder.mdx", + "label": "Which embedder should I choose?", + "slug": "choose_an_embedder" + }, { "source": "learn/ai_powered_search/difference_full_text_ai_search.mdx", "label": "Differences between full-text and AI-powered search", diff --git a/learn/ai_powered_search/choose_an_embedder.mdx b/learn/ai_powered_search/choose_an_embedder.mdx new file mode 100644 index 000000000..15da304e0 --- /dev/null +++ b/learn/ai_powered_search/choose_an_embedder.mdx @@ -0,0 +1,32 @@ +--- +title: Which embedder should I choose? — Meilisearch documentation +description: General guidance on how to choose the embedder best suited for projects using AI-powered search. +--- + +# Which embedder should I choose? + +Meilisearch officially supports many different embedders, such as OpenAI, Hugging Face, and Ollama, as well as the majority of embedding generators with a RESTful API. + +This article contains general guidance on how to choose the embedder best suited for your project. + +## When in doubt, choose OpenAI + +OpenAI returns relevant search results across different subjects and datasets. It is suited for the majority of applications and Meilisearch actively supports and improves OpenAI functionality with every new release. + +In the majority of cases, and especially if this is your first time working with LLMs and AI-powered search, choose OpenAI. + +## If you are already using a specific AI service, choose the REST embedder + +If you are already using a specific model from a compatible embedder, choose Meilisearch's REST embedder. This ensures you continue building upon tooling and workflows already in place with minimal configuration necessary. + +## If dealing with non-textual content, choose the user-provided embedder + +Meilisearch does not support searching images, audio, or any other content not presented as text. This limitation applies to both queries and documents. For example, Meilisearch's built-in embedder sources cannot search using an image instead of text. They also cannot use text to search for images without attached textual metadata. + +In these cases, you will have to supply your own embedder. + +## Only choose Hugging Face when self-hosting small static datasets + +Although it returns very relevant search results, the Hugging Face embedder must run directly in your server. This may lead to lower performance and extra costs when you are hosting Meilisearch in a service like DigitalOcean or AWS. + +That said, Hugging Face can be a good embedder for datasets under 10k documents that you don't plan to update often. Meilisearch Cloud does not support Hugging Face embedders.