How can I evaluate Ollama models such as Llama 3.3 using RAGAS? #1857

HaniaMo · 2025-01-20T13:29:22Z

[ ] I checked the documentation and related resources and couldn't find an answer to my question.

Vidit-Ostwal · 2025-01-20T14:48:51Z

@dosu Can you solve this ?

dosubot · 2025-01-20T14:50:17Z

You can successfully use RAGAS with Ollama models by using the following code:

from langchain_community.chat_models import ChatOllama
from langchain_community.embeddings import OllamaEmbeddings

ollama_llm = "mistral"
model = ChatOllama(model=ollama_llm)
embeddings_ollama = OllamaEmbeddings(model="mistral")

This solution was suggested in a similar closed issue and received positive feedback ^[1].

To continue talking to Dosu, mention @dosu.

^{Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other}

HaniaMo · 2025-01-22T03:06:25Z

I successfully run RAGAS to evaluate an Ollama model. However, I get a timeout error during evaluation and NAN values for metrics, although increasing the timeout for the RunConfig.

Evaluating: 0%| | 0/1 [00:00<?, ?it/s]ERROR:ragas.executor:Exception raised in Job[0]: TimeoutError()
{'faithfulness': nan}

**** This is the function I used to calculate the evaluation metrics
def RAGAS_metrics(user_query, model_response, contexts_list):
# 6.2- Run RAGAS metrics
print("\nRunning RAGAS evaluation metrics...")
# To dict
data = {
"question": [user_query], #user_query is string of size 182
"answer": [model_response], #model response is string of size 3361
"contexts": contexts_list, # is a list of the context: [[size=10]]

    }
 dataset = Dataset.from_dict(data)
 results = evaluate(dataset = dataset, metrics=[faithfulness],llm=llm_factory(), 
                    embeddings=embedding_factory(),run_config=RunConfig(max_workers=8,timeout=1000, log_tenacity=True))
 print("Evaluation Results from ragas:")
 print(results)

jjmachan · 2025-01-22T04:45:44Z

sadly this is a duplicate of #1170
today we don't support ollama model but will get this fixed in the coming weeks

HaniaMo added the question Further information is requested label Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I evaluate Ollama models such as Llama 3.3 using RAGAS? #1857

How can I evaluate Ollama models such as Llama 3.3 using RAGAS? #1857

HaniaMo commented Jan 20, 2025 •

edited

Loading

Vidit-Ostwal commented Jan 20, 2025

dosubot bot commented Jan 20, 2025

HaniaMo commented Jan 22, 2025

jjmachan commented Jan 22, 2025

How can I evaluate Ollama models such as Llama 3.3 using RAGAS? #1857

How can I evaluate Ollama models such as Llama 3.3 using RAGAS? #1857

Comments

HaniaMo commented Jan 20, 2025 • edited Loading

Vidit-Ostwal commented Jan 20, 2025

dosubot bot commented Jan 20, 2025

HaniaMo commented Jan 22, 2025

jjmachan commented Jan 22, 2025

HaniaMo commented Jan 20, 2025 •

edited

Loading