`developer__text_editor` tool calls doesn't update files #929

Kamariw95 · 2025-01-30T08:24:10Z

Describe the bug
when using the cli or the desktop, the developer__text_editor tool_call doesn't actually update the files like Goose expects.

To Reproduce
Steps to reproduce the behavior:

Start a session with goose session
Conduct conversation that results in a <tool_call>
Check repo via git status for file updates.
See no changes on repository.

Expected behavior
The subsequent file that goose has added context, to be updated locally.

Screenshots

Please provide following information:

OS & Arch: macOS M2 Pro 32GB
Interface: primarily CLI, but also happening in UI.
Version: v1.0.0
Extensions enabled:
Developer, Computer Controller, Memory, git, Knowledge Graph Memory, Fetch. (in UI)
Developer (in CLI)
Provider & Model: Ollama qwen2.5:7b.

Additional context
When I initially downloaded Goose, I downloaded the Desktop application and it repeatedly asked for the same file permissions. This leads me to believe I'm having some wonky permissions error - but I'm also new to running models and this maybe user error, forgive me if it is.

The text was updated successfully, but these errors were encountered:

jscott-yps · 2025-01-30T09:34:13Z

Goose is extremely resistant to actually editing files in my own experience so far too. I will ask it to edit a file and it will just respond with the changes in the message flow. It'll take 1 or 2 attempts in responses to get it to actually do the edit itself.

"EDIT THE FILE, DON'T TELL ME HOW" 😄

salman1993 · 2025-01-30T14:25:36Z

during our testing, we have found the tool calling capabilities of 7B models (text editing, bash) is much worse compared to the larger models. the gap is wider when it comes to tool calling vs. general chat completion.

goose works with Anthropic's Claude 3.5 Sonnet. among free options, i'd recommend Gemini free tier now. AFAIK DeepSeek 70B doesn't have great tool calling yet, hopeful that will change soon!

jscott-yps · 2025-01-30T14:27:48Z

during our testing, we have found the tool calling capabilities of 7B models (text editing, bash) is much worse compared to the larger models. the gap is wider when it comes to tool calling vs. general chat completion.

goose works with Anthropic's Claude 3.5 Sonnet. if you're looking for a free, i'd recommend Gemini free tier.

I'm not the reporter so i'll shut up 😄 but just wanted to add some context for my comment, i am using GPT-4o

Kamariw95 · 2025-01-30T16:25:51Z

Thank you for your quick response @salman1993.

To be fair, I've swapped to using Claude and this tool is extremely impressive. Is there anything that we could improve the performance of open source models (e.g. DeepSeek, Llama, etc)? should I just use more CPU/GPU power?

kamal94 · 2025-01-30T16:39:05Z

Thank you @salman1993. I have had good experience using Goose with Sonnet-3.5, but as @Kamariw95 noted, it is practically unusable with local models (I tried using Qwen, Llama-3.2, deepseek).

Considering Anthropic's API is rather expensive at scale (I burned through a dollar with a few commands setting up a local repo), it seems like local models would be a great use case for Goose, considering their cost savings (practically cost of electricity).

Is there planned/active work on improving the agent's performance (tool calling in this case) with lower parameter models?

P.S. I found it amusing that our usernames are
salman19**93**
kamal**94**
Kamariw**95**

salman1993 · 2025-01-30T18:31:20Z

haha that is funny! yeah, we are testing out some open models internally right now, example: watt-tool-70B looks promising.

@Kamariw95 so far, i think Llama 3.3 70B instruct is not bad since it was finetuned for tool calling but you can see the difference when you use it compared to Claude 3.5 Sonnet. I find Llama 3.3 struggles after 6-8 tool calls in a loop.

we would love to suggest a cheaper and/or open-source model (hopeful about DeepSeek when they support tool calling natively)!

michaelneale · 2025-02-01T03:59:58Z

I have been trying to get deepseek-r1 and others to do tool calling, isn't great, but looking at the default Qwen 2.5 with the change @wendytang has (and another variant of it here): #1021 may help with some of these local models that can do tool calling

baxen · 2025-02-07T06:08:32Z

I'm not the reporter so i'll shut up 😄 but just wanted to add some context for my comment, i am using GPT-4o

@jscott-yps 100% i really have to nudge gpt-4o into making the edits, but once it attempts it makes good ones/uses the tools correctly

We've confirmed that the original issue here was the model using an incorrect format for the tool call - and so the tool call itself didn't actually process. Going to close this for now as in the current state we need larger models to get consistent results. But we'll be working on improvements to try to get this to work better with smaller models!

salman1993 added the help wanted Great issue for non-Block contributors label Jan 30, 2025

wendytang self-assigned this Jan 30, 2025

wendytang mentioned this issue Jan 31, 2025

fix: enable better tool calling for developer #999

Closed

baxen closed this as completed Feb 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`developer__text_editor` tool calls doesn't update files #929

`developer__text_editor` tool calls doesn't update files #929

Kamariw95 commented Jan 30, 2025 •

edited

Loading

jscott-yps commented Jan 30, 2025

salman1993 commented Jan 30, 2025 •

edited

Loading

jscott-yps commented Jan 30, 2025 •

edited

Loading

Kamariw95 commented Jan 30, 2025 •

edited

Loading

kamal94 commented Jan 30, 2025

salman1993 commented Jan 30, 2025 •

edited

Loading

michaelneale commented Feb 1, 2025

baxen commented Feb 7, 2025

developer__text_editor tool calls doesn't update files #929

developer__text_editor tool calls doesn't update files #929

Comments

Kamariw95 commented Jan 30, 2025 • edited Loading

jscott-yps commented Jan 30, 2025

salman1993 commented Jan 30, 2025 • edited Loading

jscott-yps commented Jan 30, 2025 • edited Loading

Kamariw95 commented Jan 30, 2025 • edited Loading

kamal94 commented Jan 30, 2025

salman1993 commented Jan 30, 2025 • edited Loading

michaelneale commented Feb 1, 2025

baxen commented Feb 7, 2025

`developer__text_editor` tool calls doesn't update files #929

`developer__text_editor` tool calls doesn't update files #929

Kamariw95 commented Jan 30, 2025 •

edited

Loading

salman1993 commented Jan 30, 2025 •

edited

Loading

jscott-yps commented Jan 30, 2025 •

edited

Loading

Kamariw95 commented Jan 30, 2025 •

edited

Loading

salman1993 commented Jan 30, 2025 •

edited

Loading