Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Even with proper tool call descriptions, I've had quite a few occasions where the LLM didn't know how to use the tool.

The tools provided by the MCP server were definitely in context and there were only two or three servers with a small amount of tools enabled.

It feels too model dependant at the moment, this was Gemini 2.5 Pro which is normally state of the art but has lots of quirks for tool use it seems.

Agreed on hoping models are going to be trained to be better at using MCP.



Right, my workflow to get even a basic prompt working consistently rarely involves fewer than like 10 cycles of [run it 10 times -> update the prompt extensively to knock out problems in the first step]

And then every time I try to add something new to the prompt, all the prompting for previously existing behavior often needs to be updated as well to account for the new stuff, even if it's in a totally separate 'branch' of the prompt flow/logic.

I'd anticipate that each individual MCP I wanted to add would require a similar process to ensure reliability.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: