> An often overlooked feature of the Gemini models is that they can write and ex...

simonw · 2025-04-18T01:00:03 1744938003

The Gemini API runs the Python code for you as part of your single API call, without you having to handle the tool call request yourself.

tempaccount420 · 2025-04-18T03:18:28 1744946308

This is so much cheaper than re-prompting each tool use.

I wish this was extended to things like: you could give the model an API endpoint that it can call to execute JS code, and the only requirement is that your API has to respond within 5 seconds (maybe less actually).

I wonder if this is what OpenAI is planning to do in the upcoming API update to support tools in o3.

danpalmer · 2025-04-18T07:57:06 1744963026

I imagine there wouldn’t bd much of a cost to the provider on the API call there so much longer times may be possible. It’s not like this would hold up the LLM in any way, execution would get suspended while the call is made and the TPU/GPU will serve another request.

suchar · 2025-04-18T20:05:07 1745006707

They need to keep KV cache to avoid prompt reprocessing, so they would need to move it to ram/nvme during longer api calls to use gpu for another request

WiSaGaN · 2025-04-18T00:48:40 1744937320

This common feature requires the user of the API to implement the tool, in this case, the user is responsible to run the code the API outputs. The post you replied suggests that Gemini will run the code for the user behind the API call.

tempoponet · 2025-04-18T14:59:38 1744988378

That was how I read it as well, as if it had a built-in lambda type service in the cloud.

If we're just talking about some API support to call python scripts, that's pretty basic to wire up with any model that supports tool use.