I am a paying customer with credits and the API endpoints rate-limited me to the...

square_usual · 2024-11-22T18:37:43 1732300663

When working with AI coding tools commit early, commit often becomes essential advice. I like that aider makes every change its own commit. I can always manicure the commit history later, I'd rather not lose anything when the AI can make destructive changes to code.

webstrand · 2024-11-22T19:21:37 1732303297

I can recommend https://github.com/tkellogg/dura for making auto-commits without polluting main branch history, if your tool doesn't support it natively

teaearlgraycold · 2024-11-22T20:25:14 1732307114

Why not just continue the migration manually?

htrp · 2024-11-22T18:37:13 1732300633

Control your own inference endpoints.

its_down_again · 2024-11-22T20:24:35 1732307075

Could you explain more on how to do this? e.g if I am using the Claude API in my service, how would you suggest I go about setting up and controlling my own inference endpoint?

handfuloflight · 2024-11-22T20:25:03 1732307103

You can't. He means by using the open source models.

datavirtue · 2024-11-22T20:30:45 1732307445

Runa local LLM tuned for coding on LM Studio. It has a server and provides endpoints.

datavirtue · 2024-11-22T20:29:15 1732307355

You aren't running against a local LLM?

TeMPOraL · 2024-11-22T20:55:28 1732308928

That's like asking if they aren't paying the neighborhood drunk with wine bottles for doing house remodeling, instead of hiring a renovation crew.

rybosome · 2024-11-22T22:53:34 1732316014

That’s funny, but open weight, local models are pretty usable depending on the task.

TeMPOraL · 2024-11-22T23:08:52 1732316932

You're right, but that's also subject to compute costs and time value of money. The calculus is different for companies trying to exploit language models in some way, and different for individuals like me who have to feed the family before splurging for a new GPU, or setting up servers in the cloud, when I can get better value by paying OpenAI or Claude a few dollars and use their SOTA models until those dollars run out.

FWIW, I am a strong supporter of local models, and play with them often. It's just that for practical use, the models I can run locally (RTX 4070 TI) mostly suck, and the models I could run in the cloud don't seem worth the effort (and cost).

alwayslikethis · 2024-11-23T01:01:17 1732323677

For the money for a 4070ti, you could have bought a 3090, which although less efficient, can run bigger models like Qwen2.5 32b coder. Apparently it performs quite well for code

rjh29 · 2024-11-23T00:43:15 1732322595

I guess the cost model doesn't work because you're buying gpu that you use about 0.1% of the day

neumann · 2024-11-22T23:08:35 1732316915

That's what my grandma did in the village in Hungary. But with schnapps. And the drunk was also the professional renovation crew.

rty32 · 2024-11-23T13:11:07 1732367467

Not everyone has a 4090 or M4 Max at home.