Hacker News new | past | comments | ask | show | jobs | submit login

> > Do these techniques train models while performing the modifications?

> Depend on what you mean by training, they change the weights.

What I wonder: is there a separate model, not the LLM, that gets trained only on how to modify LLMs?

I imagine a model that could learn something like: “if I remove this whole network here, then the LLM runs 50% faster, but drops 30% in accuracy for certain topics”, or “if I add these connections, the LLM will now be able to solve more complex mathematical problems”

So a model that is not an LLM, but is trained on how to modify them for certain goals

Is that how this tool works?




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: