Cool, what software? | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		modeless 4 months ago \| parent \| context \| favorite \| on: Running GPT-OSS-120B at 500 tokens per second on N... Cool, what software?

asabla 4 months ago [–]

Initial testing has only been done with ollama. Plan on testing out llama.cpp and vllm when there is enough time

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact