The knowledge that's baked into those LLMs *comes* from sites like Stack Overflo...

VelesDude · 2024-05-08T23:43:04 1715211784

That is the big question with LLM's. How can we tell what is being fed in is original content or just the output fed back in like a recursive fractal?

hirvi74 · 2024-05-08T23:21:18 1715210478

That also means there is probably a lot of wrong information on Stack Overflow that is baked into the training too. Hopefully, they accounted for this in training, but no way of knowing.

I have not really had a lot of accuracy issues with GPT, but then again, I probably and not savvy enough to spot them, most of the time, anyway.

akira2501 · 2024-05-09T00:15:00 1715213700

If it's posted on stack overflow, it's not new, it's merely been published. If this is the bar for LLM "learning" then they are doomed to live in a hazy bubble of the recent past.

umvi · 2024-05-08T23:28:06 1715210886

SO isn't the only source. Official docs/wikis, GitHub issues, etc, are also good knowledge sources.