GPT-4 is an amazing achievement, however, it is just a language model. LLM (larg...

int_19h · on March 29, 2023

The individual components are well documented, but which specific arrangements produce the best results is still very much an active research area.

As far as training, the differences between GPT-3 and GPT-3.5 (the latter being a smaller model!) demonstrate just how much fine tuning and reinforcement learning is important to the quality of the model. Merely throwing more content from the Internet at it doesn't automatically improve things.

woeirua · on March 29, 2023

I'm not so sure about this. There is speculation that GPT4 may utilize additional specialized models underneath it for specific tasks.

precompute · on March 29, 2023

Exactly, this has been my guess as well. They must have trained the model specifically to write poems, haikus and other things. So that the output looks much more polished than it really is.

euroderf · on March 30, 2023

I for one want a module that plays around with physics models and proposes practicable FTL-capable mechanisms.

precompute · on March 31, 2023

That'd require a real AI and not just a LLM, haha.