Just a reminder that LLaMA is not open—in order to use it legally you have to ag...

immibis · on July 11, 2023

If anyone has a copyright claim to an LLM, the creators of the input data have more of a copyright claim than the company that trained it. There's a good chance they are not copyrightable at all. I'd bet there's a lot of people willing to take on that risk.

However, they might still fall under trade secret law.

fallingknife · on July 11, 2023

Why would an LLM be any less copyrightable than any other piece of software?

nokcha · on July 11, 2023

The "software" part of an LLM is pretty trivial -- the interesting piece is the the weights. Since the weights are mechanically generated by a computer, it can be argued that the weights are not copyrightable, just like a photograph taken by a monkey isn't copyrightable.

meithecatte · on July 11, 2023

The software is the matrix multiplication and gradient descent. We are talking about the numbers in the matrices. They are the output of a training algorithm, so we can only talk about the copyright on the training algorithm, and on its input data.

KingMob · on July 11, 2023

The model weights could be seen as a derived work, for which they didn't get the permission of the original copyright holders. Alternatively, it can be argued that the LLMs are no different than a fanfic writer trying to imitate the style of their favor author.

It's not obvious which way it will go, but I can see the point of those arguing that LLM data are ill-gotten gains.

abtinf · on July 11, 2023

For the same reason that phone books cannot have copyright.

eulers_secret · on July 11, 2023

People always bring this up like it’s a big deal, but most users aren’t interested in starting a business. We just wanna play with LLMs.

Frankly, I’m glad we don’t have a bunch of llamas in different skins being hawked like the current crop of “AI” startups that are just thin layers over OpenAI’s API.

kirill5pol · on July 11, 2023

That hasn’t been true for a while. Falcon 40B seemingly outperforms LLaMA 60B according to the OpenLLM leaderboard

https://huggingface.co/tiiuae/falcon-40b

lolinder · on July 11, 2023

Fair enough. I haven't really looked at Falcon as a replacement for LLaMA yet because it isn't supported by llama.cpp, but it looks promising.

logicchains · on July 11, 2023

Falcon is an open (Apache licensed) replacement for LLaMA, with a 40B version that's competitive with LLaMA 65B on benchmarks.