Viewing a single comment thread. View all comments

throwaway957280 t1_jcsjj07 wrote

Is OpenAI actually legally allowed to do that? How is using their model for training different from training on copyrighted data which all these models do?

19

Anjz t1_jcsktsf wrote

It's probably untested in courts, there's so many loopholes and variables too, what's considered a competing AI model? Companies usually just spew a bunch of stuff in their terms of use, some of which have no legal basis.

19

kex t1_jcsm7kh wrote

I'd say enjoy it while it lasts, at the very least

6

hughperman t1_jcswzfh wrote

Train a model that's designated as non-competing but open, then train another model from the output of that that's competing.

4

starstruckmon t1_jct0s11 wrote

They are. It's less to do with copyright and more to do with the fact that you signed the T&C before using their system ( and then broke ). It's simmilar to the LinkedIn data scraping case where the court ruled that it wasn't illegal for them to scrape ( nor did it violate copyright ) but they still got in trouble ( and had to settle ) because of violating the T&C.

One way around this is to have two parties, one generating and publishing the dataset ( doesn't violate T&C ) and another independant party ( who didn't sign the T&C ) fine-tuning a model on the dataset.

6

RoyalCities t1_jctcu1m wrote

Couldnt it be possible to set up a large community Q/A repositiry then? Just crowdsource whatever it outputs and document collectively.

2

bitchslayer78 t1_jcsz4s3 wrote

No they aren’t , they have no claim on transformers that would be google brain , but you don’t see alphabet throwing a sissy fit

1