frownGuy12 t1_jcsfnh7 wrote on March 19, 2023 at 5:07 AM

If OpenAI wants people to respect their IP they should take the word “open” out of their name. They scraped our data to train their models after all, it’s not like OpenAI themselves aren’t pushing the boundaries of what’s acceptable when it comes to copyright law.

Legally it’s questionable, but ethically speaking I think it’s a fine idea.

throwaway957280 t1_jcsjj07 wrote on March 19, 2023 at 5:53 AM

Is OpenAI actually legally allowed to do that? How is using their model for training different from training on copyrighted data which all these models do?

Anjz t1_jcsktsf wrote on March 19, 2023 at 6:09 AM

It's probably untested in courts, there's so many loopholes and variables too, what's considered a competing AI model? Companies usually just spew a bunch of stuff in their terms of use, some of which have no legal basis.

kex t1_jcsm7kh wrote on March 19, 2023 at 6:28 AM

I'd say enjoy it while it lasts, at the very least

hughperman t1_jcswzfh wrote on March 19, 2023 at 9:02 AM

Train a model that's designated as non-competing but open, then train another model from the output of that that's competing.

starstruckmon t1_jct0s11 wrote on March 19, 2023 at 9:57 AM

They are. It's less to do with copyright and more to do with the fact that you signed the T&C before using their system ( and then broke ). It's simmilar to the LinkedIn data scraping case where the court ruled that it wasn't illegal for them to scrape ( nor did it violate copyright ) but they still got in trouble ( and had to settle ) because of violating the T&C.

One way around this is to have two parties, one generating and publishing the dataset ( doesn't violate T&C ) and another independant party ( who didn't sign the T&C ) fine-tuning a model on the dataset.

RoyalCities t1_jctcu1m wrote on March 19, 2023 at 12:29 PM

Couldnt it be possible to set up a large community Q/A repositiry then? Just crowdsource whatever it outputs and document collectively.

[deleted] OP t1_jd0nazd wrote on March 20, 2023 at 11:44 PM

[removed]

BraianP t1_jdfjbq4 wrote on March 24, 2023 at 12:47 AM

so, open assistant?

bitchslayer78 t1_jcsz4s3 wrote on March 19, 2023 at 9:34 AM

No they aren’t , they have no claim on transformers that would be google brain , but you don’t see alphabet throwing a sissy fit

Long19980 t1_jcsllwx wrote on March 19, 2023 at 6:20 AM

They can go cry about it.

yaosio t1_jcsob5z wrote on March 19, 2023 at 6:56 AM

The output of AI can't be copyrighted so OpenAI has no say in what somebody does with the output.

lxe t1_jcsqk7t wrote on March 19, 2023 at 7:29 AM

Copyright and license terms are different things.

yaosio t1_jcsqxwf wrote on March 19, 2023 at 7:35 AM

If doesn't matter what the license terms say if it can't be enforced.

Uptown-Dog t1_jct32n7 wrote on March 19, 2023 at 10:30 AM

I think you'd be dismayed at how easy it is to enforce these things when you have OpenAI money.

starstruckmon t1_jct0v0k wrote on March 19, 2023 at 9:58 AM

It's not about copyright

https://www.reddit.com/r/MachineLearning/comments/11v4h5z/-/jct0s11

objectdisorienting t1_jcsu3xk wrote on March 19, 2023 at 8:21 AM

Will be interesting to see where lawmakers and courts ultimately land on this, but the current status quo is that AI generated text and images (or any other works) cannot be copyrighted. In other words for now all output is public domain and OpenAI can kick rocks on this. A TOS violation just means you might get banned from using their service lol.

VertexMachine t1_jct3b51 wrote on March 19, 2023 at 10:33 AM

It's most likely enforceable, but even if it's not they can simply ban OP for doing that. if OP is using their API in any way that's important to him, it's something to consider.

[P] The next generation of Stanford Alpaca

ThatInternetGuy t1_jcs253z wrote on March 19, 2023 at 2:57 AM