Chuyito t1_jcbu40y wrote on March 15, 2023 at 6:41 PM

1, We are about to see a new push for a "robots.txt" equivalent for training data. E.g If yelp had a "datarules.txt file indicating no training on its comments for private use. Idea being that you could specify a license which allows training on your data for open source, but not for profit. Benefit for yelp is similar to the original Netflix training data set we all used at some point.

2, Its going to create a massive push for open frameworks. I can see nvda going down the path of "Appliances" similar to what IBM and many tech companies did for servers with pre-installed software. Many of those were open-source software, configured and ready to use/tune to your app. If you want to adjust the weight on certain bias filters, but not write the model from scratch.. Having an in house instance of your "assistant" will be favorable to many (E.g. if you are doing research on bioFuels, chatGpt will sensor way too much in trying to push "green", and lose track of research in favor of policy.)

thecity2 t1_jcc8549 wrote on March 15, 2023 at 8:07 PM

Yes to point 1! Not enough people are talking about this aspect. The data wars are on imo. How will Google protect their mountains of video data for example.

lapurita t1_jcf4fs6 wrote on March 16, 2023 at 11:51 AM

Ugh I'm not psyched about this at all, it will just protect the big companies from competitors and result in worse products for everyone