lazybottle t1_jec8i0c wrote on March 30, 2023 at 11:19 PM Reply to comment by wind_dude in [D] Instruct Datasets for Commercial Use by JohnyWalkerRed Alpaca is not Apache 2.0 https://huggingface.co/datasets/tatsu-lab/alpaca#licensing-information > The dataset is available under the Creative Commons NonCommercial (CC BY-NC 4.0). Edit: I see the source of confusion. https://github.com/tatsu-lab/stanford_alpaca While the code is released under apache 2.0, the instruct dataset as pointed out by OP is not. One could potentially repro the steps, possibly with human ground truth, and release under a more amenable data license. Permalink Parent 1
lazybottle t1_jec8i0c wrote
Reply to comment by wind_dude in [D] Instruct Datasets for Commercial Use by JohnyWalkerRed
Alpaca is not Apache 2.0
https://huggingface.co/datasets/tatsu-lab/alpaca#licensing-information
> The dataset is available under the Creative Commons NonCommercial (CC BY-NC 4.0).
Edit: I see the source of confusion. https://github.com/tatsu-lab/stanford_alpaca
While the code is released under apache 2.0, the instruct dataset as pointed out by OP is not. One could potentially repro the steps, possibly with human ground truth, and release under a more amenable data license.