Viewing a single comment thread. View all comments

starstruckmon t1_jct06xj wrote

  • There's a already a couple high quality instruction datasets/compilations like FLAN that I think should also be mixed in.

  • Be sure to check the generated dataset for issues. Might require some cleanup like the original did.

3