visarga t1_itwxzgs wrote
Reply to comment by Southern-Trip-1102 in [D] What's the best open source model for GPT3-like text-to-text generation on local hardware? by AuspiciousApple
My experience is that models that have not had the instruction tuning treatment don't behave nice.
Southern-Trip-1102 t1_itwyur3 wrote
Could that be because of Bloom being trained on a more varied datasets as opposed to being focused on English, as it was trained on multiple languages and programming langs?
Viewing a single comment thread. View all comments