Viewing a single comment thread. View all comments

visarga t1_itwxzgs wrote

My experience is that models that have not had the instruction tuning treatment don't behave nice.

1

Southern-Trip-1102 t1_itwyur3 wrote

Could that be because of Bloom being trained on a more varied datasets as opposed to being focused on English, as it was trained on multiple languages and programming langs?

2