visarga t1_it8wcpw wrote on October 21, 2022 at 8:02 PM

Reply to comment by Paladia in New open-source language model from Google AI: Flan-T5 🍮 by Ezekiel_W

Currently running it on my desktop, AutoModelForSeq2SeqLM.from_pretrained("ArthurZ/flan-t5-xl")

Seems to be very good at solving tasks that have the necessary information in the prompt, but not as great for general knowledge and code generation compared to GPT-3. I think it could be considered like a mini-GPT-3 you can run on your machine. I'm thinking about doing and agent inside the web browser on top of it + Whisper for speech.

bortvern t1_it8x6td wrote on October 21, 2022 at 8:07 PM

Is there a tutorial for setting this up somewhere, or you recommend just reading the docs on the t5x project?

very_bad_programmer t1_itagith wrote on October 22, 2022 at 3:17 AM

How is it conversationally?

visarga t1_itau1y1 wrote on October 22, 2022 at 5:43 AM

Can't seem to get conversation from it, it's a T5 variant, and it seems to be geared towards task solving. There is also a GPT variant they don't release.