Submitted by Ezekiel_W t3_ya1oih in singularity
visarga t1_it8wcpw wrote
Reply to comment by Paladia in New open-source language model from Google AI: Flan-T5 🍮 by Ezekiel_W
Currently running it on my desktop, AutoModelForSeq2SeqLM.from_pretrained("ArthurZ/flan-t5-xl")
Seems to be very good at solving tasks that have the necessary information in the prompt, but not as great for general knowledge and code generation compared to GPT-3. I think it could be considered like a mini-GPT-3 you can run on your machine. I'm thinking about doing and agent inside the web browser on top of it + Whisper for speech.
bortvern t1_it8x6td wrote
Is there a tutorial for setting this up somewhere, or you recommend just reading the docs on the t5x project?
very_bad_programmer t1_itagith wrote
How is it conversationally?
visarga t1_itau1y1 wrote
Can't seem to get conversation from it, it's a T5 variant, and it seems to be geared towards task solving. There is also a GPT variant they don't release.
Viewing a single comment thread. View all comments