Viewing a single comment thread. View all comments

visarga t1_it8wcpw wrote

Currently running it on my desktop, AutoModelForSeq2SeqLM.from_pretrained("ArthurZ/flan-t5-xl")

Seems to be very good at solving tasks that have the necessary information in the prompt, but not as great for general knowledge and code generation compared to GPT-3. I think it could be considered like a mini-GPT-3 you can run on your machine. I'm thinking about doing and agent inside the web browser on top of it + Whisper for speech.

13

bortvern t1_it8x6td wrote

Is there a tutorial for setting this up somewhere, or you recommend just reading the docs on the t5x project?

8