zoontechnicon
zoontechnicon t1_j69b6g5 wrote
Reply to comment by zoontechnicon in [D] Simple Questions Thread by AutoModerator
The solution, as evidenced by code in huggingface/transformers is to force the probability of the end token to -Inf. What a hack...
zoontechnicon t1_j5q9atd wrote
Reply to comment by Cyclone4096 in [D] Simple Questions Thread by AutoModerator
Would you mind giving more details about the domain and the purpose of the loss function? Maybe people can give you hints based on that.
zoontechnicon t1_j5oizdf wrote
Reply to comment by kernel_KP in [D] Simple Questions Thread by AutoModerator
You could build an autoencoder using CNNs and use the latent vectors as input to a clustering algorithm.
zoontechnicon t1_j5oiraa wrote
Reply to [D] Simple Questions Thread by AutoModerator
I'm trying to use this model to summarize text: https://huggingface.co/bigscience/mt0-large Text generation seems to end after the special end token </s> however. I wonder how I would coax it to generate longer texts. Any ideas?
zoontechnicon t1_je10zfj wrote
Reply to comment by Flag_Red in [P] 🎉 Announcing Auto-Analyst: An open-source AI tool for data analytics! 🎉 by aadityaubhat
> Auto-Analyst leverages the OpenAI API
I feel like frontends for OpenAI/ChatGPT do not belong here