Daveboi7 t1_jdndvq0 wrote on March 25, 2023 at 6:19 PM

With databricks?

dreamingleo12 t1_jdndzmt wrote on March 25, 2023 at 6:19 PM

No I don’t use databricks. I only tried LLaMA and Alpaca.

Daveboi7 t1_jdnedrd wrote on March 25, 2023 at 6:22 PM

But which cloud service did you use to train them?

I tried using databricks to train a model but the setup was too complicated.

I’m wondering is there a more straightforward platform to train on?

dreamingleo12 t1_jdnel6b wrote on March 25, 2023 at 6:24 PM

You can just follow Stanford Alpaca’s github instructions, as long as you have LLaMA weights. It’s straightforward.

Daveboi7 t1_jdneqdx wrote on March 25, 2023 at 6:25 PM

Ah. I’m trying to train the Dolly model created developed databricks.

dreamingleo12 t1_jdnewt2 wrote on March 25, 2023 at 6:26 PM

It’s just Alpaca with a different base model. Databricks boasted too much.

Daveboi7 t1_jdnf18o wrote on March 25, 2023 at 6:27 PM

Yeah but the comparisons I have seen between Dolly and Alpaca look totally different.

Somehow the Dolly answers look much better imo

Edit: spelling

dreamingleo12 t1_jdnf4qn wrote on March 25, 2023 at 6:27 PM

I don’t trust DB’s results tbh. LLaMA is a better model than GPT-J.

Daveboi7 t1_jdnf96e wrote on March 25, 2023 at 6:28 PM

Somebody posted results on Twitter, they looked pretty good. I don’t think he worked for DB either. But who knows really

dreamingleo12 t1_jdlkbxl wrote on March 25, 2023 at 8:09 AM

WSJ:

“Databricks Launches ‘Dolly,’ Another ChatGPT Rival The data-management startup introduced an open-source language model for developers to build their own AI-powered chatbot apps” (Apparently DB paid them)

DB’s blog:

“Democratizing the magic of ChatGPT with open models”

Introduced? ChatGPT rival? Didn’t you just follow Stanford’s approach? You used Stanford’s dataset which was generated by GPT right? huh? This is Stanford’s achievement not DB’s. DB went too far on marketing.

Disastrous_Elk_6375 t1_jdllii0 wrote on March 25, 2023 at 8:27 AM

> https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html

This is the blog post that I've read. I can't comment on the WSJ article, and your original message implied a bunch of things that, IMO, were not found in the blog post. If you don't like the WSJ angle, your grief should be with them, not databricks. shrug

From the actual blog:

> We show that anyone can take a dated off-the-shelf open source large language model (LLM) and give it magical ChatGPT-like instruction following ability by training it in 30 minutes on one machine, using high-quality training data.

> Acknowledgments > > This work owes much to the efforts and insights of many incredible organizations. This would have been impossible without EleutherAI open sourcing and training GPT-J. We are inspired by the incredible ideas and data from the Stanford Center for Research on Foundation Models and specifically the team behind Alpaca. The core idea behind the outsized power of small dataset is thanks to the original paper on Self-Instruct. We are also thankful to Hugging Face for hosting, open sourcing, and maintaining countless models and libraries; their contribution to the state of the art cannot be overstated.

More to the point of your original message, I searched for "innovative" "innovation" "inovate" and found 0 results in the blog post. I stand by my initial take, the blog post was fair, informative and pretty transparent in what they've done, how, and why.

dreamingleo12 t1_jdllxww wrote on March 25, 2023 at 8:34 AM

Well if you ever worked with marketing or communication teams you would’ve known that DB co-authored the WSJ article. My point is that the democratization is an achievement of the Stanford Alpaca team, not DB. DB marketed it like they did the major work which is untrue.

Disastrous_Elk_6375 t1_jdlm6qd wrote on March 25, 2023 at 8:37 AM

That's fair. But you commented out of context, on a post that linked to the blog and not the WSJ article. That's on you.

dreamingleo12 t1_jdlmhcq wrote on March 25, 2023 at 8:42 AM

Well if you have connections you would’ve seen they made a good amount of posts.

[R] Hello Dolly: Democratizing the magic of ChatGPT with open models

Disastrous_Elk_6375 t1_jdlj4rn wrote on March 25, 2023 at 7:51 AM

SeymourBits t1_jdlkln7 wrote on March 25, 2023 at 8:14 AM

dreamingleo12 t1_jdll44j wrote on March 25, 2023 at 8:21 AM

Daveboi7 t1_jdm8aby wrote on March 25, 2023 at 1:11 PM

dreamingleo12 t1_jdn511a wrote on March 25, 2023 at 5:17 PM

Daveboi7 t1_jdnczd9 wrote on March 25, 2023 at 6:12 PM

dreamingleo12 t1_jdndszl wrote on March 25, 2023 at 6:18 PM