Daveboi7 t1_jdndvq0 wrote on March 25, 2023 at 6:19 PM

With databricks?

dreamingleo12 t1_jdndzmt wrote on March 25, 2023 at 6:19 PM

No I don’t use databricks. I only tried LLaMA and Alpaca.

But which cloud service did you use to train them?

I tried using databricks to train a model but the setup was too complicated.

I’m wondering is there a more straightforward platform to train on?

You can just follow Stanford Alpaca’s github instructions, as long as you have LLaMA weights. It’s straightforward.

Ah. I’m trying to train the Dolly model created developed databricks.

It’s just Alpaca with a different base model. Databricks boasted too much.

Yeah but the comparisons I have seen between Dolly and Alpaca look totally different.

Somehow the Dolly answers look much better imo

Edit: spelling

I don’t trust DB’s results tbh. LLaMA is a better model than GPT-J.

Somebody posted results on Twitter, they looked pretty good. I don’t think he worked for DB either. But who knows really