darkshenron t1_jd9lmz6 wrote on March 22, 2023 at 8:32 PM

I was looking for something similar and realised you can just apply an appropriate system prompt to GPT4 to narrow its focus. Some variant of “you are a helpful programming assistant. You help users answer questions related to programming in python language. If the question is not related to programming you decline to answer. “

currentscurrents t1_jdaq9xo wrote on March 23, 2023 at 1:11 AM

Right, but you're still loading the full GPT4 to do that.

The idea is that domain-specific chatbots might have better performance at a given model size. You can see this with StableDiffusion models, the ones trained on just a few styles have much higher quality than the base model - but only for those styles.

This is basically the idea behind mixture of experts.

darkshenron t1_jdc4wvn wrote on March 23, 2023 at 10:25 AM

Yes from an efficient angle for sure