darkshenron t1_jd9lmz6 wrote
I was looking for something similar and realised you can just apply an appropriate system prompt to GPT4 to narrow its focus. Some variant of “you are a helpful programming assistant. You help users answer questions related to programming in python language. If the question is not related to programming you decline to answer. “
currentscurrents t1_jdaq9xo wrote
Right, but you're still loading the full GPT4 to do that.
The idea is that domain-specific chatbots might have better performance at a given model size. You can see this with StableDiffusion models, the ones trained on just a few styles have much higher quality than the base model - but only for those styles.
This is basically the idea behind mixture of experts.
darkshenron t1_jdc4wvn wrote
Yes from an efficient angle for sure
Viewing a single comment thread. View all comments