Submitted by mrx-ai t3_zgr7nr in MachineLearning
jcasper t1_izj4owg wrote
Reply to comment by shadowknight094 in [R] Large language models are not zero-shot communicators by mrx-ai
Zero shot means a large language model (LLM) is performing a task without seeing any examples of the task being done. One shot or few shot gives some examples of the task in the prompt before the task.
soraki_soladead t1_izjpxd6 wrote
fwiw, it is very difficult to know if the model has seen the task (or similar tasks) before or not due to the nature of the data collection
I feel like zero shot / few shot has taken a much less rigorous tone when applied to LLMs
FutureIsMine t1_izkmv5n wrote
hmmmmm, to some extent I think you might be onto something, there could be something related, on the other hand, seeing enough data means that you've seen bits and pieces of a zero-shot task so its not exactly "seen" but its not brand new and a novel task, its just piecing it together from multiple other tasks
Same_Smoke6922 t1_izkdzgw wrote
Giving some examples in the prompt is zero-shot. Few shot means using few examples in the training.
Viewing a single comment thread. View all comments