Viewing a single comment thread. View all comments

songstar13 t1_j57nunw wrote

You can ask a translator a question about the world and if they have knowledge on that topic then they can answer you with certainty.

Current GPT models are basically a super-powered predictive text bot that answers questions. It would be like trying to answer a question using the suggested words on your phone keyboard but far more sophisticated.

They are fully capable of lying to you or giving inconsistent answers to the question because they don't "know" anything other than patterns of word association and grammar rules.

At least, this was my understanding of them fairly recently. Please correct me if that has changed.

1

Surur t1_j57w5tj wrote

I imagine you understand that LLM are a bit more sophisticated than Markov chains, and that GPT-3 for example has 175 billion parameters, which corresponds to the connections between neurons in the brain, and that the weights of these connections influences which word the system outputs.

These weights allows the LLM to see the connections between words and understand the concepts much like you do. Sure, they do not have a visual or intrinsic physical understanding but they do have clusters of 'neurons' which activate for both animal and cat for example.

In short, Markov chains use a look-up table to predict the next word, while LLM use a multi-layer (96 layer) neural network with 175 billion connections tuned on nearly all the text on the internet to choose its next word.

Just because it confabulates sometimes does not mean its all smoke and mirrors.

11

songstar13 t1_j58shuh wrote

Thank you for the more detailed explanation! I was definitely underestimating how much more complex some of these AI models have become.

3