Submitted by LevelWriting t3_zk0qek in singularity
katiecharm t1_j00db5z wrote
Reply to comment by Relative_Rich8699 in Character ai is blowing my mind by LevelWriting
I haven’t used it yet but I highly doubt it’s GPT2 if it’s impressive. GPT2 is a neat trick, but I wouldn’t call it impressive here in 2022.
oopiex t1_j00z2f6 wrote
When i tried it, it wasn't impressive
Relative_Rich8699 t1_j02bg4c wrote
Agree, but if it's using LaMDA or something more advanced than BERT/GPT-2 why is it hallucinating and giving me incorrect information about its own platform?
fingin t1_j031gnr wrote
Even GPT-4 will make silly mistakes. That's what happens when a model is trained to find probable word sequeces instead of actually having knowledge of language like people do.
Relative_Rich8699 t1_j033bjo wrote
Yes. But I was speaking to "the company's" bot on purpose and I would only say that it should be trained with company data for those questions. When I inquire about ducks it can use the world's written word.
fingin t1_j03181d wrote
I asked the character.ai bot what model it used it told me, T5. Insisted even. Regardless of the veracity of this, all of these models use tranformer-based architecture, with improvement between versions of models being due to more parameters (and correspondingly larger and higher quality training data sets). Crazy to think in two months we might be at GPT4 level and laugh about this tech we are blown away with today
Viewing a single comment thread. View all comments