mettle
mettle t1_j7hrck0 wrote
Reply to comment by yeluapyeroc in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
is it though? how would you even do that? i think if you have that actually figured out, it's easily a $1b idea.
mettle OP t1_j6jgkz8 wrote
Reply to comment by currentscurrents in [Discussion] ChatGPT and language understanding benchmarks by mettle
Sure, but the question is how often does it happen to get the right answer vs. the wrong answer and how would be measure that.
mettle OP t1_j6imy6h wrote
Reply to comment by Jean-Porte in [Discussion] ChatGPT and language understanding benchmarks by mettle
perfect, thank you!
mettle OP t1_j6im95b wrote
Reply to comment by EmmyNoetherRing in [Discussion] ChatGPT and language understanding benchmarks by mettle
this is true so far, it would seem.
you'd think there'd be some clever folks trying to quantify things better.
mettle OP t1_j6im3ap wrote
Reply to comment by fmai in [Discussion] ChatGPT and language understanding benchmarks by mettle
Thanks for this thoughtful answer.
Re: 2, are there solid numbers we would conceptual even be able to get? Are there known ongoing efforts?
mettle OP t1_j6ilm6q wrote
Reply to comment by Jean-Porte in [Discussion] ChatGPT and language understanding benchmarks by mettle
Is there some alternative benchmark that measures factual accuracy of output?
Or is that impossible to use and create because any model would overfit that data?
Submitted by mettle t3_10oyllu in MachineLearning
mettle t1_j3xtzp1 wrote
everyone's scaling back assistant efforts, though, and cortana is basically dead, so, interesting idea, but i don't think so.
mettle t1_j0mrkqp wrote
Reply to comment by CriticalTemperature1 in [D] ChatGPT, crowdsourcing and similar examples by mvujas
lots of implicit signals to look at based on what the user does after.
mettle t1_j7i5ign wrote
Reply to comment by farmingvillein in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada
the true human in the loop.