ThePhantomPhoton t1_jdsyzhn wrote on March 26, 2023 at 11:12 PM

Reply to [D] GPT4 and coding problems by enryu42

It’s easier to gauge the effectiveness of these large language models within the context of what they are actually doing, and that is repeating language they’ve learned elsewhere, predicated on some prompt provided by the user. They are not “reasoning,” although the language they use can lead us to believe that is the case. If you’re disappointed by their coding, you will certainly be disappointed by their mathematics.

ThePhantomPhoton t1_iywi3sq wrote on December 4, 2022 at 6:41 PM

Reply to [D] OpenAI’s ChatGPT is unbelievable good in telling stories! by Far_Pineapple770

That’s a very good story! The biggest challenge in building these “chat bots” is that, as the generated text increases in length, they will tend toward “untruths” as they extend outside of their context windows and move from one “scene” to another— for instance, if this story continued, it’s possible we would start to see the Batman begin to use x-ray vision powers and save Lois Lane. It’s a tough nut to crack given finite memory.

ThePhantomPhoton t1_iyk2wnq wrote on December 1, 2022 at 11:49 PM

Reply to comment by bushrod in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

I think you have a good argument for images, but language is more challenging because we rely on positional encodings (a kind of "time") to provide us with contextual clues which beat out the following form of statistical language model: Pr{x_{t+1}|x_0, x_1, ..., x_{t}} (Edit-- that is, predicting the next word in sequence given all preceding words in the sequence)

ThePhantomPhoton t1_iyk2c66 wrote on December 1, 2022 at 11:44 PM

Reply to comment by TotallyNotGunnar in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

Upvoted because I agree with you-- for many simple image problems you can even just grayscale and use the distance from the Frobenius Norm of each class as input to a logistic regression and nail many of the cases.

ThePhantomPhoton t1_iyj7b4a wrote on December 1, 2022 at 8:15 PM

Reply to [R] Statistical vs Deep Learning forecasting methods by fedegarzar

Depends on the problem. For physical phenomena, statistical techniques are very effective. For more abstract applications, like language and vision, I just don’t know how the purely statistical methods could compete.

ThePhantomPhoton t1_iw92cxj wrote on November 13, 2022 at 9:49 PM

Reply to [P] Modeling baseball injuries with temporal point processes by ssharpe42

This is very interesting! I’m a fat neck beard who works in medicine, but one of my colleagues who was interested in baseball went on to work for the Boston Red Sox— if you’re interested in these analyses, maybe ping a baseball team or two and see if they’re interested in this kind of work. Very cool topic for ML!

ThePhantomPhoton t1_iw7n0ee wrote on November 13, 2022 at 4:18 PM

Reply to [D] When was the last time you wrote a custom neural net? by cautioushedonist

Not for a few years now— back in the day, one did not simply call MLPClassifier().