hpstring
hpstring t1_ja6pm05 wrote
Do you specifically mean applications in NLP? RL seems to have a lot of applications in fields like game playing, robotics, neural theorem proving, etc. which seems to have no direct connection with LLMs
hpstring t1_j9kkkr0 wrote
Interesting research! Let's try to convince artists with this kind of work
hpstring t1_j9kf6te wrote
Reply to comment by activatedgeek in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
This is a very good answer! I want to add that apart from generalization, the fact that we have efficient optimization algorithms that can find quite good minima also contributes a lot to the deep learning magic.
hpstring t1_j9jxzpm wrote
Reply to comment by inspired2apathy in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Well the traditional ml + dimensionality reduction cannot crack e.g. imagenet recognition
hpstring t1_j9juk1f wrote
Reply to comment by GraciousReformer in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
Q1: We don't know yet. Q2: Probably there are other classes but they haven't been discovered or are only at the early age of research.
hpstring t1_j9jbhwv wrote
Reply to comment by Fancy-Jackfruit8578 in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer
This is correct for two-layer NNs, not general NNs.
hpstring t1_j9jb96f wrote
Universal approximation is not enough, you need efficiency to make things work.
DL is the only class of algorithms that beats the curse of dimensionality for discovering certain (very general) class of high dimensional functions(something related to Barron space). Point me out if this is not accurate.
hpstring t1_j9ens2o wrote
Reply to Best free and open Math AI? [D] by lorentzofthetwolakes
What level of math do you want to do?
hpstring t1_j8zw7i5 wrote
Reply to comment by baffo32 in [D] HuggingFace considered harmful to the community. /rant by drinkingsomuchcoffee
Lots of thanks! I didn't receive training from software engineering perspective, which seems to be an important aspect in machine learning.
hpstring t1_j8x5iyh wrote
Reply to comment by drinkingsomuchcoffee in [D] HuggingFace considered harmful to the community. /rant by drinkingsomuchcoffee
I'm a beginner in this field and I was wondering what it means for code to be "centralized" and "dry". Does "centralized" mean putting a lot of code in a single file and "dry" means raw code that is not very easy to read but is efficient or have some other advantages?
hpstring t1_ja6uzk4 wrote
Reply to comment by [deleted] in [D] Is RL dead/worth researching these days? by [deleted]
Understood. That depends on personal prediction of the research landscape in the future but I would say it is still researched by institutions like DeepMind. But both RL and LLM share a common aspect: they are very, very expensive.