hpstring t1_ja6uzk4 wrote on February 27, 2023 at 7:29 AM

Reply to comment by [deleted] in [D] Is RL dead/worth researching these days? by [deleted]

Understood. That depends on personal prediction of the research landscape in the future but I would say it is still researched by institutions like DeepMind. But both RL and LLM share a common aspect: they are very, very expensive.

hpstring t1_ja6pm05 wrote on February 27, 2023 at 6:22 AM

Reply to [D] Is RL dead/worth researching these days? by [deleted]

Do you specifically mean applications in NLP? RL seems to have a lot of applications in fields like game playing, robotics, neural theorem proving, etc. which seems to have no direct connection with LLMs

hpstring t1_j9kkkr0 wrote on February 22, 2023 at 5:39 PM

Reply to [R] Provable Copyright Protection for Generative Models by vyasnikhil96

Interesting research! Let's try to convince artists with this kind of work

hpstring t1_j9kf6te wrote on February 22, 2023 at 5:06 PM

Reply to comment by activatedgeek in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

This is a very good answer! I want to add that apart from generalization, the fact that we have efficient optimization algorithms that can find quite good minima also contributes a lot to the deep learning magic.

hpstring t1_j9jxzpm wrote on February 22, 2023 at 2:43 PM

Reply to comment by inspired2apathy in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

Well the traditional ml + dimensionality reduction cannot crack e.g. imagenet recognition

hpstring t1_j9juk1f wrote on February 22, 2023 at 2:18 PM

Reply to comment by GraciousReformer in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

Q1: We don't know yet. Q2: Probably there are other classes but they haven't been discovered or are only at the early age of research.

hpstring t1_j9jbhwv wrote on February 22, 2023 at 11:24 AM

Reply to comment by Fancy-Jackfruit8578 in [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

This is correct for two-layer NNs, not general NNs.

hpstring t1_j9jb96f wrote on February 22, 2023 at 11:22 AM

Reply to [D] "Deep learning is the only thing that currently works at scale" by GraciousReformer

Universal approximation is not enough, you need efficiency to make things work.

DL is the only class of algorithms that beats the curse of dimensionality for discovering certain (very general) class of high dimensional functions(something related to Barron space). Point me out if this is not accurate.

hpstring t1_j9ens2o wrote on February 21, 2023 at 10:59 AM

Reply to Best free and open Math AI? [D] by lorentzofthetwolakes

What level of math do you want to do?

hpstring t1_j8zw7i5 wrote on February 18, 2023 at 3:53 AM

Reply to comment by baffo32 in [D] HuggingFace considered harmful to the community. /rant by drinkingsomuchcoffee

Lots of thanks! I didn't receive training from software engineering perspective, which seems to be an important aspect in machine learning.

hpstring t1_j8x5iyh wrote on February 17, 2023 at 4:13 PM

Reply to comment by drinkingsomuchcoffee in [D] HuggingFace considered harmful to the community. /rant by drinkingsomuchcoffee

I'm a beginner in this field and I was wondering what it means for code to be "centralized" and "dry". Does "centralized" mean putting a lot of code in a single file and "dry" means raw code that is not very easy to read but is efficient or have some other advantages?