Submitted by Ok-Variety-8135 t3_11c9zum in singularity
turnip_burrito t1_ja2f02y wrote
Sure, you can do it if you have enough data, and a powerful enough computer.
Idk how you're going to do reinforcement learning to update the transformer weights though (I assume you want to use a transformer?). That's a lot of computation. The bigger your model is, the slower this update step will be.
Are you separating hearing and speaking/moving in time? Like are they separate steps that can't happen at the same time? My question then is why not make them simultaneous?
visarga t1_ja3637d wrote
A recent approach saves past experience data and loads it back for in-context-learning. The model itself can be task generic. So it learns by collecting new data.
Viewing a single comment thread. View all comments