Submitted by smallest_meta_review t3_yng63w in MachineLearning
smallest_meta_review OP t1_ivam34g wrote
Reply to comment by smurfpiss in [R] Reincarnating Reinforcement Learning (NeurIPS 2022) - Google Brain by smallest_meta_review
Yeah, or even across different classes of RL methods: reusing a policy for training a value-based RL (e.g, DQN) or model-based RL method.
Viewing a single comment thread. View all comments