Berke80 OP t1_j9gdg93 wrote on February 21, 2023 at 7:13 PM

Thanks, I’m guessing reinforcement learning is not as useful in advancing LLMs.

alexiuss t1_j9go6ip wrote on February 21, 2023 at 9:00 PM

Ye. It's way too easy to trick lamda into writing infinite lewd stories.

I wouldn't have a problem with that.