Viewing a single comment thread. View all comments

Berke80 OP t1_j9gdg93 wrote

Thanks, I’m guessing reinforcement learning is not as useful in advancing LLMs.

4

alexiuss t1_j9go6ip wrote

Ye. It's way too easy to trick lamda into writing infinite lewd stories.

1

Wyrade t1_j9gy1g1 wrote

I wouldn't have a problem with that.

4