Comment by rohtashotas

2 years ago

It's not a silly mistake. It was rlhf'd to do this intentionally.

When the results are more extremist than the unfiltered model, it's no longer a 'small mistake'

4 comments

rohtashotas

rlhf: Reinforcement learning from human feedback

Realistically it was probably just how Gemini was prompted to use the image generator tool