Comment by wepple 1 year ago rlhf: Reinforcement learning from human feedback 2 comments wepple Reply gnicholas 1 year ago How is this pronounced out loud? wepple 1 year ago I was just saving folks a google, as I had no idea what the acronym was.I propose rill-hiff until someone who actually know what they’re doing shows up!
gnicholas 1 year ago How is this pronounced out loud? wepple 1 year ago I was just saving folks a google, as I had no idea what the acronym was.I propose rill-hiff until someone who actually know what they’re doing shows up!
wepple 1 year ago I was just saving folks a google, as I had no idea what the acronym was.I propose rill-hiff until someone who actually know what they’re doing shows up!
How is this pronounced out loud?
I was just saving folks a google, as I had no idea what the acronym was.
I propose rill-hiff until someone who actually know what they’re doing shows up!