Comment by porridgeraisin

1 year ago

Waymo uses reinforcement learning (what it was before LLMs) (TD3+BC according to one of their blogs)

Emma is something they tried, but further down the article they explain why they don't use it as such yet.

Yep. It's an interesting experiment and really stretched my understanding of what an LLM is and can do.