Comment by porridgeraisin

1 year ago

Waymo uses reinforcement learning (what it was before LLMs) (TD3+BC according to one of their blogs)

Emma is something they tried, but further down the article they explain why they don't use it as such yet.

1 comment

porridgeraisin

Yep. It's an interesting experiment and really stretched my understanding of what an LLM is and can do.