Comment by subset
2 days ago
Ooh this looks really neat! I'd love to see more content in the future on Structured outputs/Guided generation and sampling. Another great reference on inference-time algorithms for sampling is here: https://rentry.co/samplers
Thanks for the recommendation, I'm actually working on something similar for this part of the docs (I'm also working at BentoML).
Wow that's really thorough