Slacker News Slacker News logo featuring a lazy sloth with a folded newspaper hat
  • top
  • new
  • show
  • ask
  • jobs
Library

Comment by tarruda

10 days ago

I'm only interested in the local, single user use case. Plus I use a Mac studio for inference, so vLLM is not an option for me.

1 comment

tarruda

Reply

mycall  9 days ago

You can get concurrency gains [0] as local/single user (multi-agent) use case with vLLM with your Mac Studio.

[0] https://youtu.be/Ze5XLooTt6g?t=658

Slacker News

Product

  • API Reference
  • Hacker News RSS
  • Source on GitHub

Community

  • Support Ukraine
  • Equal Justice Initiative
  • GiveWell Charities