Comment by IncreasePosts

10 hours ago

Maybe the margins are just very large for Google because they predict so much demand for 3.5?

This combined with locally runnable models getting pretty good recently (e.g. Qwen 3.6) tells me that it's time to seriously consider local dev setup again

  • Besides the cost you get the control, transparency and ability to identify small language models or LoRA you want to serve even more cost effective.

  • This should become the new Apple's hardware and software play. I am hopeful about the new CEO