Comment by cco

1 month ago

Not that far away, you can run a useful model on flagship phones today, something around GPT 3.5's level.

So we're probably only a few years out from today's SOTA models on our phones.

> you can run a useful model on flagship phones today

How?

  • Cactus Chat is probably the easiest.

    Just download the app and it has a few built-in model options. The best of those is probably Gemma-3 1B Q4 but on my Pixel 10 Pro I find the best performing model it can reasonably run is Qwen3 8B Q4_K_M.

    You can download and run any GGUF compatible model with that app.