Comment by dadachi

4 days ago

Same on mobile. I use Apple's Vision framework on-device to find people in photos for a printing app. Sending users' personal photos to an image-model API is a non-starter on privacy, latancy, and per-photo cost alone. Less flexible than a V-LLM, but for "find the people, give me box" it's instant, free, and works offline.