Comment by mlpicker

12 hours ago

What I'm curious about is whether this is actually on-device. Apple's framework caps local models around 3B params last I looked, and Claude is way bigger than that. So either there's some hybrid setup I haven't seen documented, or this is mostly a Claude SDK in FM clothing. Anyone tried it on a plane?

2 comments

mlpicker

brookst 8 hours ago

Read the linked article? It is absolutely a cloud service. Neither Apple nor Anthropic is suggesting otherwise

ABS 7 hours ago

it's cloud, the doc is explicit that requests go straight to api.anthropic.com with Apple not in the way.

so Claude via FM dies offline while Apple's on-device SystemLanguageModel (the ~3B one) keeps working. It isn't a hybrid really: the framework just has both implement the same LanguageModelSession protocol so "local 3B" and "remote frontier model" become a one-argument swap.

IMHO what's worth internalising is that the two share an API but nothing else: the on-device path runs on Apple's Neural Engine and costs battery (you can watch ANE power ramp while it works) while the cloud path costs API credits/tokens and does zero local compute. Same code, opposite cost model.