Comment by utopiah
2 months ago
Interesting, seems you are doing the recognition server-side, no reliable way to do it locally using e.g. WASM on the client?
2 months ago
Interesting, seems you are doing the recognition server-side, no reliable way to do it locally using e.g. WASM on the client?
Actually actively exploring this very topic! I have a feature-flag version where the inference runs via WASM / WebGPU (onnxruntime-web specifically).
My only pause behind rolling this out further is the performance isn't as fast as I'd like (1.5s~ latencies), and the widely varying support for WebGPU / WASM across browsers and OS pairs.
Still testing it out (and learning about ViT performance on various hardware), so hopefully more news on that front soon!