Comment by wren6991
13 hours ago
Apple muddied the waters by calling them "neural accelerators" but it seems like what they actually added in the M5 generation is tensor instructions for the existing GPU cores. It's not a separate accelerator like the ANE.
llama.cpp's Metal backend does use them when they're available.
No comments yet
Contribute on Hacker News ↗