Comment by freakynit

9 days ago

Does this prove cerebras chips are generic enough to be able to run the most common architectures of LLM's? Even the proprietary ones?

Not at all, the limitation is software to get the model on the chip and executing correctly. My bet is that they had a FDE who specializes in the chip implement Spark’s architecture on device.