Comment by adithyassekhar

13 hours ago

Skimmed through the page no mention of what spark is. Is it a new ISA? SoC with CPU, GPU and NPU? Or just GPU+AI?

It is the same CPU previously used in the overpriced DGX Spark.

It has 10 big Cortex-X925 cores, which are competitive with the Intel P-cores and with the AMD Zen cores, plus 10 small Cortex-A725 cores, which are similar in performance with the older Intel E-cores, from the Meteor Lake, Raptor Lake and Alder Lake generations. The current Intel E-cores are similar to Cortex-X4, i.e. they are much faster.

This Arm based CPU is more powerful that any Arm-based CPU previously used in a non-Apple PC, but in multi-threaded applications it is inferior to AMD Strix Halo CPUs.

The GPU of this is different from that of DGX, which was good only at ML/AI, but poor for graphics.

Here the GPU is likely to be good for graphics, and the top model will have up to 6144 FP32 execution units compared to 2560 of Strix Halo. But I assume that at least the top models will also be much more expensive than Strix Halo.

This NVIDIA CPU+GPU is limited to 128 GB of DRAM, while the successor of Strix Halo, which has been announced recently, offers up to 192 GB of DRAM, so NVIDIA continues its tradition of always providing less memory than its competitors, in order to have better profit margins.

From somewhere in the middle of Nvidia’s endless press waffle:

“The RTX Spark superchip features an NVIDIA Blackwell RTX GPU with 6,144 CUDA cores and fifth-generation Tensor Cores with FP4 precision, connected via the NVIDIA NVLink®-C2C chip-to-chip interconnect to a high-performance, 20-core NVIDIA Grace™ CPU.

MediaTek, a market leader in Arm-based system-on-a-chip designs, collaborated with NVIDIA on the custom CPU design, contributing to its best-in-class power efficiency, performance and connectivity.“

https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-...

Nvidia Grace is an ARM core.

  • Sooooo they binned all their old DGX Spark crap to pawn on poor clueless consumers to try to be an ARM fighter and probably still fail miserably behind Apple Silicon and even likely older Qualcomm metal running Windows.

    And Mediatek? Oof. I assume the SOC comes pre compromised out of the box.

    • > And Mediatek? Oof. I assume the SOC comes pre compromised out of the box.

      If by compromised you mean “china bad”, mediatek is taiwanese not chinese. Same home as asus, asrock, tsmc, htc, acer, d link, adata, biostar, insyde, gskill, foxconn, realtek and many others. From the chip to bios in your pc probably.

      If you mean quality, they make efficient and more powerful chipsets especially gpu wise compared to qualcomm. Most probably only know it from cheap chinese phones.

      6 replies →

    • It’s kind of irrelevant that they might be behind apple silicon because they’re targeting the non-apple, windows-using, section of the laptop market with a chip that can ostensibly be used for running AI models locally. Whether there’s much appetite for business users doing that remains to be seen.

      1 reply →

As best as I can tell its something like the Apple M series SoC, but for Windows: CPU + GPU with unified memory.

It has 6,144 CUDA cores is similar to a RTX 4070 (5,888) but a lot less than a 4090 (16,384), but what it does have is support for FP4.

When they claim "1 Petaflop AI compute", thats what they mean. For comparison, a RTX 4090 has ~1.3 Petaflops of FP8 processing.

The second big deal is the NVLink-C2C interconnect, which provides up to 900 GB/s of bidirectional bandwidth between GPU and CPU. For comparison, the Apple M4 has 120 GB/s and the M3 Ultra has 819 GB/s. Notably, the Apple M series does not have FP4 support, so this could mean a significant performance improvement over Apple's offerings.

  • I don't understand why this isn't bigger news - this is a laptop SoC with actual gaming hardware running on ARM - unlike Apple's M series, which tend to have rather underwhelming perf in games compared to what the specs would suggest, finally we can have a thin-and-light with an efficient gaming GPU.

    Considering how much Valve invested into ARM emulation, it's quite possible the next Steam Deck/handheld will use a variation of this (or at least there will be one using this as the SoC).

    • It's seeming to be going to be an another DGX Sparks that aren't so faster than maxed out Mac Studio, nor cheaper than 4x Blackwell on a workstation, nor cloud tokens. That's why.

      1 reply →

    • Well, we don't have any information on cost, battery life or performance yet, which all matter. Could very well run laps around the M series at half the battery life and twice the cost.

      1 reply →