← Back to context Comment by nicman23 10 hours ago > 260k contextwith a single 5090? 2 comments nicman23 Reply kgeist 7 hours ago Yep, Gated DeltaNet in Qwen3.6 requires much less VRAM for the KV cache than previous generations. Plus the KV cache is 8-bit. nicman23 7 hours ago is it in llama.cpp?
kgeist 7 hours ago Yep, Gated DeltaNet in Qwen3.6 requires much less VRAM for the KV cache than previous generations. Plus the KV cache is 8-bit. nicman23 7 hours ago is it in llama.cpp?
Yep, Gated DeltaNet in Qwen3.6 requires much less VRAM for the KV cache than previous generations. Plus the KV cache is 8-bit.
is it in llama.cpp?