← Back to context Comment by qeternity 21 hours ago Yes, absolutely in deep learning. Custom fused CUDA kernels everywhere. 1 comment qeternity Reply Scene_Cast2 20 hours ago Yep. MoE, FlashAttention, or sparse retrieval architectures for example.
Yep. MoE, FlashAttention, or sparse retrieval architectures for example.