← Back to context

Comment by winwang

1 year ago

Having talked to many engineers using distributed compute today, they seem to think that (single-node) parallel compute haven't changed much since ~2010 or so.

It's quite frustrating, and exacerbated by frequent intro-level CUDA blog posts which often just repeat what they've read.

re: raytracing, this might be crazy but, do you think we could use RT cores to accelerate control flow on the GPU? That would be hilarious!

RT cores? No. Too primitive and specific.

But there is seemingly a generalization here to the Raytracing software ecosystem. I dunno how much software / hardware needs to advance here, but we are at the point where Intel RT cores are passing the stack pointers / instruction pointers between shaders (!!!). Yes through specialist hardware but surely this can be generalized to something awesome in the future?

------

For now, I'm happy with stream expansion / stream compaction and looping over consume buffers and producer/append buffers.