Comment by spease
10 hours ago
There was a library for Rust called “faster” which worked similarly to Rayon, but for SIMD.
The simpleminded way to do what you’re saying would be to have the compiler create separate PTX and native versions of a Rayon structure, and then choose which to invoke at runtime.
Why past tense? I would use that if it truly acted like Rayon! I.e minimal friction.