Comment by vkazanov

1 month ago

It seems that proper vectorization requires a different kind of language, something similar to cuda and the like, not a general putpose scalar kind of language.

I remember intel had something like it but it went nowhere.

That is ispc.

You don't want "vectorization" though, you either want

a) a code generation tool that generates exactly the platform-specific code you want and can't silently fail.

b) at least a fundamentally vectorized language that does "scalarization" instead of the other way round.