Comment by hashxyz
5 hours ago
Pretty sure this is just vectorization. You can pack some 8bit ints into a machine-length 32bit int and add them together, that is vectorization.
5 hours ago
Pretty sure this is just vectorization. You can pack some 8bit ints into a machine-length 32bit int and add them together, that is vectorization.
I don't think that's true when the add overflows. You wouldn't want a lane's overflow to carry into an adjacent lane.