Comment by ack_complete
6 months ago
This happens with partial autovectorization, too. Compiler fails to vectorize a first loop and then vectorizes the second, result is a store forwarding failure at the start of the second loop trying to read the output of the first loop, erasing the vectorization gains.
No comments yet
Contribute on Hacker News ↗