Comment by sweetjuly
1 year ago
This would require specifying a cache line size in the ABI, which is a somewhat odd uarch detail to bubble up. While 64-bytes is conventional for large application processors and has been for a long time, I wouldn't want to make it a requirement.
It's definitely worth analyzing though.
See how big of a block you need to get 90% of the compression benefit, etc.