← Back to context

Comment by gpderetta

8 hours ago

Well, HTM was not useful per se, except accelerating an STM implementation.

It isn't very useful for that, but you can use it to implement other higher-level concurrency primitives like multi-word compare and swap efficiently.

  • True. At some point in the now distant past, AMD had a proposal for a very restricted form of HTM that allowed CAS up to 7 memory locations as they had some very specific linked list algorithms that they wanted optimize and the 7 location restrictions worked well with the number of ways of their memory.

    Nothing came out of it unfortunately.

    • I'd like to see what kind of hardware acceleration would help STMs without imposing severe limitations on their generality.

      To me, the appealing things about STMs are the possibility of separating concerns of worst-case execution time and error handling, which are normally pervasive concerns that defeat modularity, from the majority of the system's code. I know this is not the mainstream view, which is mostly about manycore performance.

      2 replies →