← Back to context

Comment by tapirl

1 day ago

Many thanks for providing the source. It also works on my machine.

TIL.

I tried the others on my x86 machine and they all do something for me - not nearly as much as the article, but something.

  • The "_ [0]byte" trick has no base in my knowledge. For the author's specified example, [1024]float64 will be always allocated on one whole page, aka, always 64-byte aligned.

    For "Array of Structs vs Struct of Arrays", using slices as fields is a good idea. If the purpose is to make fields allocated on their respective memory block, just use pointers instead.

    • > The "_ [0]byte" trick has no base in my knowledge. For the author's specified example, [1024]float64 will be always allocated on one whole page, aka, always 64-byte aligned.

      You're right - I read the results I had wrong on that one. That one is slower, not faster, on both my M2 and on x86 machine.