Comment by throwaway314155 21 days ago PyTorch MPS is about 10x faster per the README.md. 2 comments throwaway314155 Reply antirez 21 days ago I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow. Numerlor 21 days ago Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
antirez 21 days ago I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow. Numerlor 21 days ago Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
Numerlor 21 days ago Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow.
Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?