← Back to context Comment by throwaway314155 1 month ago PyTorch MPS is about 10x faster per the README.md. 2 comments throwaway314155 Reply antirez 1 month ago I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow. Numerlor 1 month ago Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
antirez 1 month ago I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow. Numerlor 1 month ago Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
Numerlor 1 month ago Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?
I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow.
Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?