Comment by ashdnazg
19 hours ago
Porting my binary & decimal palindromes[0] finding code[1] to CUDA, with which I had no experience before starting this project.
It's already working, and slightly faster than the CPU version, but that's far from an acceptable result. The occupancy (which is a term I first learned this week) is currently at a disappointing 50%, so there's a clear target for optimisation.
Once I'm satisfied with how the code runs on my modest GPU at home, the plan is to use some online GPU renting service to make it go brrrrrrrrrr and see how many new elements I can find in the series.
No comments yet
Contribute on Hacker News ↗