Comment by cannoneyed

18 days ago

Sorry about the hug of death - while I spent an embarassing amount of money on rented H100s, I couldn't be bothered to spend $5 for Cloudflare workers... Hope you all enjoy it, it should be back up now

3 comments

cannoneyed

Octoth0rpe 18 days ago

> while I spent an embarassing amount of money on rented H100s

Would you mind sharing a ballpark estimate?

KolmogorovComp 18 days ago

(Not OP) Judging by the clues in the article and trial/error, probably ~1000USD for the project.

storystarling 17 days ago

Curious if the H100s were strictly for VRAM capacity? Since this is a batch job and latency doesn't really matter, it seems like you could have run this on 4090s for a fraction of the cost. Unless the model size was the bottleneck, the price to performance ratio on consumer hardware is usually much better for this kind of workload.