Comment by dragonwriter

6 months ago

> Even a 5090 can handle that. You have to use multiple GPUs.

It takes about 40GB with the fp8 version fully loaded, but ComfyUI can (at reduced speed), with enough system RAM available, partially load models in VRAM during inference and swap at need (the NVidia page linked in the BFL announcement specifically highlights NVidia working with ComfyUI to improve this existing capacity specifically to enable Flux.2) to run on systems with too little VRAM to fully load the model.

0 comments

dragonwriter

No comments yet

Contribute on Hacker News ↗