Comment by Dylan16807 3 months ago Depends on what you're doing. I'm pretty sure the bandwidth for inference isn't much. 1 comment Dylan16807 Reply eurekin 3 months ago Depends, if it's tensor parallel or pipeline parallel. Only PP doesn't pass too much. TP does
eurekin 3 months ago Depends, if it's tensor parallel or pipeline parallel. Only PP doesn't pass too much. TP does
Depends, if it's tensor parallel or pipeline parallel. Only PP doesn't pass too much. TP does