Comment by 7e

3 months ago

Wow, no, not at all. It’s better to have a set of smaller, faster cliques connected by a slow network than a slower-than-clique flat network that connects everything. The cliques connected by a slow DCN can scale to arbitrary size. Even Google has had to resort to that for its biggest clusters.

1 comment

markhahn 3 months ago

Is this claim based on observed comm patterns in some particular AI architecture?