Comment by yid
9 years ago
> If it's "machines" plural, than you can do replication between the two.
This is the start of a scaling path that winds down Distributed Systems Avenue, and eventually leads to a place called Hadoop.
(Replication and consensus are remarkably difficult problems that Hadoop solves).
Fair 'nuff, but if you don't distribute compute, and you store the dataset locally on all the systems (not necessarily the results of mutations, just the datasets and the work that's being done), you'll still possibly reap massive perf gains over Hadoop in certain contexts.
> you'll still possibly reap massive perf gains over Hadoop in certain contexts.
Certainly, and unfortunately, the exact point at which Hadoop becomes the better option over big iron is generally an ongoing debate and shifting target. But there's no doubt that such a point actually exists.
...I'm not sure if it does. Bain's Law still stands.
But if it does, then it's a pretty big chunk of data, and a very fast network.
2 replies →