← Back to context

Comment by kingstnap

2 days ago

Because they have unrealistic targets so they make up fake uptime numbers. 99.999% would mean not even having an hour of downtime in 10 years.

I remember reddit being down for like a whole day or so and they claimed 99.5% in that month.

Ma Bell hit that decently often.

  • Is that even knowable? Like, I know they called it “The Astonishing, Unfailing, Bell System” but if they had an outage somewhere did they actually have an infrastructure of “canary phones” and such to tell in real time? (As in, they’d know even if service was restored in an hour)

    Not trying to snark, I legit got nerdsniped by this comment.

    • They absolutely did. Note that the reliability estimates exclude the last mine because trees falling and the like but they had a lot of self repair, reporting, and management facilities.

      Engineering and Operations in the Bell System is pretty great for this.

  • Running a much simpler system with much more independent nodes.

    It's a lot easier to keep packets flowing than to keep non-self-contained servers serving.