- What does Cisco have against Quebec?
- Attrition.org nails another nitwit
- Diary of a deliberately spammed housewife
- Seven cloud-computing security risks
- 20 great Windows open source projects
News | Newsletters | Podcasts | Chats | Opinions | RSS Feeds | This Week In Print | IT Careers | Community | Reports | Downloads | Slideshows | New Data Center
Partner Sites:App Performance | On Demand Security | Networking Solution | SOA | Value of WDS
I recently had a junior engineer in our lab ask me about the term "five nines." I launched into my canned script about the term representing 99.999% uptime, translating into a 5-minute, 15-second downtime budget. Still with furrowed brow, he peppered me with more questions. "Is that 5 minutes, 15 seconds in a year or over the life of the box?" "How do you define 'downtime?'" "Does that include software patches?"
The kid caught me flat-footed, and I realized I was passing along "spoon-fed" information about an important subject. I needed to educate myself on what five nines really means.
Five nines is not a hard metric, but rather the result of a predictive calculation. When a company claims that its device is five-nines reliable, it is talking about an absurdly complicated mathematical calculation based on industry-standard formulas used to predict the reliability of the box. For every possible definition of "failure" - ranging from a hint of trouble to a total meltdown - these formulas take into account the extent of the failures, the probability with which they will occur, how quickly the failures can be diagnosed, and how soon service can be restored.
Five-nines discussions blur the line between availability and reliability. A five-nines claim could be referring to either availability or reliability, depending upon which predictive formula is used. It's important to understand the difference between the two ways vendors can spin these terms.
For any given product, availability equals the total amount of time the product was up. Reliability means the number of instances in which the product went down. So you can have one big outage, and the box will reflect high reliability, but low availability. Or you could have two dozen outages of 5 seconds or less, and the box could be accurately described as being highly available, but unreliable.
Confusing, I know. But that's the point. When a vendor or marketeer says "five nines," they're probably using it as a catchall phrase that is probably devoid of any real meaning.
If you invoked the "spirit" of the law, very few products would stand the test. One of the originators of the concept of five nines is Telcordia, formerly Bellcore, which geared its specifications toward Local Access and Transport Area office and tandem switching systems. When I engaged in the ascetic exercise of reading the core specification document, I found out that "five nines" - however you define it - is entirely too forgiving. For example, as it relates to availability, Telcordia's allowable downtime budget for an entire system failure, in which all end users are down, is 24 seconds per year. So much for 5 minutes, 15 seconds.
3com 5500g is really very fast! - Anonymous
Partner Content
Simplify Your Branch Infrastructure
Learn how to simplify your branch infrastructure while dramatically increasing app performance with Citrix Branch Repeater.
Download the Free Info Kit
Next-Gen Load Balancing
Free Guide: “Next Gen Load Balancing: 8 Things You Need to Handle Today’s Network Traffic” shows you the functionality needed in your next load balancer.
Download the Free Guide
Accelerate Your Web Apps by up to 5x
Free Guide: “The Secret to Getting Maximum Speed from your Web Applications.” Learn how you can deliver Web apps up to 5x faster.
Download the Free Guide
Comment