A critical failure occurs when something like a CPU or system board fails. Critical server failures result in the loss of access to applications or data, a significant issue that impacts overall business productivity.
A non-critical failure occurs when a component like a disk drive or power supply fails. Modern data center equipment has built-in redundancies for these components, so there is no data loss.
Contrary to what the Bathtub Curve portrays, the sample data above shows failure rates don’t drastically increase. By combining reliability and longevity data, we can better understand how equipment performs over time.
Testing the myth
At Service Express, we’ve collected over 15 years of equipment data from over half a million devices to understand equipment longevity and reliability better.
Our previous article only studied equipment longevity and how it stacks up against the Bathtub Curve. In the past two years, we’ve implemented real-time reliability studies that allow for previously unseen granularity. But what’s the difference between longevity and reliability?
Longevity reporting details a product’s expected failure rate over time. This can be useful for CapEx budget planning purposes. However, beyond equipment longevity, reliability plays a critical role. Reliability studies examine how equipment has performed over time. This practice is useful in identifying outliers within a product model number or family.
Below is an example of equipment and its reliability. This graph utilizes data sets from over 500,000 pieces of equipment under agreement for over 6,000 customers and compares performance using 55,000 average annual service calls.