Data Loss Examples

Some things are hard to talk about. For example, some customers have vendor contracts that don’t let them talk about data loss experiences. Some people might not talk about data loss for fear of embarrassing their company. Even so, there are many well documented cases of corruption and data loss in HPC, mission critical and Big Data systems.

CERN, the leading Physics lab in the world studied the magnitude of data loss in their labs. They smash atoms in a linear accelerator collecting many TBs of data per second. There is a tremendous amount of analysis that needs to be done and done correctly. If even a miniscule data corruption occurs, the accuracy of the findings and research is in jeopardy. CERN Analysis by the industry. CERN Report.

In another documented case study, CERN studied the problem of Silent Data Corruption and the industry discussed it more broadly in this report CERN: Data Corruption is worse than you know. The conclusions of the study are Silent Data corruption is a fact of life, detection is critical, and elimination seems impossible.  Big Parity was invented and developed to solve exactly this problem.  Read more about Big Parity’s ability to solve the Silent Data Corruption problem.

The most cited study as it relates to potential data loss is by Bianca Scroeder and Garth Gibson from the Computer Science Department of Carnegie Mellon University. In their report Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? They detail the measured disk failures in the field to be over 2-4 times as high as manufacturers’ data sheet rates and as high as 13%. Read report here

NetApp supplied their service data to academic researchers to better understand the sources of data loss and silent data corruption.  Their report was able to reveal the prevalence and frequency of data loss and the patterns it exhibits. Read the report here.

Google has a massive amount of storage and they not only need to keep it highly available they need it free from corruption and errors.  They studied their storage and reported to industry the types of corruption they found Read the report here.   Read the industry discussion.

