Big Parity® from StreamScale leads the industry in RAID reliability and processing speed, eliminating data loss and corruption due to disk failure, service errors, silent data corruption, and unrecoverable read errors. 30X faster than competitive RAID systems, Big Parity is the only product that provides advanced Verified Erasure Coding® which enable up to 127 parity drives to keep data safe and accurate during long data analysis cycles or rebuilds.

Data Loss Examples

Some things are hard to talk about. For example, some customers have vendor contracts that don’t let them talk about data loss experiences. Some people might not talk about data loss for fear of embarrassing their company. Even so, there are many well documented cases of corruption and data loss in HPC, mission critical and Big Data systems.

CERN, the leading Physics lab in the world studied the magnitude of data loss in their labs. They smash atoms in a linear accelerator collecting many TBs of data per second. There is a tremendous amount of analysis that needs to be done and done correctly. If even a miniscule data corruption occurs, the accuracy of the findings and research is in jeopardy. CERN Analysis by the industry. CERN Report.

In another documented case study, CERN studied the problem of Silent Data Corruption and the industry discussed it more broadly in this report CERN: Data Corruption is worse than you know. The conclusions of the study are Silent Data corruption is a fact of life, detection is critical, and elimination seems impossible.  Big Parity was invented and developed to solve exactly this problem.  Read more about Big Parity’s ability to solve the Silent Data Corruption problem.

The most cited study as it relates to potential data loss is by Bianca Scroeder and Garth Gibson from the Computer Science Department of Carnegie Mellon University. In their report Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? They detail the measured disk failures in the field to be over 2-4 times as high as manufacturers’ data sheet rates and as high as 13%. Read report here

NetApp supplied their service data to academic researchers to better understand the sources of data loss and silent data corruption.  Their report was able to reveal the prevalence and frequency of data loss and the patterns it exhibits. Read the report here.

Google has a massive amount of storage and they not only need to keep it highly available they need it free from corruption and errors.  They studied their storage and reported to industry the types of corruption they found Read the report here.   Read the industry discussion.

Big Parity with Verified Erasure Coding® can protect you from data loss and keep your HPC, mission critical or Big Data system online and 100% correct. Contact StreamScale now to see how to add Big Parity to your Storage system.

Go to our forum discussion and tell us your data loss or corruption story. 

Number of parity drives is the weak link in RAID storage