How Big is Big Data?
Before diving into the insights in this report, it’s important to first define big data. BIG DATA is a common term for large amounts of data. To be qualified as big data, data must be coming into the system at a high velocity, with large variation, or at high volumes. The business case for adopting Big Data is to ultimately turn data into insight. Examples of using big data span all industries, whether combining buying habits with location data to push individually-tailored offers in the retail sector, or using predictive analytics in manufacturing plants to optimize plant maintenance.
And the reason to adopt new data technologies is more complex than whether or not data volume, velocity, variety physically requires a specialized algorithm or tool. One interesting finding is that when respondents were asked whether they were “planning to adopt new ‘Big Data’ technologies in the next six months:”
- 37% of respondents reported that they are planning to do so.
- Of the 37% (426 respondents out of 1154), the results show that, in spite of the explosion of non-batch tools and frameworks, Hadoop is still the go-to Big Data technology for new adopters.
What’s also worth noting here is that three basic, NoSQL storage models are highlighted in this “top seven” list. Document-oriented MongoDB, column-(family-)oriented Cassandra, and graph Neo4jwas the only particular graph DBMS mentioned more than once, with none clearly dominating.
As momentum of enterprise adoption builds and these distributed databases gain more traction, it’s important to understand that these databases while very useful in processing large volumes of data at a high velocity, can benefit from a backup and recovery solution purpose-built for the cloud era. That is, as these organizations adopt next-generation non-relational databases, they also need an ecosystem of data management products to protect the data and extract value from that data.
And the reason to protect data are as follows:
- First, it’s to minimize application downtime in the event of data loss due to hardware failures or human error. Human errors occur all the time, such as fat-fingers. In today’s ‘always-on’ world, customers want instant access and application downtime can be detrimental, leading to loss of customers, loss of brand, and ultimately loss of revenue.
- Second, there are compliance requirements in certain verticals that require organizations to retain and be able to recover data over its lifetime.
The challenge is that non-relational databases that support these next-generation applications (Analytics, IoT, SaaS and more) lack enterprise-class data protection solutions. And it’s this gap that’s putting enterprises at the risk of data loss and furthermore limiting their adoption of this new infrastructure. Enterprises will not be able to onboard their mission critical applications on distributed databases unless this critical gap is addressed.
That is why we are hard at work working with customers to protect their next-generation applications and cloud databases to deliver reliable recovery at scale. For the full report, please click here.