Part One: Data Lake in the Cloud

Posted by Kannan Rajagopalan on Sep 26, 2016 9:11:55 AM

Read More

Topics: Hadoop Expert, Data Lake

Chlorine for your Data Swamp: Four Key Areas for Automation

Posted by Adam Diaz on Sep 22, 2016 3:10:38 PM

Maybe we’re talking more about algaecide and not chlorine, but microbiology aside, a data lake often gets rather cloudy and disorganized shortly after being opened for use. Hadoop’s promise of schema on read lures many in, but often ends up forcing a soul-searching reevaluation of one’s principles related to data management -- not to mention a new strategy (and cost) for cleaning up a swampy data lake.

Read More

Topics: Hadoop Expert, Big Data, Data Lake

Top Streaming Technologies for Data Lakes and Real-Time Data

Posted by Greg Wood on Sep 20, 2016 10:52:47 AM

More than ever, streaming technologies are at the forefront of the Hadoop ecosystem. As the prevalence and volume of real-time data continues to increase, the velocity of development and change in technology will likely do the same. However, as the number and complexity of streaming technologies grow, consumers of Hadoop must face an increasing number of choices with increasingly blurred delineation of functionality.

Read More

Topics: Hadoop Expert, Tech Insights, Data Lake

Bedrock DLM: Big Data Lifecycle Management for the Data Lake

Posted by Scott Gidley on Sep 14, 2016 1:59:49 PM

Apache knows there’s an urgent need for data lifecycle management for big data – and now offers Heterogeneous Storage for different storage types, as well as Hadoop Archive Storage with hot, warm, cold and other storage categories.

Read More

Topics: Big Data, Product Updates, Bedrock, Data Lake

Data Fracking: Going Deep into the Data Lake Using Drill

Posted by Greg Wood on Sep 14, 2016 10:25:07 AM

Your data lake is finally live. After months and months of planning, designing, tinkering, configuring and reconfiguring, your company is ready to see the fruits of your labor. There’s just one issue: the quarter close is coming up, and data analysts are asking for their functionality yesterday, not next week. That means there’s no time to go through the motions of setting up workflows, rewriting queries to function on Hive or HBase, and working through the kinks of a new architecture. The data lake may be the best, most flexible, and most scalable architecture available, but there is one thing it is not: quick to deploy. How can all of your hard-won socialization and hype for the data lake be saved? Enter Apache Drill.

Read More

Topics: Big Data, Data Lake, apache drill

Zaloni, NetApp Partner for Data Lifecycle Management Solution

Posted by Scott Gidley, VP of Product on Sep 8, 2016 4:15:02 PM

Right-size the enterprise data lake with policy-driven, highly scalable and cost-effective data lifecycle management and cloud tiering.

Today we announced a new solution developed in partnership with NetApp: Zaloni Bedrock DLM and NetApp StorageGRID. The solution addresses the growing need for big data lifecycle management as more enterprises deploy data lakes for managing the growing variety and volume of data, including mobile, cloud-based apps and Internet of Things (IoT) data.

Read More

Topics: Product Management, Data Lake

Open Source: How Open Is It?

Posted by Adam Diaz on Sep 1, 2016 3:54:17 PM

This is the second in a multi-part series of blogs discussing Hadoop distribution differences to help enterprises focus on the important factors involved in choosing a Hadoop distributionwithout hype or marketing spin. Zaloni has a long history of helping companies gain tangible business value from Hadoop, regardless of the distribtion. 

Read More

Topics: Hadoop Expert, Big Data, MapReduce

How to Choose a Hadoop Distribution: Understanding Versioning

Posted by Adam Diaz on Aug 24, 2016 9:11:06 AM

This is the first in a multi-part series of blogs discussing Hadoop distribution differences to help enterprises focus on the important factors involved in choosing a Hadoop distribution—without hype or marketing spin. It was written in collaboration with Gregory Wood. Zaloni has a long history of helping companies gain tangible business value from Hadoop, regardless of the distribution. 

Read More

Topics: Hadoop Expert, Tech Insights, MapReduce

Kafka in action: 7 steps to real-time streaming from RDBMS to Hadoop

Posted by Rajesh Nadipalli on Aug 23, 2016 10:25:42 AM

For enterprises looking for ways to more quickly ingest data into their Hadoop data lakes, Kafka is a great option. What is Kafka? Kafka is a distributed, scalable and reliable messaging system that integrates applications/data streams using a publish-subscribe model. It is a key component in the Hadoop technology stack to support real-time data analytics or monetization of Internet of Things (IOT) data. 

Read More

Topics: Hadoop Expert, Big Data, Data Lake, kafka,

So, you want to be a tech visionary? An executive guide to data lakes

Posted by Greg Wood on Aug 19, 2016 11:47:54 AM

You’ve heard it time and time again: cloud is the future; those who don’t adopt modern big data practices will fall behind the pack; the next wave of IT disruption is right around the corner. And yet, at the same time, budgets are shrinking, demand is growing and pressure on the IT organization to show value is at an all-time high. As an executive, you have the full force of your business behind you and more options than ever to achieve both short- and long-term goals with business data. So many options, in fact, that the landscape has become a confusing, often contradictory mess of competing solutions.

Read More

Topics: Hadoop Expert, Tech Insights, Data Lake, EDW