Zaloni Zip: Solving the Challenges of Hybrid Data Lake Architecture

Posted by Parth Patel on Feb 14, 2017 3:47:07 PM

In this Zaloni Zip, we will discuss the challenges of a Hybrid Data-Lake architecture and how Zaloni’s centralized data-lake management platform tackles those challenges head-on. 

Read More

Topics: Big Data Ecosystem, Bedrock, Zaloni Zip, Data Lake, Data Management, Data Governance, Metadata Management

Zaloni Zip: Building a Modern Data Lake Architecture Pt. 2

Posted by Rajesh Nadipalli on Dec 13, 2016 11:27:48 AM

In the last video, we looked at the pain points of traditional data warehouse architecture and the high level architecture of the next generation Data Lake based on Hadoop.

In this video, I will discuss the key components you need to build a new architecture.

Read More

Topics: Hadoop, Big Data Ecosystem, Data Warehouse, Zaloni Zip, Data Lake, Data Management

Zaloni Zip: Using Transient Clusters and Keeping Your Metadata

Posted by Parth Patel on Dec 6, 2016 9:26:47 AM

As the name suggests, transient clusters are compute clusters that automatically shut down and stop billing when processing is finished. However, using this cost-effective approach has been an issue because metadata is automatically deleted by the cloud provider when a transient cluster is shut down.

This is noteworthy because metadata is the key to getting value from big data. Therefore, most enterprises have opted to pay for persistent compute across the board in order to maintain the metadata. How can enterprises leverage transient clusters for cost-savings and maintain their metadata? Read More

Topics: Hadoop, Big Data Ecosystem, Bedrock, Zaloni Zip, Data Management, Metadata Management

Zaloni Zip: Data Lineage

Posted by Adam Diaz on Dec 2, 2016 7:57:39 AM

Maintaining a lineage of data in your data lake is not just a “nice to have” feature. Many organizations from Finance to Healthcare face government regulations around not only privacy but also data lineage. In fact, “how” data traverses the data lake from ingestion through transformation is often a legal matter.

Read More

Topics: Hadoop, Big Data Ecosystem, Bedrock, Zaloni Zip

Zaloni Zip: Data Quality

Posted by Adam Diaz on Nov 22, 2016 10:23:52 AM

The term “data quality” refers to not only the properties make up good data vs. bad data but also what to do with that data after a decision has been made.

Read More

Topics: Hadoop, Big Data Ecosystem, Bedrock, Zaloni Zip

Zaloni Zip: Microservices

Posted by Aashish Majethia on Nov 15, 2016 8:29:30 AM

Microservices are a key enabler of large, scalable implementations making optimum use of hardware. Companies such as Netflix have pioneered their use and have shown the promise of what they can achieve. Netflix has used Microservices to support over 35% of all internet download traffic in North America.

Read More

Topics: Hadoop, Big Data Ecosystem, Bedrock, Zaloni Zip

Zaloni Zip: Managed Ingestion

Posted by Adam Diaz on Nov 8, 2016 1:43:33 PM

The classic method for data ingestion in Hadoop relies on a number of different technologies each with its own configuration and scaling issues. These technologies require expertise to correctly ingest the data and ensure the ingest meets the SLAs of the organization.

Read More

Topics: Hadoop, Data Lifecycle Management, Zaloni Zip

Zaloni Zip: Data Warehouse Architecture

Posted by Rajesh Nadipalli on Nov 2, 2016 11:53:26 AM

In the latest Zaloni Zip, Raj Nadipalli discusses how to modernize your data warehouse (DW) architecture. He specifically addresses the traditional DW architecture, pain points of DW architecture and the modern data lake architecture.

Read More

Topics: Hadoop, Big Data Ecosystem, Data Warehouse, Zaloni Zip

Zaloni Zip: A Breakdown of Data Lifecycle Management

Posted by Parth Patel on Oct 17, 2016 2:41:00 PM

Data Lifecycle Management optimizes utilization of HDFS by leveraging the tiered storage solution provided by Hadoop. You can optimize big data storage based on the frequency of data usage, thereby reducing the cost in an effective manner. By implementing tiered storage, data files that are not used frequently, are stored in nodes with higher density, low compute power, and low cost.

Read More

Topics: Big Data Ecosystem, Bedrock, Data Lifecycle Management, Zaloni Zip