Browsed by
Category: BigData

Cloudera Distributed Hadoop(CDH) Installation and Configuration on Google Cloud Platform

Cloudera Distributed Hadoop(CDH) Installation and Configuration on Google Cloud Platform

CDH(Cloudera Distributed Hadoop) is Cloudera’s open source platform, is the most popular distribution of Hadoop and related projects in the world (with support available via a Cloudera Enterprise subscription) CDH integrates Hadoop with more than a dozen other critical open source projects. Cloudera has created a functionally advanced system that helps end-to-end Bigdata workflows. Hadoop Basics: The Hadoop platform was designed to solve problems where huge amount of data need to processed. It is for the situations where complex analytical…

Read More Read More

What Is Big Data

What Is Big Data

Big data is a term used to refer to data sets that are too large or complex for traditional data-processing application software to adequately deal with. Data with many cases offer greater statistical power, while data with higher complexity may lead to a higher false discovery rate.