Posted on

Hadoop – What is DistCp (Distributed copy) in Hadoop

What is DistCp in Hadoop

Design Structure of DistCp (Distributed copy) DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. It uses MapReduce  for distribution, recovery and reporting and error handling.It expands a list………

Continue reading...

Posted on

Hadoop – What is Native Hadoop Library

NATIVE HADOOP LIBRARY

NATIVE HADOOP LIBRARY Due to non-availability of Java implementations Hadoop has native implementations of certain components. The library in which these components are available is called the Native Hadoop Library. This library On the………

Continue reading...

Posted on

Hadoop Archives Guide – How to Create an Archive in Hadoop

LEARN HADOOP archive commands

HADOOP ARCHIVES GUIDE Hadoop archives are special format archives. A Hadoop archive maps to a file system directory. A Hadoop archive always has a *.har extension. A Hadoop archive directory………

Continue reading...

Posted on

Hadoop – How to Set Hadoop Environment Setup

hadoop-enviornment-setup

Hadoop is supported by GNU/Linux platform and its flavors. Therefore check how to set hadoop-enviornment-setup  by tipcircle.com Pre-installation Setup Before installing Hadoop into the Linux environment, we need to set up Linux………

Continue reading...

Posted on

Apache Hadoop: What is high performance big data analytics

hadoop big data

Start with What is high performance big data analytics? Big data means extremely big datasets that are hard to deal or tackle using traditional computing techniques. Big data is not merely………

Continue reading...