What is Hadoop? 

Hadoop is an open source platform designed by Apache and written in java programming language for vast majority of data with new ways of storing that makes it easy managing the big data to cheap and efficient. Hadoop is not just a data storage for archiving massive of data, it provides wide variety of fast data processing methods. it has two main part: data processing framework and distributed file system. Rather than making it easy working with big data, Hadoop is enable scale up to cluster...

"Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. The challenges include capture, curation , storage, search, sharing, transfer, analysis, and visualization. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared to separate smaller sets with the same total amount of data, allowing correlations to be found to "spot business trends, d...