Lightning talk hadoop
11
LEANDRO DA COSTA GONÇALVES
-
Upload
leandro-costa -
Category
Technology
-
view
40 -
download
2
description
Transcript of Lightning talk hadoop
LEANDRO DA COSTA GONÇALVES
INTRODUTION Framework for distributed computing; Used in clusters/grids; Thousands of nodes; Common hardware; Petabytes of data; Open Source (Apache license); Java; Originally inspired by Google's MapReduce and GFS.
WHO USES HADOOP?
FACEBOOK; GOOGLE; YAHOO; STOCK EXCHANGE FROM NEW YORK; CERN.
PRINCIPALS COMPONENTS
Auto-recoveryHigh bandwidth consuption
ClusteringHighly tolerant of failures
Fault ToleranceDistributed processing
LIFE CYCLE OF MAPREDUCE
MAPREDUCE OF WORD COUNT
MAPPER
REDUCER
THE MAIN
THE HADOOP ECOSYSTEM
CONCLUSION
BigDataThe foundations Apache and Spring
SourceGoogle and FacebookSocial networks