The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
Big Data and Hadoop Ecosystem
-
Upload
canburak-tuemer -
Category
Data & Analytics
-
view
100 -
download
2
Transcript of Big Data and Hadoop Ecosystem
Big Data & Hadoop
EcosystemCanburak Tümer
• Ege University, BSc. Computer Engineering, ’07-’12• Libera Universitá di Bolzano, BSc. Computer Science,
’09-’10• İstanbul Technical University, MSc. Computer
Engineering, ’13-’16 (expected)• Turkcell Technology, ETL & DWH Developer, ’11-’12• Oracle, Consultant, ’12-’13• MAKEIT Software & Consulting, BI&DW Specialist ’14-...• www.canburaktumer.com/blog @canburakTblog
https://www.linkedin.com/in/canburaktumer
About MeCanburak Tümer
Agenda• Big Data• NoSQL• Hadoop• HDFS• MapReduce• Management Tools• Data Access Tools• Data Processing and Mining Tools
VOLUME VALUE
VARIETYVERIFICATION VELOCITY
- Open source big data platform- Started by developers from Yahoo!- Two main distributors now : Cloudera, Hortonworks- Both storage and processing
- HDFS for storage- MapReduce for processing- Spark engine is replacing MapReduce day by day
HDFS
Map Reduce
Managing Tools for Hadoop
Data Access Tools for Hadoop
Data Processing and Mining Tools