Big Data Spatial Analytics – An Introduction...Big Data Spatial Analytics – An Introduction Big...
Transcript of Big Data Spatial Analytics – An Introduction...Big Data Spatial Analytics – An Introduction Big...
Esri UC2013 . Technical Workshop . EsrEsrEsrEsrEsrEsrEsrsrsrrri Uii Ui Ui Ui Ui Uii UUi C20C20C20C20CCC 00C20CC2 131313 13 13 13 3 . T. T. TTT. T. T. TTTTTTechechechechhecechchchec nicnicnicnicnicnicninicnicnicicalalalalalalalalalallla WorWorWorWorWorWorWorWorWorWoorWo kshkshkshkkshkshkshkkshkskkshkshop op op opop oop op opp . .. . . .
Technical Workshop
2013 Esri International User Conference July 8–12, 2013 | San Diego, California
Big Data Spatial Analytics – An Introduction
Marwa Mabrouk Mansour Raad
Esri UC2013 . Technical Workshop .
Agenda
• What is Big Data? • What is Hadoop? • How does Spatial integrate with Big Data and Hadoop?
• How do I get started?
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
New Challenges for Organizations
• Better Decision making • Intelligence • Insight/ Foresight
• Social Data Analysis • Log files analysis • Fraud Detection
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
Collect Data!!!
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
What is Big Data???
Big Data Spatial Analytics – An Introduction
Is it Satellite & Aerial Imagery???
Is it a large network of a million points???
Is it a simple Geodatabase with Billions of records???
Is it this technology called hadoop?
Esri UC2013 . Technical Workshop .
Big Data, a new Data Type
• Volume - Ever growing data, petabytes / zetabytes - Use 350 billion annual meter readings to better predict power
consumption
• Velocity - Time-sensitive, two mins can be too late - Turn 12 TBs of tweets into updates about people’s conditions
during a storm for emergency response
• Variety - Any type of data, text / audio/ video / click streams / other - Exploit documents, images, voice recordings, videos in customer
experience analysis programs to improve customer satisfaction
Big Data
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
What is hadoop?
Big Data Spatial Analytics – An Introduction
HDFS
MapReduce
Common
Esri UC2013 . Technical Workshop .
Esri UC2013 . Technical Workshop .
Esri UC2013 . Technical Workshop .
• Parallel Framework • Executes Map Reduce Task • Reads HDFS • Java/Python/Awk
Esri UC2013 . Technical Workshop .
$> cat input | Map | sort | Reduce > out
Esri UC2013 . Technical Workshop .
Esri UC2013 . Technical Workshop .
Apache Hive
Esri UC2013 . Technical Workshop .
SQL
MapReduce Job
Esri UC2013 . Technical Workshop .
HDFS CSV TSV JSON BINARY
MapReduce
hive> select * from cities where country= lebanon ;
Esri UC2013 . Technical Workshop .
What About Spatial ?
Esri UC2013 . Technical Workshop .
Big Data – A new data type for Geospatial
Imagery
DBMS
Services Sensor
Networks
Big Data
Spreadsheets Maps
Imagery
Sp
Social Media
SS
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
Geospatial in Big Data
• Big Data with location in Enterprise Big Data warehouse
• Run spatial queries on data where it resides
• Pull Results in ArcGIS: Visualize results as a map; Include in a report; Publish in a web or mobile app
Big Data
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
GIS Tools for Hadoop
• http://esri.github.com/gis-tools-for-hadoop/ • Support Running geometry-based spatial queries inside Hadoop
• Open Source - Apache 2.0 license
• Two types of users - Developers - ArcGIS users
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
Run Hive Queries with spatial
operators
013 1 . Technical Workshop .
Build Map/ Reduce Spatial
Apps in Java
ArcGIS
Esri Geometry API
Spatial Framework
Geo
GIS Tools for Hadoop (Use samples with Hadoop & Hive to get started)
ArcGIS Connect From ArcGIS
to Hadoop using GP
Geoprocessing
Tools
GIS tools for Hadoop libraries
Esri UC2013 . Technical Workshop .
GIS Tools for Hadoop Github repos
Esri Geometry API for Java (EGA)
Spatial framework for Hadoop
GeoProcessing tools for Hadoop
GIS tools for Hadoop
r J a
Developer
Starting Point
ArcGIS User
Starting Point
http://esri.github.com/gis-tools-for-hadoop/
Esri UC2013 . Technical Workshop .
Spatial Storage
• CSV,TSV Lat,Lon • Esri JSON format
- {geometry:{x:-123,y:45},attributes:{}}
• Custom
Esri UC2013 . Technical Workshop .
User Defined Functions
• select tolower( ESRI ); • select * from mytable where cos(rad) < 0.1;
Esri UC2013 . Technical Workshop .
Spatial UDF !
Esri UC2013 . Technical Workshop .
select * from cities where near(x,y,-84.2,39.4);
Esri UC2013 . Technical Workshop .
select * from cities where contains(x,y, #mypolys );
Esri UC2013 . Technical Workshop .
How to get started? Technology
• Download GIS Tools for Hadoop -Sample (jars + sample data/ query)
• Hadoop cluster -Amazon Web Services EMR -Cloudera Manager and VM, CDH
• Run sample
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
How to get started? Data and Query
• Use Case, problem definition
• Dataset - Formats - Conversion - Location - Loading
• Query - Hive Script
Big Data Spatial Analytics – An Introduction
Big Data
Esri UC2013 . Technical Workshop . Esri UC2013 . Technical Workshop . Esri UC2013 . Technical Workshop .
AWS Elastic Map Reduce GP Tools
Cloud
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop . Esri UC2013 . Technical Workshop . Esri UC2013 . Technical Workshop .
Cloudera Map Reduce
On Premise
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
Big Data The Data warehouse
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop . Big Data Spatial Analytics – An Introduction Esri UC2013 . Technical Workshop . Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
Big Data Partners
Big Data Spatial Analytics – An Introduction
Esri UC2013 . Technical Workshop .
Marwa Mabrouk - [email protected] Mansour Raad - [email protected], http://thunderheadxpler.blogspot.com/ Please fill out the session evaluation
First Offering ID: 2333
Online – www.esri.com/ucsessionsurveys Paper – pick up and put in drop box
Thank you
Esri UC2013 . Technical Workshop . Big Data Spatial Analytics – An Introduction