EMC-Isilon__CAUDIT-RDSI_Nov2013

33
7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013 http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 1/33 © Copyright 2013 EMC Corporation. Company Confidential, do not distribute without explicit permission The POWE of EMC ISI To Trans eResea Greg Rogers, RTM EMC ISILON Australia Charles Sevior, CTO EMC ISILON Asia-Pac ISILON  CAUDIT presentation to RDSI

Transcript of EMC-Isilon__CAUDIT-RDSI_Nov2013

Page 1: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 1/33

© Copyright 2013 EMC Corporation. Company Confidential, do not distribute without explicit permission

ThePOWE

ofEMC ISITo TranseResea

Greg Rogers, RTMEMC ISILON Australia 

Charles Sevior, CTOEMC ISILON Asia-Pac

ISILON – CAUDIT presentation to RDSI

Page 2: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 2/33

© Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Agenda – Topics to be covered

1. Isilon eResearch customer references - Charles

2. Isilon NAS brief overview - Charles

3. Aspera integration into Isilon OneFS – JC

4. Hadoop analytics integration with Isilon - Charles

5. Isilon OneFS software / hardware roadmap – Charles

6. Next steps and partnerships - Greg

Page 3: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 3/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

UCLA

EMC Isilon Established LeadershipHealthcare, Life Sciences, eResear

Page 4: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 4/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Columbia UniversityCenter for Computational Biology and Bioinformatics (“

Challenge Using a traditional NAS system that strug

huge amounts of input/output demands oour computing infrastructure

Imminently outgrowing their infrastructu

Solution X-Series, NL-Series

SmartPools

SmartQuotas

Results Now supports some 4,000 CPUs, which ca

heavy I/O and data analysis demands of

Supports the storage needs of three addiwithin the Columbia University network

Applicati Genomic

 “After switching to Isilon, we no longer had to worry tcouldn’t handle our research demands. We knew thatindependently scale capacity and performance, so tha

we need, when we need it.”

JOHN LOWELLWOFFORD

Director IT Services

Page 5: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 5/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

3TIERTurns Big Data Into Accurate Renewable Energy Intelli

Challenge Traditional RAID array couldn’t scale to m

of its big data workflow

Complicated management

Escalating costs

Solution X-Series

NL-Series

SmartPools

Results Single file system and point of managem

performance tiers

Unifying all operations on a single, shared

Simplifying Big Data management

Applicati energy a

forecasti

 “Before Isilon, our team had to do everything from mamigrations to mapping directory paths, which simply for a business growing as quickly as ours. With Isilonsimple. We’ve eliminated data management and stora

freeing up resources to focus on the services that delour clients.”  

PAUL ENGLISHDirector of IT at 3TIER

Page 6: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 6/33© Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

RENCIRenaissance Computing Institute of the University of N

• Turning Big Data into

Insight in the lab andtherapy in the clinic

• Knowledge based medicineprograms for epilepsy &prostate cancer

• Secure medical workspace

• Informatics for GeneticSequencing (IGS)

• Blending traditional clustercomputing with iRODS andHadoop analytics

http://www.emc.com/collateral/white-papers/h11692-life-sciences-renci-big-data-manage-info-wp

Page 7: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 7/33© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

eResear

Challeng

Page 8: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 8/33© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

User requirements for RDSI?

Growing data sets – mostly unstructured

Reliable storage integrity / preservation Manage migration issues – active and cold d

Leverage MapReduce and Hadoop analytics

Ease of use and simple management Trust / Security / Immutability

Sharing / Collaboration / Cloud – REST API

Page 9: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 9/33© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

EMC IsilUnique Differentia

Page 10: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 10/33© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Challenges arising from standard

•Utilisation rates varyacross islands

•QOS headroom

•Performanceconsiderations

= Storage Efficiency50% - 65%

Page 11: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 11/33© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Clustered StorageLayer Com

Servers/Clients

Client/Application Layer Ethernet Layer

Servers/Clients

Servers/Clients

Compute Nodes

EMC Isilon – NODE-Based Architect

Page 12: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 12/33© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Focus on the d

not the stora

Page 13: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 13/33© Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

AutoBalance: Automated data balancing across nodereduces costs, complexity and risks for scaling storage

• AutoBalance migrates constorage nodes while systeand in production

• NO manual intervention

• NO reconfiguration

• NO server or client mountapplication changes

• Eliminates “Hot Spots”  

EMPTY

EMPTY

EMPTY

EMPTY

EMPTY

FULL

FULL

FULL

FULL

BALANCED

BALANCED

BALANCED

BALANCED

BALANCED

EMC Isilon – On-The-Fly Expansio

Page 14: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 14/33© Copyright 2013 EMC Corporation. All rights reserved.

ISILON – CAUDIT presentation to RDSI

Data Layout - protection and throu

Data is striped across the nodes

 – Not across disks – FEC not RAID

Data breakdown:

 – 8 KB blocks

 – 16 blocks per stripe unit

 – 128 KB stripe width per drive

Cli entsFile

WriteFile

Page 15: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 15/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

With N+2, N+3, andN+4 protection,data is 100% availableif multiple drives ornodes fail together

With n+1 protection,

data is 100% availableeven if one driveor one node fails

Built-in high availability clustered architecture – No RA

100%

100%

100%

100%

100%

100%

100%

100%

FAILED

FAILED

Fastest rebuild time, andwith Isilon, the more

nodes in the cluster, the

faster drive rebuild time 

EMC Isilon – Leading Data Protect

Page 16: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 16/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

20120

20

20

 “No Node Left Behind”  

12

12

72

12

72

288

Generations of Isilon nodes can coexist

– Heterogeneous nodes mix and match

– Storage investment protected

Rapid introduction of the latest technology

– Faster, denser, greener storage each year

– New capabilities like Hadoop and MobileIQ

– No server, network or application changes

Push-button node retire – SmartFail

– Storage older than 5 years is a waste of space!

EMC Isilon – Storage Never Obsole

No More Data Migration. Keep up with Moore

Page 17: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 17/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Isilon+AspeStore & Move Big

JC DiomaAspera 

Page 18: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 18/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

EMC Isilon with integrated Aspera

Page 19: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 19/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Aspera and Isilon have partnered for many

create a predictable, non-disruptive, high-performance wide area file and data deliversolution specifically for moving large data selong distances at the fastest possible speed

Aspera’s cluster-aware high-speed transfer coupled with EMC Isilon OneFS operating syand scale-out NAS architecture is a powerfucombination for distributed storage environ

The Benefits of Isilon + Aspera

Page 20: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 20/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

The Benefits of Isilon + Aspera

Maximum cluster-to-cluster performance ov

campus and wide-area networks Massive concurrency

Easy scalability of WAN transfer performanc

No single point of failure Single point of management

Web services and industry-standard API sup

Page 21: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 21/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

EMC IsilHadoop Analy

Page 22: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 22/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

ANALYTICSMOBILE

Key

B

Hadoop 2

Native HD

Pivotal H

Simultane& HDFS 2

Distribute

Support FHadoop A

No Single

Improved

Next-Gen Analytics

HDFS St d d H d Cl t

Page 23: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 23/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

HDFS: Standard Hadoop Cluster 

Data

CompuCompu

Data

Name node

3X

NFS

Name node

OLAP

HTTP

CIFS

FTP

NFS

Landing ZoneServers

Step 1:Data is copied intothe Landing Zone

StepData is locafrom a node

Cluster (3

SMapRed

Log Files

Decision SupportDatabases

User/WEBClick data

3X

3X

Step 2:Data is copied fromlanding zone onto anode in the Cluster

HDFS: Integ ated Isilon and Hado

Page 24: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 24/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

SMB, NFS,HTTP, FTP,

HDFS

HDFS: Integrated Isilon and Hado

name no

name no

name no

name no

Step 1:Much or all of the Data

lives on the Isilon Cluster

Step 2:Jobs are run

HadNFS

OLAP

Log Files

Decision SupportDatabases

User/WEBClick data

ARCHITECTURE FOR HIGH PERFORMANCE COMPUTING WITH INTEGRA

Page 25: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 25/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

ARCHITECTURE FOR HIGH PERFORMANCE COMPUTING WITH INTEGRA

Page 26: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 26/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Future Directio

Page 27: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 27/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

OneFS 7.0 - Maveric

Scalability Enterprise Mana Performance

– Improved Throughput

– Reduced Latency

Capacity– SnapshotIQ File Clones

– 20 PB File System Ready

Extensibility– REST Platform API

Data Pr– SyncIQ:

– Snapsho

AutomaProvisio– Improve

– Higher R

Availabi– Upgrade

Multi-Tenancy– Authentication Zones

– Role Based Administration

VMware– Reduced I/O Latency

– VAAI, VASA, SRM

Security and Archive– SmartLock v2 (SEC17a-4)

2H2012

2H

Page 28: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 28/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

OneFS 7.1 - Waikiki

Efficiency Scale Deduplication

– Post process, block leveldeduplication

– Increased data efficiency

– Multi-level directoryconfiguration

– Dry-run dedupe savingsestimation tool

– SmartPools Enhancements

– CLI and API management

– RBAC integration

– User defined node-pools

Perfo SyncIQ &

– “Continuo

– ~1 minut

Job Engine

– Improved

– Multiple s

– Enhanced

monitorin

Breaking b

– 3 simulta jobs

Enterprise Security and Archive

– CEE support. Varonis firstpartner for Audit

– Improved RBAC Coverage

– Encrypted Nodes

Backup

– Faster incrementals andincrementals forever

Manageability & Extensibility

– RBAC improvements to API

– Broaden pAPI coverage incl.SyncIQ AND JobEngine

– ESRS Gateway Integration

2H2013

Page 29: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 29/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Growth In Applications

Sources: IDC, Gartner, AWS Workload Estimates

Next Gen Cloud Ap

2016 48M

2012 6M

Traditional Applications

2016 141M

2012 83M 70%

Page 30: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 30/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Making Compute the Bottleneck

A Giant Leap in Capacity

Around-the-Clock, Around-the-Globe

Absolute Freedom and Control

Isilon’s OneFS Operating System 

StrategicVectors

O FS St

Page 31: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 31/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

NEXT GEN APPLICATIONS

Hybrid

Private(Enterprise)

OneFS StorageIsilon, The Big Data Platform

VERTIA

Public

I il h i R h

Page 32: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 32/33

© Copyright 2013 EMC Corporation. All rights reserved.ISILON – CAUDIT presentation to RDSI

Isilon strengths in eResearchArea Strength

Onsite data collection • Multi-protocol support for instruments & workstations• CIFS (SMB), NFS, FTP, HTTP

Repository•

SmartPools (Automatic storage tiering)• GNS (Global Namespace, with metadata acceleration)

Workspaces • SmartQuotas (Policy-based storage management)• Home Directories

Collaboration • Aspera FASP protocol integrated into OneFS• Aspera Enterprise Server and Connect Server hosted within the

HPC/Hadoop • HDFS (native support)• 10GbE (2/node)•

SmartConnect• Concurrent File Access support (OneFS 7.0)

Archival • SnapshotIQ• WORM – immutable writes• NL Series

Page 33: EMC-Isilon__CAUDIT-RDSI_Nov2013

7/25/2019 EMC-Isilon__CAUDIT-RDSI_Nov2013

http://slidepdf.com/reader/full/emc-isiloncaudit-rdsinov2013 33/33