vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
-
Upload
cloudstack-open-source-cloud-computing-project -
Category
Technology
-
view
2.632 -
download
2
description
Transcript of vBACD - Distributed Petabyte-Scale Cloud Storage with GlusterFS - 2/28
Distributed Petabyte-Scale Cloud Storage with GlusterFSThe Future of GlusterFS and Gluster.org
John Mark WalkerGlusterFS Community Guy
Red Hat, Inc.February 28, 2012
February 28, 2012 The Future of Gluster.org - John Mark Walker
The Roots of GlusterFS
● Distributed storage solutions difficult to find● Decided to write their own● No filesystem experts – Pro & Con● Applied lessons from microkernel architecture
– GNU Hurd
The Roots of GlusterFS
● All storage solutions were either● Too expensive. or...● Not scalable, or…● Single purpose, or…● Don’t support legacy apps, or…● Don't support new apps, or...● Do some combo of the above, but not very well
The Roots of GlusterFS
● The challenge:● Create a storage system that was…
– Scalable– Seamlessly integrated in the data center– Future-proof
● The solution: GlusterFS● Scalable, with DHT● POSIX-compliant● Stackable● User-space
GlusterFS Client Architecture
● Creating a file system in user space● Utilizes fuse module
– Kernel goes through fuse, which hands off to glusterd
Linux kernel
Fuse Ext4
glusterd
… …
Applications
No Centralized Metadata
Client A Client B Client C
Server X
Files
Extended Attr.
Server Y
Files
Extended Attr.
Server Z
Files
Extended Attr.
What is a Translator?
● Add/remove layers● Reorder layers● Move layers between
client and server● Implement new layers
● e.g. encryption
● Replace old layers● e.g. replication
FUSE Interface Layer
Performance Layer
Distribution Layer
Replication Layer
Protocol Layer
Local Filesystem Later
Some Features
● Distributed, replicated and/or striped volumes● Global namespace● High availability● Geo-replication● Rebalancing● Remove or replace bricks● Self healing● volume profile and top metrics
No one ever expects the Red Hat acquisition
Red Hat Invests in GlusterFS
● Unstructured data volume to grow 44x by 2020● Cloud and virtualization are driving scale-out
storage growth● Scale-out storage shipments to exceed 63,000
PB by 2015 (74% CAGR)● 40% of core cloud spend related to storage● GlusterFS-based solutions up to 50% less than
other storage systems
Red Hat Invests in GlusterFS
● GlusterFS adds to the Red Hat stack● Complements other
Red Hat offerings● Many integration points
● More engineers hacking on GlusterFS than ever before
RHEL
RHEV BareMetal
Clouds
GlusterFS Unified Storage
JBoss
Red Hat Invests in GlusterFS
● Acceleration of community investment● GlusterFS needs to be “bigger than Red Hat”● Transformation of GlusterFS from product to project
– From “open core” to upstream● More resources for engineering and community
outreach● Red Hat's success rests on economies of scale
– Critical mass of users and developers
Join a Winning Team
● We're hiring hackers and engineers● Looking for community collaborators
● ISVs, students, IT professionals, fans, et al.
“Join me, and together, we can rule the galaxy...”
The Immediate Future
The Gluster Community
● 300,000+ downloads● ~35,000 /month
● >300% increase Y/Y
● 1000+ deployments● 45 countries
● 2,000+ registered users
● Mailing lists, Forums, etc.
Global adoption
The Gluster Community
● Why are we changing?● Only 1 non-Red Hat core contributor
– There were 2, but he acquired us● Want to be the software standard for distributed
storage● Want to be more inclusive, more community-driven
Goal: create global ecosystem that supports ISVs, service providers and more
Towards “Real” Open Source
● GlusterFS, prior to acquisition● “Open Core”● Tied directly to Gluster products
– No differentiation ● Very little outside collaboration● Contributors had to assign copyright to Gluster
– Discouraged would-be contributors
Commercial Product
Towards “Real” Open Source
Open SourceCode
“Open Core”● All engineering controlled by
project/product sponsor
● No innovation outside of core engineering team
● All open source features also in commercial product
● Many features in Commercial product not in open source code
Open Source Code
Towards “Real” Open Source
Commercial Products
“Real” Open Source● Many points of collaboration
and innovation in open source project
● Engineering team from multiple sources
● Project and product do not completely overlap
● Commercial products are hardened, more secure and thoroughly tested
Fedora Linux
Towards “Real” Open Source
RHEL
“Real” Open Source● Enables more innovation on
the fringes
● Engineering team from multiple sources
● Open source project is “upstream” from commercial product
● “Downstream” products are hardened, more secure and thoroughly tested
GlusterFS
Towards “Real” Open Source
Red Hat Storage
“Real” Open Source● Enables more innovation on
the fringes
● Engineering team from multiple sources
● Open source project is “upstream” from commercial product
● “Downstream” products are hardened, more secure and thoroughly tested
Project Roadmaps
GlusterFS 3.3 ETA in Q2/Q3 2012
What's New in GlusterFS 3.3● New features
● Unified File & Object access
● Hadoop / HDFS compatibility
● New Volume Type● Replicated + striped (+ distributed) volumes
● Enhancements to Distributed volumes (DHT translator)● Rebalance can migrate open files
● Remove-brick can migrate data to remaining bricks
● Enhancements to Replicated volumes (AFR translator)● Change replica count on an active volume, add replication to distribute-only volumes
● Granular locking – Much faster self-healing for large files
● Proactive self-heal process starts without FS stat
● Round-trip reduction for lower latency
● Quorum enforcement - avoid split brain scenarios
File and Object Storage
● Traditional SAN / NAS support either file or block storage
● New storage methodologies implement RESTful APIs over HTTP
● Demand for unifying the storage infrastructure increasing
● Treats files as objects and volumes as buckets
● Available now in 3.3 betas
● Soon to be backported to 3.2.x
● Contributing to OpenStack project● Re-factored Swift API
Technology Integrations
OpenStack Imaging Services
Unified File &Object Storage
…Compute
API Layer Mobile Apps. Web Clients. Enterprise Software Ecosystem
GlusterFS used as VM storage system● Pause and re-start VM’s, even on another
hypervisor● HA and DR for VM’s● Faster VM deployment ● V-motion –like capability
Shared storage ISOs and appliances● oVirt / RHEV ● CloudStack● OpenStack
Goal: The standard for cloud storage
HDFS/Hadoop Compatibility
● HDFS compatibility library● Simultaneous file and object access within Hadoop
● Benefits● Legacy app access to MapReduce applications
● Enables data storage consolidation
● Simplify and unify storage deployments
● Provide users with file level access to data
● Enable legacy applications to access data via NFS● Analytic apps can access data without modification
The Gluster Community
● What is changing?● HekaFS / CloudFS being folded into Gluster project
– HekaFS == GlusterFS + multi-tenancy and SSL for auth and data encryption
– HekaFS.org – ETA ~9 months
What else?
GlusterFS Advisory Board
● Advisory board● Consists of industry and community leaders from Facebook, Citrix,
Fedora, and OpenStack– Richard Wareing, Storage Engineer, Facebook
– Jeff Darcy, Filesystem Engineer, Red Hat; Founder, HekaFS Project
– AB Periasamy, Co-Founder, GlusterFS project
– Ewan Mellor, Xen Engineer, Citrix; Member, OpenStack project
– David Nalley, CloudStack Community Mgr; Fedora Advisory Board
– Louis Zuckerman, Sr. System Administrator, Picture Marketing
– Joe Julian, Sr. System Administrator, Ed Wyse Beauty Products
– Greg DeKoenigsberg, Community VP, Eucalyptus; co-founder, Fedora
– John Mark Walker, Gluster.org Community Guy (Chair)
Gluster.org Web Site
● Services for users and developers● Developer section with comprehensive docs● Collaborative project hosting● Continuing development of end user documentation
and interactive tools● Published roadmaps
● Transparent feature development
GlusterFS Downloads
● Where's the code?● GlusterFS 3.3
– Simultaneous file + object– HDFS compatibility– Improved self-healing + VM hosting
● Granular locking
– Beta 3 due Feb/Mar 2012– http://download.gluster.org/pub/gluster/glusterfs
Gluster.org Services
● Gluster.org● Portal into all things GlusterFS
● Community.gluster.org● Self-support site; Q&A; HOWTOs; tutorials
● Patch review, CI● review.gluster.com
● #gluster● IRC channel on Freenode
Development Process
● Source code● Hosted at github.com/gluster
● Bugs and Feature Requests● Bugzilla.redhat.com – select GlusterFS from menu
● Patches● Submit via Gerritt at review.gluster.com
● See Development Work Flow doc: ● gluster.org/community/documentation/index.php/Development_Work_Flow
Thank You
● GlusterFS contacts● Gluster.org/interact/mailinglists● @RedHatStorage & @GlusterOrg● #gluster on Freenode
● My contact info● [email protected]● Twitter & identi.ca: @johnmark