Bigtable A Distributed Storage System for...

Bigtable: A Distributed Storage System for Structured

Data by GoogleSUNNIE CHUNG

CIS 612

Google Bigtable

� A distributed storage system for managing structured

data that is designed to scale to a very large size:

petabytes of data across thousands of commodity

servers.

� Many projects at Google store data in Bigtable, including

web indexing, Google Earth, and Google Finance.

� These applications place very different demands on

Bigtable, both in terms of :

� Data size (from URLs to web pages to satellite imagery) a

� Latency requirements (from backend bulk processing to

real-time data serving).

Google Bigtable

� Wide Applicability

� Scalability

� High Performance

� High Availability

� Bigtable is used by:

• Google Analytics

• Google Finance

• Google Earth

• Google Search / Personalized Search

• Orkut etc.

Google Bigtable is

� a sparse,

� distributed,

� persistent,

� multi dimensional sorted map which is indexed by

� a Row Key,

� a Column Key and

� a Timestamp,

� where each value in the map is an un-interpreted

array of bytes.

Data Model

( row key : string, column : string, timestamp : int64) �� string

T9 T8<html>…

<html>…

“CNN

“CNN.COM

“com.cnn.www”

“Contents;”

“anchor:cnnsi.com;”

“anchor:my.look.ca”

Fig: A slice of an eg table (consider name Webtable) storing web pages

• Arbitrary strings; currently up to 64KB in size

allowed; typical size for string data 10-100 bytes

• Every read or write is atomic

• Lexicographic ordering of keys

• Row range definition for a table is dynamic called a

Tablet

Column Families:

• Column keys are grouped into sets called column

families

• Syntax: family:qualifier

• Eg: “anchor:my.look.ca”: in the figure from previous slide

• language:LanguageID: This stores the language for

webpage

Bigtable Rows and Column Families

Timestamps:

• Multiple versions of the data are allowed in a

single cell; indexed by timestamps

• Timestamps are 64 bit integer values

• Client or Bigtable can generate timestamps for

the data

• Stored in decreasing order keeping newest data

on top of storage

• Bigtable column family can have their own

garbage collection mechanism defined to help

keep only recent ‘n’ no. of data values

Timestamps

Bigtable API Provides functions for

• Creating and Deleting tables

• Creating and Deleting column families

• Changing cluster, table and column family

metadata

• Access to data rows for write and delete

operations

• Scan through data for particular column

families, rows and data values with filters

applied

• Batch and atomic writes

Bigtable API

Google File System

Google SSTableChubby

Bigtable Building Blocks

11Bigtable Building Blocks

Google File System:

To Store log and data files

Google SSTable:

Used internally to store data in Bigtable

Chubby:

Paxos based system for consensus in network of

unreliable processors

Provides a namespace to store Directories and small

files. Each Dir or file can be used as a lock and every

access is atomic

Chubby Unavailable Bigtable Unavailable

12Implementation

Three Major components:

A Library that is linked to every client

One master server:Responsible for assigning tablets to Tablet Servers

Many Tablet servers:Dynamically added and removed from a cluster to

accommodate changes in workloads

Recall: Tablet: Every table in Bigtable is dynamically partitioned

on the basis of row keys / range. Each row range is called a

tablet.

One master server:• Responsible for assigning tablets to Tablet Servers

• Detecting the addition and expiration of Tablet Servers

• Balancing Tablet Server Load

• Garbage collection of files in GFS

Note: No client practically communicates with Master Server

Many Tablet servers:• Dynamically added and removed from a cluster to

accommodate changes in workloads

• Each Tablet Server manages a set of Tablet

• Handles Read and Write requests to the Tablets

• Split tablets when they have grown too large

Implementation Cntd.

Chubby

Other Metadata Tablets

User Table 1

User Table N

Root Tablet(1st metadata

Tablet)

Fig: Tablet Location Hierarchy

Tablets

15Tablet Assignment

Master Server and Tablet Assignment:

• Assign a tablet to only one Tablet Server at a given time

• Keep track of all the Tablet servers including tablets

assignments and un-assignments

• Master is responsible for:

• Detecting when a tablet server is no longer serving

its tablets

• Keep track of lock / Session with Chubby

Steps Taken by Master at Startup:• Grab a unique Master lock in Chubby

• Scan the servers directory to find live servers

• Communicate with every live server to check assignments

• Scan the Metadata table to learn the set of Tablets

memtable Read Op

Write Op

SSTable Files

tablet log

Memory

Tablet Representation

17Tablet Serving

Read Operation on Tablet

• Checked for Well

formedness and

proper authorization

• Valid read operation

is executed on a

merged view of

sequence of SSTables

and memtable

Write Operation on Tablet

When write operation arrives

at Tablet:

• Server checks for

well formedness and

proper authorization

• Valid mutation is

written to the

commit log

• Group commit is

used to improve the

throughput

• After write is

committed, contents

are inserted to

memtable

18Tablet Compactions

Everything that happens with data here is atomic!!!

Minor Compactions:

• Shrinks memory usage of Tablet server

• Reduces amount of data that has to be recreated from

log in case of failure

Merging Compactions:

• Performed in background on SSTables created by minor

compactions

Major Compactions:

• All SSTables are merged and rewritten to one SSTable.

• Bigtable performs this operation regularly.

19Refinements

• Locality Groups

• Compression

• Caching for Read performance

• Bloom Filters

• Commit Log Implementation

• Speeding up Tablet recovery

• Exploiting immutability

20Summary

Bigtable is a distributed storage system for storing structured data

at Google

In operation since 2005, by August 2006 more than 60 projects are

using Bigtable

Effective performance, High availability and Scalability are the

key features for most of the clients

Control over architecture allows Google to customize the

product as needed.

Use by old and new clients demonstrates that Bigtable

architecture works.

This Lecture Notes from

the paper published by Google in OSDI 2006

http://grail.csuohio.edu/~sschung/cis612/googlebigt

able-osdi06.pdf

Bigtable A Distributed Storage System for...

Documents

Transcript of Bigtable A Distributed Storage System for...

Bigtable: A Distributed Storage System for Structured Data · Bigtable is a distributed storage system for managing structured data that is designed to scale to a very large size:

BigTable A System for Distributed Structured Storage

Distributed Systems - Computer Sciencepxk/417/notes/content/17-bigtable-slides.pdf · A big table •Bigtable is NOT a relational database •Bigtable appears as a large table –“A

BigTable: A System for Distributed Structured Storage · 2019. 10. 22. · 1 BigTable: A System for Distributed Structured Storage Jeff Dean !! Joint work with: Mike Burrows, Tushar

Bigtable - A Distributed Storage System for …...Bigtable - A Distributed Storage System for Structured Data Fay Chang, Je rey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach,

Database Design - cis.csuohio.edu

GFS, Mapreduce and Bigtable - cs.helsinki.fi · • BigTable is a distributed storage system for managing structured data. • Designed to scale to a very large size • Petabytes

Distributed Systems - Computer Sciencepxk/417/notes/content/19-bigtable-slides.pdf · API: Operations on Bigtable •Create/deletetables & column families •Changecluster, table,

Bigtable: A Distributed Storage System for …cs.uwec.edu/~buipj/teaching/cs.491.s13/static/pdf/05_bigtable_a...Bigtable: A Distributed Storage System for Structured Data · 4: 3 Fig.

Bigtable A Distributed Storage System for Structured Data ...eecs.csuohio.edu/~sschung/...BigTable_Updated.pdf · Google Bigtable A distributed storage system for managing structured

Bigtable: A Distributed Storage System for Structured Dataranger.uta.edu/~sjiang/CSE6350-spring-19/Presentations-materials/Topic... · •Bigtable is a distributed multi-dimensional

CS 347Lecture 91 CS 347: Parallel and Distributed Data Management Notes X: BigTable, HBASE, Cassandra.

BigTable: A System for Distributed Structured Storagepages.cs.wisc.edu/~remzi/Classes/739/Fall2017/Papers/bigtable-slides-05.pdf · Bigtable master Bigtable tablet server Bigtable

Bigtable: A Distributed Storage System for …Bigtable •Distributed multi-level map •Fault-tolerant, persistent •Scalable –Thousands of servers –Terabytes of in-memory data

Bigtable: A Distributed Storage System for Structured Data · Bigtable is designed to reliably scale to petabytes of data and thousands of machines. Bigtable has achieved several

Cassandra - O'Reilly Mediaassets.en.oreilly.com/1/event/27/Cassandra_ Open Source Bigtable... · Two famous papers Bigtable: A distributed storage system for structured data, 2006

Bigtable: A Distributed Storage System for Structured Data

iBigTable: Practical Data Integrity for BigTable in Public ...feihu.eng.ua.edu/bigdata/week14_2.pdf · Bigtable is a distributed storage system for managing structured data. A table

Bigtable: A Distributed Storage System for …tozsu/courses/CS742/W13...Bigtable uses the distributed Google File System (GFS) to store log and data files The Google SSTable file format

Bigtable: A Distributed Storage System for Structured Data · 2016-01-09 · Introduction •Bigtable is a distributed storage system for managing structured data. •Goals of Bigtable