Jonathan/Aldrich /CharlieGarrod&charlie/courses/15-214/2015-fall/slides/06b... · 15214 2...

1 15-‐214

School of Computer Science

Principles of So3ware Construc9on: Objects, Design, and Concurrency Part 6: Concurrency and distributed systems Distributed Systems Jonathan Aldrich Charlie Garrod

2 15-‐214

Administrivia

•  Homework 5b due Thursday, 11:59 p.m. –  Finish by Friday 10 a.m. if you want to be considered as a "Best

Framework" for Homework 5c •  Our evalua9on considers:

–  Novelty –  Func9onal correctness –  Documenta9on – …

3 15-‐214

Key concepts from last Thursday

4 15-‐214

Concurrency at the language level

•  Consider: int sum = 0; Iterator i = coll.iterator(); while (i.hasNext()) { sum += i.next(); }

•  In python: sum = 0; for item in coll: sum += item

5 15-‐214

Parallel prefix sums algorithm, winding

•  Computes the par9al sums in a more useful manner

[13, 9, -4, 19, -6, 2, 6, 3]

[13, 22, -4, 15, -6, -4, 6, 9]

[13, 22, -4, 37, -6, -4, 6, 5]

[13, 22, -4, 37, -6, -4, 6, 42]

6 15-‐214

Parallel prefix sums algorithm, unwinding

•  Now unwinds to calculate the other sums

[13, 22, -4, 37, -6, -4, 6, 42]

[13, 22, -4, 37, -6, 33, 6, 42]

[13, 22, 18, 37, 31, 33, 39, 42]

•  Recall, we started with:[13, 9, -4, 19, -6, 2, 6, 3]

7 15-‐214

A framework for asynchronous computa9on

•  The java.util.concurrent.Future<V> interface V get(); V get(long timeout, TimeUnit unit); boolean isDone(); boolean cancel(boolean mayInterruptIfRunning); boolean isCancelled();

•  The java.util.concurrent.ExecutorService interface Future submit(Runnable task); Future<V> submit(Callable<V> task); List<Future<V>> invokeAll(Collection<Callable<V>> tasks); Future<V> invokeAny(Collection<Callable<V>> tasks);

8 15-‐214

Fork/Join: another common computa9onal paYern

•  In a long computa9on: –  Fork a thread (or more) to do some work –  Join the thread(s) to obtain the result of the work

•  The java.util.concurrent.ForkJoinPool class –  Implements ExecutorService –  Executes java.util.concurrent.ForkJoinTask<V> or

java.util.concurrent.RecursiveTask<V> or java.util.concurrent.RecursiveAction

9 15-‐214

Parallel prefix sums algorithm

•  How good is this? –  Work: O(n) –  Depth: O(lg n)

•  See PrefixSumsSequen9alImpl.java –  n-‐1 addi9ons –  Memory access is sequen9al

•  For PrefixSumsNonsequen9alImpl.java –  About 2n useful addi9ons, plus extra addi9ons for the loop indexes –  Memory access is non-‐sequen9al

•  The punchline: Constants maYer.

10 15-‐214

Today: Distributed system design

•  Java networking fundamentals •  Introduc9on to distributed systems

–  Mo9va9on: reliability and scalability –  Failure models –  Techniques for:

•  Reliability (availability) •  Scalability •  Consistency

11 15-‐214

Our des9na9on: Distributed systems

•  Mul9ple system components (computers) communica9ng via some medium (the network)

•  Challenges: –  Heterogeneity –  Scale –  Geography –  Security –  Concurrency –  Failures

(courtesy of http://www.cs.cmu.edu/~dga/15-440/F12/lectures/02-internet1.pdf

12 15-‐214

Communica9on protocols

•  Agreement between par9es for how communica9on should take place

Friendly greeting.

Muttered reply.

Destination?

Pittsburgh.

Thank you.

(courtesy of http://www.cs.cmu.edu/~dga/15-440/F12/lectures/02-internet1.pdf

13 15-‐214

Abstractions of a network connection

TCP | UDP | …

HTTP | FTP | …

HTML | Text | JPG | GIF | PDF | …

data link layer

physical layer

14 15-‐214

Internet addresses and sockets

•  For IP version 4 (IPv4) host address is a 4-‐byte number –  e.g. 127.0.0.1 –  Hostnames mapped to host IP addresses via DNS –  ~4 billion dis9nct addresses

•  Port is a 16-‐bit number (0-‐65535) –  Assigned conven9onally

•  e.g., port 80 is the standard port for web servers

15 15-‐214

Packet-‐oriented and stream-‐oriented connec9ons

•  UDP: User Datagram Protocol –  Unreliable, discrete packets of data

•  TCP: Transmission Control Protocol –  Reliable data stream

16 15-‐214

Networking in Java

•  The java.net.InetAddress: static InetAddress getByName(String host); static InetAddress getByAddress(byte[] b); static InetAddress getLocalHost();

•  The java.net.Socket: Socket(InetAddress addr, int port); boolean isConnected(); boolean isClosed(); void close(); InputStream getInputStream(); OutputStream getOutputStream();

•  The java.net.ServerSocket: ServerSocket(int port); Socket accept(); void close(); …

17 15-‐214

Simple sockets demos

•  NetworkServer.java •  A basic chat system:

–  TransferThread.java –  TextSocketClient.java –  TextSocketServer.java

18 15-‐214

Higher levels of abstrac9on

•  Applica9on-‐level communica9on protocols •  Frameworks for simple distributed computa9on

–  Remote Procedure Call (RPC) –  Java Remote Method Invoca9on (RMI)

•  Common paYerns of distributed system design •  Complex computa9onal frameworks

–  e.g., distributed map-‐reduce

19 15-‐214

20 15-‐214

21 15-‐214

Aside: The robustness vs. redundancy curve

? redundancy robustness

22 15-‐214

Metrics of success

•  Reliability –  O3en in terms of availability: frac9on of 9me system is working

•  99.999% available is "5 nines of availability" •  Scalability

–  Ability to handle workload growth

23 15-‐214

A case study: Passive primary-‐backup replica9on

•  Architecture before replica9on:

–  Problem: Database server might fail

client front-end {alice:90, bob:42, …} client front-end

database server:

24 15-‐214

A case study: Passive primary-‐backup replica9on

–  Problem: Database server might fail

•  Solu9on: Replicate data onto mul9ple servers

database server:

primary:

{alice:90, bob:42, …}

backup:

{alice:90, bob:42, …}

backup:

25 15-‐214

Passive primary-‐backup replica9on protocol

1.  Front-‐end issues request with unique ID to primary DB 2.  Primary checks request ID

–  If already executed request, re-‐send response and exit protocol 3.  Primary executes request and stores response 4.  If request is an update, primary DB sends updated state, ID, and

response to all backups –  Each backup sends an acknowledgement

5.  A3er receiving all acknowledgements, primary DB sends response to front-‐end

26 15-‐214

Issues with passive primary-‐backup replica9on

•  If primary DB crashes, front-‐ends need to agree upon which unique backup is new primary DB –  Primary failure vs. network failure?

•  If backup DB becomes new primary, surviving replicas must agree on current DB state

•  If backup DB crashes, primary must detect failure to remove the backup from the cluster –  Backup failure vs. network failure?

•  If replica fails* and recovers, it must detect that it previously failed

•  Many subtle issues with par9al failures •  …

27 15-‐214

More issues…

•  Concurrency problems? –  Out of order message delivery?

•  Time…

•  Performance problems? –  2n messages for n replicas –  Failure of any replica can delay response –  Rou9ne network problems can delay response

•  Scalability problems? –  All replicas are wriYen for each update –  Primary DB responds to every request

28 15-‐214

29 15-‐214

Types of failure behaviors

•  Fail-‐stop •  Other hal9ng failures •  Communica9on failures

–  Send/receive omissions –  Network par99ons –  Message corrup9on

•  Data corrup9on •  Performance failures

–  High packet loss rate –  Low throughput –  High latency

•  Byzan9ne failures

30 15-‐214

Common assump9ons about failures

•  Behavior of others is fail-‐stop (ugh) •  Network is reliable (ugh) •  Network is semi-‐reliable but asynchronous •  Network is lossy but messages are not corrupt •  Network failures are transi9ve •  Failures are independent •  Local data is not corrupt •  Failures are reliably detectable •  Failures are unreliably detectable

31 15-‐214

Some distributed system design goals

•  The end-‐to-‐end principle –  When possible, implement func9onality at the end nodes (rather than the

middle nodes) of a distributed system

•  The robustness principle –  Be strict in what you send, but be liberal in what you accept from others

•  Protocols •  Failure behaviors

•  Benefit from incremental changes •  Be redundant

–  Data replica9on –  Checks for correctness

32 15-‐214

Replica9on for scalability: Client-‐side caching

–  Problem: Server throughput is too low

•  Solu9on: Cache responses at (or near) the client –  Cache can respond to repeated read requests

database server:

client front-end

{alice:90, bob:42, …}

database server: cache

33 15-‐214

Replica9on for scalability: Client-‐side caching

•  Hierarchical client-‐side caches:

client

front-end

client

front-end

{alice:90, bob:42, …}

database server:

client

34 15-‐214

Replica9on for scalability: Server-‐side caching

–  Problem: Database server throughput is too low

•  Solu9on: Cache responses on mul9ple servers –  Cache can respond to repeated read requests

database server:

client front-end

{alice:90, bob:42, …}

database server: cache

35 15-‐214

Cache invalida9on

•  Time-‐based invalida9on (a.k.a. expira9on) –  Read-‐any, write-‐one –  Old cache entries automa9cally discarded –  No expira9on date needed for read-‐only data

•  Update-‐based invalida9on –  Read-‐any, write-‐all –  DB server broadcasts invalida9on message to all caches when the DB is

updated

36 15-‐214

Cache replacement policies

•  Problem: caches have finite size •  Common* replacement policies

–  Op9mal (Belady's) policy •  Discard item not needed for longest 9me in future

–  Least Recently Used (LRU) •  Track 9me of previous access, discard item accessed least recently

–  Least Frequently Used (LFU) •  Count # 9mes item is accessed, discard item accessed least frequently

–  Random •  Discard a random item from the cache

37 15-‐214

Par99oning for scalability

•  Par99on data based on some property, put each par99on on a different server

client front-end {cohen:9, bob:42, …}

client front-end

CMU server:

{alice:90, pete:12, …}

Yale server: {deb:16, reif:40, …}

MIT server:

38 15-‐214

Horizontal par99oning

•  a.k.a. "sharding" •  A table of data:

username school value

cohen CMU 9

bob CMU 42

alice Yale 90

pete Yale 12

deb MIT 16

reif MIT 40

39 15-‐214

Recall: Basic hash tables

•  For n-‐size hash table, put each item X in the bucket: X.hashCode() % n

0 1 2 3 4 5 6 7 8 9 10 11 12

{reif:40} {bob:42} {pete:12} {deb:16}

{alice:90} {cohen:9}

40 15-‐214

Par99oning with a distributed hash table

•  Each server stores data for one bucket •  To store or retrieve an item, front-‐end server hashes the key,

contacts the server storing that bucket

client front-end {reif:40}

client front-end

Server 0:

{bob:42} Server 3: {pete:12,

alice:90}

Server 5:

{ } Server 1:

41 15-‐214

Consistent hashing

•  Goal: Benefit from incremental changes –  Resizing the hash table (i.e., adding or removing a server) should not

require moving many objects

•  E.g., Interpret the range of hash codes as a ring –  Each bucket stores data for a range of the ring

•  Assign each bucket an ID in the range of hash codes •  To store item X don't compute X.hashCode() % n. Instead, place X in bucket with the same ID as or next higher ID than X.hashCode()

42 15-‐214

Problems with hash-‐based par99oning

•  Front-‐ends need to determine server for each bucket –  Each front-‐end stores look-‐up table? –  Master server storing look-‐up table? –  Rou9ng-‐based approaches?

•  Places related content on different servers –  Consider range queries: SELECT * FROM users WHERE lastname STARTSWITH 'G'

43 15-‐214

Master/tablet-‐based systems •  Dynamically allocate range-‐based par99ons

–  Master server maintains tablet-‐to-‐server assignments –  Tablet servers store actual data –  Front-‐ends cache tablet-‐to-‐server assignments

client front-end

k-z: {pete:12, reif:42}

client front-end

Tablet server 1:

a-c: {alice:90, bob:42, cohen:9}

Tablet server 2: d-g: {deb:16} h-j:{ }

Tablet server 3:

{a-c:[2], d-g:[3,4], h-j:[3], k-z:[1]}

Master:

d-g: {deb:16}

Tablet server 4:

44 15-‐214

Coming next…

•  More distributed systems –  MapReduce

Jonathan/Aldrich /CharlieGarrod&charlie/courses/15-214/2015-fall/slides/06b... · 15214 2...

Documents

Transcript of Jonathan/Aldrich /CharlieGarrod&charlie/courses/15-214/2015-fall/slides/06b... · 15214 2...

Ok Google - Search Studiessearchstudies.org/.../OK_Google...Abschlussbericht.pdf · Google entwickelten Micro-Moments Want-Wantto-do, -to-know, Want-to-buy und Want-to-go durchliefen),

THE EFFECTS OF FAIRNESS PERCEPTION OF …vuir.vu.edu.au/15214/1/Aryani.pdf · THE EFFECT OF FAIRNESS PERCEPTION OF PERFORMANCE MEASUREMENT IN THE BALANCED SCORECARD ENVIRONMENT. By

Reflectionson Developing Autonomy? - 中央大学c-faculty.chuo-u.ac.jp/~mikenix1/ldsig/AYA/Barfield.pdf · andorganize mypresentation. I wantto improve howI read andmake notes in

5 Council Mintues 15Jun17srcusyd.net.au/wp-content/uploads/2017/10/5_Council-Mintues_15Ju… · Managementis#hitting#casual#staff#hard.#They#wantto#abolish#Scholarly#Teaching#Fellows#

MEmuAppPlayerUserManual · on theside barand select the APK filefrom Windows to install. 2.4 InstallAppfromMEmutoPhone Ifyou wantto sync anygame orapp from MEmu to your phone, please

ESCUELA POLITÉCNICA NACIONALbibdigital.epn.edu.ec/bitstream/15000/15214/1/CD-6988.pdf · Jaime Cadena nuestro director y guía a través de nuestra carrera y al Eco. César Espinosa,

Archiving*DataFrom* Splunk*Indexes* “Archive Splunk Data and Access Using Hadoop Tools” session . Why*Is*Archiving*Important?* Wantto*keep*some*datafor*avery*long*Gme*

Community Music Therapy & Performance in Adolescent Mental ... · • Youth wantto perform; • Caseload cannot accommodate all youth who wish to ... (CYC 2 ). Relational Affordances:

Toward soware engineering in prac0ceckaestne/15214/s2017/slides/20170427... · Toward soware engineering in prac0ce Claire Le Goues 15-214 ... , Twi]er cards that allow Tweets to

aviatorpizza.comaviatorpizza.com/menu-elgin.pdf · pizza & brew company you'll love our specialty pizzas but if you have something you wantto create we have that option too. $6.00

Introduction Basic Concepts - INFLIBNETshodhganga.inflibnet.ac.in/bitstream/10603/15214/7/07_chapter1.pdf · 2 1.1. INTRODUCTION Flow of fluids bounded by one or two planes aroused

JoshBloch / /CharlieGarrod&charlie/courses/15-214/2016-spring/slides/01-introduction.pdf15214 18 Software engineering is the branch of computer science that creates practical, cost-effective

Schedule - Quality Diagnostic Testing · 2019-10-25 · Lactic Acid Bacteria Count In-house method 21BAMC-B023 based on ISO 15214:1998 Microbiology of food and animal feeding stuffs-

Radiation effects on Unsteady Free Convection Heat and ...shodhganga.inflibnet.ac.in/bitstream/10603/15214/11/11_chapter5.pdfhypersonic flights, missile re-entry, rocket combustion

HUBUNGAN FAKTOR SOSIAL EKONOMI, HIGIENE SANITASI ...eprints.undip.ac.id/15214/1/Aeda_Ernawati.pdf · secara tidak langsung berkaitan dengan faktor sosial ekonomi dan higiene ... Sampel

RedrawingtheBoundaryBetween …nvsl.ucsd.edu/assets/talks/2012-05-21-DaMoN-release.pdf · match"disk"seman;cs" We"wantto"keep"these" Rethink"these" 20 ARIES’ Disk-Centric Design

SUMMARY’OF’EVIDENCE’AND’SUPPLEMENTARY ......! 6!! So9called’Mitigation’Measures’for’Onehunga! 33.! Idon’treally!wantto!head!down!the!path!of!mitigation!or!offWsetting!the!damage!

If you are an architect or designer and wantto collect ...

Thenormalchild:growthanddevelopment oftheinfantandchild ...assets.cambridge.org/97805215/15214/excerpt/9780521515214_excerpt.pdf · starts at about eight years, the long bones of

Principles of Software Construction: Objects, Design, and ...€¦ · 15214 8 Today: Concurrency, part 2 • The backstory ! Motivation, goals, problems, … • Basic concurrency

ArchivingDataFrom SplunkIndexes “Archive Splunk Data and Access Using Hadoop Tools” session . WhyIsArchivingImportant? WanttokeepsomedataforaverylongGme*