Crdt and their uses. SE 2016

CRDT Data StructuresAnd their real world uses

Distributed counter (motivation example) Problem

21 21 21 21

The problemSeveral active servers

Semi-online systems (navigators, sales support software)

Distributed systems (replication in general)

Consistency … all nodes see the same data at the same time ..

Consistency has costs

Consistency Eventual consistency “ …. eventually all accesses to that item will return the last updated value … “

Strong eventual consistency

“... any two nodes that have received the same (unordered) set of updates will be in the same state … “

CRDTConflict-free Replicated Data Type

CvRDT (Convergent) aka 'state-based objects'

CmRDT (Commutative) aka 'ops-based objects'

First CRDT: G-Counter{a:4,b:5} {a:4,b:5}

{a:5,b:5} {a:4,b:6} {a:5,b:6} {a:5,b:6}

Value of the counter - sum of all parts

CRDT counter: alternative20 20

G-Counter extension: PN-CounterWhat if decrement operation is also required?

Use two g-counters: one for increments, another for decrements

State based vs ops basedState based:

Merge should be: associative, commutative and idempotent (A B) C= A (B C) ⨂ ⨂ ⨂ ⨂ max(max(1, 2), 3) = max(1, max(2, 3))

A B = B A ⨂ ⨂ max(1, 2) = max(2, 1)

A B = A B B ⨂ ⨂ ⨂ max(1, 2) = max(max(1, 2), 2)

State based vs ops basedOps based:

Replication channels should have exactly-once semantic and support causal ordering

21 21+1

Concurrent operations should commute

Example: Cassandra

Cassandra’s distributed counters are implemented as PN-counters (almost)

Cassandra’s general replication is implemented as another CRDT (LWW-Register)

(data1,12) (data2,14) (data2,14) (data2,14)

MV-Register (multi value)Vector clock

Very similar to g-counter - {a:100, b: 98, c: 101}

Each operation on server increments corresponding value

Merge takes maximum of correlated elements

One event is AFTER another if vector clock of one is less than vector clock

of another:

{a: 10, b: 11, c: 12} < {a:10, b:12, c:12}

MV-RegisterValue is stored and replicated with vector clockOn merge if one vector clock is bigger than another then keep value with bigger vector clock

Otherwise keep both and let client choose which one to keep

Example: RiakIt uses MV-Register for general replication if multi-value mode is enabled It uses LWW-Register otherwise

PN-counter is used to implement distributed countersOther CRDTs are also used (Sets, flags)

G-Set (Grow only)Merge operation is set union, which is idempotent, commutative and associativeOnly add operation is supported

OR-Set (observed remove)Idea is to use two sets for added and removed items, but...

Store them with some unique id. Such id should be assigned during adding of element

(a, id1)(a, id2)(b, id3)(c, id4)

‘Added’ Set

(b, id3)

‘Removed’ Set

(a, id2)

Element is eligible to remove only if it is in ‘added’ set with same id

Real life example: TomTom (navigators)Challenges:

Account data (like favorites) can be modified from different devices concurrently

Device can be offline during adding new information

Huge number of accounts

Real life example: TomTom (navigators)CRDT to rescue!

They use combination of CRDT: MV-Register + modified OR-Set (OUR-Set)

*Original image from TomTom’s presentation

Real life example: Swarm.jsSwarm is a reactive data sync library and middleware

Swarm uses CRDT to merge data

Real life example: Spark’s accumulatorDocblock of Accumulable classA data type that can be accumulated, ie has an commutative and associative "add" operation

You must define how to add data, and how to merge two of these together

It seems like CRDT!

Crdt and their uses. SE 2016

Technology

Transcript of Crdt and their uses. SE 2016

Competing Uses in the SE National Waterways Conference ...Competing Uses in the SE National Waterways Conference September 26, 2013. September 26, 2013 Member Systems and Service Territory

Verifying Strong Eventual Consistency in Distributed Systemsmartin.kleppmann.com/papers/crdt-isabelle-oopsla17.pdf · Our Isabelle theory iles are open source1 and included in the

Real Differences between OT and CRDT for Co-Editors

Student Category TOTAL % CRDT HRS FTE/30 FTE/15Student Category TOTAL % CRDT HRS FTE/30 FTE/15 Credit 12577 100% 109105.5 3636.85 7273.7 Freshman 7316 58.17% 61665 2055.5 4111 Sophomore

Reference 150 SE - Audio Research 150 SE Manual WEB(0).pdf · The Reference 150 SE uses a grounding system that does not require a ground-lifter adapter plug on the A.C. power cord

BOUNDS FOR THE CRDT ALGORITHM€¦ · 1. Introduction In [11] Driscoll and Vavasis introduced the CRDT (Cross Ratios and Delaunay Triangulations) algorithm for computing conformal

SE 300 Solder Paste Inspection User Guidethe solder paste screen printer. The main uses of the SE 300 system are: To intercept defective prints before components are placed onto them

Existed Activities and Followed up Activities of Project ... · The participants from all stakeholder are jointing with project reflect meeting. The training for activities of CRDT

Purely functional distributed Universidade NOVA de Lisboa ... · 4-20 minutes to communicate. 24 The Truth. 25. 26. 27 ... Crdt Bool Bool ... 2501 sols since Curiosity Landing, Mars

El pretérito y cómo se usa The uses of one of the two past tenses in Spanish.

Real Differences between OT and CRDT for Co-Editors · 3 In literature, CRDT can refer to a number of different data types [44]. In this paper, we focus exclusively on CRDT solutions

STRENGTHENING OUR VILLA GES FOR A NEW INDIAcrdt.iitd.ac.in/CRDT/Final_BrochureCRDT.pdf · There was a continuum of rural – urban settlements in a rational and harmonious way. The

Using Erlang, Riak and the ORSWOT CRDT at bet365 for ... Using Erlang, Riak and the ORSWOT CRDT at bet365 for Scalability and Performance Michael Owen Research and Development Engineer

A CRDT Supporting Selective Undo for Collaborative Text ...

Eyeblaster Case Study Se Uses Personalization to Engage Global Campaign

A Comparative Analysis o Crrnt Crdt Rsk Mdls

Morton County Commission Meeting Agenda June 8, 2017 ...90CBB59C-38EA... · 2016 5040 HMSTD CRDT 651807425 $311,200 MECKLE $311,200 3214 BAY SHORE BEND SE MD $0 OWNER. ... Fact that

Inhaltsverzeichnis: Key-Value Stores - uni-leipzig.de...2. Automatische Lösung über CRDT (Convergent Replicated Data Types) • Spezielle Datentypen mit eingeschränkten Operationen

[inria-00555588, v1] A comprehensive study of Convergent ...db.cs.berkeley.edu/cs286/papers/crdt-tr2011.pdf2011/01/13 · inria-00555588, version 1 - 13 Jan 2011 Mots-clés : Réplication

HK-HE HSBC Global Investment Funds - Euro High Yield Bond · ALLIANZ SE 4.750 CRDT AGR ASSR 4.250 GDF SUEZ 3.875 TALANX FINANZ 8.367 15/06/42 TENNET HLD BV 2.995 AXA SA 3.941 Sorted