Managing 50K+ Redis Databases Over 4 Public Clouds ... with a Tiny Devops Team

powering lightning fast apps

The newest NoSQL

The fastest data store available today (served entirely

from RAM)

Among the top 3 databases chosen by developers

Much more than a simple key/value - Strings, Hashes,

Lists, Sets, Sorted Set, LUA, transactions, Bits

operations

Strong use cases, dynamic community, large eco-

system

Leading the commercial Redis market

Founded in 2011; GA in 02/2013

2,400+ paying customers; 52,000+ DBs; 100+

new DBs/day

2nd largest contributor to open source Redis

Raised $13M - Bain/Carmel/Strategic/Angels

Offices in Santa Clara and Tel-Aviv

Redis Labs

Redis Cloud Memcached Cloud

Our offering

Fully-managed cloud services.

On-prem server license - soon.

100msec =

Fast apps requirements

max E2E response time, under any load

50msec = average Internet latency

50msec = required app response time (includes processing & multi DB accesses)

1msec = required DB response time

The only database to meet requirement

DB performance comparison@<1mse

@<1msec

@<20msec

@<10-50msec

@<100msec

@>100msec

Why is Redis efficient ?

Many data-structures

Many cool commands (atomicity

maintained)

Complexity aware

Real world use case:

•500+GB

•400K writes/sec

•1500 reads/sec

•37.5KB average object size

Efficiency

No extra work at app level

1.5Gbps 120Gbps

Tones of work at

app level

6 Nodes cluster

150+ Nodes cluster

Timeline

Followers

Caching

Messaging

Geo search

Leaderboards

Job management

RT analytics

Verticals & main use cases

Online advertisin

Social Gaming

Financial Services

• Multi-TB in memory

• ~ 300,000 reads/sec

• ~ 5,000*N writes/sec

N - # of followers

Twitter

Every Timeline

(800 tweets per user)

is on Redis

• 20TB+ in memory

• ~ 6,000,000 reads/sec

• ~ 600,000 writes/sec

Weibo (Chinese Twitter)

• Counting

• Reverse cache

• Top 10 lists

• Last Index

• Relational list/Message Queue

• Fast transactions w/ LUA

Object graph:

• Per user (Sorted Set w/ timestamp as

score)

store the users followed (explicit+

implicit)

store the user’s followers

(explicit+implicit)

• Per board

Redis Hash for storing explicit followers

Redis Set for storing explicit unfollowers

Stack Overflow

Three levels of cache:

• Local cache (no persistence)

sessions, and pending view count

updates

• Site cache

hot question id lists, users acceptance

rates..

• Global cache

Inboxes, API usage quotas, …

Github

• Redis is used for routing info

• Matching user repositories to server

Hipchat

• Which users are in which room

• Who is online

• XMPP server balancing

Youporn

Most data is found in Hashes with ordered Sets used to

know what data to show

(1) ZinterStore on:

{videos:filters:release}{videos:filters:orientation:straig

{videos:filters:categories(id)}{videos:ordering:rating}

(2) Perform a ZRANGE to get the pages we want and get

the list of video_ids back

(3) Start pipelining to get all the videos from Hashes

Snapchat

• 500+ instances

• 15-50TB

• Running on GCE

400M messages/day

Why Redis Labs ?

Infinite seamless scalability

True high-availability

Stable top performance

Zero management

Users choose us because..

Dynamic Clustering Technology

Zero-latency proxy

Cluster

manager

In-Memory Node

Cross-shard processor

In-Memory Cluster

Challenge #1

How to serve users from the same data-center ?

4 clouds /10 regions

18 data-centers / 30 clusters

AWS zones mapping dilemma

Redis Labs Userus-east-1a us-east-1c

us-east-1b

us-east-1c us-east-1e

us-east-1d us-east-1a

us-east-1e us-east-1b

Eric Hammond’s post on: Matching EC2 Availability

Zones Across AWS Accounts

How did we solve it

Redis Labs

Challenge #2

Which instance type shall we use for our cluster?

Various instance types in the same cluster• High load scenarios • High memory usage scenarios • New generation of instances

Dedicated instances

As cheap as possible

Cluster’s node requirements

Adrian Cockcroft's Blog - Understanding and using Amazon EBS - Elastic Block Store

• use large instances and get dedicated instances for free

The tip

What we use today

C3 & R3 A4/5/6/7n1-standardn1-highmemn1-highcpu

Challenge #3

How to mange data-persistence with high volumes

of ‘writes’ and slow cloud storage ?

Ephemeral vs. Persistence storage

Ephemeral

EBS/Cloud Drive/Persistent

Disk/SAN

Network attachedPersistent

Direct attachedEphemeral

“Fast”

Adrian’ s Blog use the larger EBSes if you want speed

Google (GCP) “Larger volumes can achieve higher I/O levels than smaller volumes”

The tips

We use large volumes (1TB+)

We use both ephemeral and persistent storage

We improved/tuned/optimized the Redis persistent storage interface

If replication is enabled, slave writes to disk

We don’t use PIOPS

What we do

Why not PIOPS

Challenge #4

How to monitor 50K+ databases, 30+ clusters and

hundreds of nodes ?

Zabbix (not Nagios) - per node metrics

Limbic (home made) - databases’ metrics• 50K (databases) x 100+(metrics) x 10K+(time

resolutions)

• Based on Python, RRD, Redis

Redis adminUI – cluster configuration

Monitoring

Team/Method/Spirit

Team /Method/Spirit

Tiny devops team

Core dev. team knows ops (very well)

Baby steps, especially in production

The practical approach always wins

Review your plans every 3 months

We are hiring !

Thank You

Why is Redis efficient ?

Many data-structures

Many cool commands (atomicity

maintained)

Complexity aware

Think data-structure • Strings

• Hashes

• Lists

• Sets Sorted Sets

• HyperLogLogs

Cool commands• SET if it doesn’t exist – O(1)

• Blocking POP (with timeout) – O(1)

• (blocking) POP from one list, PUSH to another – O(1)

• Get/Set string ranges (and bit operation) – O(N)

• Union/Intersect/Ranges of SETs – O(N)+O(Mxlog(M))

• Pub/Sub – O(1)/O(M)/O(M+N)

• LUA / Transactions / Pipelining

Managing 50K+ Redis Databases Over 4 Public Clouds ... with a Tiny Devops Team

Technology

Transcript of Managing 50K+ Redis Databases Over 4 Public Clouds ... with a Tiny Devops Team

RedisConf17 - Redis Enterprise: Continuous Availability, Unlimited Scaling, Secure and Cost-Effective Redis

Redis Presentation

Redis cluster

Redis for search - Dvir Volk, Redis Labs

What's new with enterprise Redis - Leena Joshi, Redis Labs

Taller Redis

Redis Labcamp

Redis Overview

50K Index Back

Redis for Security Data : SecurityScorecard JVM Redis Usage

Mini-Training: Redis

RedisConf17 - Building Large High Performance Redis Databases with Redis Enterprise

Multiple NoSQL Use Cases with Redis Modules · Redis Modules C extensions to Redis Work at native speed Add commands Can implement their own native data types Can extend redis or

redis überall

Amazon · Amazon ElastiCache for Redis ElastiCache for Redis User Guide Table of Contents What Is ElastiCache for Redis

Redis begins

Intel® Memory Drive Technology Redis Performance ... · Redis servers on the system and Redis clients on the client machines. • The Redis server is started with multiple instances

Redis - Baidu

Redis - BlagoevgradConf 2013

Redis Persistence