for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite...

18
@vkulichenko Apache Ignite 101: Key Deployment Strategies for Database Acceleration Valentin Kulichenko Aug 5, 2020

Transcript of for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite...

Page 1: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

@vkulichenko

Apache Ignite 101:Key Deployment Strategiesfor Database Acceleration

Valentin KulichenkoAug 5, 2020

Page 2: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Memory is Much … Much Faster Than Disk

System Event Actual Latency Computer Latency at a Human Scale

One CPU cycle 0.4 ns 1 sLevel 1 cache access 0.9 ns 2 sLevel 2 cache access 2.8 ns 7 sLevel 3 cache access 28 ns 1 minMain memory access (DDR DIMM) ~100 ns 4 minIntel Optane DC persistent memory access ~350 ns 15 minIntel Optane DC SSD I/O < 10 µs 7 hrsNVMe SSD I/O ~25 µs 17 hrsSSD I/O 50-150 µs 1.5 - 4 daysRotational disk I/O 1 – 10ms 1 – 9 monthsInternet: SF to NY 65 ms 5 years

Page 3: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Dirty Secret of In-Memory Systems

Page 4: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Apache Ignite In-Memory Computing Platform

Mainframe NoSQL HadoopDistributed Ignite Persistence

Disk Tier

RDBMS

Machine and Deep Learning

EventsStreamingMessagingTransaction

sSQLKey-Value

Service GridCompute Grid

Application Layer

Web SaaS SocialMobile IoT

Rol

ling

Upg

rade

s

Sec

urity

& A

uditi

ng

Mon

itorin

g &

Man

agem

ent

Seg

men

tatio

n P

rote

ctio

n

Dat

a C

ente

r Rep

licat

ion

Net

wor

k B

acku

ps

Full,

Incr

emen

tal,

Con

tinuo

us B

acku

ps

Poi

nt-in

-Tim

e R

ecov

ery

Het

erog

eneo

us R

ecov

ery

Distributed In-Memory Tier

GridGain Enterprise FeaturesApache Ignite Features

Page 5: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Mode Description Major Advantage

In-Memory 100% data in the In-Memory Store (only) Maximum performance possible(data is never written to disk)

In-Memory + 3rd Party DB

Data in the In-Memory Data Store as a caching layer (aka. in-memory data grid)

3rd Party DB (RDBMS, NoSQL, etc) used for persistence

Horizontal scalabilityFaster reads and writes

In-Memory + Persistent Store

The whole data set is stored both in memory and on disk Survives cluster failures

100% on Disk + In-Memory Cache 100% of data is in GridGain Persistent Store anda subset is in memory

Unlimited data scale beyond RAM capacity

Multi-Tier Architecture Advantages

Page 6: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Ignite Memory Tier

Off-Heap Memory

Java Heap

Off-Heap Memory

Java Heap

Off-Heap Memory

Java Heap

Permanent Storage for Data & Indexes

Interim Objects(records requested by

apps)

Page 7: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Ignite Native Persistence

• Distributed Persistence Tier– Fully transactional and consistent– No need to cache 100% of data in RAM– No need to warm-up RAM on restarts

• Performance vs. Cost Tradeoff– Cache more for fastest performance– Cache less to reduce infrastructure costs

Page 8: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Apache Ignite as a Cache

Page 9: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Apache Ignite as a Data Grid

Page 10: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Apache Ignite as a Database

Page 11: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Change Data Capture

ChangeDataCapture

Page 12: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Change Data Capture Options

• Database triggers• Oracle GoldenGate• Streaming (e.g. Debezium+Kafka)

Page 13: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

CDC with Debezium and Kafka

Blog and Demo by Evgenii Zhuravlev:

https://www.gridgain.com/resources/blog/change-data-capture-between-mysql-and-gridgain-debezium

Page 14: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Hadoop Acceleration

Page 15: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Digital Integration Hub

Page 16: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Stay connected with Apache Ignite users & experts

meetup.com/Apache-Ignite-Virtual-Meetup/

Page 17: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Join Apache Ignite Community

Discuss https://ignite.apache.org/community/resources.html#mail-lists

Check demos https://github.com/GridGain-Demos/

Connect https://twitter.com/apacheignite

Join events https://ignite.apache.org/events.html

Contribute https://ignite.apache.org/community/contribute.html

Page 18: for Database Acceleration Key Deployment Strategies Apache ... · Mainframe Distributed Ignite Persistence NoSQL Hadoop Disk Tier RDBMS Machine and Deep Learning Messaging Streaming

2020 © GridGain Systems @vkulichenko

Questions?