Fluentd: Unified Logging Layer at CWT2014

23
Masahiro Nakagawa Nov 6, 2014 Cloudera World Tokyo Fluentd Unified Logging Layer

description

Fluentd presentation at Cloudera World Tokyo

Transcript of Fluentd: Unified Logging Layer at CWT2014

Page 1: Fluentd: Unified Logging Layer at CWT2014

Masahiro NakagawaNov 6, 2014

Cloudera World Tokyo

FluentdUnified Logging Layer

Page 2: Fluentd: Unified Logging Layer at CWT2014

Who are you?

> Masahiro Nakagawa > github/twitter: @repeatedly

> Treasure Data, Inc. > Senior Software Engineer > Fluentd / td-agent developer

> I love OSS :) > D language - Phobos committer > Fluentd - Main maintainer > MessagePack / RPC- D and Python (only RPC) > The organizer of Presto Source Code Reading > etc…

Page 3: Fluentd: Unified Logging Layer at CWT2014

Structured logging !

Reliable forwarding !

Pluggable architecture

http://fluentd.org/

Page 4: Fluentd: Unified Logging Layer at CWT2014

Collect Store Process Visualize

Data source

Reporting

Monitoring

Data Processing

Page 5: Fluentd: Unified Logging Layer at CWT2014

Related Products

Store Process

Cloudera

Horton Works

Treasure Data

Collect Visualize

Tableau

Excel

R

easier & shorter time

???

Page 6: Fluentd: Unified Logging Layer at CWT2014

What’s Fluentd?

• Open sourced log collector written in Ruby

• Customization is essential small core + many plugins

6

It’s like syslogd, but uses JSON for log messages

Page 7: Fluentd: Unified Logging Layer at CWT2014

Divide & Conquer & Retry

7

error retry

error retry retry

retry

Page 8: Fluentd: Unified Logging Layer at CWT2014

Core Plugins!

• Divide & Conquer

• Buffring & Retrying

• Error handling

• Message routing

• Parallelism

!

!

• read / receive data

• write / send data

8

Page 9: Fluentd: Unified Logging Layer at CWT2014

Pluggable Architecture

9

Buffer Output

Input

> Forward > HTTP > File tail > dstat > ...

> Forward > File > MongoDB > ...

> File > Memory

Engine

Output

> rewrite > ...

Pluggable Pluggable

Page 10: Fluentd: Unified Logging Layer at CWT2014

Before

10

Page 11: Fluentd: Unified Logging Layer at CWT2014

After

11

Page 12: Fluentd: Unified Logging Layer at CWT2014

M x N → M + N

12

Nagios

MongoDB

Hadoop

Alerting

Amazon S3

Analysis

Archiving

MySQL

Apache

Frontend

Access logs

syslogd

App logs

System logs

Backend

Databasesbuffer / buffer / routing

Page 13: Fluentd: Unified Logging Layer at CWT2014

Ecosystem

Page 14: Fluentd: Unified Logging Layer at CWT2014

Treasure Agent

• Treasure Data distribution of Fluentd

• including Ruby, core libraries andthird party plugins

• Current stable is 2.1.1

• rpm / deb / dmg

• 2.1.2 will be released soon with fluentd-ui

14

Page 15: Fluentd: Unified Logging Layer at CWT2014

fluentd-forwarder

• Forwarding agent written in Go

• mainly for Windows support

• immature than Fluentd

• Bundle TCP input/output and TD output

• No plugin mechanizm

• Similar product: ik, fluent-agent-hydra, etc

15

Page 16: Fluentd: Unified Logging Layer at CWT2014

fluentd-ui

• Manage Fluentd instance using Web GUI

16

Page 17: Fluentd: Unified Logging Layer at CWT2014

Use-case

Page 18: Fluentd: Unified Logging Layer at CWT2014

Users

• Companies

• Web service, Ad Tech, Game, Media, EC, IoT, PoS, etc..

• Products

• Kubernetes, Cloudn, Harvester, Bloomberg Clustered Private Cloud, etc..

18

Page 19: Fluentd: Unified Logging Layer at CWT2014

Simple forwarding

19

HDFS HBaseMongo

S3

Page 20: Fluentd: Unified Logging Layer at CWT2014

Separate log stream

20

Page 21: Fluentd: Unified Logging Layer at CWT2014

CEP for stream processing

21

Page 22: Fluentd: Unified Logging Layer at CWT2014

Roadmap

• v0.10 (current)

• v0.12 (Nov, 2014)

• Filter / Label / at-least-once

• v0.14 (TBD)

• New Plugin APIs, ServerEngine

• v1 (TBD. In the first half of 2015)

22

Page 23: Fluentd: Unified Logging Layer at CWT2014

Check: treasuredata.com

Cloud service for the entire data pipeline