Analyzing Data with the ELK Stack

Advancing the Elastic Stack -It’s more than just log aggregation!

Introduction

Mike ClarkeDevOps Engineer/SA

Mike KeithSenior Software Engineer

Agenda● Project/Problem Overview

○ Our environment and problem we were solving○ Initially to solve distributed log problem

● Elastic Stack Overview● Kibana and ElasticSearch Demo

Architecture Overview● Our Environment

○ Multiple Geographical Regions/Zones○ Ingest processing application○ Webservice application

■ Our webservice application logs tell us a lot about what is going on with customers sending us information.

○ Access logs for JBOSS○ Data archive application

JBossWebservice

JBossUI

Architecture Overview

NoSQL DB

Project / Problem● Log aggregation is hard● No historical reference, as logs age off● Obtaining stats was painful

○ Realistically when all your service stats are in your logs what do you do?● Cluster SSH only helps so much

Obtaining stats was painful ?!?!?!

cat log | grep "someword" | awk '{print $8}' | paste -sd+ | bc

host@me$: cat log | grep "someword" | awk '{print $8}' | paste -sd+ | bc

host@me$: cat log | grep "someword" | awk '{print $8}' | paste -sd+ | bc...………

20host@me$: cat log | grep "someword" | awk '{print $8}' | paste -sd+ | bc1240

Technical Overview● For the most part restricted to FOSS products● Needed to be easily obtainable● Available options

○ GrayLog○ Grafana○ Airbrake○ Splunk○ Elastic Stack

Elastic Stack (formerly ELK) Overview

Elasticsearch - Distributed, RESTful search and analytics engine

Logstash - Server-side data processing pipeline

Kibana - Powerful visualization UI

Beats - Single-purpose, lightweight data shippers

X-Pack - Powerful features which enhance the Elastic Stack

Elastic Stack (formerly ELK) Overview

Initial Solution - Log Aggregation● Single node servers● Installed Elastic Stack and began shipping all application server logs to a

centralized server.● Near Realtime● Raw log message transitioned into a fielded log message● Grok parsing (text pattern matching)● Filters etc.

Elasticsearch

Logstash

Filebeat

KibanaFilebeat

Filebeat

Architecture Overview

Filebeatfilebeat.prospectors:

- input_type: log

paths:- /data/logs/apache/*.log

fields:type: apache

fields_under_root: true

#----------------------------- Logstash output --------------------------------output.logstash:

hosts: ["localhost:5443"]bulk_max_size: 1024

Logstash - Input & Outputinput {

beats {port => 5443ssl => truessl_certificate => "/etc/pki/tls/certs/logstash-forwarder.crt"ssl_key => "/etc/pki/tls/private/logstash-forwarder.key"

output {elasticsearch {

hosts => ["localhost:9200"]index => "%{[@metadata][beat]}-%{[@metadata][type]}-%{+YYYY.MM.dd}"document_type => "%{type}"user => "elastic"password => "*******"

Kibana - Discover

Logstash - Filters

filter {grok {

match => { "message" => "%{IPORHOST:remote_ip} - %{DATA:user_name} \[%{HTTPDATE:time}\] \"%{WORD:method} %{DATA:url} HTTP/%{NUMBER:http_version}\" %{NUMBER:response_code:int} %{NUMBER:bytes:int} "}

}mutate {

add_field => { "read_timestamp" => "%{@timestamp}" }}date {

match => [ "time", "dd/MMM/YYYY:H:m:s Z" ]remove_field => "time"

Kibana

● We change from looking at who is talking to us, to what they are talking to us about.

○ We kept adding more to our logs just so we could see it in Kibana.○ Our data was already in Avro format, which made it easy to convert to JSON ○ Then we used the JSON Codec for logstash to input directly into elasticsearch.

● Considered Accumulo○ But there was just too much we had to build to get it to a usable state.

Evolution of the solution

Kibana Twitter Demo● Let’s take a look at some interesting things you can see in kibana● Counting very easily across different fields in your data (makes aggregating

and histograms very easy)● Data changes over time, sometimes you need to go back and update

something you already stored?○ State changes or updates of some kind to the original document.

Twitter Data DemoBasic twitter JSON:

{ screen_name, text, retweeted_status.user.screen_name, retweeted_status.retweet_count, retweeted_status.text, ... }

Data Storage Elastic Stack Architecture

ElasticsearchData Node 1

Logstash Node 1

Kibana

Filebeat

Logstash Node 4

ElasticsearchData Node 20

... ElasticsearchClient Node

ElasticsearchMaster Node 1

ElasticsearchMaster Node 2

... ...

Conclusion & Takeaways● Low Barrier to Entry● Quickly Search Across Data● Horizontally Scalable● Easily Visualize Data

About Clarity Business Solutions● We are a team of Software and System Engineers● Customer focused and mission driven● For more about us, please visit: www.claritybizsol.com

● Follow us:

@claritybizsol

Analyzing Data with the ELK Stack

Software

Transcript of Analyzing Data with the ELK Stack

Log Consolidation with ELK Stack

Parallel Coordinates Visualization in the ELK Stack

BUILDING HA ELK STACK FOR DRUPAL

ELK stack - soit.sk fileSplunk Widely used Easy to use Cross platform Expensive Complex set up process ELK stack Easy installation Open Source

Bill centralises logs, be more like Bill - Percona · Bill centralises logs ... ELK Stack. ELK + B. Elastic Stack. Beats: Shipper. Beats Lightweight go binary ... Now you know you

Automation of Log Analysis Using the Hunting ELK Stack

The ELK Stack - Conygreneueda.conygre.com/citi/content/elk/elk.pdf · The ELK Stack Elastic Logging. Agenda 1. Logging and analysis 2. The ELK stack 3. Logs & Elasticsearch Lab 1

Learning ELK Stack - Sample Chapter

The Elastic Stack - openSUSE · The Elastic stack ... The Elastic stack (formerly: ELK stack) Elasticsearch Kibana Logstash. Database UI Log server / parser. 12

Centralised logging with ELK stack

ELK Stack - Turn boring logfiles into sexy dashboard

Using the ELK Stack for CASTOR Application Logging at RAL

Bitnami ELK for Huawei Enterprise Cloud ELK for Huawei Enterprise Cloud Description The ELK stack is a log management platform consisting of Elasticsearch (deep search and data analytics),

Diventare famosi con lo stack ELK - Alfonso Iannotta

ELK stack Big Data visualization using D3 library

using LOD and the ELK stack - Semantics...Building business tools for the scholarly publishing domain using LOD and the ELK stack SEMANTiCS Vienna 2018 Analytics Markus Kaindl Senior

The ELK Stack - Conygreneueda.conygre.com/citi/content/elk/latestelk.pdf · The ELK Stack Centralise your view of logs and get what you need. ... //qbox.io/blog/welcome-to-the-elk-stack-elasticsearch-logstash

Suricata IDS/IPS - Vereniging NLUUG · Suricata creator and lead dev ... – Suricata – ELK stack ... – e.g. various ELK parts from the official ELK images

ELK - Stack - Munich .net UG

ELK stack & log parsing - agenda.infn.it · Output configuration to route parsed data in a search analytics engine (Elasticsearch). ELK stack & log parsing TOMMASO DIOTALEVI Parse