Poster Title

1
Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-AC04-94AL85000. SAND No. 2011-XXXXP The Hybrid Toolkit The Hybrid Toolkit is a flexible, extensible collection of software tools for capturing, storing, analyzing, and visualizing data. Hybrid was initially developed under the Sandia LDRD program as a system to bring scalable data analytics to enterprise security analysts working on the problem of detecting phishing and spear phishing attacks in email messages. The broad applicability of the capabilities currently in Hybrid has led to adoption of the toolkit for addressing a wider range of problems: data exfiltration, malware analysis, host-based anomaly detection, and large-scale network emulation, to name a few. Contacts: Warren Davis (PI) [email protected] Danny Dunlavy [email protected] Christopher Nebergall [email protected] Python implementation Worker-Manager-Executor computation model Data and I/O abstraction Stateful data Data storage • CouchDB • MongoDB Data analysis • Descriptive statistics • Clustering: Partitional (e.g. K-Means) and hierarchical Topic modeling: Latent Dirichlet Allocation Classification: k- nearest neighbor, multi-layer perceptrons, sentiment analysis Data processing Plaintext (Unicode and ASCII) • PDF MS Word • HTML • SMTP Application integration • Bro (http://www.bro.org ) • Splunk ( http://www.splunk.com ) Hybrid Toolkit Details Benefits Cross platform development and application Windows, MacOS, Linux Deployment extensibility Desktop, server, HPC system Development support Multi-core computation Extensible logging module Rapid prototyping Proven production experience Web interfaces support cyber team collaboration

description

Poster Title. The Hybrid Toolkit. - PowerPoint PPT Presentation

Transcript of Poster Title

Page 1: Poster Title

Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin

Corporation, for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-AC04-94AL85000.

SAND No. 2011-XXXXP

The Hybrid ToolkitThe Hybrid Toolkit is a flexible, extensible collection of software tools for capturing, storing, analyzing, and visualizing data.

Hybrid was initially developed under the Sandia LDRD program as a system to bring scalable data analytics to enterprise security analysts working on the problem of detecting phishing and spear phishing attacks in email messages.

The broad applicability of the capabilities currently in Hybrid has led to adoption of the toolkit for addressing a wider range of problems: data exfiltration, malware analysis, host-based anomaly detection, and large-scale network emulation, to name a few.

Contacts: • Warren Davis (PI)

[email protected]• Danny Dunlavy

[email protected]• Christopher Nebergall

[email protected]

Python implementationWorker-Manager-Executor computation modelData and I/O abstractionStateful dataData storage• CouchDB• MongoDB

Data analysis • Descriptive statistics• Clustering: Partitional (e.g.

K-Means) and hierarchical• Topic modeling: Latent

Dirichlet Allocation• Classification: k-nearest

neighbor, multi-layer perceptrons, sentiment analysis

Data processing• Plaintext (Unicode and ASCII)• PDF• MS Word• HTML• SMTP

Application integration• Bro (http://www.bro.org)• Splunk (http://www.splunk.com)

Hybrid Toolkit Details

Benefits Cross platform

development and application Windows, MacOS,

Linux Deployment extensibility

Desktop, server, HPC system

Development support Multi-core

computation Extensible logging

module Rapid prototyping Proven production

experience

Web interfaces support cyber team collaboration