Stratio platform overview v4.1

Post on 07-Jul-2015

229 views 5 download

description

Stratio is a Big Data platform based on Spark. It is 100% open source and enterprise - ready. In Stratio we are Pure Spark, since it is the only technology in the market able to combine stored data analyses with real time streaming data, all in the same query. We are unique in integrating Spark processing with the main NoSql databases: Cassandra, MongoDB, ElasticSearch, ...

Transcript of Stratio platform overview v4.1

SELECT * FROM tweets WHERE lucene=

'{

filter :

{

type : "range",

field : "time",

lower : "2014/04/25",

upper : "2014/04/1"

},

query :

{

type : "phrase",

field : "body",

values : ["big", "data"]

},

sort :

{

fields: [ {field:"retweets”, reverse:true} ]

}

}';

CASSANDRA

Kafka

STRATIO DEEP

STRATIO DEEP

readClobreadCSVreadLinereadMultiLinereadAvroreadJson

addCurrentTimeaddLocalHostgeoIPfindReplaceSplit

generateUUIDdecompressIfextractJsonPathsdetectMimeType

xqueryextractURIComponentsxsltGrok (regular expressions)

exec

spooling SNMP

Kite SoftwareDevelopment Kit