Introduction to Graph Analytics and Oracle Cloud Service · PDF file Oracle Graph Analytics...
date post
14-Mar-2020Category
Documents
view
7download
0
Embed Size (px)
Transcript of Introduction to Graph Analytics and Oracle Cloud Service · PDF file Oracle Graph Analytics...
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Introduction to Graph Analytics and Oracle Cloud Service
Hans Viehmann Jean Ihm Korbi Schmid Product Manager EMEA Product Manager US Engineering Manager Oracle Oracle Oracle
@SpatialHannes @JeanIhm
October 22, 2018
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, timing, and pricing of any features or functionality described for Oracle’s products may change and remains at the sole discretion of Oracle Corporation.
2
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Program Agenda
Product Introduction
Use Cases
Feature Overview
Demo
1
2
3
4
3
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. | 4
https://twitter.jeffprod.com
Following, no follow back
Follower, no follow back
Follow each other
https://twitter.jeffprod.com/
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Oracle’s Spatial and Graph Strategy Enabling Spatial and Graph use cases on every platform
5
Oracle Big Data Spatial and Graph
Oracle Database Spatial and Graph
Spatial and Graph in Oracle Cloud
Database Cloud Service Exadata Cloud Service
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Two Graph Data Models
RDF Data Model
• Data federation • Knowledge representation • Semantic Web
Social Network Analysis
Financial Retail, Marketing Social Media Smart Manufacturing
Linked Data Semantic Web
Property Graph Model
• Path Analytics • Social Network Analysis • Entity analytics
Life Sciences Health Care Publishing Finance
Use Case Graph Model Industry Domain
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Graph Database Features:
• Scalability, Performance, Security • Graph Analytics • Graph Query Language • Graph Visualization • Standard Interfaces • Integration with Machine Learning tools
7
Courtesy Tom Sawyer Perspectives
Courtesy Linkurious
Courtesy Tom Sawyer Perspectives
Courtesy Linkurious
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Spatial and Graph
• Available for Big Data platform – Hadoop, HBase, Oracle NoSQL
• Supported both on BDA and commodity hardware – CDH and Hortonworks
• Database connectivity through Big Data Connectors or Big Data SQL
• Part of Big Data Cloud Service
Oracle Spatial and Graph (DB option)
• Available with Oracle 12.2/18c (EE) • Using tables for graph persistence • In-database graph analytics
– Sparsification, shortest path, page rank, triangle counting, WCC, sub graph generation…
• SQL and PGQL queries possible • Included in Database Cloud Services
8
Oracle Products Supporting Property Graphs
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Use Cases
9
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Graph Analysis for Business Insight
10
Identify Influencers
Discover Graph Patterns in Big Data
Generate Recommendations
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• Customer profitability analysis – Part of larger Hadoop/Big Data project
• Analysis of banking transactions – Focus on corporate customers
• Identification of undesired behavioural patterns, eg. – Customers using other banks to make large
numbers of transactions – Many of which flow back to Banco Galicia
• Increase fees, terminate contracts, or move activities to Banco Galicia
• Implemented by Oracle Consulting
11
Banco de Galicia
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• Creating Knowledge Graphs from all kinds of content – Social media networks, documents, images,
audio, video, structured data – Using machine learning (text analysis,
classification, entity extraction, face recognition, speech2text, ...)
• Enabling relationship analysis and semantic search
• bigCONNECT platform built by mWARE – Running on Big Data Applicance, Big Data Cloud
Service or commodity Hadoop
12
Romanian Police Force BIG DATA since 2012
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Ministry of Finance, Eastern Europe • Detecting relationships between people,
accounts, companies – Similar to Paradise Papers
• Identifying suspicious patterns – Circular money transfers – Connections (existing path/shortest path) to
companies in tax havens
• Ingesting accounting data in SAF-T format – Hadoop-based processing (Oozie, Spark, Hive) – Terabytes of data, rapidly growing
• Interactive graph analysis in Apex with Cytoscape.js
EU VAT fraud
13
Company in other EU member
country
BO RD
ER 0
% V
A T
BO RD
ER 0
% V
A T
BO RD
ER 0
% V
A T
BO RD
ER
Importer „fake” company
Buffer Reseller
Tax authority
No t p
ay in
g VA
T
ImporterCompany in other EU member
country
Buffer Reseller
Tax authority Tax authority
Pa yi
ng V
AT
Exporter company
Ge tti
ng
VA T
re fu
nd
BO RD
ER 0
% V
A T
BO RD
ER 0
% V
A T
BO RD
ER 0
% V
A T
BO RD
ER
Company in other EU member
country
Company in other EU member
country
?
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• Management of Bill-of-materials – Automotive manufacturing process – Supporting high variance and short
innovation cycles
• Data coming from various sources • Complex PGQL queries to associate
parts and subcomponents – Performance as key requirement – Happy with response times and
scaleability
14
Mazda
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Feature Overview
15
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Oracle Graph Analytics Architecture
Scalable and Persistent Storage
Graph Storage Management
Graph Analytics In-memory Analytic Engine
Blueprints & SolrCloud / Lucene
Property Graph Support on Apache HBase, Oracle NoSQL or Oracle 12.2
REST W eb Service
Python, Perl, PHP, Ruby, Javascript, …
Java APIs
Java APIs/JDBC/SQL/PLSQL
16
Cy to
sc ap
e Pl
ug -in
R In
te gr
at io
n (O
AA gr
ap h)
Sp ar
k in
te gr
at io
n
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Interacting with the Graph
• Access through APIs – Implementation of Apache Tinkerpop Blueprints APIs – Based on Java, REST plus SolR Cloud/Lucene support for text search
• Scripting – Groovy, Python, Javascript, ... – Apache Zeppelin integration, Javascript (Node.js) language binding
• Graphical UIs – Cytoscape, plug-in available – Commercial Tools such as TomSawyer Perspectives
On-premise product geared towards data scientists and developers
17
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
Example: Betweenness Centrality in Big Data Graph Code
18
D
A
C
E
B
F
I
J
H
K
G
analyst.vertexBetweennessCentrality(pg) .getTopKValues(15)
Copyright © 2018, Oracle and/or its affiliates. All rights reserved. |
• Finding a given pattern in graph – Fraud detection – Anomaly detection – Subgraph extraction – ...
• SQL-like syntax but with graph pattern description and property access – Interactive (real-time) analysis – Supporting aggregates, comparison,
such as max, min, order by, group by
• Proposed for standardization by Oracle – Specification available on-line – Open-sourced front-end (i.e. parser)
Pattern matching in Property Graphs using PGQL
https://github.com/oracle/pgql-lang
19