An Introduction to Neo4j

Stephan Pirnbaum :Person NeosCon :SPEAKS_AT Dresden :Conference :City :IN An
Introduction to Neo4j :Talk :HOSTS :SPEAKS_ABOUT

https://neo4j.com/blog/neo4j-community-june-2016/

 The world is a graph ◼ Full of connected
people, events, and other things ◼ Relations matter! https://www.businessinsider.com/explainer-what-exactly-is-the-social-graph-2012-3?IR=T

 The IT-world is full of graphs ◼ Software Projects
consists of Modules, Packages, Classes, ... All these are in relation to each other ◼ Where are JOIN-tables in reality (and how do you explain them)?

 Data volume is increasing and getting more connected ◼
Online Transactions ◼ Social Networks ◼ Smart Devices https://www.sensorsexpo.com/iot-ecosystem

 High value in data relationships ◼ Connecting data on
a new way can improve existing and create new use case  Brings many advantages over competitors  There is plenty of data in todays world created every second

https://memegenerator.net/instance/84458628/questioning-african-kid-so-youre-telling-me-relational-databases-are-not-so-relational

 Relational DBs can‘t handle relationships well ◼ Cannot model
or store data and relationships without complexity ◼ Performance degrades with number and levels of relationships and DB size ◼ Query complexity grows with need for JOINs ◼ Adding new types of data and relationships requires schema redesign

Internal Applications  Master Data Management  Network and IT
Operations  Fraud Detection  … Customer-Facing Applications  Real-Time Recommendations  Graph-Based Search  Identity and Access Management  Knowledge Graph  …  https://neo4j.com/use-cases/

 Usage for Good ◼ Offshore Leaks ◼ Panama Papers
◼ Paradise Papers https://neo4j.com/blog/neo4j-power-behind-paradise-papers/

 Usage for Good ◼ Cancer Research Candiolo Cancer Institute
◼ Diabetes Research German Center for Diabetes Research https://www.it-zoom.de/it-director/e/diabetesforschung-nutzt-neo4j-21116/

 ACID-compliant  Transactional  Native graph storage and processing
 Property-Graph-Model  Open Source and Commercial Licensing  Offers drivers for: Python, .Net, Java, PHP, …

Werner Vogels, CTO of Amazon

 „The Whiteboardmodel is the Graph Model“ ◼ No need
for object-relational-mapping ◼ Same understanding of the data model for IT and business ◼ No need for complex Join-Tables and alike https://de.slideshare.net/neo4j/the-graph-database-universe-neo4j-overview

 Nodes ◼ Objects in the graph ◼ Stores data
using name-value properties ◼ Can have labels attached  Relationships ◼ Relates nodes by type (Label) and direction ◼ Stores data using name-value properties Stephan :Person:Author Neo4j – Part 1 :WROTE :Article firstName: Stephan lastName: Pirnbaum birthday: 26.11.1993 title: Neo4j – Part 4 state: Published publishedOn: 5/11/2019

WROTE STEPHAN Neo4j – Part 1 Relational Model Graph Model
Author Article Author-Article STEPHAN Neo4j – Part 1 Neo4j – Part 3 Neo4j – Part 2 Neo4j – Part 2 Neo4j – Part 3 https://logisima.developpez.com/tutoriel/nosql/neo4j/introduction-neo4j/

 Let‘s model ◼ Article ◼ Tag ◼ Category ◼
Person ◼ Author ◼ Comment Master Data Activity

:Person :Author :Article :WROTE :Category :CONTAINS :CONTAINS :Comment :Tag :WROTE
:HAS_COMMENT :Person

 Schema Design and Migration ◼ Neo4j has no schema
in the classical sense ◼ Definition of indexes on properties possible ◼ Definition of uniqueness/existence constraints for properties possible

 Cypher-Based LOAD CSV Capability  Command-Line Bulk Loader bin/neo4j-import
 JSON/XML Loader  ETL Tool for RDBMS  NOSQL DB Access https://neo4j.com/developer/neo4j-etl/

Stephan :Person:Author Neo4j – Part 1 :WROTE :Article Match (:Author{firstName:
‘‘Stephan“})-[:WROTE]->(article:Article) RETURN article Node Node Relationship LABEL PROPERTY LABEL VAR

 Find all authors MATCH (a:Author) RETURN a.firstName, a.lastName

 Find all articles of all authors grouped by author
MATCH (p:Person)-[:WROTE]->(a:Article) RETURN p.firstName, p.lastName, collect(a {.title, .publishedOn}) AS Articles

 Let‘s find out who has written articles which are
tagged as „Neo4j“ ◼ In SQL… SELECT DISTINCT p.firstName, p.lastName FROM Person p LEFT JOIN Article a ON p.personId = a.personId LEFT JOIN ArticleTag aT ON a.articleId = aT.articleId LEFT JOIN Tag t ON aT.tagId = t.tagId WHERE t.name = "Neo4j“

 Let‘s find out who has written articles which are
tagged as „Neo4j“ ◼ In Cypher ☺ MATCH (p:Author)-[:WROTE]->(a:Article), (a)-[:TAGGED_BY]->(t:Tag{name: “Neo4j“}) RETURN DISTINCT p.firstName, p.lastName

 Find co-occuring tags ◼ Useful to identify new categories
MATCH (t1:Tag)-->()<--(t2:Tag) RETURN t1.name, t2.name, count(*) AS cooccurences

 Build a recommendation engine for articles :Person :Author :Article
:WROTE :Category :CONTAINS :CONTAINS :Comment :Tag :WROTE :HAS_COMMENT :Person

MATCH (p:Person)-[:COMMENTED]->(a1:Article), (peer:Person)-[:WROTE|:COMMENTED]->(a1), (peer:Person)-[:WROTE|:COMMENTED]->(a2:Article) WHERE NOT exists((p)-[:COMMENTED]->(a2)) RETURN a2.title, count(*)
AS frequency ORDER BY frequency DESC LIMIT 5

https://memegenerator.net/instance/84458650/the-most-interesting-man-in-the-world-without-the-beer-i-recommend- you-build-a-recommendation-engine

 Easy modelling of hierarchical data structures  Usage for
powerfull recommendation engines ◼ https://www.adamcowley.co.uk/neo4j/wordpress-recommendations-neo4j-part-1- data-modelling/ (WordPress)  Usage for page-view tracking ◼ https://neo4j.com/blog/graph-databases-drupal-neo4j-module-rules-integration/ (Drupal)

 Click-Path analysis using Snowplow ◼ https://snowplowanalytics.com/blog/2017/07/17/loading-and-analysing- snowplow-event-data-in-Neo4j/  Structr
is based on Neo4j: https://structr.com/  CMS using GraphQL: https://graphcms.com/

Please ask questions to: Stephan Pirnbaum buschmais GbR Inhaber Torsten
Busch, Frank Schwarz, Dirk Mahler und Tobias Israel [email protected] http://buschmais.de/ Dresden, 11.05.2019

An Introduction to Neo4j

An Introduction to Neo4j

Stephan Pirnbaum

More Decks by Stephan Pirnbaum

Other Decks in Programming

Featured

Transcript

Stephan Pirnbaum :Person NeosCon :SPEAKS_AT Dresden :Conference :City :IN An

https://neo4j.com/blog/neo4j-community-june-2016/

 The world is a graph ◼ Full of connected

 The IT-world is full of graphs ◼ Software Projects

 Data volume is increasing and getting more connected ◼

 High value in data relationships ◼ Connecting data on

https://memegenerator.net/instance/84458628/questioning-african-kid-so-youre-telling-me-relational-databases-are-not-so-relational

 Relational DBs can‘t handle relationships well ◼ Cannot model

Internal Applications  Master Data Management  Network and IT

 Usage for Good ◼ Offshore Leaks ◼ Panama Papers

 Usage for Good ◼ Cancer Research Candiolo Cancer Institute

 ACID-compliant  Transactional  Native graph storage and processing

Werner Vogels, CTO of Amazon

 „The Whiteboardmodel is the Graph Model“ ◼ No need

 Nodes ◼ Objects in the graph ◼ Stores data

WROTE STEPHAN Neo4j – Part 1 Relational Model Graph Model

 Let‘s model ◼ Article ◼ Tag ◼ Category ◼

:Person :Author :Article :WROTE :Category :CONTAINS :CONTAINS :Comment :Tag :WROTE

 Schema Design and Migration ◼ Neo4j has no schema

 Cypher-Based LOAD CSV Capability  Command-Line Bulk Loader bin/neo4j-import

Stephan :Person:Author Neo4j – Part 1 :WROTE :Article Match (:Author{firstName:

 Find all authors MATCH (a:Author) RETURN a.firstName, a.lastName

 Find all articles of all authors grouped by author

 Let‘s find out who has written articles which are

 Let‘s find out who has written articles which are

 Find co-occuring tags ◼ Useful to identify new categories

 Build a recommendation engine for articles :Person :Author :Article

MATCH (p:Person)-[:COMMENTED]->(a1:Article), (peer:Person)-[:WROTE|:COMMENTED]->(a1), (peer:Person)-[:WROTE|:COMMENTED]->(a2:Article) WHERE NOT exists((p)-[:COMMENTED]->(a2)) RETURN a2.title, count(*)

https://memegenerator.net/instance/84458650/the-most-interesting-man-in-the-world-without-the-beer-i-recommend- you-build-a-recommendation-engine

 Easy modelling of hierarchical data structures  Usage for

 Click-Path analysis using Snowplow ◼ https://snowplowanalytics.com/blog/2017/07/17/loading-and-analysing- snowplow-event-data-in-Neo4j/  Structr

Please ask questions to: Stephan Pirnbaum buschmais GbR Inhaber Torsten