SlideShare una empresa de Scribd logo
1 de 30
Descargar para leer sin conexión
Neo4j and Apache Hop
Future-proof data orchestration for Neo4j
What is a graph database?
● A graph consists of nodes connected by relationships.
● Graph databases store data as graph structures (nodes and
relationships)
Property Graph Model Components
● Nodes
○ Represent the objects in the graph
○ Can be labeled
CAR
PERSON PERSON
Property Graph Model Components
● Nodes
○ Represent the objects in the graph
○ Can be labeled
● Relationships
○ Relate nodes by type and direction
CAR
DRIVES
LOVES
LOVES
LIVES WITH
O
W
N
S
PERSON PERSON
Property Graph Model Components
● Nodes
○ Represent the objects in the graph
○ Can be labeled
● Relationships
○ Relate nodes by type and direction
● Properties
○ Name-value pairs that can go on
nodes and relationships
CAR
DRIVES
name: “Dan”
born: May 29, 1970
twitter: “@dan”
name: “Ann”
born: Dec 5, 1975
since:
Jan 10, 2011
brand: “Volvo”
model: “V70”
LOVES
LOVES
LIVES WITH
O
W
N
S
PERSON PERSON
Cypher: Graph Query Language
MATCH (:Person { name:"Dan"} ) -[:LOVES]-> (:Person { name:"Ann"} )
LOVES
Dan Ann
LABEL PROPERTY
NODE NODE
LABEL PROPERTY
Neo4j Use Cases
● Social network analysis
● Recommendation engine
● Search & Discovery
● Network & Data Center
● Master Data Management
● Identity & Access Management
● GEO
● Fraud Detection
● …
Blog series: Top 10 Neo4j use cases
Neo4j
● The world’s leading graph database
● June 2021: raised $325M!
● On-premise, in the cloud or hybrid (Aura)
● Graph ML algorithms
● High performance
● Up to trillions of nodes and relationships
Common Approaches
● Neo4j tools:
○ Load CSV
○ APOC
○ Neo4j ETL (RDBMS to graph)
○ Code (Python, Java, Go, …)
Why Data Projects Fail...
What is required?
● Good scope and project management
● Easy to develop, run, debug, troubleshoot (enable citizen developers)
● Error handling, logging, monitoring, lineage
● Testing (unit, integration, regression)
● Repeatable, reproducible
● Life Cycle Management
● Scale with data requirements, volume and project
● Integrate perfectly with DevOps
● Integrate perfectly with Neo4j
Apache Hop
● Recursive acronym: Hop Orchestration Platform
● Orchestration:
○ Data: pipelines and workflows
○ Metadata: editing, handling, management,...
○ Insights: data/execution lineage, logging, …
○ Configurations: handling ecosystem complexity
Apache Hop : background
● Community lead initiative
●
● New scalable GUI
● New metadata back-end
● Simplified toolset
● Code refactored, renamed, trimmed down, ...
● Extra plugins: Projects, Testing, Apache Beam, Debugging, ...
● …
Apache Hop Incubator
● Forked PDI/Kettle 8.2 + WebSpoon + patches + plugins + …
● → Represents 20 years of development!
● Part of the ASF Incubator, close to TLP
● 1.0 early October!
● built continuously
● Active, quickly growing community
https://hop.apache.org
Why Apache Hop?
● A quickly diversifying technological landscape
→ Makes it hard to manage complexity
→ Drives the need for rapid innovation
● Development done independent from a single large corporation
● By and for data orchestration professionals
Guiding principles
Apache Hop aims to make data orchestration better:
● Easy: setup, build, maintenance, deployment, …
● Fast: startup time, supporting Spark, Flink & DataFlow, ...
● Transparent: before, during and after execution
● Predictable: unit and integration testing
● Innovative: need for the latest tech (digital transformation)
● Best practices: support version control, testing, CI/CD, project,
lifecycle management, ...
Apache Hop : key architecture features
● Metadata driven: no code generation
● Modular pluggable architecture: scale back to <30MB
● Fast startup, minimal overhead
● Apache Beam with support for Apache Spark, Apache Flink and GCP
DataFlow runners
● Version controlled documentation
● Ease of use: transparent naming and easy to use tools
● Integration test: critical components are tested daily with integration tests
→ runtime compatibility, stability, ...
Apache Hop : key GUI features
● Pluggable GUI features
● Scalable interface for high DPI displays or visually impaired
● Perspectives for easy fast context switching
● Designed for web browsers and mobile users
→ Single click mode for faster navigation
● Full support for 4 platforms: Windows, OSX, Linux & Web
● Support for “dark mode” themes on Linux and OSX
Apache Hop : key configuration features
● All GUI configuration options have a command line variant
● Single central system configuration JSON file
● Easy project and lifecycle environment configuration
● Configuration and metadata inheritance from other projects
● Standard docker container
● Stateless server supporting multi-tenancy
● Version control friendly setup
Apache Hop : Q4 & 2022 roadmap
● Graduation to Apache Top Level Project (TLP)
● Pluggable field expressions
● Java 11 (mirror Apache Beam)
● new execute/preview/debug GUI
● Improved cloud support
● Airflow runtime support
● Marketplace for 3rd party plugins
● ...
Apache Hop : Community!
● Accepted in the ASF Incubator in Sept ‘20, ready to graduate to TLP
● Apache is a community building organisation
● Great communities deliver great software
● During incubation we are asked to
→ Grow the community
→ Release software the Apache way
Apache Hop : Community
● No single company drives the software forward
● The Hop community is growing fast across all social media channels, chat
server, …
● Anyone is welcome with ideas, code, bug fixes, suggestions, documentation,
translations, …
● No bug is too small or too big to fix.
● No improvement suggestion is too small or too big to consider
Apache Hop and Neo4j
● Best Neo4j (incl Aura!) support of any platform!!
● Functionality:
○ Neo4j logging perspective
○ Neo4j connection type, graph data type
○ 20+ action and transform plugins to write data to Neo4j, run
Cypher, split graph in nodes and relationships etc
Apache Hop : Enterprise support
● Lean With Data wants to help you!
● See: www.leanwithdata.com
→ Support
→ Training
→ Certification
→ Custom development
→ Data orchestration tool migration
Lean Orchestration: professionally supported Apache Hop.
Get the best of both worlds.
Know.bi is a Lean With Data partner
Demo
Thank you for your interest and time!
@ApacheHop, @know_bi
https://www.linkedin.com/company/apachehop
https://www.linkedin.com/company/knowbi
https://chat.project-hop.org

Más contenido relacionado

La actualidad más candente

Graphs & Neo4j - Past Present Future
Graphs & Neo4j - Past Present FutureGraphs & Neo4j - Past Present Future
Graphs & Neo4j - Past Present Future
jexp
 
GraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas Suravarapu
GraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas SuravarapuGraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas Suravarapu
GraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas Suravarapu
Neo4j
 

La actualidad más candente (20)

Neo4j Data Loading with Kettle
Neo4j Data Loading with KettleNeo4j Data Loading with Kettle
Neo4j Data Loading with Kettle
 
Neo4j-Databridge: Enterprise-scale ETL for Neo4j
Neo4j-Databridge: Enterprise-scale ETL for Neo4jNeo4j-Databridge: Enterprise-scale ETL for Neo4j
Neo4j-Databridge: Enterprise-scale ETL for Neo4j
 
A whirlwind tour of graph databases
A whirlwind tour of graph databasesA whirlwind tour of graph databases
A whirlwind tour of graph databases
 
Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020Presto @ Zalando - Big Data Tech Warsaw 2020
Presto @ Zalando - Big Data Tech Warsaw 2020
 
GraphTour - Albelli: Running Neo4j on a large scale image platform
GraphTour - Albelli: Running Neo4j on a large scale image platformGraphTour - Albelli: Running Neo4j on a large scale image platform
GraphTour - Albelli: Running Neo4j on a large scale image platform
 
GraphTour - Neo4j Database Overview
GraphTour - Neo4j Database OverviewGraphTour - Neo4j Database Overview
GraphTour - Neo4j Database Overview
 
Introduction to Neo4j and .Net
Introduction to Neo4j and .NetIntroduction to Neo4j and .Net
Introduction to Neo4j and .Net
 
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with GraphsNeo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
 
Hadoop User Group Ireland (HUG) Ireland - Eddie Baggot Presentation April 2016
Hadoop User Group Ireland (HUG) Ireland - Eddie Baggot Presentation April 2016Hadoop User Group Ireland (HUG) Ireland - Eddie Baggot Presentation April 2016
Hadoop User Group Ireland (HUG) Ireland - Eddie Baggot Presentation April 2016
 
How Graph Databases efficiently store, manage and query connected data at s...
How Graph Databases efficiently  store, manage and query  connected data at s...How Graph Databases efficiently  store, manage and query  connected data at s...
How Graph Databases efficiently store, manage and query connected data at s...
 
Graphs & Neo4j - Past Present Future
Graphs & Neo4j - Past Present FutureGraphs & Neo4j - Past Present Future
Graphs & Neo4j - Past Present Future
 
Introduction to Neo4j
Introduction to Neo4jIntroduction to Neo4j
Introduction to Neo4j
 
The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...
The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...
The Evolution of the Fashion Retail Industry in the Age of AI with Kshitij Ku...
 
Kappa Architecture, IoT of the cars - LibreCon 2016
Kappa Architecture, IoT of the cars - LibreCon 2016Kappa Architecture, IoT of the cars - LibreCon 2016
Kappa Architecture, IoT of the cars - LibreCon 2016
 
Graphs for AI & ML, Jim Webber, Neo4j
Graphs for AI & ML, Jim Webber, Neo4jGraphs for AI & ML, Jim Webber, Neo4j
Graphs for AI & ML, Jim Webber, Neo4j
 
Neo4j Import Webinar
Neo4j Import WebinarNeo4j Import Webinar
Neo4j Import Webinar
 
Finding the Needle in a Haystack With Knowledge Graphs
Finding the Needle in a Haystack With Knowledge GraphsFinding the Needle in a Haystack With Knowledge Graphs
Finding the Needle in a Haystack With Knowledge Graphs
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
GraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas Suravarapu
GraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas SuravarapuGraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas Suravarapu
GraphConnect Europe 2016 - Faster Lap Times with Neo4j - Srinivas Suravarapu
 
Adobe Behance Scales to Millions of Users at Lower TCO with Neo4j
Adobe Behance Scales to Millions of Users at Lower TCO with Neo4jAdobe Behance Scales to Millions of Users at Lower TCO with Neo4j
Adobe Behance Scales to Millions of Users at Lower TCO with Neo4j
 

Similar a Visual, scalable, and manageable data loading to and from Neo4j with Apache Hop

Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
StampedeCon
 

Similar a Visual, scalable, and manageable data loading to and from Neo4j with Apache Hop (20)

[HKDUG] #20151017 - BarCamp 2015 - Drupal 8 is Coming! Are You Ready?
[HKDUG] #20151017 - BarCamp 2015 - Drupal 8 is Coming! Are You Ready?[HKDUG] #20151017 - BarCamp 2015 - Drupal 8 is Coming! Are You Ready?
[HKDUG] #20151017 - BarCamp 2015 - Drupal 8 is Coming! Are You Ready?
 
Go at uber
Go at uberGo at uber
Go at uber
 
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
Building a Next-gen Data Platform and Leveraging the OSS Ecosystem for Easy W...
 
GraphQL is actually rest
GraphQL is actually restGraphQL is actually rest
GraphQL is actually rest
 
Dynatech presentation for TSI Career Day
Dynatech presentation for TSI Career DayDynatech presentation for TSI Career Day
Dynatech presentation for TSI Career Day
 
Present and future of unified, portable, and efficient data processing with A...
Present and future of unified, portable, and efficient data processing with A...Present and future of unified, portable, and efficient data processing with A...
Present and future of unified, portable, and efficient data processing with A...
 
Handout: 'Open Source Tools & Resources'
Handout: 'Open Source Tools & Resources'Handout: 'Open Source Tools & Resources'
Handout: 'Open Source Tools & Resources'
 
GlueCon 2015 - How REST APIs can glue all types of devices together
GlueCon 2015 - How REST APIs can glue all types of devices togetherGlueCon 2015 - How REST APIs can glue all types of devices together
GlueCon 2015 - How REST APIs can glue all types of devices together
 
Creating a custom API for a headless Drupal
Creating a custom API for a headless DrupalCreating a custom API for a headless Drupal
Creating a custom API for a headless Drupal
 
Introduction to PHP (SDPHP)
Introduction to PHP   (SDPHP)Introduction to PHP   (SDPHP)
Introduction to PHP (SDPHP)
 
Data Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFixData Science in the Cloud @StitchFix
Data Science in the Cloud @StitchFix
 
Machine learning and big data @ uber a tale of two systems
Machine learning and big data @ uber a tale of two systemsMachine learning and big data @ uber a tale of two systems
Machine learning and big data @ uber a tale of two systems
 
AirBNB's ML platform - BigHead
AirBNB's ML platform - BigHeadAirBNB's ML platform - BigHead
AirBNB's ML platform - BigHead
 
Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa...
 Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa... Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa...
Bighead: Airbnb’s End-to-End Machine Learning Platform with Krishna Puttaswa...
 
The Fn Project: A Quick Introduction (December 2017)
The Fn Project: A Quick Introduction (December 2017)The Fn Project: A Quick Introduction (December 2017)
The Fn Project: A Quick Introduction (December 2017)
 
Apache Arrow at DataEngConf Barcelona 2018
Apache Arrow at DataEngConf Barcelona 2018Apache Arrow at DataEngConf Barcelona 2018
Apache Arrow at DataEngConf Barcelona 2018
 
Activity feeds (and more) at mate1
Activity feeds (and more) at mate1Activity feeds (and more) at mate1
Activity feeds (and more) at mate1
 
Design Summit - Technology Vision - Oleg Barenboim and Jason Frey
Design Summit - Technology Vision - Oleg Barenboim and Jason FreyDesign Summit - Technology Vision - Oleg Barenboim and Jason Frey
Design Summit - Technology Vision - Oleg Barenboim and Jason Frey
 
DevOps practices and tools of a small company in love with open source
DevOps practices and tools of a small company in love with open sourceDevOps practices and tools of a small company in love with open source
DevOps practices and tools of a small company in love with open source
 
Red hat infrastructure for analytics
Red hat infrastructure for analyticsRed hat infrastructure for analytics
Red hat infrastructure for analytics
 

Más de Neo4j

Más de Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Último

AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Último (20)

Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 

Visual, scalable, and manageable data loading to and from Neo4j with Apache Hop

  • 1. Neo4j and Apache Hop Future-proof data orchestration for Neo4j
  • 2.
  • 3. What is a graph database? ● A graph consists of nodes connected by relationships. ● Graph databases store data as graph structures (nodes and relationships)
  • 4. Property Graph Model Components ● Nodes ○ Represent the objects in the graph ○ Can be labeled CAR PERSON PERSON
  • 5. Property Graph Model Components ● Nodes ○ Represent the objects in the graph ○ Can be labeled ● Relationships ○ Relate nodes by type and direction CAR DRIVES LOVES LOVES LIVES WITH O W N S PERSON PERSON
  • 6. Property Graph Model Components ● Nodes ○ Represent the objects in the graph ○ Can be labeled ● Relationships ○ Relate nodes by type and direction ● Properties ○ Name-value pairs that can go on nodes and relationships CAR DRIVES name: “Dan” born: May 29, 1970 twitter: “@dan” name: “Ann” born: Dec 5, 1975 since: Jan 10, 2011 brand: “Volvo” model: “V70” LOVES LOVES LIVES WITH O W N S PERSON PERSON
  • 7. Cypher: Graph Query Language MATCH (:Person { name:"Dan"} ) -[:LOVES]-> (:Person { name:"Ann"} ) LOVES Dan Ann LABEL PROPERTY NODE NODE LABEL PROPERTY
  • 8. Neo4j Use Cases ● Social network analysis ● Recommendation engine ● Search & Discovery ● Network & Data Center ● Master Data Management ● Identity & Access Management ● GEO ● Fraud Detection ● … Blog series: Top 10 Neo4j use cases
  • 9. Neo4j ● The world’s leading graph database ● June 2021: raised $325M! ● On-premise, in the cloud or hybrid (Aura) ● Graph ML algorithms ● High performance ● Up to trillions of nodes and relationships
  • 10.
  • 11. Common Approaches ● Neo4j tools: ○ Load CSV ○ APOC ○ Neo4j ETL (RDBMS to graph) ○ Code (Python, Java, Go, …)
  • 12. Why Data Projects Fail...
  • 13.
  • 14. What is required? ● Good scope and project management ● Easy to develop, run, debug, troubleshoot (enable citizen developers) ● Error handling, logging, monitoring, lineage ● Testing (unit, integration, regression) ● Repeatable, reproducible ● Life Cycle Management ● Scale with data requirements, volume and project ● Integrate perfectly with DevOps ● Integrate perfectly with Neo4j
  • 15.
  • 16. Apache Hop ● Recursive acronym: Hop Orchestration Platform ● Orchestration: ○ Data: pipelines and workflows ○ Metadata: editing, handling, management,... ○ Insights: data/execution lineage, logging, … ○ Configurations: handling ecosystem complexity
  • 17. Apache Hop : background ● Community lead initiative ● ● New scalable GUI ● New metadata back-end ● Simplified toolset ● Code refactored, renamed, trimmed down, ... ● Extra plugins: Projects, Testing, Apache Beam, Debugging, ... ● …
  • 18. Apache Hop Incubator ● Forked PDI/Kettle 8.2 + WebSpoon + patches + plugins + … ● → Represents 20 years of development! ● Part of the ASF Incubator, close to TLP ● 1.0 early October! ● built continuously ● Active, quickly growing community https://hop.apache.org
  • 19. Why Apache Hop? ● A quickly diversifying technological landscape → Makes it hard to manage complexity → Drives the need for rapid innovation ● Development done independent from a single large corporation ● By and for data orchestration professionals
  • 20. Guiding principles Apache Hop aims to make data orchestration better: ● Easy: setup, build, maintenance, deployment, … ● Fast: startup time, supporting Spark, Flink & DataFlow, ... ● Transparent: before, during and after execution ● Predictable: unit and integration testing ● Innovative: need for the latest tech (digital transformation) ● Best practices: support version control, testing, CI/CD, project, lifecycle management, ...
  • 21. Apache Hop : key architecture features ● Metadata driven: no code generation ● Modular pluggable architecture: scale back to <30MB ● Fast startup, minimal overhead ● Apache Beam with support for Apache Spark, Apache Flink and GCP DataFlow runners ● Version controlled documentation ● Ease of use: transparent naming and easy to use tools ● Integration test: critical components are tested daily with integration tests → runtime compatibility, stability, ...
  • 22. Apache Hop : key GUI features ● Pluggable GUI features ● Scalable interface for high DPI displays or visually impaired ● Perspectives for easy fast context switching ● Designed for web browsers and mobile users → Single click mode for faster navigation ● Full support for 4 platforms: Windows, OSX, Linux & Web ● Support for “dark mode” themes on Linux and OSX
  • 23. Apache Hop : key configuration features ● All GUI configuration options have a command line variant ● Single central system configuration JSON file ● Easy project and lifecycle environment configuration ● Configuration and metadata inheritance from other projects ● Standard docker container ● Stateless server supporting multi-tenancy ● Version control friendly setup
  • 24. Apache Hop : Q4 & 2022 roadmap ● Graduation to Apache Top Level Project (TLP) ● Pluggable field expressions ● Java 11 (mirror Apache Beam) ● new execute/preview/debug GUI ● Improved cloud support ● Airflow runtime support ● Marketplace for 3rd party plugins ● ...
  • 25. Apache Hop : Community! ● Accepted in the ASF Incubator in Sept ‘20, ready to graduate to TLP ● Apache is a community building organisation ● Great communities deliver great software ● During incubation we are asked to → Grow the community → Release software the Apache way
  • 26. Apache Hop : Community ● No single company drives the software forward ● The Hop community is growing fast across all social media channels, chat server, … ● Anyone is welcome with ideas, code, bug fixes, suggestions, documentation, translations, … ● No bug is too small or too big to fix. ● No improvement suggestion is too small or too big to consider
  • 27. Apache Hop and Neo4j ● Best Neo4j (incl Aura!) support of any platform!! ● Functionality: ○ Neo4j logging perspective ○ Neo4j connection type, graph data type ○ 20+ action and transform plugins to write data to Neo4j, run Cypher, split graph in nodes and relationships etc
  • 28. Apache Hop : Enterprise support ● Lean With Data wants to help you! ● See: www.leanwithdata.com → Support → Training → Certification → Custom development → Data orchestration tool migration Lean Orchestration: professionally supported Apache Hop. Get the best of both worlds. Know.bi is a Lean With Data partner
  • 29. Demo
  • 30. Thank you for your interest and time! @ApacheHop, @know_bi https://www.linkedin.com/company/apachehop https://www.linkedin.com/company/knowbi https://chat.project-hop.org