SlideShare una empresa de Scribd logo
1 de 19
REAL TIME STREAMING ANALYTICS
@ FORD
June 13, 2017
1
•Original Problem Statement
•Architecture Components
•Data Journey
•Challenges
•Live Demo – Streaming from Dearborn
•RTSA RoadMap & Vision
Agenda
2
3
Product Vision / Mission Statement
•Experiments (BDD 2.0)
• No platform to do ‘Streaming’ Experiments
• How do we enable ‘Self-Service’ Streaming?
•Utility Ingestion
• Existing Storm solution would not scale
operationally the way it had been implemented.
• Today applications developer their own one off
ingestion solutions to deal with proxy and
firewall rules. How do we reduce the surface
area that is exposed while handling multiple
types of ingest?
SCA-V / BDD BUSINESS VALUE
BDD (Big Data Drive) drives value across the enterprise today and in the
future
Pillar 1
Collection
Pillar 2
Configuration
Pillar 3
Edge Analytics
Enables
• Off cycle credit validation
• Intelligent Customer Interactions
• Vehicle performance insights
• Customer specific city solutions
• Fleet based telematics
• Warranty reduction across fleets
• Powertrain fuel efficiency improvement
• Automotive cybersecurity
• High-touch customer / dealer engagement
• Product feature validation
• Vehicle feature deployment
• Product development lifecycle reduction
• Vehicle diagnostic and prognostic enhacements
5
SCA-V (Single Complete
Actionable Vehicle
Landing Zone
Discovery
Zone
Data Supply
Chain
Multi-Platform Data and Analytics Ecosystem
Data and Analytics
Ecosystem
SCA-C (Single Complete
Actionable Customer)
other
• Development leverages the product team approach which promotes cross-
functional partnerships in FordLabs, PD, IT and GDI&A
• Developed the first edge computing platform which emulates the fully
networked vehicle-1 and 2 (FNV-1/FNV-2) and provides production grade
web based software to support this vehicle platform
• Created the first real-time streaming application in the enterprise
• Represents a significant shift toward data-driven decision making by
leveraging rich, connected vehicle data. The solution includes Natural
Language Search, Real Time Streaming, vehicle architecture agnosticism,
software deployable anywhere (ePID2.0, TCU, Sync, ECG), and rapid
vehicle data validation processes
• The platform can accommodate a diverse set of vehicles across the fleet
With BDD, we created a cloud agnostic Ford owned and managed
real time streaming solution
66
BDD 2.0 ACCOMPLISHMENTS: A THIN SLICE
Real Time Streaming Analytics - Conceptual
Real Time streaming is an incremental capability over traditional batch processing to
ingest, transform and score individual streams of real time data
Lambda architecture is a data-processing architecture
designed to handle massive quantities of data by taking
advantage of both batch and stream-processing methods.
Routing Pub/Sub Processing
AnalyzeStore
Real-Time
Batch Model is trained,
optimized and
deployedHistorical
persistence
The model is executed
Real Time Streaming Analytics – Conceptual
8
Routing Pub/Sub Processing
AnalyzeStore
Real-Time
Batch Model is trained,
optimized and
deployedHistorical
persistence
The model is executed
1
2
3
Real Time Streaming Data ingested, routed, transformed
Data passed from speed layer to batch/storage layer
Analytical apps consuming/producing data in the real-time speed layer
4 Historical data analyzed, models developed and trained
RTSA – Analytics & Data Flow Life-Cycle
5 Trained analytical models deployed to the real-time speed layer
1
2
3
4
5
Apps
Data
Analytics
Speed
Demonstration
BDD Dashboard: http://bdd-vase.apps-
q01.pcfqaecc.ford.com/#/
SAS ESP: RTSA
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
10
1
2
3
Data from OpenXC ingested via Cloud Foundry WebSocket
Data routed from Cloud to Ford data center via NiFi
Specific data consumed by an analytical app
4 Data published to Kafka on prem
Live Demo - Data Flow Narrative
5 Data persisted in Hadoop on prem
5
1
2
1
3
4
Live Demo
Real Time Streaming Analytics – Physical
HBase
Summary of Key Concepts
RTSA is….
•Fully developed, managed, and deployed by Ford
•We own the data at every step
•Fully cloud and data center agnostic
•Push and pull capable
•No additional Ford Data Center Exposure
•Horizontally scalable
11
With BDD (Big Data Drive), we created a cloud agnostic Ford owned and
managed real time streaming solution
• RTSA product to provide foundational enterprise services :
–Data ingest
–Data Processing
–Stream Routing
• Including Cloud to On-premise
–Analytics
–Data Persistence On-premise
Roadmap
12
Ingestion, Transformation, Processing, and Persistence of
Streaming Data in Real-Time
Foundational services available in production environment Q1 for
applications promoted from experiment status.
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
13
HBase
Other Opportunities
14
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
REST
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
Other Opportunities
 NY FordHub Cisco Meraki WiFi
 Data started flowing 2/28 via RTSA
 Production infrastructure in Q1
HBase
15
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
REST
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
Other Opportunities??
HBase
16
Third Party
Data Sources
Third Party
Data Consumers
(as needed)
Vehicle
WebSocket
NiFi
Apps XYZ
NiFi
Pull*
HDFS
Push
Push
Apps XYZ
Azure CLOUD
*Native NiFi Site-2-Site HTTP Proxy Capability.
Fixes Storm Endpoint Scaling Ops problem today.
EventHub/IoTHub
Ford Network and
Data Center
Firewall
REST
WebSocket
REST
MQTT
P
M
M
L
Firewall
P
M
M
L
Intelligent Mobile
Apps
Public Internet
EDGE/IoT
Dynamic Stream Routing
Event and/or
Streaming Data
Made Available to
Authorized Third
Party Partners as
needed
• DPF Regen
• Silver
• Security
• Plant Floor
• ControlTec
• LCV Telematics
• MiniFi
• Cisco Meraki
-Dealer WiFi
-Other Hubs
HBase
This Is The End
•Discussion
•Questions
17
18
Andrea Siudara
Tom BryansMelissa Richards
Kevin Cooper
RTSA Product Owner
Tracy HewiitDan Totten
Core RTSA Organization
RTSA Product Organization
3/11/2017
Laura Churchill
PM
T Young
J Niemiec
G Gwidz
DHickey
Jill Johnson
PM
Raju Doma
Delivery Supervisor
C Petras
E Ulicny
D Godwin
GDIA
Information Technology
GDIA
Smart Mobility Analytics
Appendix
19

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Batch Processing vs Stream Processing Difference
Batch Processing vs Stream Processing DifferenceBatch Processing vs Stream Processing Difference
Batch Processing vs Stream Processing Difference
 
Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)
 
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
Best Practices for Streaming IoT Data with MQTT and Apache Kafka®
 
Neo4j graph database
Neo4j graph databaseNeo4j graph database
Neo4j graph database
 
Aws migration case study_blr_meetup
Aws migration case study_blr_meetupAws migration case study_blr_meetup
Aws migration case study_blr_meetup
 
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its BenefitsDatabricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
 
Google Cloud and Data Pipeline Patterns
Google Cloud and Data Pipeline PatternsGoogle Cloud and Data Pipeline Patterns
Google Cloud and Data Pipeline Patterns
 
Auto-Train a Time-Series Forecast Model With AML + ADB
Auto-Train a Time-Series Forecast Model With AML + ADBAuto-Train a Time-Series Forecast Model With AML + ADB
Auto-Train a Time-Series Forecast Model With AML + ADB
 
Greenplum 6 Changes
Greenplum 6 ChangesGreenplum 6 Changes
Greenplum 6 Changes
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
 
Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
Cloud Migration Cookbook: A Guide To Moving Your Apps To The CloudCloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
 
Google Cloud Machine Learning
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 
Big Data Architecture and Design Patterns
Big Data Architecture and Design PatternsBig Data Architecture and Design Patterns
Big Data Architecture and Design Patterns
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
 
7. Key-Value Databases: In Depth
7. Key-Value Databases: In Depth7. Key-Value Databases: In Depth
7. Key-Value Databases: In Depth
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
WIPRO
WIPROWIPRO
WIPRO
 

Similar a Real Time Streaming Architecture at Ford

Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkApplying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Gerardo Pardo-Castellote
 

Similar a Real Time Streaming Architecture at Ford (20)

Preventative Maintenance of Robots in Automotive Industry
Preventative Maintenance of Robots in Automotive IndustryPreventative Maintenance of Robots in Automotive Industry
Preventative Maintenance of Robots in Automotive Industry
 
Forecast key1 0615_ak_evening
Forecast key1 0615_ak_eveningForecast key1 0615_ak_evening
Forecast key1 0615_ak_evening
 
Schnellere Digitalisierung mit einer cloudbasierten Datenstrategie
Schnellere Digitalisierung mit einer cloudbasierten DatenstrategieSchnellere Digitalisierung mit einer cloudbasierten Datenstrategie
Schnellere Digitalisierung mit einer cloudbasierten Datenstrategie
 
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDBMongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
MongoDB World 2019: Wipro Software Defined Everything Powered by MongoDB
 
DEVNET-1166 Open SDN Controller APIs
DEVNET-1166	Open SDN Controller APIsDEVNET-1166	Open SDN Controller APIs
DEVNET-1166 Open SDN Controller APIs
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
 
Webinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-ServiceWebinar: Enterprise Trends for Database-as-a-Service
Webinar: Enterprise Trends for Database-as-a-Service
 
Daimler’s Community Approach to TAS Platform Monitoring
Daimler’s Community Approach to TAS Platform MonitoringDaimler’s Community Approach to TAS Platform Monitoring
Daimler’s Community Approach to TAS Platform Monitoring
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?Intel IT Open Cloud - What's under the Hood and How do we Drive it?
Intel IT Open Cloud - What's under the Hood and How do we Drive it?
 
z Systems redefining Enterprise IT for digital business - Alain Poquillon
z Systems redefining Enterprise IT for digital business - Alain Poquillonz Systems redefining Enterprise IT for digital business - Alain Poquillon
z Systems redefining Enterprise IT for digital business - Alain Poquillon
 
Functional AI and Pervasive Networking in Automotive
 Functional AI and Pervasive Networking in Automotive Functional AI and Pervasive Networking in Automotive
Functional AI and Pervasive Networking in Automotive
 
In memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGainIn memory computing principles by Mac Moore of GridGain
In memory computing principles by Mac Moore of GridGain
 
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged PlatformMicrosoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
 
Brocade Software Networking Presentation at Interface 2016
Brocade Software Networking Presentation at Interface 2016Brocade Software Networking Presentation at Interface 2016
Brocade Software Networking Presentation at Interface 2016
 
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and SimulinkApplying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
Applying MBSE to the Industrial IoT: Using SysML with Connext DDS and Simulink
 
IND3: Predix for Transportation (Predix Transform 2016)
IND3: Predix for Transportation (Predix Transform 2016)IND3: Predix for Transportation (Predix Transform 2016)
IND3: Predix for Transportation (Predix Transform 2016)
 
IMS01 IMS Keynote
IMS01   IMS KeynoteIMS01   IMS Keynote
IMS01 IMS Keynote
 
Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart Cities
 

Más de DataWorks Summit

HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 

Más de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Real Time Streaming Architecture at Ford

  • 1. REAL TIME STREAMING ANALYTICS @ FORD June 13, 2017 1
  • 2. •Original Problem Statement •Architecture Components •Data Journey •Challenges •Live Demo – Streaming from Dearborn •RTSA RoadMap & Vision Agenda 2
  • 3. 3 Product Vision / Mission Statement •Experiments (BDD 2.0) • No platform to do ‘Streaming’ Experiments • How do we enable ‘Self-Service’ Streaming? •Utility Ingestion • Existing Storm solution would not scale operationally the way it had been implemented. • Today applications developer their own one off ingestion solutions to deal with proxy and firewall rules. How do we reduce the surface area that is exposed while handling multiple types of ingest?
  • 4. SCA-V / BDD BUSINESS VALUE BDD (Big Data Drive) drives value across the enterprise today and in the future Pillar 1 Collection Pillar 2 Configuration Pillar 3 Edge Analytics Enables • Off cycle credit validation • Intelligent Customer Interactions • Vehicle performance insights • Customer specific city solutions • Fleet based telematics • Warranty reduction across fleets • Powertrain fuel efficiency improvement • Automotive cybersecurity • High-touch customer / dealer engagement • Product feature validation • Vehicle feature deployment • Product development lifecycle reduction • Vehicle diagnostic and prognostic enhacements
  • 5. 5 SCA-V (Single Complete Actionable Vehicle Landing Zone Discovery Zone Data Supply Chain Multi-Platform Data and Analytics Ecosystem Data and Analytics Ecosystem SCA-C (Single Complete Actionable Customer) other
  • 6. • Development leverages the product team approach which promotes cross- functional partnerships in FordLabs, PD, IT and GDI&A • Developed the first edge computing platform which emulates the fully networked vehicle-1 and 2 (FNV-1/FNV-2) and provides production grade web based software to support this vehicle platform • Created the first real-time streaming application in the enterprise • Represents a significant shift toward data-driven decision making by leveraging rich, connected vehicle data. The solution includes Natural Language Search, Real Time Streaming, vehicle architecture agnosticism, software deployable anywhere (ePID2.0, TCU, Sync, ECG), and rapid vehicle data validation processes • The platform can accommodate a diverse set of vehicles across the fleet With BDD, we created a cloud agnostic Ford owned and managed real time streaming solution 66 BDD 2.0 ACCOMPLISHMENTS: A THIN SLICE
  • 7. Real Time Streaming Analytics - Conceptual Real Time streaming is an incremental capability over traditional batch processing to ingest, transform and score individual streams of real time data Lambda architecture is a data-processing architecture designed to handle massive quantities of data by taking advantage of both batch and stream-processing methods. Routing Pub/Sub Processing AnalyzeStore Real-Time Batch Model is trained, optimized and deployedHistorical persistence The model is executed
  • 8. Real Time Streaming Analytics – Conceptual 8 Routing Pub/Sub Processing AnalyzeStore Real-Time Batch Model is trained, optimized and deployedHistorical persistence The model is executed 1 2 3 Real Time Streaming Data ingested, routed, transformed Data passed from speed layer to batch/storage layer Analytical apps consuming/producing data in the real-time speed layer 4 Historical data analyzed, models developed and trained RTSA – Analytics & Data Flow Life-Cycle 5 Trained analytical models deployed to the real-time speed layer 1 2 3 4 5 Apps Data Analytics Speed
  • 10. Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing 10 1 2 3 Data from OpenXC ingested via Cloud Foundry WebSocket Data routed from Cloud to Ford data center via NiFi Specific data consumed by an analytical app 4 Data published to Kafka on prem Live Demo - Data Flow Narrative 5 Data persisted in Hadoop on prem 5 1 2 1 3 4 Live Demo Real Time Streaming Analytics – Physical HBase
  • 11. Summary of Key Concepts RTSA is…. •Fully developed, managed, and deployed by Ford •We own the data at every step •Fully cloud and data center agnostic •Push and pull capable •No additional Ford Data Center Exposure •Horizontally scalable 11 With BDD (Big Data Drive), we created a cloud agnostic Ford owned and managed real time streaming solution
  • 12. • RTSA product to provide foundational enterprise services : –Data ingest –Data Processing –Stream Routing • Including Cloud to On-premise –Analytics –Data Persistence On-premise Roadmap 12 Ingestion, Transformation, Processing, and Persistence of Streaming Data in Real-Time Foundational services available in production environment Q1 for applications promoted from experiment status.
  • 13. Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing 13 HBase Other Opportunities
  • 14. 14 Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall REST P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing Other Opportunities  NY FordHub Cisco Meraki WiFi  Data started flowing 2/28 via RTSA  Production infrastructure in Q1 HBase
  • 15. 15 Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall REST P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing Other Opportunities?? HBase
  • 16. 16 Third Party Data Sources Third Party Data Consumers (as needed) Vehicle WebSocket NiFi Apps XYZ NiFi Pull* HDFS Push Push Apps XYZ Azure CLOUD *Native NiFi Site-2-Site HTTP Proxy Capability. Fixes Storm Endpoint Scaling Ops problem today. EventHub/IoTHub Ford Network and Data Center Firewall REST WebSocket REST MQTT P M M L Firewall P M M L Intelligent Mobile Apps Public Internet EDGE/IoT Dynamic Stream Routing Event and/or Streaming Data Made Available to Authorized Third Party Partners as needed • DPF Regen • Silver • Security • Plant Floor • ControlTec • LCV Telematics • MiniFi • Cisco Meraki -Dealer WiFi -Other Hubs HBase
  • 17. This Is The End •Discussion •Questions 17
  • 18. 18 Andrea Siudara Tom BryansMelissa Richards Kevin Cooper RTSA Product Owner Tracy HewiitDan Totten Core RTSA Organization RTSA Product Organization 3/11/2017 Laura Churchill PM T Young J Niemiec G Gwidz DHickey Jill Johnson PM Raju Doma Delivery Supervisor C Petras E Ulicny D Godwin GDIA Information Technology GDIA Smart Mobility Analytics

Notas del editor

  1. 1) Intro RTSA          Lambda     2) BBD was to validate and instantiate the RTSA     3) Demo - Live Drive          - Oldie but goodies          - Huey     4) Vision     5) Roadmap - production plans                  - NY Hub          - BDD 2           Cotinued support for expierments          - PLant floor (FIS)          - Security          - Silver          - DPF regen          - Dealer WiFi (Meraki)
  2. GDIA is building an enterprise single complete and actionable data and analytics eco-system, centered around SCA-C, focused on ingesting and curating Ford’s internal applications and warehouses and providing analytics as a service opportunities. This important work and can be accomplished with a shared vision and roadmap with IT. But as we enter the emerging world of connectivity driven customer experience management and data driven everything, the data and analytics ecosystem must expand to include other edge nodes, including the car. This integrated multi-platform data analytics ecosystem can not be delivered by GDIA and IT alone. The partnership needs to be expanded to include PD. Winners are moving through build->measure->learn fastest. Our born into competitors understand this. Just another node. Not part of data analytics ecosystem. Emerging requires shared understanding with PD.
  3. Real-time analytics is a term used to refer to analytics that are able to be accessed as they come into a system. In general, the term analytics is used to define data patterns that provide meaning to a business or other entity, where analysts collect valuable information by sorting through and analyzing that data. Vast amounts of data are flowing at high velocity over the wire today. Organizations that can process and act on this streaming data in real time can dramatically improve efficiencies and differentiate themselves in the market.   Some additional  bullet points for the ‘what is real time streaming’   Real time data ingesting and analysis – the speed of today’s processing systems have moved from classical data-warehousing batch reporting to the realm of real-time processing and analytics. Real-time means near to zero latency and access to information whenever it is required.
  4. Real-time analytics is a term used to refer to analytics that are able to be accessed as they come into a system. In general, the term analytics is used to define data patterns that provide meaning to a business or other entity, where analysts collect valuable information by sorting through and analyzing that data. Vast amounts of data are flowing at high velocity over the wire today. Organizations that can process and act on this streaming data in real time can dramatically improve efficiencies and differentiate themselves in the market.   Some additional  bullet points for the ‘what is real time streaming’   Real time data ingesting and analysis – the speed of today’s processing systems have moved from classical data-warehousing batch reporting to the realm of real-time processing and analytics. Real-time means near to zero latency and access to information whenever it is required. DPF Regen Silver Security Plant Floor Cisco Meraki Dealer WiFi New York Hub