In this talk I will show data engineers and architects how to run real-time TensorFlow Inception Image Recognition on images captured by remote sensors and images in tweets and facebook posts.
In the same flow I will also demonstrate how to apply real-time sentiment analysis and intelligent routing of data to Phoenix, Email and Slack.
I will elaborate on a number of different sentiment analysis frameworks available for use within Apache NiFi including Python NLTK, Stanford CoreNLP, Python SpaCy and Python TextBlob.
This talk will be a deep dive into how to manage complex dataflow pipelines ingesting from multiple streaming sources including social, public open data feeds, logs, drones, RDBMS and IoT with transformations, deep learning, machine learning and business rules.
Data engineers will be shown the power of Apache NiFi for loading diverse sources of data, applying transformations in-stream, routing based on attributes, adding sentiment data to workflows, running deep learning algorithms in stream and
storing data into Apache Phoenix on HBase and Apache Hive as ORC tables.
In this talk, I will walk through each step in the process from ingest of each source, applying filters, performing transformations, converting types, picking and converting fields and finally storing data to Apache Phoenix on HBase.
A quick data analysis to show streaming updates to data will be done in Apache Zeppelin running on HDP 2.x.
This is based on a few talks I have given at the Future of Data - Princeton meetup on various ingestion and processing patterns with Apache NiFi.
Speaker:
TImothy Spann, Solutions Engineer, Hortonworks
Monitor Time
Follow—ups
Q/A at end
Defer additional questions to later, we are short on time
Ingest – multiple options, different types of data (rdbms, streams, files)
HDF, Sqoop, Flume, Kafka
Streaming
Script vs UI + Mgmt. Data Movement tool. Streamlined.
Monitor Time
Follow—ups
Q/A at end
Defer additional questions to later, we are short on time
Ingest – multiple options, different types of data (rdbms, streams, files)
HDF, Sqoop, Flume, Kafka
Streaming
Script vs UI + Mgmt. Data Movement tool. Streamlined.
Monitor Time
Follow—ups
Q/A at end
Defer additional questions to later, we are short on time
Ingest – multiple options, different types of data (rdbms, streams, files)
HDF, Sqoop, Flume, Kafka
Streaming
Script vs UI + Mgmt. Data Movement tool. Streamlined.
Monitor Time
Follow—ups
Q/A at end
Defer additional questions to later, we are short on time
Ingest – multiple options, different types of data (rdbms, streams, files)
HDF, Sqoop, Flume, Kafka
Streaming
Script vs UI + Mgmt. Data Movement tool. Streamlined.
Monitor Time
Follow—ups
Q/A at end
Defer additional questions to later, we are short on time
Ingest – multiple options, different types of data (rdbms, streams, files)
HDF, Sqoop, Flume, Kafka
Streaming
Script vs UI + Mgmt. Data Movement tool. Streamlined.