Internet of Things

•Descargar como PPT, PDF•

0 recomendaciones•429 vistas

IOT , touted as the next big thing, is here ! Mobstac is using IOT,which leverages hadoop, and helps in proximity marketing using beacons.

Datos y análisis

Project Morpheus
(Beaconstac Analytics)
Garima Batra
Core Platform Engineer | MobStac
May 2015

A quick intro about Beaconstac 1
Beaconstac is a proximity marketing and analytics platform for
beacons
Several beacon specific events are defined to aid proximity marketing
The events include Camp on event, beacon exit event, region enter,
region exit etc.
Beaconstac analytics platform makes it easy for
managers/marketers/developers to analyze event data
Components include Beaconstac iOS/Android sdk, beaconstac portal

Why Hadoop? 1
Collect event logs generated from Beaconstac SDK usage
Needed a system to answer queries like
o Heat map of beacons by the number of visits received in a specified time
interval.
o Heat map of beacons by the amount of time spent in a specified time
interval.
o Average time spent by users near different beacons
o Last seen per user
o Last seen per beacon
o Analyzing data with custom attributes filters
o Traversed path in an area by individual users

Leveraging Amazon's EMR for Beaconstac
Analytics
1
 Amazon's Streaming API for writing mapper and reducer functions in Python
 Input - Copy programs to Amazon S3
 Output – Copy the processed/output data to S3
 Initial tests were run using Amazon's EMR console. Here you can
define the following -
1) Cluster configuration – Name, Termination protection, Logging,
logs location on S3 etc.
2) Software configuration – Hadoop AMI version, applications to be
installed on startup etc.
3) Hardware configuration – Types of nodes – master, Core and
Task
4) Security keys, allowed users
5) Bootstrap actions – Configure Hadoop, Custom actions etc.
6) Steps – Streaming program, Hive program, Pig program

Batch processing for Morpheus 1
AWS Data pipeline

Deep dive into EMR startup and job
submission
1

How Does AWS Data Pipeline Work? 1
Pipeline definition - specifies the business logic of your data management
AWS Data pipeline web service - interprets the pipeline definition and assigns
tasks to workers to move and transform data.
Task runner - polls the AWS Data Pipeline web service for tasks and then
performs those tasks.

Morpheus version of Data pipeline 1
Runs every hour
Requires a Kafka
consumer script
Copy the
output to
Elastic
Search
Run EMR
jobs
Copy logs
from Kafka
to S3
Runs once every
day
Processes each
job and produces
output
Each job
comprises of
mapper and
reducer scripts
Runs once every
day
Inserts output in
Elastic search

Settings file in each job 1
Source: Lorem Ipsum
Questions??
1

Más contenido relacionado

La actualidad más candente

From MapReduce to Apache SparkJen Aman

Microservices meetup April 2017SignalFx

Location based push notificationsjamesrichards

Confluent On Azure: Why you should add Confluent to your Azure toolkit | Alic...HostedbyConfluent

Spark Summit EU talk by John MusserSpark Summit

AI on Spark for Malware Analysis and Anomalous Threat DetectionDatabricks

How we Auto Scale applications based on CPU with Kubernetes at M6Web?Vincent Gallissot

Ml sprint16 thesis_introThanhNguyen3805

Advanced Spark Meetup - Jan 12, 2016Michelle Casbon

How to Define and Share your Event APIs using AsyncAPI and Event API Products...HostedbyConfluent

Monitoringstrikr .

Spark at AirbnbHao Wang

Simplifying Big Data Applications with Apache Spark 2.0Spark Summit

Sumo Logic QuickStart Webinar Oct 2016Sumo Logic

One Click Streaming Data Pipelines & Flows | Leveraging Kafka & Spark | Ido F...HostedbyConfluent

Digital Transformation & Solvency II Simulations for L&G: Optimizing, Acceler...OW2

DBOpsstrikr .

Building adaptive user experiences using Contextual Multi-Armed Bandits with...HostedbyConfluent

Asynchronous Hyperparameter Optimization with Apache SparkDatabricks

Stream processing IoT time series data with Kafka & InfluxDB | Al Sargent, In...HostedbyConfluent

La actualidad más candente (20)

From MapReduce to Apache Spark

Microservices meetup April 2017

Location based push notifications

Confluent On Azure: Why you should add Confluent to your Azure toolkit | Alic...

Spark Summit EU talk by John Musser

AI on Spark for Malware Analysis and Anomalous Threat Detection

How we Auto Scale applications based on CPU with Kubernetes at M6Web?

Ml sprint16 thesis_intro

Advanced Spark Meetup - Jan 12, 2016

How to Define and Share your Event APIs using AsyncAPI and Event API Products...

Monitoring

Spark at Airbnb

Simplifying Big Data Applications with Apache Spark 2.0

Sumo Logic QuickStart Webinar Oct 2016

One Click Streaming Data Pipelines & Flows | Leveraging Kafka & Spark | Ido F...

Digital Transformation & Solvency II Simulations for L&G: Optimizing, Acceler...

DBOps

Building adaptive user experiences using Contextual Multi-Armed Bandits with...

Asynchronous Hyperparameter Optimization with Apache Spark

Stream processing IoT time series data with Kafka & InfluxDB | Al Sargent, In...

Similar a Internet of Things

Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...Landon Robinson

Spark Development Lifecycle at Workday - ApacheCon 2020Pavel Hardak

Apache Spark Development Lifecycle @ Workday - ApacheCon 2020Eren Avşaroğulları

Apache Spark Listeners: A Crash Course in Fast, Easy MonitoringDatabricks

vinay-mittal-newVinay Mittal

Broadcast Music Inc - Release Automation Rockstars!ghodgkinson

IDC 1도 모르는 개발자가 DevOps를 만났을때주은 안

Spring Boot & Spring Cloud Apps on Pivotal Application Service - Daniel LavoieVMware Tanzu

Aleksandr_Savelyev_Resume_Mar_2016Aleksandr Savelyev

Big Data in the CloudAmazon Web Services

kumarResumeKumar RAMASWAMY

"The Suitcase" Project Cloud QTR meeting presentation @ Disney/ABCETCenter

batbern43 Events - Lessons learnt building an Enterprise Data BusBATbern

IoT Analytics from Edge to Cloud - using IBM InformixPradeep Muthalpuredathe

Data Streaming with Apache Kafka & MongoDBconfluent

SpringOne Tour Denver - Spring Boot & Spring Cloud on Pivotal Application Ser...VMware Tanzu

AI Scalability for the Next DecadePaula Koziol

6. DISZ - Webalkalmazások skálázhatósága a Google Cloud PlatformonMárton Kodok

DevOps Spain 2019. Beatriz Martínez-IBMatSistemas

Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv Amazon Web Services

Similar a Internet of Things (20)

Spark + AI Summit 2019: Apache Spark Listeners: A Crash Course in Fast, Easy ...

Spark Development Lifecycle at Workday - ApacheCon 2020

Apache Spark Development Lifecycle @ Workday - ApacheCon 2020

Apache Spark Listeners: A Crash Course in Fast, Easy Monitoring

vinay-mittal-new

Broadcast Music Inc - Release Automation Rockstars!

IDC 1도 모르는 개발자가 DevOps를 만났을때

Spring Boot & Spring Cloud Apps on Pivotal Application Service - Daniel Lavoie

Aleksandr_Savelyev_Resume_Mar_2016

Big Data in the Cloud

kumarResume

"The Suitcase" Project Cloud QTR meeting presentation @ Disney/ABC

batbern43 Events - Lessons learnt building an Enterprise Data Bus

IoT Analytics from Edge to Cloud - using IBM Informix

Data Streaming with Apache Kafka & MongoDB

SpringOne Tour Denver - Spring Boot & Spring Cloud on Pivotal Application Ser...

AI Scalability for the Next Decade

6. DISZ - Webalkalmazások skálázhatósága a Google Cloud Platformon

DevOps Spain 2019. Beatriz Martínez-IBM

Streaming data analytics (Kinesis, EMR/Spark) - Pop-up Loft Tel Aviv

Más de DeZyre

Top 10 Data Visualization ToolsDeZyre

Data Scientist SkillsDeZyre

What companies hiring data scientists and hadoop developers are looking for?DeZyre

Big Data TimelineDeZyre

How to program your way into data science?DeZyre

Big Data Use CasesDeZyre

Big data hadoop salary trendsDeZyre

Stay updated through online hackathonsDeZyre

How to become a data scientistDeZyre

How big data is transforming BIDeZyre

Sports and Big dataDeZyre

Big data in healthcareDeZyre

What is big data DeZyre

25 things that make Amazons Jeff Bezos, Jeff BezosDeZyre

Más de DeZyre (14)

Top 10 Data Visualization Tools

Data Scientist Skills

What companies hiring data scientists and hadoop developers are looking for?

Big Data Timeline

How to program your way into data science?

Big Data Use Cases

Big data hadoop salary trends

Stay updated through online hackathons

How to become a data scientist

How big data is transforming BI

Sports and Big data

Big data in healthcare

What is big data

25 things that make Amazons Jeff Bezos, Jeff Bezos

Último

Introduction-to-Machine-Learning (1).pptxfirstjob4

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083

Zuja dropshipping via API with DroFx.pptxolyaivanovalion

Data-Analysis for Chicago Crime Data 2023ymrp368

BabyOno dropshipping via API with DroFx.pptxolyaivanovalion

Invezz.com - Grow your wealth with trading signalsInvezz1

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Midocean dropshipping via API with DroFxolyaivanovalion

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692

100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate

FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg

Edukaciniai dropshipping via API with DroFxolyaivanovalion

Internet of Things

1. Big Data and Internet of things(IOT)

2. Project Morpheus (Beaconstac Analytics) Garima Batra Core Platform Engineer | MobStac May 2015

3. A quick intro about Beaconstac 1 Beaconstac is a proximity marketing and analytics platform for beacons Several beacon specific events are defined to aid proximity marketing The events include Camp on event, beacon exit event, region enter, region exit etc. Beaconstac analytics platform makes it easy for managers/marketers/developers to analyze event data Components include Beaconstac iOS/Android sdk, beaconstac portal

4. Why Hadoop? 1 Collect event logs generated from Beaconstac SDK usage Needed a system to answer queries like o Heat map of beacons by the number of visits received in a specified time interval. o Heat map of beacons by the amount of time spent in a specified time interval. o Average time spent by users near different beacons o Last seen per user o Last seen per beacon o Analyzing data with custom attributes filters o Traversed path in an area by individual users

5. Leveraging Amazon's EMR for Beaconstac Analytics 1  Amazon's Streaming API for writing mapper and reducer functions in Python  Input - Copy programs to Amazon S3  Output – Copy the processed/output data to S3  Initial tests were run using Amazon's EMR console. Here you can define the following - 1) Cluster configuration – Name, Termination protection, Logging, logs location on S3 etc. 2) Software configuration – Hadoop AMI version, applications to be installed on startup etc. 3) Hardware configuration – Types of nodes – master, Core and Task 4) Security keys, allowed users 5) Bootstrap actions – Configure Hadoop, Custom actions etc. 6) Steps – Streaming program, Hive program, Pig program

6. Integrating EMR in production 1

7. Batch processing for Morpheus 1 AWS Data pipeline

8. Deep dive into EMR startup and job submission 1

9. How Does AWS Data Pipeline Work? 1 Pipeline definition - specifies the business logic of your data management AWS Data pipeline web service - interprets the pipeline definition and assigns tasks to workers to move and transform data. Task runner - polls the AWS Data Pipeline web service for tasks and then performs those tasks.

10. Morpheus version of Data pipeline 1 Runs every hour Requires a Kafka consumer script Copy the output to Elastic Search Run EMR jobs Copy logs from Kafka to S3 Runs once every day Processes each job and produces output Each job comprises of mapper and reducer scripts Runs once every day Inserts output in Elastic search

11. Settings file in each job 1 Source: Lorem Ipsum Questions?? 1

Internet of Things

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Internet of Things

Similar a Internet of Things (20)

Más de DeZyre

Más de DeZyre (14)

Último

Último (20)

Internet of Things