In this session, learn how the Intel team of software engineers and data scientists, in collaboration with the Michael J Fox Foundation, built a big data analytics platform using Hadoop and other IoT technologies. The solution leverages wearable sensors and smartphone application to monitor PD patient's motor activities, 24/7. The platform collects and processes large stream of data, and enables different analytics services such as activity recognition and different PD related measurements to researchers. These machine learning algorithms are used to detect patterns in the data that can help researchers understand the progression of the disease and develop effective treatments. You leave with a comprehensive view of the tools and platforms from Intel that you can use in building your own applications on AWS. In addition there will be a deeper dive to explain the way this platforms enables near real- time analytics as part of the ingestion process.
Parkinson's Disease-a neuromuscular disease that causes gradually worsening symptoms such as tremors, difficulty in movement, and sleep loss -affects over 5 million people worldwide. Because the symptoms vary from individual to individual, research into the disease is hampered by the lack of objective data. As is typical of many healthcare applications, the collection, storage, and analysis of data is complex, expensive, and time-consuming. Intel is tackling this challenge by building a solution that uses wearable devices to collect data from patients anonymously and store it securely.
Sponsored by Intel.
3. The Hypothesis / Opportunity
The problem –
PD Big-Data is not really available
Solution
•Enable breakthroughs in Parkinson disease research through Big Data analytics
•Small disparate sources of data
•Most data is limited and unavailable
•Instrument PD patients with wearable devices for large scale, continuous 24 X 7 data collection
4. Patients are not able to objectively evaluate their condition
No Objective measure of Parkinson disease symptoms
Cost of trials are in the scales of $M and they take several years to complete
Very small number of patients contribute to research
Researchers can not scale to large N due to technology limitations
5. 30
subjects
5
Days per Subject
0.15TB
Per Subject per Day
500
subjects
30
Days per Subject
1GB
Per Subject per Day
15TB
Every month
1000
subjects
365
Days per Subject
Per Subject per Day
365TB
Every year
41. Configuration & MetaData
Web Services
Data Storage
Analytics
Rule Engine
Web
Site
MQTT
Node.js
Express
Angular.js
Bootstrap
Node.js
Express
MongoDB
Message Broker
Scala
Java
Akka
Spray
Apache Kafka
Apache Phoenix
Spark
R
ElasticLoad Balancing
CDH 5.2
YARN
MapReduce 2
HBase (time series)
InfiniDB for AWS (aggregates)
Cache
Redis
Deployment:
AWS Cloud Formation
OpsCodeChef
Amazon VPC
Monitoring:
Nagios/ Zabbix
Logging:
Logstash
Auto Scaling
42. Intel announced a comprehensive developer program for hobbyists, students and entrepreneurial developers with outreach, training and tools required to rapidly develop, test and deploy applications for the Internet of Things.
•Package of easy to use hardware, software & tools, services
•Global HackathonChallenge with prizes
•20 City IoT Roadshow distributing 5,000 kits
•University Program with courseware and labs starting with Carnegie Mellon
•On-line community for learning, building sharing
See Edison Live at the Intel Booth
43.
44.
45.
46. You can use this platform to collect data and build your own solution with Edison!