5. This is a GPU
(graphical processing unit)
Deep Learning needs Big Data and GPUs
•Lots of data is needed to train
machine learning (ML) models
Big Data
Analytics
Features
DL
16. Data Problem
“Data is the hardest part of ML and the most important
piece to get right. Modelers spend most of their time
selecting and transforming features at training time and
then building the pipelines to deliver those features to
production models. Broken data is the most common cause
of problems in production ML systems”
– Uber
16
Data
Collection
Experimentation Training Serving
Feature
Extraction
Data
Transformation
& Verification
TestFEATURE STORE
17. Hopsworks’ Feature Store
- Reusability of features
between models and
teams
- Automatic backfilling of
features
- Automatic feature
documentation and
analysis
- Feature versioning
- Standardized access of
features between training
and serving
- Feature discovery
17
21. •The Storage Layer: For storing feature data in the feature store
•The Metadata Layer: For storing feature metadata (versioning, feature
analysis, documentation, jobs)
•The Feature Engineering Jobs: For computing features
•The Feature Registry: A user interface to share and discover features
•The Feature Store API: For writing/reading to/from the feature store
Building Blocks of a Feature Store
21
35. Filesystems are not good enough
Uber on Petastorm:
“[Using files] is hard to implement at large scale, especially
using modern distributed file systems such
as HDFS and S3 (these systems are typically optimized for
fast reads of large chunks of data).”
https://eng.uber.com/petastorm/
35
36. NVMe Disks – Game Changer
•HDFS (and S3) are designed around large blocks
(optimized to overcome slow random I/O on disks), while
new NVMe hardware supports orders of magnitude
faster random disk I/O.
•Can we support faster random disk I/O with HDFS?
-Yes with HopsFS.
36
57. Summary
•The future of Deep Learning is Distributed
https://www.oreilly.com/ideas/distributed-tensorflow
•Hopsworks is a new Data and AI Platform with
first-class support for
Python / Deep Learning / ML / Data Governance / GPUs
57