.NET Fest 2019. Оля Гавриш. Машинное обучение для .NET программистов

Тема доклада
KYIV 2019
Machine Learning for .NET developers
.NET CONFERENCE #1 IN UKRAINE
Olia Gavrysh

.NET LEVEL UP
About me
.NET CONFERENCE #1 IN UKRAINE KYIV 2019
Olia Gavrysh
Program Manager
Microsoft, .NET team
twitter: @oliagavrysh

Let me learn something about you…

When you start Machine Learning
without calculus

.NET LEVEL UP
Agenda
1. Machine Learning crash course
2. Building ML model with for your .NET app
with ML.NET

Machile Learning
crash course
KYIV 2019 .NET CONFERENCE #1 IN UKRAINE

© Microsoft Corporation
Examples
Predicts prices for the next month
Identify faces in images and videos
Detect fraud transactions
Classify if a customer is at retention risk
Some problems are difficult to solve using traditional algorithms and
procedural programming.
These examples are good candidates for machine learning.

A repetitive
decision or process
Solution lacks an
explicit definition
A lot of historic
data
When can you use Machine Learning?

.NET LEVEL UP
Machine Learning
“Programming the UnProgrammable”
rooms, bedrooms, bathrooms
location, view, near school
footage
year built
garage, basement, patio
…
{f(x) {f(x)

Many ML Tasks
Is this A or B? How much? How many? How is this organized?
Regression ClusteringClassification
And many more…

How ML works
ŷ = f(x)
Fcost = |y - ŷ| → 0
ŷ - our model
y – actual values (known answers)
Fcost - shows the difference between your
prediction and the actual values

.NET LEVEL UP
Creating ML Model
Train Evaluate UseBuild

.NET LEVEL UP
Building Model
Build
1. Upload Data
2. Prepare Data
3. Choose Algorithm

.NET LEVEL UP
Training Model
Running the chosen
algorithm on the data.
Train

.NET LEVEL UP
Evaluating Model
Calculate metrics that show how
good is the model using test data.
If not good – go back to Build phase.
Evaluate
All metrics: https://docs.microsoft.com/dotnet/machine-learning/resources/metrics

.NET LEVEL UP
Consuming Model
Consume in your client
applications.
Use

Building a model with
KYIV 2019 .NET CONFERENCE #1 IN UKRAINE

ML.NET
Machine Learning framework for building custom ML Models
Build for .NET developers
Proven at scale
Azure, Office, Windows
Extensible
TensorFlow, ONNX and Infer.NET
Cross-platform and open-source
Runs everywhere
Custom ML made easy with tools
CLI + UI-based tool for building models

In data science 80% of the time is spent
on preparing the data and 20% of the time
is spent on complaining about the need to
prepare the data.

ML.NET Tooling
to help you with making decisions
AutoML
Model
Builder

How much is the taxi fare for 1 passenger going from Boryspil to Kreschatyk?

Criterion
Loss
Min Samples Split
Min Samples Leaf
XYZ
Parameter 1
Parameter 2
Parameter 3
Parameter 4
…
Distance
Trip time
Car type
Passengers
Time of day
…
Gradient Boosted
Nearest Neighbors
SGD
Bayesian Regression
LGBM
…
Distance Gradient Boosted
30%
Model
Car type
Passengers
Getting started with Machine Learning can be hard
ML.NET takes care of data prep, feature selection & hyperparameter tuning
Which algorithm? Which parameters?Which features?

N Neighbors
Weights
Metric
P
ZYX
Distance
Trip time
Car type
Passengers
Time of day
…
Gradient Boosted
Nearest Neighbors
SGD
Bayesian Regression
LGBM
…
Nearest Neighbors
Criterion
Loss
Min Samples Split
Min Samples Leaf
XYZ
50%
Model
Iterate
30%
Gradient BoostedDistance
Car brand
Year of make
Car type
Passengers
Trip time
Getting started w/machine learning can be hard

50%
30%
70%30%45%50%65%95%35%10%75%20%70%30%15%
Iterate
Getting started w/machine learning can be hard
A

25%40%70%
25%
95%
25% 25%
25%
25%
40%
40%
40%
40%
70%
70%
70%Enter data
Define goals
Apply
constraints
Input Intelligently test multiple models in parallel
Optimized model
95%
ML.NET accelerates model development

Demo
ML.NET

http://dot.net/ml
http://aka.ms/mlnetsamples
http://aka.ms/mlnetdocs
http://aka.ms/mlnet
Resources

KYIV 2019
Thank you!
twitter: @oliagavrysh

How long to train
*Dataset Size Dataset Type Avg. Time to train*
0 - 10 Mb Numeric and Text 10 sec
10 - 100 Mb Numeric and Text 10 min
100 - 500 Mb Numeric and Text 30 min
500 - 1 Gb Numeric and Text 60 min
1 Gb+ Numeric and Text 3 hour+
The exact time to train is a function of a few parameters like:
• The number of features or columns being used to predict
• The type of columns i.e. text vs. numeric
• The Type of machine learning task (e.g. regression vs. classification)
We have tested Model Builder with even 1TB dataset but building a high-quality model for
that size of dataset can take up to four days.

1. Supervised and not supervised
2. Types of ML problems (https://docs.microsoft.com/en-us/dotnet/machine-
learning/tutorials/index)
Pictures for diff problem types: https://docs.microsoft.com/en-us/dotnet/machine-
learning/automate-training-with-model-builder
1. Data prep https://docs.microsoft.com/en-us/dotnet/machine-learning/how-to-
guides/prepare-data-ml-net
2. Parameters, hyperparameter, labels
3. Training set evaluation set. Cross validation
4. Success metrics: accuracy, … https://github.com/dotnet/machinelearning-
samples/blob/master/modelbuilder/readme.md#evaluate
All metrics here: https://docs.microsoft.com/en-us/dotnet/machine-
learning/resources/metrics
5. How to improve results: more time, more/better data (the last – that’s where data
scientists are needed)

Price prediction step by step (one hot encoder, …): https://docs.microsoft.com/en-
us/dotnet/machine-learning/tutorials/predict-prices

Difference between machine learning and AI:
• If it’s written in Python, it’s probably machine learning
• If it’s written in PowerPoint, it’s probably AI

.NET Fest 2019. Оля Гавриш. Машинное обучение для .NET программистов

.NET Fest 2019. Оля Гавриш. Машинное обучение для .NET программистов

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (8)

Similar a .NET Fest 2019. Оля Гавриш. Машинное обучение для .NET программистов

Similar a .NET Fest 2019. Оля Гавриш. Машинное обучение для .NET программистов (20)

Más de NETFest

Más de NETFest (20)

Último

Último (20)

.NET Fest 2019. Оля Гавриш. Машинное обучение для .NET программистов

Notas del editor