SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
mlcourse.ai
Open Machine Learning Course
by OpenDataScience
Yury Kashnitskiy (@yorko)
Data Scientist @ KPN, Amsterdam
OpenDataScience. DataFest
OpenDataScience. Kaggle
mlcourse.ai. What we have for you
Syllabus
• 10 lectures
• Basic ML algorithms and their applications
• Assignments and in-class practice
• Competitions
• Individual projects
• Tutorials
More info here https://mlcourse.ai/roadmap
What makes it different
• Lots and lots of practice
• Theoretical understanding of
applied techniques
• Delving into competitions
• Your own projects
• Really vibrant community!
Roadmap/logistics
• All communication in ODS Slack, #mlcourse_ai
• https://mlcourse.ai/roadmap
• 10 assignments – ~10 credits each
• Projects, competitions, tutorials – up to 40 crd. each
• Current rating is here https://goo.gl/TGGr3b
• All materials are stored on GitHub https://github.com/Yorko/
mlcourse.ai and https://mlcourse.ai
• Top-100 participants will be mentioned on a special Wiki page
Toolbox
• Python
• Jupyter notebooks
• GitHub
• Docker (optional)
• Other libs like Vowpal Wabbit & Xgboost
• Instructions https://mlcourse.ai/prerequisites
Lecture 1
• Data analysis with Pandas
• Practice on first steps after
getting data
Lecture 2
• Visual data analysis with
Pandas and Seaborn
• Crucial plots for feature
exploration
• Practice on «drawing»
Lecture 3
• Foundations of Machine
Learning
• Supervised learning
• Decision trees
• k Nearest Neighbours
• Practice: first steps with
Scikit-learn
Lecture 4
• Linear classification models
• Regularization
• Cross-validation
• Practice on logistic regression
for a "real-world" task
Lecture 5
• Ensembles, random forest
• Feature importance
• Practice on random forest and
assessing feature importance
Lecture 6
• Regression task
• Linear and non-linear
regression models
• Practice on grasping core
ideas behind linear regression
Lecture 7
• Unsupervised Learning
• Principal Component Analysis
• Clustering
• Practice: clustering Samsung
Galaxy S3 sensor data into
types of human activity
Lecture 8
• Stochastic Gradient Descent
& Online learning
• Learning with a couple GB of
data
• Vowpal Wabbit
• Extracting simple features
from texts
• Practice: text classification
Lecture 9
• Time series
• Classical and modern
approaches
• Practice: ARIMA model,
Facebook Prophet
Lecture 10
• Gradient boosting: a modern
view
• Theoretical basis for gradient
boosting
• Best implementations
• Practice: beating a baseline in
a Kaggle Inclass competition
Regularization?
Assignments
• Full versions are announced
during course sessions https://
mlcourse.ai/assignments
• Demo versions are found in
course repo https://
github.com/Yorko/mlcourse.ai
• And in a Kaggle Dataset
mlcourse.ai https://
www.kaggle.com/kashnitsky/
mlcourse
Kaggle Inclass
• Alice - tracking visited websites
to distinguish Alice from all others
• Medium - predicting #claps for a
story on Medium
More info here https://mlcourse.ai/roadmap
Individual projects
• Throughout the whole course
• Straightforward instructions
• Your own data or just Kaggle
Datasets
• Peer review
• Very cool experience
More info here https://mlcourse.ai/roadmap
Project "Alice"
• A substitute for an individual
project if you don't have cool
ideas for one
• Clear instructions
• 6 weeks, 6 notebooks to
complete
• In cooperation with Yandex
and MIPT, specialization
"Machine Learning and Data
Analysis"
• Solutions are not shared
Tutorials
• Your own tutorials on pretty
much any topic around ML & DS
• Peer-voted
• Nice way to grasp something
yourself is to write a tutorial
More info here https://mlcourse.ai/roadmap
More info in Slack
#mlcourse.ai, pinned items
Good luck!
https://mlcourse.ai/news

Más contenido relacionado

Similar a mlcourse.ai, introduction, course overview

Next Generation Teaching and Learning
Next Generation Teaching and LearningNext Generation Teaching and Learning
Next Generation Teaching and LearningCharles Severance
 
Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...
Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...
Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...IstvanKoren
 
HASTAC Scholars: Omeka and Digital Archives
HASTAC Scholars: Omeka and Digital ArchivesHASTAC Scholars: Omeka and Digital Archives
HASTAC Scholars: Omeka and Digital Archivesjkmcgrath
 
Getting Started With Omeka (DHSI 2015 Unconference)
Getting Started With Omeka (DHSI 2015 Unconference)Getting Started With Omeka (DHSI 2015 Unconference)
Getting Started With Omeka (DHSI 2015 Unconference)jkmcgrath
 
Building the Next Generation Teaching and Learning Environment
Building the Next Generation Teaching and Learning EnvironmentBuilding the Next Generation Teaching and Learning Environment
Building the Next Generation Teaching and Learning EnvironmentCharles Severance
 
2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking
2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking
2019-04-17 Bio-IT World G Suite-Jira Cloud Sample TrackingBruce Kozuma
 
Digital Tools in The Classroom: Omeka Workshop (Northeastern University)
Digital Tools in The Classroom: Omeka Workshop (Northeastern University)Digital Tools in The Classroom: Omeka Workshop (Northeastern University)
Digital Tools in The Classroom: Omeka Workshop (Northeastern University)jkmcgrath
 
Rapid TIMES model development using git, agile and dashboards: reflections an...
Rapid TIMES model development using git, agile and dashboards: reflections an...Rapid TIMES model development using git, agile and dashboards: reflections an...
Rapid TIMES model development using git, agile and dashboards: reflections an...IEA-ETSAP
 
Open drupal DrupalCamp Gent 2018
Open drupal DrupalCamp Gent 2018Open drupal DrupalCamp Gent 2018
Open drupal DrupalCamp Gent 2018LimoenGroen
 
Importance of Developers to HE in the UK
Importance of Developers to HE in the UKImportance of Developers to HE in the UK
Importance of Developers to HE in the UKPaul Walk
 
2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...
2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...
2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...GIS in the Rockies
 
Prototyping like it is 2022
Prototyping like it is 2022 Prototyping like it is 2022
Prototyping like it is 2022 Michael Yagudaev
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...Krishna-Kumar
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolElectronic Resources & Libraries
 
Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...
Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...
Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...LIBER Europe
 

Similar a mlcourse.ai, introduction, course overview (20)

Learning Emerging Tech
Learning Emerging TechLearning Emerging Tech
Learning Emerging Tech
 
Course Intro.pdf
Course Intro.pdfCourse Intro.pdf
Course Intro.pdf
 
Next Generation Teaching and Learning
Next Generation Teaching and LearningNext Generation Teaching and Learning
Next Generation Teaching and Learning
 
Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...
Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...
Requirements Bazaar powered by AngularJS and Polymer - Talk at Google Develop...
 
Hyun joong
Hyun joongHyun joong
Hyun joong
 
Datalake project
Datalake projectDatalake project
Datalake project
 
HASTAC Scholars: Omeka and Digital Archives
HASTAC Scholars: Omeka and Digital ArchivesHASTAC Scholars: Omeka and Digital Archives
HASTAC Scholars: Omeka and Digital Archives
 
Getting Started With Omeka (DHSI 2015 Unconference)
Getting Started With Omeka (DHSI 2015 Unconference)Getting Started With Omeka (DHSI 2015 Unconference)
Getting Started With Omeka (DHSI 2015 Unconference)
 
Building the Next Generation Teaching and Learning Environment
Building the Next Generation Teaching and Learning EnvironmentBuilding the Next Generation Teaching and Learning Environment
Building the Next Generation Teaching and Learning Environment
 
Remarks on MOOC's
Remarks on MOOC'sRemarks on MOOC's
Remarks on MOOC's
 
2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking
2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking
2019-04-17 Bio-IT World G Suite-Jira Cloud Sample Tracking
 
Digital Tools in The Classroom: Omeka Workshop (Northeastern University)
Digital Tools in The Classroom: Omeka Workshop (Northeastern University)Digital Tools in The Classroom: Omeka Workshop (Northeastern University)
Digital Tools in The Classroom: Omeka Workshop (Northeastern University)
 
Rapid TIMES model development using git, agile and dashboards: reflections an...
Rapid TIMES model development using git, agile and dashboards: reflections an...Rapid TIMES model development using git, agile and dashboards: reflections an...
Rapid TIMES model development using git, agile and dashboards: reflections an...
 
Open drupal DrupalCamp Gent 2018
Open drupal DrupalCamp Gent 2018Open drupal DrupalCamp Gent 2018
Open drupal DrupalCamp Gent 2018
 
Importance of Developers to HE in the UK
Importance of Developers to HE in the UKImportance of Developers to HE in the UK
Importance of Developers to HE in the UK
 
2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...
2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...
2013 Education Track, Using Badges to Document Competency-Based Geospatial Le...
 
Prototyping like it is 2022
Prototyping like it is 2022 Prototyping like it is 2022
Prototyping like it is 2022
 
Create great cncf user base from lessons learned from other open source com...
Create great cncf user base from   lessons learned from other open source com...Create great cncf user base from   lessons learned from other open source com...
Create great cncf user base from lessons learned from other open source com...
 
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway ProtocolExposing Library Content with the NISO Metasearch XML Gateway Protocol
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
 
Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...
Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...
Digital Humanities Clinics – Leading Dutch Librarians into DH. Lotte Wilms, N...
 

Más de Yury Kashnitsky

Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPYury Kashnitsky
 
Gender-unbiased BERT-based Pronoun Resolution
Gender-unbiased BERT-based  Pronoun ResolutionGender-unbiased BERT-based  Pronoun Resolution
Gender-unbiased BERT-based Pronoun ResolutionYury Kashnitsky
 
Time series forecasting with ARIMA
Time series forecasting with ARIMATime series forecasting with ARIMA
Time series forecasting with ARIMAYury Kashnitsky
 
Необычные модели Playboy, или про поиск аномалий в данных
Необычные модели Playboy, или про поиск аномалий в данныхНеобычные модели Playboy, или про поиск аномалий в данных
Необычные модели Playboy, или про поиск аномалий в данныхYury Kashnitsky
 

Más de Yury Kashnitsky (6)

Benchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLPBenchmarking transfer learning approaches for NLP
Benchmarking transfer learning approaches for NLP
 
Gender-unbiased BERT-based Pronoun Resolution
Gender-unbiased BERT-based  Pronoun ResolutionGender-unbiased BERT-based  Pronoun Resolution
Gender-unbiased BERT-based Pronoun Resolution
 
mlcourse.ai. Outro
mlcourse.ai. Outromlcourse.ai. Outro
mlcourse.ai. Outro
 
Time series forecasting with ARIMA
Time series forecasting with ARIMATime series forecasting with ARIMA
Time series forecasting with ARIMA
 
mlcourse.ai. Clustering
mlcourse.ai. Clusteringmlcourse.ai. Clustering
mlcourse.ai. Clustering
 
Необычные модели Playboy, или про поиск аномалий в данных
Необычные модели Playboy, или про поиск аномалий в данныхНеобычные модели Playboy, или про поиск аномалий в данных
Необычные модели Playboy, или про поиск аномалий в данных
 

Último

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 

Último (20)

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 

mlcourse.ai, introduction, course overview

  • 1. mlcourse.ai Open Machine Learning Course by OpenDataScience Yury Kashnitskiy (@yorko) Data Scientist @ KPN, Amsterdam
  • 4. mlcourse.ai. What we have for you
  • 5. Syllabus • 10 lectures • Basic ML algorithms and their applications • Assignments and in-class practice • Competitions • Individual projects • Tutorials More info here https://mlcourse.ai/roadmap
  • 6. What makes it different • Lots and lots of practice • Theoretical understanding of applied techniques • Delving into competitions • Your own projects • Really vibrant community!
  • 7. Roadmap/logistics • All communication in ODS Slack, #mlcourse_ai • https://mlcourse.ai/roadmap • 10 assignments – ~10 credits each • Projects, competitions, tutorials – up to 40 crd. each • Current rating is here https://goo.gl/TGGr3b • All materials are stored on GitHub https://github.com/Yorko/ mlcourse.ai and https://mlcourse.ai • Top-100 participants will be mentioned on a special Wiki page
  • 8. Toolbox • Python • Jupyter notebooks • GitHub • Docker (optional) • Other libs like Vowpal Wabbit & Xgboost • Instructions https://mlcourse.ai/prerequisites
  • 9. Lecture 1 • Data analysis with Pandas • Practice on first steps after getting data
  • 10. Lecture 2 • Visual data analysis with Pandas and Seaborn • Crucial plots for feature exploration • Practice on «drawing»
  • 11. Lecture 3 • Foundations of Machine Learning • Supervised learning • Decision trees • k Nearest Neighbours • Practice: first steps with Scikit-learn
  • 12. Lecture 4 • Linear classification models • Regularization • Cross-validation • Practice on logistic regression for a "real-world" task
  • 13. Lecture 5 • Ensembles, random forest • Feature importance • Practice on random forest and assessing feature importance
  • 14. Lecture 6 • Regression task • Linear and non-linear regression models • Practice on grasping core ideas behind linear regression
  • 15. Lecture 7 • Unsupervised Learning • Principal Component Analysis • Clustering • Practice: clustering Samsung Galaxy S3 sensor data into types of human activity
  • 16. Lecture 8 • Stochastic Gradient Descent & Online learning • Learning with a couple GB of data • Vowpal Wabbit • Extracting simple features from texts • Practice: text classification
  • 17. Lecture 9 • Time series • Classical and modern approaches • Practice: ARIMA model, Facebook Prophet
  • 18. Lecture 10 • Gradient boosting: a modern view • Theoretical basis for gradient boosting • Best implementations • Practice: beating a baseline in a Kaggle Inclass competition Regularization?
  • 19. Assignments • Full versions are announced during course sessions https:// mlcourse.ai/assignments • Demo versions are found in course repo https:// github.com/Yorko/mlcourse.ai • And in a Kaggle Dataset mlcourse.ai https:// www.kaggle.com/kashnitsky/ mlcourse
  • 20. Kaggle Inclass • Alice - tracking visited websites to distinguish Alice from all others • Medium - predicting #claps for a story on Medium More info here https://mlcourse.ai/roadmap
  • 21. Individual projects • Throughout the whole course • Straightforward instructions • Your own data or just Kaggle Datasets • Peer review • Very cool experience More info here https://mlcourse.ai/roadmap
  • 22. Project "Alice" • A substitute for an individual project if you don't have cool ideas for one • Clear instructions • 6 weeks, 6 notebooks to complete • In cooperation with Yandex and MIPT, specialization "Machine Learning and Data Analysis" • Solutions are not shared Tutorials • Your own tutorials on pretty much any topic around ML & DS • Peer-voted • Nice way to grasp something yourself is to write a tutorial More info here https://mlcourse.ai/roadmap
  • 23. More info in Slack #mlcourse.ai, pinned items Good luck! https://mlcourse.ai/news