SlideShare a Scribd company logo
1 of 14
機械学習の数理100問
R言語バージョン
大阪大学
鈴木 讓
R研究集会@統数研
2019年12月21日
@Joe_suzuki
鈴木 讓 (すずきじょう, Joe Suzuki)
大阪大学大学院基礎工学研究科 数理科学領域 教授
(基礎工学部情報科学科数理科学コース)
専門: データ科学、機械学習、グラフィカルモデル
開発CRAN パッケージ: BNSL
R言語は、Rstudioではなくコマンドライン
規模が大きくなると、Rcppに移植
user 2015@Aalbolg 参加
数学50% R言語50%
問題番号 分野
1-18 線形回帰
19-31 分類
32-39 リサンプリング
40-48 情報量基準
49-56 正則化
57-68 非線形
69-74 決定木
75-87 サポートベクトルマシン
88-100 教師なし学習
bayesnet.org/100.pdf
解く時間 阪大学部 阪大M1
30分以内 70% 90%
30分以上 2時間以内 25% 10%未満
2時間以上半日以内 5% 若干
丁寧すぎるヒントがついている
理系学部1年の線形代数を仮定
問題どうしつながっている
数学的な導出
ヒント盛りだくさん
R言語での実現
動作確認
数学で正確な判断をしたことが、正当化される
ロジスティック回帰の最尤法の例
講義の演習問題
R言語の学習と統計学の学習の相乗効果
時期 所属 科目
2016年後期 理学部数学科 実験数学 3
2017,2018,2019後期 基礎工数理科学 計算数理 B
基礎工 情報科学科 数理科学コース
(履修前に、R言語と数理の基礎)
G. James, D. Witten, R. Tibshrani, T.Hastie
``Introduction to Statistical Learning with R”
Rを使った統計教育の問題意識
• パッケージにデータを放り込むだけ
頭を使わない人生を奨励している
• 使い方を覚えるだけ
学問は毎日変化・成長している
学生を、明日の化石にしている
講義の目標
学生の頭の中に数理的なロジックを構築できないか
データ関連人材育成関西地区コンソーシアム(duex.jp)
神戸大、奈良先、和歌山大、滋賀大、阪大、阪府大+阪市大
通信教育(e-learning):
100問 + ビデオ教材
ビデオ@vimeo
(機械学習の数理、スパース、ベイズネット、統計学)
書籍としての出版
機械学習の数理100問: 統計的学習 with R (共立出版)
問題番号 分野 問題
0章 線形代数 なし
1章 線形回帰 1-18
2章 分類 19-31
3章 リサンプリング 32-39
4章 情報量基準 40-48
5章 正則化 49-56
6章 非線形 57-68
7章 決定木 69-74
8章 サポートベクトルマシン 75-87
9章 教師なし学習 88-100
bayesnet.org/main.pdf
B5 220ページ
2020年3月中旬発行予定
類似書籍との比較
分量が多い
数学的
コードがない
ロジックより直感
プレゼンがうまい
Rの使い方
しきいが高い プロには物足りない
新書の位置づけ
分量が少ない
Rのソース
数学的な感動
国内の既存の機械学習書籍への問題意識
• 「ななめ読み」を前提
• 「身につける」がない
新書のねらい: 手を動かす
数式を導出
Rコードを組立てる
(パッケージよりスクラッチ)
執筆における苦心: Rによる作図
Library(tikzDevice)
tikz(“fig1-7.tex”,width=4.0,height=3.2)
curve(soft.th(5,x),-5,5)
dev.off
documentclass[dvipdfmx,a4j]{jbook}
usepackage{tikz}
begin{figure}
input{fig1-7}
caption{label{fig1-7} 図を表示}
end{figure}
tikz形式(テキスト)を
fig1-7.texという
ファイル名にはきだす
出力されたtikz形式を
手で修正する
スパース推定と機械学習への応用 100問: 近日公開
数理 + R言語
2018年前期の大学院講義(90名が受講): 129問を提供
2019年11月行動計量学会セミナー(62名が受講): 86問を提供
2019年後期の学部3年, 学年4年, 博士のセミナーで93問を提供
bayesnet.org/sparse.pdf
書籍の将来展望
機械学習の数理100問シリーズ (共立出版)
1. 統計的学習 with R
2. 統計的学習 with Python
3. Lasso with R
4. Lasso with Python
5. 機械学習のためのカーネル with R
6. 機械学習のためのカーネル with Python
7. グラフィカルモデルと因果推論 with R
8. グラフィカルモデルと因果推論 with Python
対応する100問に対するビデオを公開予定
W○○○○○ful Rシリーズ
からの誘いもあったが。。。

More Related Content

More from Joe Suzuki

E-learning Design and Development for Data Science in Osaka University
E-learning Design and Development for Data Science in Osaka UniversityE-learning Design and Development for Data Science in Osaka University
E-learning Design and Development for Data Science in Osaka UniversityJoe Suzuki
 
AMBN2017 サテライトワークショップ
AMBN2017 サテライトワークショップAMBN2017 サテライトワークショップ
AMBN2017 サテライトワークショップJoe Suzuki
 
CRAN Rパッケージ BNSLの概要
CRAN Rパッケージ BNSLの概要CRAN Rパッケージ BNSLの概要
CRAN Rパッケージ BNSLの概要Joe Suzuki
 
Forest Learning from Data
Forest Learning from DataForest Learning from Data
Forest Learning from DataJoe Suzuki
 
A Bayesian Approach to Data Compression
A Bayesian Approach to Data CompressionA Bayesian Approach to Data Compression
A Bayesian Approach to Data CompressionJoe Suzuki
 
A Conjecture on Strongly Consistent Learning
A Conjecture on Strongly Consistent LearningA Conjecture on Strongly Consistent Learning
A Conjecture on Strongly Consistent LearningJoe Suzuki
 
A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...
A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...
A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...Joe Suzuki
 
A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...
A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...
A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...Joe Suzuki
 
研究紹介(学生向け)
研究紹介(学生向け)研究紹介(学生向け)
研究紹介(学生向け)Joe Suzuki
 
Bayesian Criteria based on Universal Measures
Bayesian Criteria based on Universal MeasuresBayesian Criteria based on Universal Measures
Bayesian Criteria based on Universal MeasuresJoe Suzuki
 
MDL/Bayesian Criteria based on Universal Coding/Measure
MDL/Bayesian Criteria based on Universal Coding/MeasureMDL/Bayesian Criteria based on Universal Coding/Measure
MDL/Bayesian Criteria based on Universal Coding/MeasureJoe Suzuki
 
The Universal Measure for General Sources and its Application to MDL/Bayesian...
The Universal Measure for General Sources and its Application to MDL/Bayesian...The Universal Measure for General Sources and its Application to MDL/Bayesian...
The Universal Measure for General Sources and its Application to MDL/Bayesian...Joe Suzuki
 
Universal Prediction without assuming either Discrete or Continuous
Universal Prediction without assuming either Discrete or ContinuousUniversal Prediction without assuming either Discrete or Continuous
Universal Prediction without assuming either Discrete or ContinuousJoe Suzuki
 
Bayesian network structure estimation based on the Bayesian/MDL criteria when...
Bayesian network structure estimation based on the Bayesian/MDL criteria when...Bayesian network structure estimation based on the Bayesian/MDL criteria when...
Bayesian network structure estimation based on the Bayesian/MDL criteria when...Joe Suzuki
 
The Universal Bayesian Chow-Liu Algorithm
The Universal Bayesian Chow-Liu AlgorithmThe Universal Bayesian Chow-Liu Algorithm
The Universal Bayesian Chow-Liu AlgorithmJoe Suzuki
 
Bayes Independence Test
Bayes Independence TestBayes Independence Test
Bayes Independence TestJoe Suzuki
 
Efficietly Learning Bayesian Network Structures based on the B&B Strategy: A ...
Efficietly Learning Bayesian Network Structuresbased on the B&B Strategy: A ...Efficietly Learning Bayesian Network Structuresbased on the B&B Strategy: A ...
Efficietly Learning Bayesian Network Structures based on the B&B Strategy: A ...Joe Suzuki
 
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...Joe Suzuki
 

More from Joe Suzuki (20)

E-learning Design and Development for Data Science in Osaka University
E-learning Design and Development for Data Science in Osaka UniversityE-learning Design and Development for Data Science in Osaka University
E-learning Design and Development for Data Science in Osaka University
 
UAI 2017
UAI 2017UAI 2017
UAI 2017
 
AMBN2017 サテライトワークショップ
AMBN2017 サテライトワークショップAMBN2017 サテライトワークショップ
AMBN2017 サテライトワークショップ
 
CRAN Rパッケージ BNSLの概要
CRAN Rパッケージ BNSLの概要CRAN Rパッケージ BNSLの概要
CRAN Rパッケージ BNSLの概要
 
Forest Learning from Data
Forest Learning from DataForest Learning from Data
Forest Learning from Data
 
A Bayesian Approach to Data Compression
A Bayesian Approach to Data CompressionA Bayesian Approach to Data Compression
A Bayesian Approach to Data Compression
 
A Conjecture on Strongly Consistent Learning
A Conjecture on Strongly Consistent LearningA Conjecture on Strongly Consistent Learning
A Conjecture on Strongly Consistent Learning
 
A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...
A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...
A Generalization of the Chow-Liu Algorithm and its Applications to Artificial...
 
A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...
A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...
A Generalization of Nonparametric Estimation and On-Line Prediction for Stati...
 
研究紹介(学生向け)
研究紹介(学生向け)研究紹介(学生向け)
研究紹介(学生向け)
 
Bayesian Criteria based on Universal Measures
Bayesian Criteria based on Universal MeasuresBayesian Criteria based on Universal Measures
Bayesian Criteria based on Universal Measures
 
MDL/Bayesian Criteria based on Universal Coding/Measure
MDL/Bayesian Criteria based on Universal Coding/MeasureMDL/Bayesian Criteria based on Universal Coding/Measure
MDL/Bayesian Criteria based on Universal Coding/Measure
 
The Universal Measure for General Sources and its Application to MDL/Bayesian...
The Universal Measure for General Sources and its Application to MDL/Bayesian...The Universal Measure for General Sources and its Application to MDL/Bayesian...
The Universal Measure for General Sources and its Application to MDL/Bayesian...
 
Universal Prediction without assuming either Discrete or Continuous
Universal Prediction without assuming either Discrete or ContinuousUniversal Prediction without assuming either Discrete or Continuous
Universal Prediction without assuming either Discrete or Continuous
 
Bayesian network structure estimation based on the Bayesian/MDL criteria when...
Bayesian network structure estimation based on the Bayesian/MDL criteria when...Bayesian network structure estimation based on the Bayesian/MDL criteria when...
Bayesian network structure estimation based on the Bayesian/MDL criteria when...
 
The Universal Bayesian Chow-Liu Algorithm
The Universal Bayesian Chow-Liu AlgorithmThe Universal Bayesian Chow-Liu Algorithm
The Universal Bayesian Chow-Liu Algorithm
 
Bayes Independence Test
Bayes Independence TestBayes Independence Test
Bayes Independence Test
 
Efficietly Learning Bayesian Network Structures based on the B&B Strategy: A ...
Efficietly Learning Bayesian Network Structuresbased on the B&B Strategy: A ...Efficietly Learning Bayesian Network Structuresbased on the B&B Strategy: A ...
Efficietly Learning Bayesian Network Structures based on the B&B Strategy: A ...
 
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
Forest Learning based on the Chow-Liu Algorithm and its Application to Genom...
 
2016 7-13
2016 7-132016 7-13
2016 7-13
 

R集会@統数研