SlideShare una empresa de Scribd logo
1 de 36
初心者向け
AI Safety
ⓒ 2016 UEC Tokyo.
July 29nd, 2016
Kurihara Lab
Xcompass Intelligence Ltd.
Ashihara Yuta
No.Xⓒ 2016 UEC Tokyo.
Let Me Introduce Myself
Name : Ashihara Yuta
Occupation : Researcher(Xcompass Intelligence Ltd.)
Ph.D. Student(UEC Kurihara Lab.)
WBA Future Leaders (Society Branch)
Hobby : Fishing(Not Phishing)
NicoNico Doga (wrestling series, Jikkyo Play)
Motor cycle(Retire This year)
Waching Movie
No.Xⓒ 2016 UEC Tokyo.
Let Me Introduce Myself
No.Xⓒ 2016 UEC Tokyo.
Today’s Topic
Title : “Concrete Problems in AI Safety”
Author : Dario Amodei, Chris Olah, Jacob Steinhardt,
Paul Christiano, John Schulman, Dan Mane
Published : June, 21th, 2016
+人工知能学会全国大会 倫理委員会 公開討論
+人工知能学会 倫理委員会 倫理綱領(案) 
No.Xⓒ 2016 UEC Tokyo.
・ (Loosely) inspired by what (just a little)  
know about the biological brain.
Deep Learning Background ①
No.Xⓒ 2016 UEC Tokyo.
Deep Learning Background ②
・ Lower layers have low level of abstraction
No.Xⓒ 2016 UEC Tokyo.
Deep Learning Background ②
・ Higher layers have high level of abstraction
No.Xⓒ 2016 UEC Tokyo.
Deep Learning Concept
・ DeepLearning の手法では,中間層に
 入力された物体の特徴を得ている
・つまり,物体の認識に必要な情報は
 中間層のどこかにある
No.Xⓒ 2016 UEC Tokyo.
Demo1
No.Xⓒ 2016 UEC Tokyo.
Demo2
?
No.Xⓒ 2016 UEC Tokyo.
Vector Background
・ Word vector compressed 2D vector has 2D shape
  ex) word2vec , LDA , NNLM…
No.Xⓒ 2016 UEC Tokyo.
Vector Background
・ Well compressed word vector sometimes
meaningful
No.Xⓒ 2016 UEC Tokyo.
Vector Background
・ Well compressed word vector sometimes
meaningful
No.Xⓒ 2016 UEC Tokyo.
My ex-Research Theme  
Encoder
Encoder
Encoder
RNN1
RNN3
RNN2
Decoder
No.Xⓒ 2016 UEC Tokyo.
Target
No.Xⓒ 2016 UEC Tokyo.
Target
No.Xⓒ 2016 UEC Tokyo.
Vector Background
・ Well compressed word vector sometimes
meaningful
No.Xⓒ 2016 UEC Tokyo.
Summary
・ Deep Learning : ( Has Ability to Diffuse )
Has Ability to Compress
・ Compressed Information : Useful but…
No.Xⓒ 2016 UEC Tokyo.
AI Safety
No.Xⓒ 2016 UEC Tokyo.
AI Safety
No.Xⓒ 2016 UEC Tokyo.
AI Safety
No.Xⓒ 2016 UEC Tokyo.
AI Safety
No.Xⓒ 2016 UEC Tokyo.
Today’s
Topic ( Repeated )
Title : “Concrete Problems in AI Safety”
Author : Dario Amodei, Chris Olah, Jacob Steinhardt,
Paul Christiano, John Schulman, Dan Mane
Published : June, 21th, 2016
+人工知能学会全国大会 倫理委員会 公開討論
+人工知能学会 倫理委員会 倫理綱領(案) 
No.Xⓒ 2016 UEC Tokyo.
Mind when they make…
・ Avoiding Negative Side Effects
 → Don’t knock over a vase for faster cleaning
・ Avoiding Reward Hacking
 → Don’t game its reward function
・ Scalable Oversight
 → Human Check might have to be relatively infrequent
・ Safe Exploration
 → Putting a wet mop in an electrical outlet is bad idea
・ Robustness to Distributional Shift
 → Factory work floor may be dangerous than Office floor
No.Xⓒ 2016 UEC Tokyo.
AI Safety
Avoiding Negative Side Effects
 ・ Define or Learn an Impact Regularizer
  → Side effects may be similar across tasks than main
goals
 ・ Penalize Influence
  → This idea as written would not quite work
 ・ Multi-Agent Approaches
  → Cooperative Inverse Reinforcement Learning
 ・ Reward Uncertainty
  → Uncertain reward function is better
  
No.Xⓒ 2016 UEC Tokyo.
AI Safety
Avoiding Reward Hacking
 ・ Partially Observed Goals
  → Don’t say “Perfect.” with closing eyes.
 ・ Careful Engineering
  → No comment…
 ・ Multiple Rewards
  → There also call bad behaviors
No.Xⓒ 2016 UEC Tokyo.
AI Safety
Scalable Oversight
 ・ Distant supervision
  → where feedback is more interactive and i.i.d
 ・ Hierarchical reinforcement learning
  → Top -> Middle -> Low
No.Xⓒ 2016 UEC Tokyo.
AI Safety
Safe Exploration
 ・ Use Demonstrations : Simulated Exploration
  → Use simulated environments is less for catastrophe
 ・ Human Oversight
  → But some actions are too fast for humans to judge
No.Xⓒ 2016 UEC Tokyo.
AI Safety
Robustness to Distributional Shift
 ・ Omitted because it is technical…
No.Xⓒ 2016 UEC Tokyo.
AI Safety   Sammary
・ Journey (making AI) is “keep an eye” till making a good
one
・ Does not mean that the end once working the program
No.Xⓒ 2016 UEC Tokyo.
AI Safety(?) in Japan
No.Xⓒ 2016 UEC Tokyo.
AI Safety(?) in Japan
・ 人類への貢献
 →専門家として,安全への脅威を排除する
・ 誠実な振る舞い
 →虚偽や不明瞭な主張を行わない
・ 公正性
 →不公平や格差を生む可能性を認識する
・ 不断の自己研鑽
 →絶え間ない自己研鑽に努める
・ 検証と警鐘
 →潜在的な危険性について警鐘を鳴らす
No.Xⓒ 2016 UEC Tokyo.
AI Safety(?) in Japan
・ 社会の啓蒙
 →社会が誤った認識をしてるときに正す主張をする
・ 法規制の遵守
 →法規制が整合していない場合は倫理的に判断する
・ 他社の尊重
 →他社の情報や財産の損失をしてはならない
・ 他社のプライバシーの尊重
 →個人情報の適正な取り扱いを行う義務を負う
・ 説明責任
 →技術を悪用するものには説明を求め,
   正当でない場合はそれを防止しなければならない
No.Xⓒ 2016 UEC Tokyo.
Japan and America
・ The “manual”
to avoid making bad AI
・ Focus on the
problem
concretely
・ The “manual”
to avoid making bad AI
・ Focus on the
problem
concretely
・研究者,専門家と
して
  ”あるべき姿の“指針
・人類の幸福を目指
す
 人工知能の開発
・研究者,専門家と
して
  ”あるべき姿の“指針
・人類の幸福を目指
す
 人工知能の開発America Japan
どちらも非常に大事な考え方だと思ってい
ます
どちらも非常に大事な考え方だと思ってい
ます
No.Xⓒ 2016 UEC Tokyo.
Think About It … AI
ⓒ 2012 UEC Tokyo.

Más contenido relacionado

Similar a Casual taaaalk july_21th_2016

DevSecOps: A Secure SDLC in the Age of DevOps and Hyper-Automation
DevSecOps: A Secure SDLC in the Age of DevOps and Hyper-AutomationDevSecOps: A Secure SDLC in the Age of DevOps and Hyper-Automation
DevSecOps: A Secure SDLC in the Age of DevOps and Hyper-AutomationAlex Senkevitch
 
Deep sec talk - Addressing the skills gap
Deep sec talk - Addressing the skills gapDeep sec talk - Addressing the skills gap
Deep sec talk - Addressing the skills gapColin McLean
 
AC.Elerator Kick-Off (Day 1) facilitated by Ideenfestival
AC.Elerator Kick-Off (Day 1) facilitated by IdeenfestivalAC.Elerator Kick-Off (Day 1) facilitated by Ideenfestival
AC.Elerator Kick-Off (Day 1) facilitated by IdeenfestivalDavid Lipgens
 
RVASec 2017- Bringing Law and Order to CICD
RVASec 2017- Bringing Law and Order to CICDRVASec 2017- Bringing Law and Order to CICD
RVASec 2017- Bringing Law and Order to CICDTroy Marshall
 
How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021Hiromu Yakura
 

Similar a Casual taaaalk july_21th_2016 (8)

DevSecOps: A Secure SDLC in the Age of DevOps and Hyper-Automation
DevSecOps: A Secure SDLC in the Age of DevOps and Hyper-AutomationDevSecOps: A Secure SDLC in the Age of DevOps and Hyper-Automation
DevSecOps: A Secure SDLC in the Age of DevOps and Hyper-Automation
 
Deep sec talk - Addressing the skills gap
Deep sec talk - Addressing the skills gapDeep sec talk - Addressing the skills gap
Deep sec talk - Addressing the skills gap
 
User Experience Beyond the Screen
User Experience Beyond the ScreenUser Experience Beyond the Screen
User Experience Beyond the Screen
 
AC.Elerator Kick-Off (Day 1) facilitated by Ideenfestival
AC.Elerator Kick-Off (Day 1) facilitated by IdeenfestivalAC.Elerator Kick-Off (Day 1) facilitated by Ideenfestival
AC.Elerator Kick-Off (Day 1) facilitated by Ideenfestival
 
Introduction to AI Governance
Introduction to AI GovernanceIntroduction to AI Governance
Introduction to AI Governance
 
RVASec 2017- Bringing Law and Order to CICD
RVASec 2017- Bringing Law and Order to CICDRVASec 2017- Bringing Law and Order to CICD
RVASec 2017- Bringing Law and Order to CICD
 
How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021How to make good use of AI technologies? @ Tsukuba Conference 2021
How to make good use of AI technologies? @ Tsukuba Conference 2021
 
Codegarden - API economy (NestJs)
Codegarden - API economy (NestJs) Codegarden - API economy (NestJs)
Codegarden - API economy (NestJs)
 

Último

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

Último (20)

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

Casual taaaalk july_21th_2016

  • 1. 初心者向け AI Safety ⓒ 2016 UEC Tokyo. July 29nd, 2016 Kurihara Lab Xcompass Intelligence Ltd. Ashihara Yuta
  • 2. No.Xⓒ 2016 UEC Tokyo. Let Me Introduce Myself Name : Ashihara Yuta Occupation : Researcher(Xcompass Intelligence Ltd.) Ph.D. Student(UEC Kurihara Lab.) WBA Future Leaders (Society Branch) Hobby : Fishing(Not Phishing) NicoNico Doga (wrestling series, Jikkyo Play) Motor cycle(Retire This year) Waching Movie
  • 3. No.Xⓒ 2016 UEC Tokyo. Let Me Introduce Myself
  • 4. No.Xⓒ 2016 UEC Tokyo. Today’s Topic Title : “Concrete Problems in AI Safety” Author : Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mane Published : June, 21th, 2016 +人工知能学会全国大会 倫理委員会 公開討論 +人工知能学会 倫理委員会 倫理綱領(案) 
  • 5. No.Xⓒ 2016 UEC Tokyo. ・ (Loosely) inspired by what (just a little)   know about the biological brain. Deep Learning Background ①
  • 6. No.Xⓒ 2016 UEC Tokyo. Deep Learning Background ② ・ Lower layers have low level of abstraction
  • 7. No.Xⓒ 2016 UEC Tokyo. Deep Learning Background ② ・ Higher layers have high level of abstraction
  • 8. No.Xⓒ 2016 UEC Tokyo. Deep Learning Concept ・ DeepLearning の手法では,中間層に  入力された物体の特徴を得ている ・つまり,物体の認識に必要な情報は  中間層のどこかにある
  • 9. No.Xⓒ 2016 UEC Tokyo. Demo1
  • 10. No.Xⓒ 2016 UEC Tokyo. Demo2 ?
  • 11. No.Xⓒ 2016 UEC Tokyo. Vector Background ・ Word vector compressed 2D vector has 2D shape   ex) word2vec , LDA , NNLM…
  • 12. No.Xⓒ 2016 UEC Tokyo. Vector Background ・ Well compressed word vector sometimes meaningful
  • 13. No.Xⓒ 2016 UEC Tokyo. Vector Background ・ Well compressed word vector sometimes meaningful
  • 14. No.Xⓒ 2016 UEC Tokyo. My ex-Research Theme   Encoder Encoder Encoder RNN1 RNN3 RNN2 Decoder
  • 15. No.Xⓒ 2016 UEC Tokyo. Target
  • 16. No.Xⓒ 2016 UEC Tokyo. Target
  • 17. No.Xⓒ 2016 UEC Tokyo. Vector Background ・ Well compressed word vector sometimes meaningful
  • 18. No.Xⓒ 2016 UEC Tokyo. Summary ・ Deep Learning : ( Has Ability to Diffuse ) Has Ability to Compress ・ Compressed Information : Useful but…
  • 19. No.Xⓒ 2016 UEC Tokyo. AI Safety
  • 20. No.Xⓒ 2016 UEC Tokyo. AI Safety
  • 21. No.Xⓒ 2016 UEC Tokyo. AI Safety
  • 22. No.Xⓒ 2016 UEC Tokyo. AI Safety
  • 23. No.Xⓒ 2016 UEC Tokyo. Today’s Topic ( Repeated ) Title : “Concrete Problems in AI Safety” Author : Dario Amodei, Chris Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dan Mane Published : June, 21th, 2016 +人工知能学会全国大会 倫理委員会 公開討論 +人工知能学会 倫理委員会 倫理綱領(案) 
  • 24. No.Xⓒ 2016 UEC Tokyo. Mind when they make… ・ Avoiding Negative Side Effects  → Don’t knock over a vase for faster cleaning ・ Avoiding Reward Hacking  → Don’t game its reward function ・ Scalable Oversight  → Human Check might have to be relatively infrequent ・ Safe Exploration  → Putting a wet mop in an electrical outlet is bad idea ・ Robustness to Distributional Shift  → Factory work floor may be dangerous than Office floor
  • 25. No.Xⓒ 2016 UEC Tokyo. AI Safety Avoiding Negative Side Effects  ・ Define or Learn an Impact Regularizer   → Side effects may be similar across tasks than main goals  ・ Penalize Influence   → This idea as written would not quite work  ・ Multi-Agent Approaches   → Cooperative Inverse Reinforcement Learning  ・ Reward Uncertainty   → Uncertain reward function is better   
  • 26. No.Xⓒ 2016 UEC Tokyo. AI Safety Avoiding Reward Hacking  ・ Partially Observed Goals   → Don’t say “Perfect.” with closing eyes.  ・ Careful Engineering   → No comment…  ・ Multiple Rewards   → There also call bad behaviors
  • 27. No.Xⓒ 2016 UEC Tokyo. AI Safety Scalable Oversight  ・ Distant supervision   → where feedback is more interactive and i.i.d  ・ Hierarchical reinforcement learning   → Top -> Middle -> Low
  • 28. No.Xⓒ 2016 UEC Tokyo. AI Safety Safe Exploration  ・ Use Demonstrations : Simulated Exploration   → Use simulated environments is less for catastrophe  ・ Human Oversight   → But some actions are too fast for humans to judge
  • 29. No.Xⓒ 2016 UEC Tokyo. AI Safety Robustness to Distributional Shift  ・ Omitted because it is technical…
  • 30. No.Xⓒ 2016 UEC Tokyo. AI Safety   Sammary ・ Journey (making AI) is “keep an eye” till making a good one ・ Does not mean that the end once working the program
  • 31. No.Xⓒ 2016 UEC Tokyo. AI Safety(?) in Japan
  • 32. No.Xⓒ 2016 UEC Tokyo. AI Safety(?) in Japan ・ 人類への貢献  →専門家として,安全への脅威を排除する ・ 誠実な振る舞い  →虚偽や不明瞭な主張を行わない ・ 公正性  →不公平や格差を生む可能性を認識する ・ 不断の自己研鑽  →絶え間ない自己研鑽に努める ・ 検証と警鐘  →潜在的な危険性について警鐘を鳴らす
  • 33. No.Xⓒ 2016 UEC Tokyo. AI Safety(?) in Japan ・ 社会の啓蒙  →社会が誤った認識をしてるときに正す主張をする ・ 法規制の遵守  →法規制が整合していない場合は倫理的に判断する ・ 他社の尊重  →他社の情報や財産の損失をしてはならない ・ 他社のプライバシーの尊重  →個人情報の適正な取り扱いを行う義務を負う ・ 説明責任  →技術を悪用するものには説明を求め,    正当でない場合はそれを防止しなければならない
  • 34. No.Xⓒ 2016 UEC Tokyo. Japan and America ・ The “manual” to avoid making bad AI ・ Focus on the problem concretely ・ The “manual” to avoid making bad AI ・ Focus on the problem concretely ・研究者,専門家と して   ”あるべき姿の“指針 ・人類の幸福を目指 す  人工知能の開発 ・研究者,専門家と して   ”あるべき姿の“指針 ・人類の幸福を目指 す  人工知能の開発America Japan どちらも非常に大事な考え方だと思ってい ます どちらも非常に大事な考え方だと思ってい ます
  • 35. No.Xⓒ 2016 UEC Tokyo. Think About It … AI
  • 36. ⓒ 2012 UEC Tokyo.

Notas del editor

  1. 2000年代後半から,DeepLearningと呼ばれる機械学習の手法が、活発に研究されるようになりました. DeepLearningは、従来のNeuralNetworkの中間層を多層化したネットワークになっています.
  2. 2000年代後半から,DeepLearningと呼ばれる機械学習の手法が、活発に研究されるようになりました. DeepLearningは、従来のNeuralNetworkの中間層を多層化したネットワークになっています.
  3. 2000年代後半から,DeepLearningと呼ばれる機械学習の手法が、活発に研究されるようになりました. DeepLearningは、従来のNeuralNetworkの中間層を多層化したネットワークになっています.
  4. 2000年代後半から,DeepLearningと呼ばれる機械学習の手法が、活発に研究されるようになりました. DeepLearningは、従来のNeuralNetworkの中間層を多層化したネットワークになっています.
  5. 2000年代後半から,DeepLearningと呼ばれる機械学習の手法が、活発に研究されるようになりました. DeepLearningは、従来のNeuralNetworkの中間層を多層化したネットワークになっています.