SlideShare una empresa de Scribd logo
1 de 31
Descargar para leer sin conexión
The AI & I at Work
Tarek Hoteit – PhD, IT Director TR Labs at Thomson Reuters
October 19, 2018 – University of Texas in Dallas MIS Club
http://tarek.computer
Agenda
• Short history of Data Science & AI
• Data Science and AI together 4 ever
• Practical Techniques for data scientists when using AI
• Real demos for work
History of Data Science
1962 John W. Turkey predicted effect of modern-day electronic computing on
data analysis as an empirical science1
History of Data Science
1965 “Programma 101” 1st
commercial programmable desktop
calculator
History of Data Science
1981 IBM released 1st
personal computer,
followed by Apple in 1983
with GUI
Fast forward 20 years later….
History of Data Science
Data Scientists (from Chapter 3: Roles & responsibilities of individuals and institutions )
The interests of data scientists – the information and computer scientists, database and
software engineers and programmers, disciplinary experts, curators and expert annotators,
librarians, archivists, and others, who are crucial to the successful management of a digital
data collection – lie in having their creativity and intellectual contributions fully recognized.
In pursuing these interests, they have the responsibility to:
• conduct creative inquiry and analysis; enhance through consultation, collaboration, and
coordination the ability of others to conduct research and education using digital data
collections;
• be at the forefront in developing innovative concepts in database technology and
information sciences, including methods for data visualization and information
discovery, and applying these in the fields of science and education relevant to the
collection;
• implement best practices and technology; serve as a mentor to beginning or
transitioning investigators, students and others interested in pursuing data science;
• design and implement education and outreach programs that make the benefits of data
collections and digital information science available to the broadest possible range of
researchers, educators, students, and the general public.
2005 National Science Board advocates data science career
History of Data Science
2010 data science takes center stage in computer technology / customers
use more technology devices, social media, mobile & machines become
faster
In the mean time….
1956 -The 1956 Dartmouth summer research
project on artificial intelligence was initiated
August. 31, 1955
proposal authored by:
J. McCarthy, Dartmouth College
M. L. Minsky, Harvard University
N. Rochester, I.B.M. Corporation
C.E. Shannon, Bell Telephone Laboratories
History of AI
1968 – Space Odyssey 2001 by Stanley Kubrick is
released featuring intelligent computer, HAL 9000.
History of AI
1950 – 60s : reasoning AI, prototypes – high interests
1971: winter AI came up
1980s – 1990s: another hype with expert systems, neural networks,
1990s: AI Winter 2
History of AI
Late 90’s 2000’s – hype starts again (Deep Blue beats Kasparov
in chess
2006 – University of Toronto develops deep learning
2011 – Watson wins at Jeopardy
2016 – Alpha Go beats GO champions
History of AI
2017 – Alpha Go Zero beats
Alpha Go 100 to 0 after starting
from scratch
Now everyone is into artificial intelligence
So where does data
science and artificial
intelligence cross its
other?
Data Science, Artificial Intelligence cross path in all
places
We now have two types of Data Scientists
Data Scientists Type A – Analytical
• Focuses on the why
• Heavy on statistics, machine
learning fundamentals, data
wrangling
• Use Python/R, SQL
Data Scientist Type B –
Builder/Machine Learning Engineer
• Focused on creating new
products
• Heavy on machine learning,
software engineering, linear
algebra and differential
equations
• Use Python/Java/Scala, Docker,
cloud computing
Jesse Steinweg-Woods https://www.datascience.com/blog/guide-to-popular-data-science-jobs
Common coding grounds? Python favorable among machine
learning and data science jobs
Based on indeed.com last updated late 2017
Python & Data Science libraries are heavily used for data
analysis
“The number of Data Scientists is
constantly growing and at the moment
the number of Data Scientists is larger
than the number of Web Developers
among Python users.” – JetBrains
2018 “The State of Developer
Ecosystem Survey in 2018”
https://www.jetbrains.com/research/devec
osystem-2018/python/
Note: Java and JavaScript are still the most popular programming
languages for developers but more people continue to learn Python
JetBrains 2018 “The State of
Developer Ecosystem Survey in
2018”
To move from Data Scientist Type A to Type B
You need to build a solid
foundation for your data
and move up the pyramid
Monica Rogati “The AI Hierarchy of Needs” https://hackernoon.com/the-ai-hierarchy-of-needs-18f111fcc007
Some practical coding techniques
for Data Scientists
Jupyter Notebooks for data scientists
• Python Python Python
• Core modules: NumPy, SciPy,
MatplotLib
• Work environment: Jupyter
Notebooks – works with
Python, R, C++, Julia and more
http://jupyter.org/try
• Anaconda or VirtualEnv to
isolate Python work
environment
Complete Coding experience using JupyterLab
• Try Jupyter Labs - next-
generation web-based
user interface
• Pip install jupyterlab or
conda install -c conda-
forge jupyterlab
or cloud based development solutions – Google Collab
Free Google Collab https://colab.research.google.com/ . You can leverage their GPUs
More useful resources for researchers
• Nurture AI – curated summary of research papers
https://nurture.ai/home
• Auto ML https://cloud.google.com/automl/
• Public Data Search https://www.google.com/publicdata/directory
• Google Dataset Search https://toolbox.google.com/datasetsearch
AI & I at Work
Sentiment Analysis on Twitter using Django,
Docker containers, Python & Google NLP
Twitter API using Tweepy
Python Library & Twitter Dev
Account
Local Docker running
PostGresql database
Django & Python to run and
manage the code and data
Google Cloud Natural
Language Processing SDK to
run sentiment analysis
GITHUB Source Code: https://github.com/hoteit/sentiment-tweets
Training Google AutoML for categorizing customer
reviews
Searched for a dataset on
https://toolbox.google.com/data
setsearch
Found “100K+ Scraped Course
Reviews from the Coursera
Website (As of May 2017)
Analyzed the data,
cleaned when necessary
(pretraining step)
Created Google Cloud
AutoML project & activiated
NLP APIs, uploaded data
No AI expertise needed!
Dataset import
Train/Evaluate/Predict model
GitHub Source Code https://github.com/hoteit/coursereviews-automl
Fun time with AWS DeepLens - Deep learning-enabled
video camera
Chose a project template on
https://console.aws.amazon.com/deeplens
Registered Deeplens Device
& Deployed Project
Model configured using SageMaker, in this case:
SSD architecture with a ResNet-50 feature extractor
on S3, accessible via Lambda

Más contenido relacionado

La actualidad más candente

Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBeth Plale
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science clubData Science Club
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI dayMohammed Barakat
 
Data science
Data scienceData science
Data science9diov
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Gabriel Moreira
 
The Evolution of Open Data
The Evolution of Open DataThe Evolution of Open Data
The Evolution of Open DataJeanne Holm
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningJulian Bright
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
Datascience and python
Datascience and pythonDatascience and python
Datascience and pythonUmmeSalmaM1
 
Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015
Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015
Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015KISK FF MU
 

La actualidad más candente (17)

Bridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital TextBridging Digital Humanities Research and Big Data Repositories of Digital Text
Bridging Digital Humanities Research and Big Data Repositories of Digital Text
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
 
Lecture #01
Lecture #01Lecture #01
Lecture #01
 
Lecture #02
Lecture #02 Lecture #02
Lecture #02
 
Introduction to Computing Logic Formulation
Introduction to Computing Logic FormulationIntroduction to Computing Logic Formulation
Introduction to Computing Logic Formulation
 
Lecture #03
Lecture #03Lecture #03
Lecture #03
 
Data science presentation 2nd CI day
Data science presentation 2nd CI dayData science presentation 2nd CI day
Data science presentation 2nd CI day
 
data science
data sciencedata science
data science
 
Data science
Data scienceData science
Data science
 
Python for Data Science - TDC 2015
Python for Data Science - TDC 2015Python for Data Science - TDC 2015
Python for Data Science - TDC 2015
 
The Evolution of Open Data
The Evolution of Open DataThe Evolution of Open Data
The Evolution of Open Data
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine Learning
 
BigDataCSEKeyNote_2012
BigDataCSEKeyNote_2012BigDataCSEKeyNote_2012
BigDataCSEKeyNote_2012
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Datascience and python
Datascience and pythonDatascience and python
Datascience and python
 
Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015
Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015
Linda Treude, Sabine Wolf: Features for the Future Library #bcs2015
 

Similar a The Ai & I at Work

Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data ScienceFeyzi R. Bagirov
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...
Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...
Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...Bohyun Kim
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachMihai Criveti
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1ISSIP
 
intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...jybufgofasfbkpoovh
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Enrico Motta
 
Moving forward data centric sciences weaving AI, Big Data & HPC
Moving forward data centric sciences  weaving AI, Big Data & HPCMoving forward data centric sciences  weaving AI, Big Data & HPC
Moving forward data centric sciences weaving AI, Big Data & HPCGenoveva Vargas-Solar
 
A Comprehensive Guide to Data Science Technologies.pdf
A Comprehensive Guide to Data Science Technologies.pdfA Comprehensive Guide to Data Science Technologies.pdf
A Comprehensive Guide to Data Science Technologies.pdfGeethaPratyusha
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving UpPaco Nathan
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Big Data Spain
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Research Data Alliance
 

Similar a The Ai & I at Work (20)

Introduction to Big Data and Data Science
Introduction to Big Data and Data ScienceIntroduction to Big Data and Data Science
Introduction to Big Data and Data Science
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...
Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...
Taking on a New Leadership Challenge: Student-Focused Learning in Artificial ...
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps Approach
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1
 
intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Big Data: Big Issues for IP
Big Data: Big Issues for IPBig Data: Big Issues for IP
Big Data: Big Issues for IP
 
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
Research in Intelligent Systems and Data Science at the Knowledge Media Insti...
 
Moving forward data centric sciences weaving AI, Big Data & HPC
Moving forward data centric sciences  weaving AI, Big Data & HPCMoving forward data centric sciences  weaving AI, Big Data & HPC
Moving forward data centric sciences weaving AI, Big Data & HPC
 
Visualization notes
Visualization notesVisualization notes
Visualization notes
 
A Comprehensive Guide to Data Science Technologies.pdf
A Comprehensive Guide to Data Science Technologies.pdfA Comprehensive Guide to Data Science Technologies.pdf
A Comprehensive Guide to Data Science Technologies.pdf
 
Data science
Data scienceData science
Data science
 
Datascience
DatascienceDatascience
Datascience
 
Data Science in 2016: Moving Up
Data Science in 2016: Moving UpData Science in 2016: Moving Up
Data Science in 2016: Moving Up
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
 
Data science
Data scienceData science
Data science
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
 
Digital Humanities Workshop
Digital Humanities WorkshopDigital Humanities Workshop
Digital Humanities Workshop
 

Último

Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
ARTERIAL BLOOD GAS ANALYSIS........pptx
ARTERIAL BLOOD  GAS ANALYSIS........pptxARTERIAL BLOOD  GAS ANALYSIS........pptx
ARTERIAL BLOOD GAS ANALYSIS........pptxAneriPatwari
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptxmary850239
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesVijayaLaxmi84
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 

Último (20)

Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
ARTERIAL BLOOD GAS ANALYSIS........pptx
ARTERIAL BLOOD  GAS ANALYSIS........pptxARTERIAL BLOOD  GAS ANALYSIS........pptx
ARTERIAL BLOOD GAS ANALYSIS........pptx
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx4.9.24 School Desegregation in Boston.pptx
4.9.24 School Desegregation in Boston.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their uses
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 

The Ai & I at Work

  • 1. The AI & I at Work Tarek Hoteit – PhD, IT Director TR Labs at Thomson Reuters October 19, 2018 – University of Texas in Dallas MIS Club http://tarek.computer
  • 2.
  • 3. Agenda • Short history of Data Science & AI • Data Science and AI together 4 ever • Practical Techniques for data scientists when using AI • Real demos for work
  • 4. History of Data Science 1962 John W. Turkey predicted effect of modern-day electronic computing on data analysis as an empirical science1
  • 5. History of Data Science 1965 “Programma 101” 1st commercial programmable desktop calculator
  • 6. History of Data Science 1981 IBM released 1st personal computer, followed by Apple in 1983 with GUI
  • 7. Fast forward 20 years later….
  • 8. History of Data Science Data Scientists (from Chapter 3: Roles & responsibilities of individuals and institutions ) The interests of data scientists – the information and computer scientists, database and software engineers and programmers, disciplinary experts, curators and expert annotators, librarians, archivists, and others, who are crucial to the successful management of a digital data collection – lie in having their creativity and intellectual contributions fully recognized. In pursuing these interests, they have the responsibility to: • conduct creative inquiry and analysis; enhance through consultation, collaboration, and coordination the ability of others to conduct research and education using digital data collections; • be at the forefront in developing innovative concepts in database technology and information sciences, including methods for data visualization and information discovery, and applying these in the fields of science and education relevant to the collection; • implement best practices and technology; serve as a mentor to beginning or transitioning investigators, students and others interested in pursuing data science; • design and implement education and outreach programs that make the benefits of data collections and digital information science available to the broadest possible range of researchers, educators, students, and the general public. 2005 National Science Board advocates data science career
  • 9. History of Data Science 2010 data science takes center stage in computer technology / customers use more technology devices, social media, mobile & machines become faster
  • 10. In the mean time….
  • 11. 1956 -The 1956 Dartmouth summer research project on artificial intelligence was initiated August. 31, 1955 proposal authored by: J. McCarthy, Dartmouth College M. L. Minsky, Harvard University N. Rochester, I.B.M. Corporation C.E. Shannon, Bell Telephone Laboratories History of AI
  • 12. 1968 – Space Odyssey 2001 by Stanley Kubrick is released featuring intelligent computer, HAL 9000. History of AI
  • 13. 1950 – 60s : reasoning AI, prototypes – high interests 1971: winter AI came up 1980s – 1990s: another hype with expert systems, neural networks, 1990s: AI Winter 2 History of AI
  • 14. Late 90’s 2000’s – hype starts again (Deep Blue beats Kasparov in chess 2006 – University of Toronto develops deep learning 2011 – Watson wins at Jeopardy 2016 – Alpha Go beats GO champions History of AI 2017 – Alpha Go Zero beats Alpha Go 100 to 0 after starting from scratch
  • 15. Now everyone is into artificial intelligence
  • 16. So where does data science and artificial intelligence cross its other?
  • 17. Data Science, Artificial Intelligence cross path in all places
  • 18. We now have two types of Data Scientists Data Scientists Type A – Analytical • Focuses on the why • Heavy on statistics, machine learning fundamentals, data wrangling • Use Python/R, SQL Data Scientist Type B – Builder/Machine Learning Engineer • Focused on creating new products • Heavy on machine learning, software engineering, linear algebra and differential equations • Use Python/Java/Scala, Docker, cloud computing Jesse Steinweg-Woods https://www.datascience.com/blog/guide-to-popular-data-science-jobs
  • 19. Common coding grounds? Python favorable among machine learning and data science jobs Based on indeed.com last updated late 2017
  • 20. Python & Data Science libraries are heavily used for data analysis “The number of Data Scientists is constantly growing and at the moment the number of Data Scientists is larger than the number of Web Developers among Python users.” – JetBrains 2018 “The State of Developer Ecosystem Survey in 2018” https://www.jetbrains.com/research/devec osystem-2018/python/
  • 21. Note: Java and JavaScript are still the most popular programming languages for developers but more people continue to learn Python JetBrains 2018 “The State of Developer Ecosystem Survey in 2018”
  • 22. To move from Data Scientist Type A to Type B You need to build a solid foundation for your data and move up the pyramid Monica Rogati “The AI Hierarchy of Needs” https://hackernoon.com/the-ai-hierarchy-of-needs-18f111fcc007
  • 23. Some practical coding techniques for Data Scientists
  • 24. Jupyter Notebooks for data scientists • Python Python Python • Core modules: NumPy, SciPy, MatplotLib • Work environment: Jupyter Notebooks – works with Python, R, C++, Julia and more http://jupyter.org/try • Anaconda or VirtualEnv to isolate Python work environment
  • 25. Complete Coding experience using JupyterLab • Try Jupyter Labs - next- generation web-based user interface • Pip install jupyterlab or conda install -c conda- forge jupyterlab
  • 26. or cloud based development solutions – Google Collab Free Google Collab https://colab.research.google.com/ . You can leverage their GPUs
  • 27. More useful resources for researchers • Nurture AI – curated summary of research papers https://nurture.ai/home • Auto ML https://cloud.google.com/automl/ • Public Data Search https://www.google.com/publicdata/directory • Google Dataset Search https://toolbox.google.com/datasetsearch
  • 28. AI & I at Work
  • 29. Sentiment Analysis on Twitter using Django, Docker containers, Python & Google NLP Twitter API using Tweepy Python Library & Twitter Dev Account Local Docker running PostGresql database Django & Python to run and manage the code and data Google Cloud Natural Language Processing SDK to run sentiment analysis GITHUB Source Code: https://github.com/hoteit/sentiment-tweets
  • 30. Training Google AutoML for categorizing customer reviews Searched for a dataset on https://toolbox.google.com/data setsearch Found “100K+ Scraped Course Reviews from the Coursera Website (As of May 2017) Analyzed the data, cleaned when necessary (pretraining step) Created Google Cloud AutoML project & activiated NLP APIs, uploaded data No AI expertise needed! Dataset import Train/Evaluate/Predict model GitHub Source Code https://github.com/hoteit/coursereviews-automl
  • 31. Fun time with AWS DeepLens - Deep learning-enabled video camera Chose a project template on https://console.aws.amazon.com/deeplens Registered Deeplens Device & Deployed Project Model configured using SageMaker, in this case: SSD architecture with a ResNet-50 feature extractor on S3, accessible via Lambda

Notas del editor

  1. https://datasciencedegree.wisconsin.edu/blog/history-of-data-science/ http://stat-graphics.org/movies/prim9.html
  2. Photos from https://www.inexhibit.com/case-studies/olivetti-programma-101-at-the-origins-of-the-personal-computer/ Progamma 101 https://en.wikipedia.org/wiki/Programma_101#/media/File:Olivetti_Programma_101_-_Museo_scienza_e_tecnologia_Milano.jpg
  3. Apple Lisa taken from https://i2.wp.com/www.mac-history.net/wp-content/uploads/2007/10/Apple_Lisa_1.jpg?ssl=1
  4. https://www.nsf.gov/pubs/2005/nsb0540/ Full pdf https://www.nsf.gov/pubs/2005/nsb0540/nsb0540.pdf
  5. https://www.economist.com/special-report/2010/02/25/data-data-everywhere https://b-i.forbesimg.com/gilpress/files/2013/05/linkedin-data-science-chart1.png & https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/
  6. https://www.bbc.com/timelines/zq376fr#ztkpycw https://www.youtube.com/watch?v=ARJ8cAGm6JE
  7. https://www.actuaries.digital/2018/09/05/history-of-ai-winters/ https://courses.cs.washington.edu/courses/csep590/06au/projects/history-ai.pdf
  8. https://deepmind.com/blog/alphago-zero-learning-scratch/
  9. Image from http://mattturck.com/wp-content/uploads/2018/07/Matt_Turck_FirstMark_Big_Data_Landscape_2018_Final.png
  10. From https://www.kdnuggets.com/2017/01/data-science-puzzle-revisited.html
  11. F
  12. Link https://www.indeed.com/jobtrends/q-python-and-%22machine-learning%22-q-python-and-%22data-science%22-q-%22data-science%22-and-%22machine-learning%22-q-java-and-%22data-science%22-q-javascript-and-%22data-science%22-q-%22R%22-and-%22data-science%22-q-%22R%22-and-%22machine-learning%22.html