SlideShare a Scribd company logo
1 of 33
What is Data Science
• Data science, also known
as data-driven science, is
an interdisciplinary field
of scientific methods,
processes, algorithms and
systems to extract
knowledge or insights
from data in various forms,
either structured or
unstructured, similar
to datamining.
Need for DataScientists & Job opportunities
• Data volume is increasing in enterprises because of transactional data,
internet and mobile apps
• Decision making will have to be fast and accurate and should be
available at the point of need
• Without analytics its impossible to run large enterprises like Amazon,
Flipkart, Reliance Jio, Airtel, Citi Bank, Unilever, P&G, Google, IBM,
Microsoft, Alibaba, eBay, Tesco, Metro cash, Walmart…
• There is insufficient number of personnel skilled in analytics where as
demand is more
• Opportunities in Startups, IoT, Consumer Goods, eCommerce, KPOs,
BPOs, Telecom, B&F, Logistics, Utilities…
• Just browse through Naukri, Shine, Monster etc
What is Machine Learning?
• Machine learning teaches computers to do what comes naturally to
humans and animals: learn from experience.
• Machine learning algorithms use computational methods to “learn”
information directly from data without relying on a predetermined
equation as a model.
• The algorithms adaptively improve their performance as the number of
samples available for learning increases.
Real-World Applications
 With the rise in big data, machine learning has become particularly important
for solving problems in areas like these:
 Computational finance, for credit scoring and algorithmic trading
 Image processing and computer vision, for face recognition, motion
detection, and object detection
 Computational biology, for tumor detection, drug discovery, and DNA
sequencing
 Energy production, for price and load forecasting
 Automotive, aerospace, and manufacturing, for predictive maintenance
 Natural language processing
How Machine Learning Works
• Machine learning uses
two types of
techniques: supervised
learning, which trains a
model on known input
and output data so
that it can predict
future outputs, and
unsupervised learning,
which finds hidden
patterns or intrinsic
structures in input
data.
How Do You Decide Which Algorithm to Use?
• Algorithm
selection also
depends on
the size and
type of data
you’re
working with,
the insights
you want to
get from the
data, and how
those insights
will be used
When Should You Use Machine Learning?
• you have a complex task or problem involving a large
amount of data and lots of variables, but no existing
formula or equation
• Hand-written rules and equations are too complex—as in
face recognition and speech recognition
• The rules of a task are constantly changing—as in fraud
detection from transaction records.
• The nature of the data keeps changing, and the program
needs to adapt—as in automated trading,energy demand
forecasting, and predicting shopping trends.
Supervised Learning
• The aim of supervised machine learning is to build a model that makes
predictions based on evidence in the presence of uncertainty. A
supervised learning algorithm takes a known set of input data and known
responses to the data (output) and trains a model to generate reasonable
predictions for the response to new data.
• Supervised learning uses classification and regression techniques to
develop predictive models.
• Regression techniques predict continuous responses— for example,
changes in temperature or fluctuations in power demand. Typical
applications include electricity load forecasting and algorithmic trading.
• Classification techniques predict discrete responses—for example,
whether an email is genuine or spam, or whether a tumor is cancerous
or benign. Classification models
• classify input data into categories. Typical applications include medical
imaging, speech recognition, and credit scoring.
Unsupervised Learning
• Unsupervised learning finds hidden patterns or intrinsic structures in
data. It is used to draw inferences from datasets consisting of input
data without labeled responses.
• Clustering is the most common unsupervised learning technique. It is
used for exploratory data analysis to find hidden patterns or groupings
in data. Applications for clustering include gene sequence analysis,
market research, and object recognition.
Common Dimensionality Reduction Techniques
Why R?
• Free Software
• Versatile and crowd sourced for development
• Handle multiple platform
• End to End service in Data Science
• Functionality is divided into a number of packages
• Variety of analytical techniques 7000+ algorithms
• No restriction in length of column
• Integrates with other software
29
Data Types in R
• Vectors
• Matrices
• Arrays
• List
• DataFrame
30
Objects
• character
• numeric (real numbers)
• Integer
• Complex
• logical (True/False)
31
Data Operators
• Arithmetic+-*/%^
• Relational >=,<=,==,!=
• Logical ! and &
• Model Formula D ~ I
• Assignment = or <-
• List Index $
• Sequence :
32
Case Study
• Multiple Linear Regression Model
• Methods: All in, Step by Step, (Forward,Backward, Bi-directional),
Score comparison
• Independent Variables: R&D Spend, Administration, Marketing
Spend
• Dependent Variable: Profit
• Training Data 80% & Test Data 20%
33

More Related Content

Similar to What is Machine Learning.pptx

BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGUmair Shafique
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needGibDevs
 
Operations Research and ICT A Keynote Address
Operations Research and ICT A Keynote AddressOperations Research and ICT A Keynote Address
Operations Research and ICT A Keynote AddressElvis Muyanja
 
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Harbinger Systems - HRTech Builder of Choice
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data AnalyticsYASH GAIKWAD
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer AnalyticsCourse5i
 
Machine learning is the new BI
Machine learning is the new BIMachine learning is the new BI
Machine learning is the new BICycloides
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityAndry Alamsyah
 
Machine Learning course in Chandigarh Join
Machine Learning course in Chandigarh JoinMachine Learning course in Chandigarh Join
Machine Learning course in Chandigarh Joinasmeerana605
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & ApplicationsFazle Rabbi Ador
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxMadhumitha N
 
unit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxunit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxDr.Shweta
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxChitrachitrap
 
Introduction to Machine Learning.pptx
Introduction to Machine Learning.pptxIntroduction to Machine Learning.pptx
Introduction to Machine Learning.pptxDr. Amanpreet Kaur
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecasesSreenatha Reddy K R
 

Similar to What is Machine Learning.pptx (20)

BIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNINGBIG DATA AND MACHINE LEARNING
BIG DATA AND MACHINE LEARNING
 
Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your need
 
DOWLD SLIDES.pptx
DOWLD SLIDES.pptxDOWLD SLIDES.pptx
DOWLD SLIDES.pptx
 
NCCT.pptx
NCCT.pptxNCCT.pptx
NCCT.pptx
 
Operations Research and ICT A Keynote Address
Operations Research and ICT A Keynote AddressOperations Research and ICT A Keynote Address
Operations Research and ICT A Keynote Address
 
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
 
In-Depth Data Analytics
In-Depth Data AnalyticsIn-Depth Data Analytics
In-Depth Data Analytics
 
Machine Learning in Customer Analytics
Machine Learning in Customer AnalyticsMachine Learning in Customer Analytics
Machine Learning in Customer Analytics
 
Machine learning is the new BI
Machine learning is the new BIMachine learning is the new BI
Machine learning is the new BI
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
 
Machine Learning course in Chandigarh Join
Machine Learning course in Chandigarh JoinMachine Learning course in Chandigarh Join
Machine Learning course in Chandigarh Join
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Data Mining & Applications
Data Mining & ApplicationsData Mining & Applications
Data Mining & Applications
 
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptxINTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
INTRODUCTION TO DATA SCIENCE -CONCEPTS.pptx
 
unit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptxunit 1.2 supervised learning.pptx
unit 1.2 supervised learning.pptx
 
Unit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptxUnit 1-ML (1) (1).pptx
Unit 1-ML (1) (1).pptx
 
Introduction to Machine Learning.pptx
Introduction to Machine Learning.pptxIntroduction to Machine Learning.pptx
Introduction to Machine Learning.pptx
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 

Recently uploaded

Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 

Recently uploaded (20)

Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 

What is Machine Learning.pptx

  • 1. What is Data Science • Data science, also known as data-driven science, is an interdisciplinary field of scientific methods, processes, algorithms and systems to extract knowledge or insights from data in various forms, either structured or unstructured, similar to datamining.
  • 2. Need for DataScientists & Job opportunities • Data volume is increasing in enterprises because of transactional data, internet and mobile apps • Decision making will have to be fast and accurate and should be available at the point of need • Without analytics its impossible to run large enterprises like Amazon, Flipkart, Reliance Jio, Airtel, Citi Bank, Unilever, P&G, Google, IBM, Microsoft, Alibaba, eBay, Tesco, Metro cash, Walmart… • There is insufficient number of personnel skilled in analytics where as demand is more • Opportunities in Startups, IoT, Consumer Goods, eCommerce, KPOs, BPOs, Telecom, B&F, Logistics, Utilities… • Just browse through Naukri, Shine, Monster etc
  • 3.
  • 4.
  • 5.
  • 6.
  • 7. What is Machine Learning? • Machine learning teaches computers to do what comes naturally to humans and animals: learn from experience. • Machine learning algorithms use computational methods to “learn” information directly from data without relying on a predetermined equation as a model. • The algorithms adaptively improve their performance as the number of samples available for learning increases.
  • 8. Real-World Applications  With the rise in big data, machine learning has become particularly important for solving problems in areas like these:  Computational finance, for credit scoring and algorithmic trading  Image processing and computer vision, for face recognition, motion detection, and object detection  Computational biology, for tumor detection, drug discovery, and DNA sequencing  Energy production, for price and load forecasting  Automotive, aerospace, and manufacturing, for predictive maintenance  Natural language processing
  • 9. How Machine Learning Works • Machine learning uses two types of techniques: supervised learning, which trains a model on known input and output data so that it can predict future outputs, and unsupervised learning, which finds hidden patterns or intrinsic structures in input data.
  • 10. How Do You Decide Which Algorithm to Use? • Algorithm selection also depends on the size and type of data you’re working with, the insights you want to get from the data, and how those insights will be used
  • 11. When Should You Use Machine Learning? • you have a complex task or problem involving a large amount of data and lots of variables, but no existing formula or equation • Hand-written rules and equations are too complex—as in face recognition and speech recognition • The rules of a task are constantly changing—as in fraud detection from transaction records. • The nature of the data keeps changing, and the program needs to adapt—as in automated trading,energy demand forecasting, and predicting shopping trends.
  • 12. Supervised Learning • The aim of supervised machine learning is to build a model that makes predictions based on evidence in the presence of uncertainty. A supervised learning algorithm takes a known set of input data and known responses to the data (output) and trains a model to generate reasonable predictions for the response to new data. • Supervised learning uses classification and regression techniques to develop predictive models. • Regression techniques predict continuous responses— for example, changes in temperature or fluctuations in power demand. Typical applications include electricity load forecasting and algorithmic trading. • Classification techniques predict discrete responses—for example, whether an email is genuine or spam, or whether a tumor is cancerous or benign. Classification models • classify input data into categories. Typical applications include medical imaging, speech recognition, and credit scoring.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. Unsupervised Learning • Unsupervised learning finds hidden patterns or intrinsic structures in data. It is used to draw inferences from datasets consisting of input data without labeled responses. • Clustering is the most common unsupervised learning technique. It is used for exploratory data analysis to find hidden patterns or groupings in data. Applications for clustering include gene sequence analysis, market research, and object recognition.
  • 23.
  • 24.
  • 25.
  • 26.
  • 28.
  • 29. Why R? • Free Software • Versatile and crowd sourced for development • Handle multiple platform • End to End service in Data Science • Functionality is divided into a number of packages • Variety of analytical techniques 7000+ algorithms • No restriction in length of column • Integrates with other software 29
  • 30. Data Types in R • Vectors • Matrices • Arrays • List • DataFrame 30
  • 31. Objects • character • numeric (real numbers) • Integer • Complex • logical (True/False) 31
  • 32. Data Operators • Arithmetic+-*/%^ • Relational >=,<=,==,!= • Logical ! and & • Model Formula D ~ I • Assignment = or <- • List Index $ • Sequence : 32
  • 33. Case Study • Multiple Linear Regression Model • Methods: All in, Step by Step, (Forward,Backward, Bi-directional), Score comparison • Independent Variables: R&D Spend, Administration, Marketing Spend • Dependent Variable: Profit • Training Data 80% & Test Data 20% 33