SlideShare una empresa de Scribd logo
1 de 38
BY :
RAGINI RATRE
EEE 7TH SEMESTER
Introduction Of ML
Different sub topics of AI
Types of Machine Learning
Different Languages use in ML
Advantages of R over other Languages
R Programming
Project on HEART DISEASE ANALYSIS WITH R
Content:
 In computer science machine learning refers to a type of data analysis
that uses algorithms that learn from data. It is a type of artificial
intelligence (AI) that provides systems with the ability to learn without
being explicitly programmed.
What is ML ??
Different sub topics of AI
• Neural Network
• Planning
• Robotics
• ML
• Natural Language processing
• Perception
• Knowledge
• Cognitive system
Types of Machine Learning
• Supervised
• Unsupervised
• Reinforcement
 R programming
 Python
 Java-Family / C-Family
 Matlab
 Mathematica
 Stata etc
Different Languages use in ML
Why We Used R in ML..???
why
why
why
why
why
why
why
why
why
• Reduced Coad length
• Availability(4000+ functions are available)
• Simple and easy to learn
• Free an Open source
ADVANTAGES OF R OVER OTHER LANGUAGES
R Programming
 myString <- "Hello, World!"
 > print ( myString)
 [1] "Hello, World!"
Basic Syntax
Data Type
• Logical v <- TRUE print(class(v))
• Numeric v <- 23.5 print(class(v))
• Integer v <- 2L print(class(v))
• Complex v <- 2+5i print(class(v))
• Character v <- "TRUE" print(class(v))
• Raw v <- charToRaw("Hello") print(class(v))
OPERATORS
• Add v <- c( 2,5.5,6)
• Sub t <- c(8, 3, 4)
• Mul print(v+t)
• Div etc print(v-t)
• print(v*t)
Accessing Vector Elements
R Programming
# Accessing vector elements using position.
t <- c("Sun","Mon","Tue","Wed","Thurs","Fri","Sat")
u <- t[c(2,3,6)];print(u)
OUTPUT: [1] "Mon" "Tue" "Fri"
# Accessing vector elements using logical indexing.
v <- t[c(TRUE,FALSE,FALSE,FALSE,FALSE,TRUE,FALSE)]
print(v)
[1] "Sun" "Fri"
# Accessing vector elements using negative indexing. x <-
t[c(-2,-5)] print(x)
[1] "Sun" "Tue" "Wed" "Fri" "Sat"
R Programming
# Create the predictor and response variable.
x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131)
y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48)
relation <- lm(y~x)
# Give the chart file a name.
png(file = "linearregression.png")
# Plot the chart.
plot(y,x,col = "blue",main = "Height & Weight Regression",
abline(lm(x~y)),cex = 1.3,pch = 16,xlab = "Weight in Kg",ylab = "Height in cm")
# Save the file.
dev.off()
Regression
R Programming
# Load the party package. It will automatically load other # dependent packages.
library(party)
# Create the input data frame.
input.dat <- readingSkills[c(1:105),]
# Give the chart file a name.
png(file = "decision_tree.png")
# Create the tree.
output.tree <- ctree( nativeSpeaker ~ age + shoeSize + score, data = input.dat)
# Plot the tree.
plot(output.tree)
# Save the file.
dev.off()
DECSION TREE
R Programming
OUTPUT
null device 1
Loading required package: methods
Loading required package: grid
Loading required package: mvtnorm
Loading required package: modeltools
Loading required package: stats4
Loading required package: strucchange
Loading required package: zoo
Attaching package: ‘zoo’
The following objects are masked from ‘package:base’:
as.Date,as.Date.numeric
Loading required package: sandwich
R Programming
Graphs and Plots
# Create data for the graph.
x <- c(21, 62, 10, 53)
labels <- c("London", "New York", "Singapore", "Mumbai")
# Give the chart file a name.
png(file = "city.jpg")
# Plot the chart. pie(x,labels)
# Save the file.
dev.off()
Pi CHART
R Programming
Graphs and Plots
# Give the chart file a name.
png(file = "boxplot.png")
# Plot the chart.
boxplot(mpg ~ cyl, data = mtcars, xlab =
"Number of Cylinders", ylab = "Miles Per
Gallon", main = "Mileage Data")
# Save the file. dev.off()
BOX PLOT
R Programming
Graphs and Plots
# Create the data for the chart
H <- c(7,12,28,3,41)
# Give the chart file a name
png(file = "barchart.png")
# Plot the bar chart
barplot(H)
# Save the file
dev.off()
BAR PLOTS
R Programming
Graphs and Plots
# Create the data for the chart.
v <- c(7,12,28,3,41)
# Give the chart file a name.
png(file = "line_chart.jpg")
# Plot the bar chart. plot(v,type = "o")
# Save the file.
dev.off()
LINE GRAPHS
 Image Recognition
 Speech Recognition
 Medical Diagnosis
 Statistical Arbitrage
 Prediction
APPLICATIONs OF ML
FEW STEPS TO HANDLE THE DATA
• Data Selection
Consider what data is available, what data is missing and what data can be
removed.
• Data Preprocessing
Organize your selected data by formatting, cleaning and sampling from it.
• Data Transformation
Transform preprocessed data ready for machine learning by engineering
features using scaling, attribute decomposition and attribute aggregation.
 Identify and handle missing values
 Data FORMATTING
 DATA NORMALIZATION (CENTERING/SCALING)
 DATA BINNING
PRE-PROCESSING DATA IN R
 Missing value occur when no data value is stored for a variable(feature) in an
observation
 Could be represented as “?” , “N/A” , 0 or Just blank cell
Drop the missing value:
 Drop the variable
 Drop the data entry
Replace the missing value:
 Replace it with an average of similar datapoint
 Replace it by frequency
 Replace it based on other function
Identify and handle missing values
 Applying calculation to an entire column
 Like conversion of data like from meter to kilometer
Data FORMATTING
 My making the range consistent between variables , normalization
enables a fairer comparison between features
 Few methods:
1. Simple Feature Scaling x(new)=x(old)/x(max)
2. Min-Max x(new) = (x(old)-x(min))/(x(max)-x(min))
DATA NORMALIZATION (CENTERING/SCALING)
 Binning Grouping of values into “bins”
 Converts numeric into categorical variables
 Grouping a set of numerical values into subset “bins”
DATA BINNING
Data preparation / transformation is a large subject that can involve a lot of
iterations, exploration and analysis. Getting good at data preparation will
make you a master at machine learning.
DATA TRANSFORMATION
PROJECT ON Heart disease
• First we need .csv file here csv stands for comma separated values
>>> getwd()
>>> setwd("C:UsersVinitDesktopMachineLearning")
>>> table1<-read.table("heart_disease_male.csv",header=T,sep=",")
• We can also use read.csv() function in place of read.table() as it is a csv file
Output:
age chest_pain rest_bpress blood_sugar rest_electro max_heart_rate exercice_angina disease
1 43 asympt 140 f normal 135 yes 1
2 39 atyp_angina 120 f normal 160 yes 0
3 39 non_anginal 160 t normal 160 no 0
4 42 non_anginal 160 f normal 146 no 0
5 49 asympt 140 f normal 130 no
0
6 50 asympt 140 f normal 135 no 0
>>>head(table1)
table(is.na(table1$AGE))
table(is.na(table1$CHEST_PAIN))
table(is.na(table1$REST_BPRESS))
table(is.na(table1$BLOOD_SUGAR))
table(is.na(table1$REST_ELECTRO))
table(is.na(table1$MAX_HEART_RATE))
table(is.na(table1$EXERCICE_ANGINA))
table(is.na(table1$DISEASE))
HANDLE MISSING DATA
OUTPUT:
FALSE
209
So, there is no need to
handle data as there is no
missing values
DATA visualization
>>>library(ggplot2)
>>> x<-ggplot(table1)
>>> x+geom_bar(aes(x=age))
>>> x+geom_bar(aes(x=blood_sugar))
>>> x+geom_density(aes(max_heart_rate))
Output:
>>>x<-ggplot(table1,aes(y=max_heart_rate,x=disease))
>>>x+geom_violin()+geom_jitter()
Output:
 model1<- glm(disease~.,family="binomial") #take all the columns in the model
summary(model1)
Prediction
 model2=glm(disease~chest_pain+exercise_angina+max_heart_rate,
training,family=”binomial”)
 summary(model2)
 In conclusion, Machine learning is an incredible breakthrough in the field
of artificial intelligence. While it does have some frightening implications
when you think about it, these Machine Learning Applications are several
of the many ways this technology can improve our lives.
Conclusion
PPT ON MACHINE LEARNING by Ragini Ratre

Más contenido relacionado

La actualidad más candente

Grouping & Summarizing Data in R
Grouping & Summarizing Data in RGrouping & Summarizing Data in R
Grouping & Summarizing Data in RJeffrey Breen
 
Is there a perfect data-parallel programming language? (Experiments with More...
Is there a perfect data-parallel programming language? (Experiments with More...Is there a perfect data-parallel programming language? (Experiments with More...
Is there a perfect data-parallel programming language? (Experiments with More...Julian Hyde
 
Move your data (Hans Rosling style) with googleVis + 1 line of R code
Move your data (Hans Rosling style) with googleVis + 1 line of R codeMove your data (Hans Rosling style) with googleVis + 1 line of R code
Move your data (Hans Rosling style) with googleVis + 1 line of R codeJeffrey Breen
 
Data Analysis with R (combined slides)
Data Analysis with R (combined slides)Data Analysis with R (combined slides)
Data Analysis with R (combined slides)Guy Lebanon
 
Introduction to Pandas and Time Series Analysis [PyCon DE]
Introduction to Pandas and Time Series Analysis [PyCon DE]Introduction to Pandas and Time Series Analysis [PyCon DE]
Introduction to Pandas and Time Series Analysis [PyCon DE]Alexander Hendorf
 
R-programming-training-in-mumbai
R-programming-training-in-mumbaiR-programming-training-in-mumbai
R-programming-training-in-mumbaiUnmesh Baile
 
3 R Tutorial Data Structure
3 R Tutorial Data Structure3 R Tutorial Data Structure
3 R Tutorial Data StructureSakthi Dasans
 
Python for R Users
Python for R UsersPython for R Users
Python for R UsersAjay Ohri
 
5 R Tutorial Data Visualization
5 R Tutorial Data Visualization5 R Tutorial Data Visualization
5 R Tutorial Data VisualizationSakthi Dasans
 
Data Analysis with Python Pandas
Data Analysis with Python PandasData Analysis with Python Pandas
Data Analysis with Python PandasNeeru Mittal
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyrRomain Francois
 
Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011Milind Bhandarkar
 
Data profiling with Apache Calcite
Data profiling with Apache CalciteData profiling with Apache Calcite
Data profiling with Apache CalciteJulian Hyde
 
Introduction to r studio on aws 2020 05_06
Introduction to r studio on aws 2020 05_06Introduction to r studio on aws 2020 05_06
Introduction to r studio on aws 2020 05_06Barry DeCicco
 
Big Data Mining in Indian Economic Survey 2017
Big Data Mining in Indian Economic Survey 2017Big Data Mining in Indian Economic Survey 2017
Big Data Mining in Indian Economic Survey 2017Parth Khare
 
R reference card
R reference cardR reference card
R reference cardHesher Shih
 

La actualidad más candente (20)

Grouping & Summarizing Data in R
Grouping & Summarizing Data in RGrouping & Summarizing Data in R
Grouping & Summarizing Data in R
 
R programming language
R programming languageR programming language
R programming language
 
Is there a perfect data-parallel programming language? (Experiments with More...
Is there a perfect data-parallel programming language? (Experiments with More...Is there a perfect data-parallel programming language? (Experiments with More...
Is there a perfect data-parallel programming language? (Experiments with More...
 
Move your data (Hans Rosling style) with googleVis + 1 line of R code
Move your data (Hans Rosling style) with googleVis + 1 line of R codeMove your data (Hans Rosling style) with googleVis + 1 line of R code
Move your data (Hans Rosling style) with googleVis + 1 line of R code
 
Data Analysis with R (combined slides)
Data Analysis with R (combined slides)Data Analysis with R (combined slides)
Data Analysis with R (combined slides)
 
Introduction to Pandas and Time Series Analysis [PyCon DE]
Introduction to Pandas and Time Series Analysis [PyCon DE]Introduction to Pandas and Time Series Analysis [PyCon DE]
Introduction to Pandas and Time Series Analysis [PyCon DE]
 
R-programming-training-in-mumbai
R-programming-training-in-mumbaiR-programming-training-in-mumbai
R-programming-training-in-mumbai
 
R programming by ganesh kavhar
R programming by ganesh kavharR programming by ganesh kavhar
R programming by ganesh kavhar
 
3 R Tutorial Data Structure
3 R Tutorial Data Structure3 R Tutorial Data Structure
3 R Tutorial Data Structure
 
Python for R Users
Python for R UsersPython for R Users
Python for R Users
 
5 R Tutorial Data Visualization
5 R Tutorial Data Visualization5 R Tutorial Data Visualization
5 R Tutorial Data Visualization
 
Data Analysis with Python Pandas
Data Analysis with Python PandasData Analysis with Python Pandas
Data Analysis with Python Pandas
 
Machine Learning in R
Machine Learning in RMachine Learning in R
Machine Learning in R
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyr
 
Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011Modeling with Hadoop kdd2011
Modeling with Hadoop kdd2011
 
Data profiling with Apache Calcite
Data profiling with Apache CalciteData profiling with Apache Calcite
Data profiling with Apache Calcite
 
Map/Reduce intro
Map/Reduce introMap/Reduce intro
Map/Reduce intro
 
Introduction to r studio on aws 2020 05_06
Introduction to r studio on aws 2020 05_06Introduction to r studio on aws 2020 05_06
Introduction to r studio on aws 2020 05_06
 
Big Data Mining in Indian Economic Survey 2017
Big Data Mining in Indian Economic Survey 2017Big Data Mining in Indian Economic Survey 2017
Big Data Mining in Indian Economic Survey 2017
 
R reference card
R reference cardR reference card
R reference card
 

Similar a PPT ON MACHINE LEARNING by Ragini Ratre

R Programming - part 1.pdf
R Programming - part 1.pdfR Programming - part 1.pdf
R Programming - part 1.pdfRohanBorgalli
 
Hands on Mahout!
Hands on Mahout!Hands on Mahout!
Hands on Mahout!OSCON Byrum
 
Data Analytics with R and SQL Server
Data Analytics with R and SQL ServerData Analytics with R and SQL Server
Data Analytics with R and SQL ServerStéphane Fréchette
 
RDataMining slides-r-programming
RDataMining slides-r-programmingRDataMining slides-r-programming
RDataMining slides-r-programmingYanchang Zhao
 
The Very ^ 2 Basics of R
The Very ^ 2 Basics of RThe Very ^ 2 Basics of R
The Very ^ 2 Basics of RWinston Chen
 
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners Jen Stirrup
 
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)Serban Tanasa
 
Slides on introduction to R by ArinBasu MD
Slides on introduction to R by ArinBasu MDSlides on introduction to R by ArinBasu MD
Slides on introduction to R by ArinBasu MDSonaCharles2
 
Data analysis in R
Data analysis in RData analysis in R
Data analysis in RAndrew Lowe
 
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptxfINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptxdataKarthik
 
Unit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxUnit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxMalla Reddy University
 
R for Pirates. ESCCONF October 27, 2011
R for Pirates. ESCCONF October 27, 2011R for Pirates. ESCCONF October 27, 2011
R for Pirates. ESCCONF October 27, 2011Mandi Walls
 
Unit 4_Working with Graphs _python (2).pptx
Unit 4_Working with Graphs _python (2).pptxUnit 4_Working with Graphs _python (2).pptx
Unit 4_Working with Graphs _python (2).pptxprakashvs7
 

Similar a PPT ON MACHINE LEARNING by Ragini Ratre (20)

R Introduction
R IntroductionR Introduction
R Introduction
 
R workshop
R workshopR workshop
R workshop
 
R Programming - part 1.pdf
R Programming - part 1.pdfR Programming - part 1.pdf
R Programming - part 1.pdf
 
Hands on Mahout!
Hands on Mahout!Hands on Mahout!
Hands on Mahout!
 
Data Analytics with R and SQL Server
Data Analytics with R and SQL ServerData Analytics with R and SQL Server
Data Analytics with R and SQL Server
 
RDataMining slides-r-programming
RDataMining slides-r-programmingRDataMining slides-r-programming
RDataMining slides-r-programming
 
The Very ^ 2 Basics of R
The Very ^ 2 Basics of RThe Very ^ 2 Basics of R
The Very ^ 2 Basics of R
 
e_lumley.pdf
e_lumley.pdfe_lumley.pdf
e_lumley.pdf
 
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners
 
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
Get up to Speed (Quick Guide to data.table in R and Pentaho PDI)
 
17641.ppt
17641.ppt17641.ppt
17641.ppt
 
Slides on introduction to R by ArinBasu MD
Slides on introduction to R by ArinBasu MDSlides on introduction to R by ArinBasu MD
Slides on introduction to R by ArinBasu MD
 
17641.ppt
17641.ppt17641.ppt
17641.ppt
 
Data analysis in R
Data analysis in RData analysis in R
Data analysis in R
 
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptxfINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
fINAL Lesson_5_Data_Manipulation_using_R_v1.pptx
 
Aggregate.pptx
Aggregate.pptxAggregate.pptx
Aggregate.pptx
 
Unit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxUnit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptx
 
R for Pirates. ESCCONF October 27, 2011
R for Pirates. ESCCONF October 27, 2011R for Pirates. ESCCONF October 27, 2011
R for Pirates. ESCCONF October 27, 2011
 
R programmingmilano
R programmingmilanoR programmingmilano
R programmingmilano
 
Unit 4_Working with Graphs _python (2).pptx
Unit 4_Working with Graphs _python (2).pptxUnit 4_Working with Graphs _python (2).pptx
Unit 4_Working with Graphs _python (2).pptx
 

Último

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 

Último (20)

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 

PPT ON MACHINE LEARNING by Ragini Ratre

  • 1. BY : RAGINI RATRE EEE 7TH SEMESTER
  • 2. Introduction Of ML Different sub topics of AI Types of Machine Learning Different Languages use in ML Advantages of R over other Languages R Programming Project on HEART DISEASE ANALYSIS WITH R Content:
  • 3.  In computer science machine learning refers to a type of data analysis that uses algorithms that learn from data. It is a type of artificial intelligence (AI) that provides systems with the ability to learn without being explicitly programmed. What is ML ??
  • 4. Different sub topics of AI • Neural Network • Planning • Robotics • ML • Natural Language processing • Perception • Knowledge • Cognitive system
  • 5.
  • 6. Types of Machine Learning • Supervised • Unsupervised • Reinforcement
  • 7.  R programming  Python  Java-Family / C-Family  Matlab  Mathematica  Stata etc Different Languages use in ML
  • 8. Why We Used R in ML..??? why why why why why why why why why
  • 9. • Reduced Coad length • Availability(4000+ functions are available) • Simple and easy to learn • Free an Open source ADVANTAGES OF R OVER OTHER LANGUAGES
  • 10. R Programming  myString <- "Hello, World!"  > print ( myString)  [1] "Hello, World!" Basic Syntax Data Type • Logical v <- TRUE print(class(v)) • Numeric v <- 23.5 print(class(v)) • Integer v <- 2L print(class(v)) • Complex v <- 2+5i print(class(v)) • Character v <- "TRUE" print(class(v)) • Raw v <- charToRaw("Hello") print(class(v)) OPERATORS • Add v <- c( 2,5.5,6) • Sub t <- c(8, 3, 4) • Mul print(v+t) • Div etc print(v-t) • print(v*t)
  • 11. Accessing Vector Elements R Programming # Accessing vector elements using position. t <- c("Sun","Mon","Tue","Wed","Thurs","Fri","Sat") u <- t[c(2,3,6)];print(u) OUTPUT: [1] "Mon" "Tue" "Fri" # Accessing vector elements using logical indexing. v <- t[c(TRUE,FALSE,FALSE,FALSE,FALSE,TRUE,FALSE)] print(v) [1] "Sun" "Fri" # Accessing vector elements using negative indexing. x <- t[c(-2,-5)] print(x) [1] "Sun" "Tue" "Wed" "Fri" "Sat"
  • 12. R Programming # Create the predictor and response variable. x <- c(151, 174, 138, 186, 128, 136, 179, 163, 152, 131) y <- c(63, 81, 56, 91, 47, 57, 76, 72, 62, 48) relation <- lm(y~x) # Give the chart file a name. png(file = "linearregression.png") # Plot the chart. plot(y,x,col = "blue",main = "Height & Weight Regression", abline(lm(x~y)),cex = 1.3,pch = 16,xlab = "Weight in Kg",ylab = "Height in cm") # Save the file. dev.off() Regression
  • 13. R Programming # Load the party package. It will automatically load other # dependent packages. library(party) # Create the input data frame. input.dat <- readingSkills[c(1:105),] # Give the chart file a name. png(file = "decision_tree.png") # Create the tree. output.tree <- ctree( nativeSpeaker ~ age + shoeSize + score, data = input.dat) # Plot the tree. plot(output.tree) # Save the file. dev.off() DECSION TREE
  • 14. R Programming OUTPUT null device 1 Loading required package: methods Loading required package: grid Loading required package: mvtnorm Loading required package: modeltools Loading required package: stats4 Loading required package: strucchange Loading required package: zoo Attaching package: ‘zoo’ The following objects are masked from ‘package:base’: as.Date,as.Date.numeric Loading required package: sandwich
  • 15. R Programming Graphs and Plots # Create data for the graph. x <- c(21, 62, 10, 53) labels <- c("London", "New York", "Singapore", "Mumbai") # Give the chart file a name. png(file = "city.jpg") # Plot the chart. pie(x,labels) # Save the file. dev.off() Pi CHART
  • 16. R Programming Graphs and Plots # Give the chart file a name. png(file = "boxplot.png") # Plot the chart. boxplot(mpg ~ cyl, data = mtcars, xlab = "Number of Cylinders", ylab = "Miles Per Gallon", main = "Mileage Data") # Save the file. dev.off() BOX PLOT
  • 17. R Programming Graphs and Plots # Create the data for the chart H <- c(7,12,28,3,41) # Give the chart file a name png(file = "barchart.png") # Plot the bar chart barplot(H) # Save the file dev.off() BAR PLOTS
  • 18. R Programming Graphs and Plots # Create the data for the chart. v <- c(7,12,28,3,41) # Give the chart file a name. png(file = "line_chart.jpg") # Plot the bar chart. plot(v,type = "o") # Save the file. dev.off() LINE GRAPHS
  • 19.  Image Recognition  Speech Recognition  Medical Diagnosis  Statistical Arbitrage  Prediction APPLICATIONs OF ML
  • 20. FEW STEPS TO HANDLE THE DATA • Data Selection Consider what data is available, what data is missing and what data can be removed. • Data Preprocessing Organize your selected data by formatting, cleaning and sampling from it. • Data Transformation Transform preprocessed data ready for machine learning by engineering features using scaling, attribute decomposition and attribute aggregation.
  • 21.  Identify and handle missing values  Data FORMATTING  DATA NORMALIZATION (CENTERING/SCALING)  DATA BINNING PRE-PROCESSING DATA IN R
  • 22.  Missing value occur when no data value is stored for a variable(feature) in an observation  Could be represented as “?” , “N/A” , 0 or Just blank cell Drop the missing value:  Drop the variable  Drop the data entry Replace the missing value:  Replace it with an average of similar datapoint  Replace it by frequency  Replace it based on other function Identify and handle missing values
  • 23.  Applying calculation to an entire column  Like conversion of data like from meter to kilometer Data FORMATTING
  • 24.  My making the range consistent between variables , normalization enables a fairer comparison between features  Few methods: 1. Simple Feature Scaling x(new)=x(old)/x(max) 2. Min-Max x(new) = (x(old)-x(min))/(x(max)-x(min)) DATA NORMALIZATION (CENTERING/SCALING)
  • 25.  Binning Grouping of values into “bins”  Converts numeric into categorical variables  Grouping a set of numerical values into subset “bins” DATA BINNING
  • 26. Data preparation / transformation is a large subject that can involve a lot of iterations, exploration and analysis. Getting good at data preparation will make you a master at machine learning. DATA TRANSFORMATION
  • 27. PROJECT ON Heart disease • First we need .csv file here csv stands for comma separated values >>> getwd() >>> setwd("C:UsersVinitDesktopMachineLearning") >>> table1<-read.table("heart_disease_male.csv",header=T,sep=",") • We can also use read.csv() function in place of read.table() as it is a csv file
  • 28. Output: age chest_pain rest_bpress blood_sugar rest_electro max_heart_rate exercice_angina disease 1 43 asympt 140 f normal 135 yes 1 2 39 atyp_angina 120 f normal 160 yes 0 3 39 non_anginal 160 t normal 160 no 0 4 42 non_anginal 160 f normal 146 no 0 5 49 asympt 140 f normal 130 no 0 6 50 asympt 140 f normal 135 no 0 >>>head(table1)
  • 30. DATA visualization >>>library(ggplot2) >>> x<-ggplot(table1) >>> x+geom_bar(aes(x=age)) >>> x+geom_bar(aes(x=blood_sugar)) >>> x+geom_density(aes(max_heart_rate))
  • 32.
  • 35.  model1<- glm(disease~.,family="binomial") #take all the columns in the model summary(model1) Prediction
  • 37.  In conclusion, Machine learning is an incredible breakthrough in the field of artificial intelligence. While it does have some frightening implications when you think about it, these Machine Learning Applications are several of the many ways this technology can improve our lives. Conclusion