SlideShare una empresa de Scribd logo
1 de 24
PRESENT BY: ABDUL AHAD ABRO
1
Data Science & Predictive Analytics
Computer Engineering Department, Ege University, Turkey
Presentations 2
Veri Bilimi ve Tahmin Edici
Analizler
May 16-2017
 Veri Bilimi ( Data Science )
 Tahmin Edici Analizler ( Predictive Analytics )
 Makine öğrenme ( Machine Learning )
 Makine Öğrenme Algoritması ( Machine Learning Algorithm )
 Regression & Classification
 Microsoft Azure Machine Learning Studio
 Regression with Microsoft Excel
 Academic Studies / Articles / Publications
Agenda
2
Or
Data Science is an umbrella that contain many other fields like Machine learning, Data
Mining, big Data, statistics, Data visualization and data analytics.
What is Data Science ? Veri Bilimi nedir
Data science also known as data-driven science, is
an interdisciplinary field about scientific methods,
processes and systems to extract knowledge from
data in various forms, either structured or
unstructured, similar to Knowledge Discovery in
Databases (KDD) [4].
3
Predictive Analytics Tahmin Edici Analizler
Predictive analytics is the branch of the advanced analytics which is
used to make predictions about unknown future events. Predictive
analytics uses many techniques from data mining, statistics,
modeling, machine learning, and artificial intelligence to analyze
current data to make predictions about future. It uses a number of
data mining, and analytical techniques to bring together the
management, information technology, and modeling business
process to make predictions about future. The patterns found in
historical and transactional data can be used to identify risks and
opportunities for future.
4
Prediction (Making Decisions)
5
Machine Learning Makine öğrenme
A branch of artificial intelligence, concerned with the design and development of
algorithms that allow computers to evolve behaviors based on empirical data.
Construction and study of systems that can learn from data.
6
Machine Learning Algorithm
Supervised learning ---
Predicting the Future..
Learn from the past example to predict future.
Unsupervised learning ---
Understanding the past
Making Sense of Data.
Learning Structure of Data
Compressing data for consumption.
Semi-supervised learning --- A mix of supervised and unsupervised
learning.
Reinforcement learning --- Allows the machine or software agent to
learn its behavior based on feedback from the environment. This
behavior can be learnt once and for all, or keep on adapting as time
goes by.
7
8
Regression:: In supervised learning target variables in regression must be continuous.
Categorical target variables are modelled in classification.
Regression has less or even no emphasis on using probability to describe the random
variation between the predictor and the target.
Regression is used to predict continuous values.
9
Classification:: Classification is used to predict which class a data point is part of (discrete value).
The Training data (observations, measurements, etc.) are accompanied by labels
indicating the class of observations.
New data is classified based on the training set.
10
Abstract
The use of the term Data Science is becoming increasingly common. The Machine
learning specifically supervised learning, classification and regression process state with
Microsoft Azure Machine learning studio and resultant compare with Microsoft excel
classification and regression for sophistication and accuracy of result. In this mean
process classification and linear regression result in both cases consider probably same
as in equally data set utilized..
11
Keywords
“Data Science”, “Machine Learning”, “Predictive Analysis”,
“Future Predict”, “Data Mining”.
12
Introduction
Data Science is an interdisciplinary field about processes and systems to extract knowledge or insights
from data in various forms. Data Science is not just Machine Learning. Data science is a concept to
unify statistics, data analysis and their related methods in order to understand and analyze actual
phenomena with data.
13
Microsoft Azure Machine Learning Studio
Model of Linear Regression
14
Microsoft Azure Machine Learning Studio
Result of Linear Regression
15
Methodology
Supervised machine learning methods for regression and classification have been designed mostly for
data types that lie in vector spaces i.e. response (i.e. output) and/or predictor (i.e. input) variables are
often arranged into vectors of predefined dimensionality [2]. In this methodology the machine
learning specifically classification and linear regression have used. For sake of that research
methodology data sets is required for performing the classification and linear regression methods.
Many data sets available online resources but the most sophisticated, accurate and authentic data
sets utilized in this research which got from Microsoft Azure Machine learning studio data sets. The
data sets named as Automobile with multivariate data type includes 205 Instances with 26 attributes.
In this data sets various specification of have discussed regarding automobiles like fuel type,
aspiration, number of doors, engine-location, length, width, height, horsepower, city-mpg, highway-
mpg, peak-rpm, price and so on.
16
Methodology (continue)
The linear regression and classification process take place over Microsoft azure machine learning
studio, over attributes which is based on dependent and independent variables of data sets where
built in saved data sets, trained model, machine learning, score model and evaluation model are
present for ready to use. Moreover for perfection and comparing the result of Microsoft azure
machine learning studio, Microsoft excel program has used for linear regression by utilizing the same
dataset and comparing the final result.
17
Microsoft Excel
Dataset (Attributes & Instances)
Linear Regression
Academic Studies
18
Regression and classification using extreme learning machine based
on L1-norm and L2-norm. (Least Absolute Deviations and Least Squares)
Extreme learning machine (ELM) is a very simple machine learning algorithm and it can achieve a good
generalization performance with extremely fast speed. Therefore it has practical significance for data analysis in
real-world application. At the information stage, there has been a growing interest in the study of data analysis
techniques. Techniques of data analysis can extract previously unknown, hidden, but potentially useful
information and knowledge from original data, which is helpful to provide suggestions or decisions for future
actions .
Data analysis plays a huge guidance role for making future plans in practical applications. In this paper, a novel
algorithm called L1–L2-ELM was proposed as an effective technology in data analysis. It can deal with multiple-
output regression and multiple-class classification problems in a unified framework.
Xiong Luo, Xiaohui Chang, Xiaojuan Ban
Academic Studies
19
Minimal Learning Machine: A novel supervised distance-based approach for
Regression and classification.
Supervised machine learning methods for regression and classification have been designed mostly for data types
that lie in vector spaces, i.e. response (i.e output) and/or predictor (i.e. input)variables are often arranged in to
vectors of predefined dimensionality. There are other types of data, however such as graphs, sequences, shapes,
images, trees and covariance matrices, which are less amenable to being treated within standard
regression/classification frameworks. These data types usually do not lie in a natural vector space, but rather in a
metric space.
For regression tasks, there are some prior works in which the response and/or the predictor variables are
expressed as distance (i.e. dissimilarity) matrices. A comprehensive set of computer experiments illustrates that
the proposed method achieves accuracies that are comparable to more traditional machine learning methods for
regression and classification thus offering a computationally valid alternative to such approaches.
Amauri Holanda de Souza Júnior, Francesco Corona, Guilherme A. Barreto b,Yoan Miche, Amaury Lendasse
Python Code
20
# Required Packages
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
from sklearn import datasets, linear_model
# Function to get data
def get_data(file_name):
data = pd.read_csv(file_name)
x_parameter = []
y_parameter = []
for single_square_feet ,single_price_value in
zip(data['square_feet'],data['price']):
x_parameter.append([float(single_square_feet)])
y_parameter.append(float(single_price_value))
return x_parameter,y_parameter
x,y = get_data('input_data.csv')
print x
print y
Python Code
21
# Function for Fitting data to Linear model
def
linear_model_main(X_parameters,Y_parameters,predict_
value):
# Create linear regression object
regr = linear_model.LinearRegression()
regr.fit(X_parameters, Y_parameters)
predict_outcome = regr.predict(predict_value)
predictions = {}
predictions['intercept'] = regr.intercept_
predictions['coefficient'] = regr.coef_
predictions['predicted_value'] = predict_outcome
return predictions
x,y = get_data('input_data.csv')
predict_value = 700
result = linear_model_main(x,y,predict_value)
print "Intercept value " , result['intercept']
print "coefficient" , result['coefficient']
print "Predicted value: ",result['predicted_value']
# Function to show the resutls of linear fit model
def show_linear_line(X_parameters,Y_parameters):
# Create linear regression object
regr = linear_model.LinearRegression()
regr.fit(X_parameters, Y_parameters)
plt.scatter(X_parameters,Y_parameters,color='blue')
plt.plot(X_parameters,regr.predict(X_parameters),color='red',li
newidth=4)
plt.xticks(())
plt.yticks(())
plt.show()
show_linear_line(X,Y)
References
[1] Xiong Luo, Xiaohui Chang, Xiaojuan Ban (2015). Regression and classification using extreme
learning machine based on L1-norm and L2-norm. (Least Absolute Deviations and Least Squares).
[2] Amauri Holanda de Souza Júnior, Francesco Corona, Guilherme A. Barreto b,Yoan Miche, Amaury
Lendasse (2015). Minimal Learning Machine: A novel supervised distance-based approach for
regression and classification.
[3] Sumit Mund (2015). Microsoft Azure Machine Learning.
[4] https://en.wikipedia.org/wiki/Data_science
22
23
24

Más contenido relacionado

La actualidad más candente

Comparative Analysis: Effective Information Retrieval Using Different Learnin...
Comparative Analysis: Effective Information Retrieval Using Different Learnin...Comparative Analysis: Effective Information Retrieval Using Different Learnin...
Comparative Analysis: Effective Information Retrieval Using Different Learnin...RSIS International
 
Trending Topics in Machine Learning
Trending Topics in Machine LearningTrending Topics in Machine Learning
Trending Topics in Machine LearningTechsparks
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Usama Fayyaz
 
Machine learning ICT
Machine learning ICTMachine learning ICT
Machine learning ICTMaheenDilawar
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsDinusha Dilanka
 
Internship project report,Predictive Modelling
Internship project report,Predictive ModellingInternship project report,Predictive Modelling
Internship project report,Predictive ModellingAmit Kumar
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETEditor IJMTER
 
A study on rough set theory based
A study on rough set theory basedA study on rough set theory based
A study on rough set theory basedijaia
 
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistanceSelecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistancePhD Assistance
 
MACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXMACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXmlaij
 
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Jerrin George
 
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsMis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsVidya sagar Sharma
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt Poojamanic
 

La actualidad más candente (18)

Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Comparative Analysis: Effective Information Retrieval Using Different Learnin...
Comparative Analysis: Effective Information Retrieval Using Different Learnin...Comparative Analysis: Effective Information Retrieval Using Different Learnin...
Comparative Analysis: Effective Information Retrieval Using Different Learnin...
 
Trending Topics in Machine Learning
Trending Topics in Machine LearningTrending Topics in Machine Learning
Trending Topics in Machine Learning
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
 
Machine learning ICT
Machine learning ICTMachine learning ICT
Machine learning ICT
 
Performance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning AlgorithmsPerformance Comparision of Machine Learning Algorithms
Performance Comparision of Machine Learning Algorithms
 
Internship project report,Predictive Modelling
Internship project report,Predictive ModellingInternship project report,Predictive Modelling
Internship project report,Predictive Modelling
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
 
A study on rough set theory based
A study on rough set theory basedA study on rough set theory based
A study on rough set theory based
 
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - PhdassistanceSelecting the Right Type of Algorithm for Various Applications - Phdassistance
Selecting the Right Type of Algorithm for Various Applications - Phdassistance
 
MACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOXMACHINE LEARNING TOOLBOX
MACHINE LEARNING TOOLBOX
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
 
Mis End Term Exam Theory Concepts
Mis End Term Exam Theory ConceptsMis End Term Exam Theory Concepts
Mis End Term Exam Theory Concepts
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt
 
Eckovation Machine Learning
Eckovation Machine LearningEckovation Machine Learning
Eckovation Machine Learning
 
Machine learning
 Machine learning Machine learning
Machine learning
 

Similar a Regression with Microsoft Azure & Ms Excel

Data clustering using map reduce
Data clustering using map reduceData clustering using map reduce
Data clustering using map reduceVarad Meru
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationAnkit Gupta
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerIJERA Editor
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...IRJET Journal
 
An Overview of Supervised Machine Learning Paradigms and their Classifiers
An Overview of Supervised Machine Learning Paradigms and their ClassifiersAn Overview of Supervised Machine Learning Paradigms and their Classifiers
An Overview of Supervised Machine Learning Paradigms and their ClassifiersIJAEMSJORNAL
 
A machine learning model for predicting innovation effort of firms
A machine learning model for predicting innovation effort of  firmsA machine learning model for predicting innovation effort of  firms
A machine learning model for predicting innovation effort of firmsIJECEIAES
 
introduction to machine learning
introduction to machine learningintroduction to machine learning
introduction to machine learningJohnson Ubah
 
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET Journal
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYEditor IJMTER
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applicationsBenjaminlapid1
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning BasicsSuresh Arora
 
Regression and Artificial Neural Network in R
Regression and Artificial Neural Network in RRegression and Artificial Neural Network in R
Regression and Artificial Neural Network in RDr. Vaibhav Kumar
 
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET Journal
 

Similar a Regression with Microsoft Azure & Ms Excel (20)

Machine Learning.pptx
Machine Learning.pptxMachine Learning.pptx
Machine Learning.pptx
 
Data clustering using map reduce
Data clustering using map reduceData clustering using map reduce
Data clustering using map reduce
 
Intro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning PresentationIntro/Overview on Machine Learning Presentation
Intro/Overview on Machine Learning Presentation
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
 
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdfMachine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdf
 
An Overview of Supervised Machine Learning Paradigms and their Classifiers
An Overview of Supervised Machine Learning Paradigms and their ClassifiersAn Overview of Supervised Machine Learning Paradigms and their Classifiers
An Overview of Supervised Machine Learning Paradigms and their Classifiers
 
A machine learning model for predicting innovation effort of firms
A machine learning model for predicting innovation effort of  firmsA machine learning model for predicting innovation effort of  firms
A machine learning model for predicting innovation effort of firms
 
introduction to machine learning
introduction to machine learningintroduction to machine learning
introduction to machine learning
 
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
IRJET - Comparative Analysis of GUI based Prediction of Parkinson Disease usi...
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
 
Supervised learning techniques and applications
Supervised learning techniques and applicationsSupervised learning techniques and applications
Supervised learning techniques and applications
 
Machine Learning Basics
Machine Learning BasicsMachine Learning Basics
Machine Learning Basics
 
Internshipppt.pptx
Internshipppt.pptxInternshipppt.pptx
Internshipppt.pptx
 
Regression and Artificial Neural Network in R
Regression and Artificial Neural Network in RRegression and Artificial Neural Network in R
Regression and Artificial Neural Network in R
 
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
IRJET- Prediction of Crime Rate Analysis using Supervised Classification Mach...
 
Ew36913917
Ew36913917Ew36913917
Ew36913917
 
Machine Learning - Deep Learning
Machine Learning - Deep LearningMachine Learning - Deep Learning
Machine Learning - Deep Learning
 
Lecture-6-7.pptx
Lecture-6-7.pptxLecture-6-7.pptx
Lecture-6-7.pptx
 

Más de Dr. Abdul Ahad Abro

Artificial intelligence - AI Complete Concept
Artificial intelligence - AI Complete ConceptArtificial intelligence - AI Complete Concept
Artificial intelligence - AI Complete ConceptDr. Abdul Ahad Abro
 
Edge Coloring & K-tuple coloring
Edge Coloring & K-tuple coloringEdge Coloring & K-tuple coloring
Edge Coloring & K-tuple coloringDr. Abdul Ahad Abro
 
Shortest-Path Problems - Graph Theory in Computer Applications
Shortest-Path Problems - Graph Theory in Computer ApplicationsShortest-Path Problems - Graph Theory in Computer Applications
Shortest-Path Problems - Graph Theory in Computer ApplicationsDr. Abdul Ahad Abro
 
Connectivity - Graph Theory in Computer Applications
Connectivity - Graph Theory in Computer ApplicationsConnectivity - Graph Theory in Computer Applications
Connectivity - Graph Theory in Computer ApplicationsDr. Abdul Ahad Abro
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationDr. Abdul Ahad Abro
 
Expert System - Artificial intelligence
Expert System - Artificial intelligenceExpert System - Artificial intelligence
Expert System - Artificial intelligenceDr. Abdul Ahad Abro
 

Más de Dr. Abdul Ahad Abro (10)

DBMS & RDBMS
DBMS & RDBMSDBMS & RDBMS
DBMS & RDBMS
 
Outlier Detection
Outlier DetectionOutlier Detection
Outlier Detection
 
AI vs Human
AI vs HumanAI vs Human
AI vs Human
 
Artificial intelligence - AI Complete Concept
Artificial intelligence - AI Complete ConceptArtificial intelligence - AI Complete Concept
Artificial intelligence - AI Complete Concept
 
Edge Coloring & K-tuple coloring
Edge Coloring & K-tuple coloringEdge Coloring & K-tuple coloring
Edge Coloring & K-tuple coloring
 
Graph Coloring
Graph ColoringGraph Coloring
Graph Coloring
 
Shortest-Path Problems - Graph Theory in Computer Applications
Shortest-Path Problems - Graph Theory in Computer ApplicationsShortest-Path Problems - Graph Theory in Computer Applications
Shortest-Path Problems - Graph Theory in Computer Applications
 
Connectivity - Graph Theory in Computer Applications
Connectivity - Graph Theory in Computer ApplicationsConnectivity - Graph Theory in Computer Applications
Connectivity - Graph Theory in Computer Applications
 
Data mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, ClassificationData mining , Knowledge Discovery Process, Classification
Data mining , Knowledge Discovery Process, Classification
 
Expert System - Artificial intelligence
Expert System - Artificial intelligenceExpert System - Artificial intelligence
Expert System - Artificial intelligence
 

Último

Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterMateoGardella
 

Último (20)

Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 

Regression with Microsoft Azure & Ms Excel

  • 1. PRESENT BY: ABDUL AHAD ABRO 1 Data Science & Predictive Analytics Computer Engineering Department, Ege University, Turkey Presentations 2 Veri Bilimi ve Tahmin Edici Analizler May 16-2017
  • 2.  Veri Bilimi ( Data Science )  Tahmin Edici Analizler ( Predictive Analytics )  Makine öğrenme ( Machine Learning )  Makine Öğrenme Algoritması ( Machine Learning Algorithm )  Regression & Classification  Microsoft Azure Machine Learning Studio  Regression with Microsoft Excel  Academic Studies / Articles / Publications Agenda 2
  • 3. Or Data Science is an umbrella that contain many other fields like Machine learning, Data Mining, big Data, statistics, Data visualization and data analytics. What is Data Science ? Veri Bilimi nedir Data science also known as data-driven science, is an interdisciplinary field about scientific methods, processes and systems to extract knowledge from data in various forms, either structured or unstructured, similar to Knowledge Discovery in Databases (KDD) [4]. 3
  • 4. Predictive Analytics Tahmin Edici Analizler Predictive analytics is the branch of the advanced analytics which is used to make predictions about unknown future events. Predictive analytics uses many techniques from data mining, statistics, modeling, machine learning, and artificial intelligence to analyze current data to make predictions about future. It uses a number of data mining, and analytical techniques to bring together the management, information technology, and modeling business process to make predictions about future. The patterns found in historical and transactional data can be used to identify risks and opportunities for future. 4
  • 6. Machine Learning Makine öğrenme A branch of artificial intelligence, concerned with the design and development of algorithms that allow computers to evolve behaviors based on empirical data. Construction and study of systems that can learn from data. 6
  • 7. Machine Learning Algorithm Supervised learning --- Predicting the Future.. Learn from the past example to predict future. Unsupervised learning --- Understanding the past Making Sense of Data. Learning Structure of Data Compressing data for consumption. Semi-supervised learning --- A mix of supervised and unsupervised learning. Reinforcement learning --- Allows the machine or software agent to learn its behavior based on feedback from the environment. This behavior can be learnt once and for all, or keep on adapting as time goes by. 7
  • 8. 8
  • 9. Regression:: In supervised learning target variables in regression must be continuous. Categorical target variables are modelled in classification. Regression has less or even no emphasis on using probability to describe the random variation between the predictor and the target. Regression is used to predict continuous values. 9 Classification:: Classification is used to predict which class a data point is part of (discrete value). The Training data (observations, measurements, etc.) are accompanied by labels indicating the class of observations. New data is classified based on the training set.
  • 10. 10 Abstract The use of the term Data Science is becoming increasingly common. The Machine learning specifically supervised learning, classification and regression process state with Microsoft Azure Machine learning studio and resultant compare with Microsoft excel classification and regression for sophistication and accuracy of result. In this mean process classification and linear regression result in both cases consider probably same as in equally data set utilized..
  • 11. 11 Keywords “Data Science”, “Machine Learning”, “Predictive Analysis”, “Future Predict”, “Data Mining”.
  • 12. 12 Introduction Data Science is an interdisciplinary field about processes and systems to extract knowledge or insights from data in various forms. Data Science is not just Machine Learning. Data science is a concept to unify statistics, data analysis and their related methods in order to understand and analyze actual phenomena with data.
  • 13. 13 Microsoft Azure Machine Learning Studio Model of Linear Regression
  • 14. 14 Microsoft Azure Machine Learning Studio Result of Linear Regression
  • 15. 15 Methodology Supervised machine learning methods for regression and classification have been designed mostly for data types that lie in vector spaces i.e. response (i.e. output) and/or predictor (i.e. input) variables are often arranged into vectors of predefined dimensionality [2]. In this methodology the machine learning specifically classification and linear regression have used. For sake of that research methodology data sets is required for performing the classification and linear regression methods. Many data sets available online resources but the most sophisticated, accurate and authentic data sets utilized in this research which got from Microsoft Azure Machine learning studio data sets. The data sets named as Automobile with multivariate data type includes 205 Instances with 26 attributes. In this data sets various specification of have discussed regarding automobiles like fuel type, aspiration, number of doors, engine-location, length, width, height, horsepower, city-mpg, highway- mpg, peak-rpm, price and so on.
  • 16. 16 Methodology (continue) The linear regression and classification process take place over Microsoft azure machine learning studio, over attributes which is based on dependent and independent variables of data sets where built in saved data sets, trained model, machine learning, score model and evaluation model are present for ready to use. Moreover for perfection and comparing the result of Microsoft azure machine learning studio, Microsoft excel program has used for linear regression by utilizing the same dataset and comparing the final result.
  • 17. 17 Microsoft Excel Dataset (Attributes & Instances) Linear Regression
  • 18. Academic Studies 18 Regression and classification using extreme learning machine based on L1-norm and L2-norm. (Least Absolute Deviations and Least Squares) Extreme learning machine (ELM) is a very simple machine learning algorithm and it can achieve a good generalization performance with extremely fast speed. Therefore it has practical significance for data analysis in real-world application. At the information stage, there has been a growing interest in the study of data analysis techniques. Techniques of data analysis can extract previously unknown, hidden, but potentially useful information and knowledge from original data, which is helpful to provide suggestions or decisions for future actions . Data analysis plays a huge guidance role for making future plans in practical applications. In this paper, a novel algorithm called L1–L2-ELM was proposed as an effective technology in data analysis. It can deal with multiple- output regression and multiple-class classification problems in a unified framework. Xiong Luo, Xiaohui Chang, Xiaojuan Ban
  • 19. Academic Studies 19 Minimal Learning Machine: A novel supervised distance-based approach for Regression and classification. Supervised machine learning methods for regression and classification have been designed mostly for data types that lie in vector spaces, i.e. response (i.e output) and/or predictor (i.e. input)variables are often arranged in to vectors of predefined dimensionality. There are other types of data, however such as graphs, sequences, shapes, images, trees and covariance matrices, which are less amenable to being treated within standard regression/classification frameworks. These data types usually do not lie in a natural vector space, but rather in a metric space. For regression tasks, there are some prior works in which the response and/or the predictor variables are expressed as distance (i.e. dissimilarity) matrices. A comprehensive set of computer experiments illustrates that the proposed method achieves accuracies that are comparable to more traditional machine learning methods for regression and classification thus offering a computationally valid alternative to such approaches. Amauri Holanda de Souza Júnior, Francesco Corona, Guilherme A. Barreto b,Yoan Miche, Amaury Lendasse
  • 20. Python Code 20 # Required Packages import matplotlib.pyplot as plt import numpy as np import pandas as pd from sklearn import datasets, linear_model # Function to get data def get_data(file_name): data = pd.read_csv(file_name) x_parameter = [] y_parameter = [] for single_square_feet ,single_price_value in zip(data['square_feet'],data['price']): x_parameter.append([float(single_square_feet)]) y_parameter.append(float(single_price_value)) return x_parameter,y_parameter x,y = get_data('input_data.csv') print x print y
  • 21. Python Code 21 # Function for Fitting data to Linear model def linear_model_main(X_parameters,Y_parameters,predict_ value): # Create linear regression object regr = linear_model.LinearRegression() regr.fit(X_parameters, Y_parameters) predict_outcome = regr.predict(predict_value) predictions = {} predictions['intercept'] = regr.intercept_ predictions['coefficient'] = regr.coef_ predictions['predicted_value'] = predict_outcome return predictions x,y = get_data('input_data.csv') predict_value = 700 result = linear_model_main(x,y,predict_value) print "Intercept value " , result['intercept'] print "coefficient" , result['coefficient'] print "Predicted value: ",result['predicted_value'] # Function to show the resutls of linear fit model def show_linear_line(X_parameters,Y_parameters): # Create linear regression object regr = linear_model.LinearRegression() regr.fit(X_parameters, Y_parameters) plt.scatter(X_parameters,Y_parameters,color='blue') plt.plot(X_parameters,regr.predict(X_parameters),color='red',li newidth=4) plt.xticks(()) plt.yticks(()) plt.show() show_linear_line(X,Y)
  • 22. References [1] Xiong Luo, Xiaohui Chang, Xiaojuan Ban (2015). Regression and classification using extreme learning machine based on L1-norm and L2-norm. (Least Absolute Deviations and Least Squares). [2] Amauri Holanda de Souza Júnior, Francesco Corona, Guilherme A. Barreto b,Yoan Miche, Amaury Lendasse (2015). Minimal Learning Machine: A novel supervised distance-based approach for regression and classification. [3] Sumit Mund (2015). Microsoft Azure Machine Learning. [4] https://en.wikipedia.org/wiki/Data_science 22
  • 23. 23
  • 24. 24