SlideShare una empresa de Scribd logo
1 de 23
Python and Data AnalyticsPython and Data Analytics
•Understand the problem By Understanding the Data
•Predictive Model Building: Balancing Performance, Complexity,
and theBig Data
Machine learningMachine learning
machine learning explores the study and
construction of algorithms that can learn
from and make predictions on data.
Predictive model buildingPredictive model building
The process of building a predictive model is called
training.
 Attributes: the variables being used to make predictions is known as:
◦ Predictors.
◦ Features
◦ Independent variables
◦ Input
 Labels are also known as,
◦ Outcomes
◦ Targets
◦ Dependent variables
◦ Responses
A machine learning project may not be
linear, but it has a number of well known
steps:
Define Problem.
Prepare Data.
Evaluate Algorithms.
Improve Results.
Present Results.
the iris dataset has followingthe iris dataset has following
structurestructure
Attributes are numeric so you have to figure out
how to load and handle data.
It is a classification problem, allowing you to
practice with perhaps an easier type of supervised
learning algorithm.
It is a multi-class classification problem (multi-
nominal) that may require some specialized
handling.
It only has 4 attributes and 150 rows, meaning it is
small and easily fits into memory.
All of the numeric attributes are in the same units
and the same scale, not requiring any special scaling
or transforms to get started.
Machine Learning in Python:Machine Learning in Python:
Step-By-StepStep-By-Step
Installing the Python and SciPy
platform.
Loading the dataset.
Summarizing the dataset.
Visualizing the dataset.
Evaluating some algorithms.
Making some predictions.
Basic library in pythonBasic library in python
NumPy‘s array type augments the Python language
with an efficient data structure useful for numerical
work, e.g., manipulating matrices. NumPy also
provides basic numerical routines, such as tools for
finding eigenvectors.
SciPy contains additional routines needed in
scientific work: for example, routines for computing
integrals numerically, solving differential equations,
optimization, and sparse matrices.
The matplotlib module produces high quality plots.
With it you can turn your data or your models into
figures for presentations or articles. No need to do
the numerical work in one program, save the data,
and plot it with another program.
 The Pandas module is a massive collaboration of many
modules along with some unique features to make a very
powerful module.
 Pandas is great for data manipulation, data analysis, and data
visualization.
 The Pandas modules uses objects to allow for data analysis
at a fairly high performance rate in comparison to typical
Python procedures. With it, we can easily read and write
from and to CSV files, or even databases.
 From there, we can manipulate the data by columns, create
new columns, and even base the new columns on other
column data.
 The scikit library used for
 Simple and efficient tools for data mining and data analysis
 Accessible to everybody, and reusable in various contexts
 Built on NumPy, SciPy, and matplotlib
 Open source, commercially usable
NumPy: Base n-dimensional array
package
SciPy: Fundamental library for scientific
computing
Matplotlib: Comprehensive 2D/3D
plotting
IPython: Enhanced interactive console
Sympy: Symbolic mathematics
Pandas: Data structures and analysis
1. Downloading, Installing and Starting
Python SciPy
1.1 Install SciPy Libraries
There are 5 key libraries that you will need to
install. Below is a list of the Python SciPy
libraries required for this tutorial:
scipy
numpy
matplotlib
pandas
sklearn
http://machinelearningmastery.com/machi
ne-learning-in-python-step-by-step/

Más contenido relacionado

La actualidad más candente

Python Programming Language
Python Programming LanguagePython Programming Language
Python Programming Language
Laxman Puri
 
Python quick guide1
Python quick guide1Python quick guide1
Python quick guide1
Kanchilug
 

La actualidad más candente (19)

Python for the Mobile and Web
Python for the Mobile and WebPython for the Mobile and Web
Python for the Mobile and Web
 
20120314 changa-python-workshop
20120314 changa-python-workshop20120314 changa-python-workshop
20120314 changa-python-workshop
 
Python presentation
Python presentationPython presentation
Python presentation
 
Python course syllabus
Python course syllabusPython course syllabus
Python course syllabus
 
Python Programming
Python ProgrammingPython Programming
Python Programming
 
PYTHON CURRENT TREND APPLICATIONS- AN OVERVIEW
PYTHON CURRENT TREND APPLICATIONS- AN OVERVIEWPYTHON CURRENT TREND APPLICATIONS- AN OVERVIEW
PYTHON CURRENT TREND APPLICATIONS- AN OVERVIEW
 
Python Programming Language
Python Programming LanguagePython Programming Language
Python Programming Language
 
Python quick guide1
Python quick guide1Python quick guide1
Python quick guide1
 
introduction of python in data science
introduction of python in data scienceintroduction of python in data science
introduction of python in data science
 
Introduction to python
 Introduction to python Introduction to python
Introduction to python
 
Python
PythonPython
Python
 
Presentation on java
Presentation on javaPresentation on java
Presentation on java
 
Python Summer Internship
Python Summer InternshipPython Summer Internship
Python Summer Internship
 
Python Crash Course
Python Crash CoursePython Crash Course
Python Crash Course
 
Seminar report on python 3 course
Seminar report on python 3 courseSeminar report on python 3 course
Seminar report on python 3 course
 
Introduction to python
Introduction to pythonIntroduction to python
Introduction to python
 
About Python Programming Language | Benefit of Python
About Python Programming Language | Benefit of PythonAbout Python Programming Language | Benefit of Python
About Python Programming Language | Benefit of Python
 
IRJET- Python: Simple though an Important Programming Language
IRJET- Python: Simple though an Important Programming LanguageIRJET- Python: Simple though an Important Programming Language
IRJET- Python: Simple though an Important Programming Language
 
Python Usefulness
Python UsefulnessPython Usefulness
Python Usefulness
 

Similar a Python and data analytics

Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
Simplilearn
 
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Rohit Dubey
 

Similar a Python and data analytics (20)

Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...Data Science With Python | Python For Data Science | Python Data Science Cour...
Data Science With Python | Python For Data Science | Python Data Science Cour...
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
Python ml
Python mlPython ml
Python ml
 
Adarsh_Masekar(2GP19CS003).pptx
Adarsh_Masekar(2GP19CS003).pptxAdarsh_Masekar(2GP19CS003).pptx
Adarsh_Masekar(2GP19CS003).pptx
 
Data science
Data scienceData science
Data science
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 
Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)Building Data Products with Python (Georgetown)
Building Data Products with Python (Georgetown)
 
Python for Data Science: A Comprehensive Guide
Python for Data Science: A Comprehensive GuidePython for Data Science: A Comprehensive Guide
Python for Data Science: A Comprehensive Guide
 
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
 
Basic of python for data analysis
Basic of python for data analysisBasic of python for data analysis
Basic of python for data analysis
 
Real world machine learning with Java for Fumankaitori.com
Real world machine learning with Java for Fumankaitori.comReal world machine learning with Java for Fumankaitori.com
Real world machine learning with Java for Fumankaitori.com
 
Predictive Analytics Project in Automotive Industry
Predictive Analytics Project in Automotive IndustryPredictive Analytics Project in Automotive Industry
Predictive Analytics Project in Automotive Industry
 
BSSML16 L10. Summary Day 2 Sessions
BSSML16 L10. Summary Day 2 SessionsBSSML16 L10. Summary Day 2 Sessions
BSSML16 L10. Summary Day 2 Sessions
 
UNIT_5_Data Wrangling.pptx
UNIT_5_Data Wrangling.pptxUNIT_5_Data Wrangling.pptx
UNIT_5_Data Wrangling.pptx
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
 

Más de Shree M.L.Kakadiya MCA mahila college, Amreli

Más de Shree M.L.Kakadiya MCA mahila college, Amreli (20)

Machine Learning by Rj
Machine Learning by RjMachine Learning by Rj
Machine Learning by Rj
 
Listeners and filters in servlet
Listeners and filters in servletListeners and filters in servlet
Listeners and filters in servlet
 
Servlet unit 2
Servlet unit 2 Servlet unit 2
Servlet unit 2
 
Servlet by Rj
Servlet by RjServlet by Rj
Servlet by Rj
 
Networking in python by Rj
Networking in python by RjNetworking in python by Rj
Networking in python by Rj
 
Jsp in Servlet by Rj
Jsp in Servlet by RjJsp in Servlet by Rj
Jsp in Servlet by Rj
 
Motion capture by Rj
Motion capture by RjMotion capture by Rj
Motion capture by Rj
 
Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
 
Text processing by Rj
Text processing by RjText processing by Rj
Text processing by Rj
 
Multithreading by rj
Multithreading by rjMultithreading by rj
Multithreading by rj
 
Django by rj
Django by rjDjango by rj
Django by rj
 
Database programming
Database programmingDatabase programming
Database programming
 
CGI by rj
CGI by rjCGI by rj
CGI by rj
 
Adv. python regular expression by Rj
Adv. python regular expression by RjAdv. python regular expression by Rj
Adv. python regular expression by Rj
 
Seminar on Project Management by Rj
Seminar on Project Management by RjSeminar on Project Management by Rj
Seminar on Project Management by Rj
 
Spring by rj
Spring by rjSpring by rj
Spring by rj
 
Python by Rj
Python by RjPython by Rj
Python by Rj
 
Leadership & Motivation
Leadership & MotivationLeadership & Motivation
Leadership & Motivation
 
Event handling
Event handlingEvent handling
Event handling
 
Layout manager
Layout managerLayout manager
Layout manager
 

Último

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 

Python and data analytics

  • 1. Python and Data AnalyticsPython and Data Analytics •Understand the problem By Understanding the Data •Predictive Model Building: Balancing Performance, Complexity, and theBig Data
  • 2. Machine learningMachine learning machine learning explores the study and construction of algorithms that can learn from and make predictions on data.
  • 3.
  • 4. Predictive model buildingPredictive model building The process of building a predictive model is called training.  Attributes: the variables being used to make predictions is known as: ◦ Predictors. ◦ Features ◦ Independent variables ◦ Input  Labels are also known as, ◦ Outcomes ◦ Targets ◦ Dependent variables ◦ Responses
  • 5. A machine learning project may not be linear, but it has a number of well known steps: Define Problem. Prepare Data. Evaluate Algorithms. Improve Results. Present Results.
  • 6. the iris dataset has followingthe iris dataset has following structurestructure Attributes are numeric so you have to figure out how to load and handle data. It is a classification problem, allowing you to practice with perhaps an easier type of supervised learning algorithm. It is a multi-class classification problem (multi- nominal) that may require some specialized handling. It only has 4 attributes and 150 rows, meaning it is small and easily fits into memory. All of the numeric attributes are in the same units and the same scale, not requiring any special scaling or transforms to get started.
  • 7. Machine Learning in Python:Machine Learning in Python: Step-By-StepStep-By-Step Installing the Python and SciPy platform. Loading the dataset. Summarizing the dataset. Visualizing the dataset. Evaluating some algorithms. Making some predictions.
  • 8. Basic library in pythonBasic library in python NumPy‘s array type augments the Python language with an efficient data structure useful for numerical work, e.g., manipulating matrices. NumPy also provides basic numerical routines, such as tools for finding eigenvectors. SciPy contains additional routines needed in scientific work: for example, routines for computing integrals numerically, solving differential equations, optimization, and sparse matrices. The matplotlib module produces high quality plots. With it you can turn your data or your models into figures for presentations or articles. No need to do the numerical work in one program, save the data, and plot it with another program.
  • 9.  The Pandas module is a massive collaboration of many modules along with some unique features to make a very powerful module.  Pandas is great for data manipulation, data analysis, and data visualization.  The Pandas modules uses objects to allow for data analysis at a fairly high performance rate in comparison to typical Python procedures. With it, we can easily read and write from and to CSV files, or even databases.  From there, we can manipulate the data by columns, create new columns, and even base the new columns on other column data.  The scikit library used for  Simple and efficient tools for data mining and data analysis  Accessible to everybody, and reusable in various contexts  Built on NumPy, SciPy, and matplotlib  Open source, commercially usable
  • 10. NumPy: Base n-dimensional array package SciPy: Fundamental library for scientific computing Matplotlib: Comprehensive 2D/3D plotting IPython: Enhanced interactive console Sympy: Symbolic mathematics Pandas: Data structures and analysis
  • 11. 1. Downloading, Installing and Starting Python SciPy 1.1 Install SciPy Libraries There are 5 key libraries that you will need to install. Below is a list of the Python SciPy libraries required for this tutorial: scipy numpy matplotlib pandas sklearn
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.