SlideShare una empresa de Scribd logo
1 de 44
AnIntroductionto
DATASCIENCE
By: Shweta Saraswat
01
Definitions and Examples
02
What exactly data science and
data scientist do?
03
Relation between artificial
ineliigence and data science
04
To become a data scientist one
should know the various
techniques.
TABLEOFCONTENTS
DataScience&Importance DataScience Process
AIandDataScience Prerequisites forDS
WhatisDataScience?
Data science is an
interdisciplinary field that uses
algorithms, procedures, and
processes to examine large
amounts of data in order to
uncover hidden patterns,
generate insights, and direct
decision making.
Importanceof
DataScience
01
Career Opportunities
"The rise of Data Science needs will create roughly 11.5 million job openings by
2026" US Bureau of Labour Statistics
"By 2026, Data Scientists and Analysts will become the number one emerging
role in the world." World Economic Forum
Data Science and Artificial Intelligence are amongst the hottest fields of the
21st century that will impactall segments of daily life by 2025, from transport
and logistics to healthcare and customer service.
Examples:
Oil giant Shell, for instance, used data science to anticipate machine failure at
facilities across the world.
Agricultural company Cargill developed a mobile data-tracking app that helps
shrimp farmers reduce mortality rates.
Dr. Pepper Snapple Group analyzed data with machine learning to glean more
details about beverage sales and vendors.
And freight company Pitt Ohio used historical data and predictive analytics to
estimate delivery time with 99 percent accuracy.
Facts on Data Generation
Facts on Data Genaration
Statistics show that more than 500 terabytes of new data are
entered into the databases of the social networking site
Facebook every day.
•A single Jet engine can generate over 10 terabytes of data in
30 minutes of flight time. With several thousand flights per day,
data generation reaches several petabytes.
•Stock Exchange is also an example of big data that generates
about a terabyte of new trade data per day
HowdoesData
ScienceWork?
02
CollectData
Raw data is gathered
from various sources that
explain the business
problem
Using various statistical
analysis, and machine
learning approaches, data
modeling is performed to
get the optimum solutions
that best explain the
business problem.
Actionable insights that
will serve as a solution for
the business problems
gathered through data
science.
How does Data Science Work?
AnalyzeData Insights
CollectData
Gather the previous data
on the sales that were
closed.
Use statistical analysis to
find out the patterns that
were followed by the leads
that were closed.
Use machine learning to
get actionable insights for
finding out potential leads.
Consider an Example!
AnalyzeData Insights
Suppose there is an organization that is working
towards finding out potential leads for their sales
team. They can follow the following approach to
get an optimal solution using Data Science:
Letscheckrelationship betweenAIandDataScience
“In above example we saw machine
learning is required for insights”
AIandDataScience
03
Data science and artificial intelligence are not the same.
“Data science and artificial intelligence are two technologies
that are transforming the world. While artificial intelligence powers data
science operations, data science is not completely dependent on AI.
Data Science is leading the fourth industrial revolution. ”
Datascience alsorequires machinelearningalgorithms, which results in
dependencyonAI.
Comparison BetweenAIandDataScience
• Data science jobs require the knowledge of ML languages
like R and Python to perform various data operations and
computer science expertise.
• Data science uses more tools apart from AI. This is because
data science involves multiples steps to analyze data and
generate insights.
• Data science models are built for statistical insights
whereas AI is used to build models that mimic cognition and
human understanding.
Comparison BetweenAIandDataScience
• Today’s industries require both, data science and
artificial intelligence. Data science will help them
make necessary data-driven decisions and assess
their performance in the market, while artificial
intelligence will help industries work with
smarter devices and software that will minimize
workload and optimize all the processes for
improves innovation.
Comparison BetweenAIandDataScience
PrerequisitesforData
Science
04
Computer Science MachineLearning
Statistics
DataScienceisasciencewhichuses:
Visualization
To collect ,clean, Integrate, analyze, visualize, interact with
data to create data products.
HumanComputer Interaction
RoadmapofDataScience
Commonly used
languages are python
and R
Involves charts,graphs etc.
We have already two libraries
in Python for this i.e Seaborn,
matplotlib
Like mean, median ,
mode, standard
deviation etc.
Know about basic algorithms
of ML like Linear regression,
Logistic regression, Decision
tree, SVM algorithm, KNN
algorithm , Random forest
algorithm.
1.LearnProgramming
Language
3.DataVisualization
2.Mathematics &Statistics
4.MachineLearning
5.Project
Try to make projects with
the help of Kaggle
Python:
• Python is also most popular programming language among data scientists
these days.
• Python is very versatile and can be used in almost all the processes in Data
Science.
• Be it data mining or running embedded systems, python can do everything, and
because of this, 40% of the people that participated in a survey by O’Reilly
said that they used Python most often.
• Pandas, a python library, is used for data analysis and can do anything from
plotting data with histograms, to importing data from spreadsheets.
• Python can take data in various formats and import SQL tables to your code
easily.
• The python packages you need to master are Numpy, Matplotlib, PyTorch,
Pandas, Scikit-Learn, and Seaborn in Python.
RprogrammingLanguage
•Following Python, the next skill in the list was R Programming, mentioned in 32% of the job
postings. R is a language specifically designed for Data Science.
• It can be used to solve any Data Science related problem that you might encounter.
•It is the most popular language among Data Scientists.
•Infact, 43% of data scientists prefer to use R for solving statistical problems.
• It is one of the most important Data Science Prerequisites. However, the learning curve is steep. It
is difficult to master, especially if you already have an expertise in any other programming language.
•R can implement ML algorithms to give us a vast variety of statistical and graphical techniques like
time-series analysis, clustering, classical statistical tests etc.
•It is used for calculations and data manipulation. Tidyverse, Ggplot2, Stringr, Dplyr and Caret are
some of the things to master in R.
Mathematics and Statistics
Mathematics is one of the very popular Prerequisites for Data Science.
Probability and Statistics are used for data imputation, visualization of
features, feature transformation, model evaluation, dimensionality reduction,
feature engineering and data preprocessing.
Multivariable Calculus is used to build Machine Learning Models. For Model
Evaluation, Data Preprocessing and Data Transformation, we use linear
Algebra. A matrix is used to represent a Data set.
Mathematics and Statistics
There is no defined syllabus for what you need to learn in Mathematics and Statistics but
here are a few topics you should be familiar with –
•Mean, Median, Mode, Variance, Standard Deviation, and Percentiles
•Bayes Theorem And Probability Distribution (Normal, Poisson, and Binomial)
•Covariance Matrix and Correlation Coefficient
•Mean Square Error and R2 Score
•Statistical tests like p-value, hypothesis Testing, and chi-square
•Multivariate Functions, Cost Functions and Maxima and Minima of a Function
•Step Function, Sigmoid Function, Logit function and Rectified Linear Unit
•Vectors and Matrices; Transpose, Inverse, and Determinant of a matrix
•Dot Product, Cross Product, Eigenvalues, and Eigenvectors
Data Visualization
Data Visualization is a very important Prerequisite for Data Science. In
simple words, data visualization is a representation of data visually, through
graphs and charts. A data scientist should be able to represent data
graphically, using charts, graphs, maps, etc.
Data Visualization
There are multiple components in a good data visualization –
•Data Component – The first step in visualizing data is understanding the type of data, for example, it
could be continuous data, discrete data, or categorical data.
•Geometric Component – This means deciding what kind of visualization will best suit your data-
histograms, bar plots, heatmaps, scatter plots, pair plots, line graphs, etc.
•Mapping Component – In this component, you decide what variable you should use as the x-variable
(independent variable) and y-variable (dependent variable). This is especially important for multi-
dimensional datasets.
•Labels Component – This has the axes labels, legends, titles, font size, etc.
•Scale Component – Here you decide which scale you will be using- log scale, linear scale, etc.
•Ethical Component – This is to make sure that the visualization you have done, tells the true story,
and doesn’t mislead the audience.
Machine Learning and Artificial Intelligence –
•ML helps in analyzing large amounts of data using algorithms. Using Machine Learning, major
parts of a data scientist’s jobs can be automated.
•Only a small percentage of Data Scientists are proficient with advanced machine learning
techniques like adversarial learning, neural networks, reinforcement learning, Outlier Detection,
Time Series etc.
•The most skilled data scientists are highly familiar with advanced machine learning techniques
such as recommendation engines and Natural Language Processing.
•If you want to stand out from the crowd and be one at the top tier, knowledge of machine
learning techniques such as logistic regression, supervised machine learning, decision trees,
Survival Analysis, Computer Vision, etc., is a must.
Start making small Data Science projects
with the help of Kaggle
InsideKaggleyou’llfindallthecode&datayouneedto
doyourdatasciencework.
Useover50,000publicdatasetsand400,000
publicnotebookstoconqueranyanalysisinnotime.
Introduction to Data Science.pptx

Más contenido relacionado

La actualidad más candente

Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Edureka!
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecasesSreenatha Reddy K R
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Venkata Reddy Konasani
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Simplilearn
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data ScienceSpotle.ai
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceEdureka!
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientistVijayMohan Vasu
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data sciencebhavesh lande
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analyticsSSaudia
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with PythonDavis David
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine LearningKnoldus Inc.
 

La actualidad más candente (20)

Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Data science
Data scienceData science
Data science
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Data science
Data science Data science
Data science
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data analytics
Data analyticsData analytics
Data analytics
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 
Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Exploratory data analysis with Python
Exploratory data analysis with PythonExploratory data analysis with Python
Exploratory data analysis with Python
 
Feature Engineering in Machine Learning
Feature Engineering in Machine LearningFeature Engineering in Machine Learning
Feature Engineering in Machine Learning
 
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
 

Similar a Introduction to Data Science.pptx

Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute PoojaPatidar11
 
Data Analytics & Visualization (Introduction)
Data Analytics & Visualization (Introduction)Data Analytics & Visualization (Introduction)
Data Analytics & Visualization (Introduction)Dolapo Amusat
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxsumitkumar600840
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career pathRubikal
 
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Rohit Dubey
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfData Science Council of America
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxShanmugasundaram M
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorialAakashdata
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAjaved75
 
Top Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD AssistanceTop Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD AssistancePHDAssistance2
 
Real-Time Data Analytics Examples
Real-Time Data Analytics ExamplesReal-Time Data Analytics Examples
Real-Time Data Analytics ExamplesPHDAssistance2
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceSrishti44
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?Aspire Techsoft Academy
 
What is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdfWhat is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdfRoshni Sharma
 
KIT-601 Lecture Notes-UNIT-1.pdf
KIT-601 Lecture Notes-UNIT-1.pdfKIT-601 Lecture Notes-UNIT-1.pdf
KIT-601 Lecture Notes-UNIT-1.pdfDr. Radhey Shyam
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data ScientistRohit Dubey
 

Similar a Introduction to Data Science.pptx (20)

Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
 
DataScience_RoadMap_2023.pdf
DataScience_RoadMap_2023.pdfDataScience_RoadMap_2023.pdf
DataScience_RoadMap_2023.pdf
 
Data Analytics & Visualization (Introduction)
Data Analytics & Visualization (Introduction)Data Analytics & Visualization (Introduction)
Data Analytics & Visualization (Introduction)
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
Data Analytics Career Paths
Data Analytics Career PathsData Analytics Career Paths
Data Analytics Career Paths
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career path
 
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
Data Science Job ready #DataScienceInterview Question and Answers 2022 | #Dat...
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Untitled document.pdf
Untitled document.pdfUntitled document.pdf
Untitled document.pdf
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorial
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Top Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD AssistanceTop Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD Assistance
 
Real-Time Data Analytics Examples
Real-Time Data Analytics ExamplesReal-Time Data Analytics Examples
Real-Time Data Analytics Examples
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
 
What is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdfWhat is the difference between Data Science and Data Analytics.pdf
What is the difference between Data Science and Data Analytics.pdf
 
KIT-601 Lecture Notes-UNIT-1.pdf
KIT-601 Lecture Notes-UNIT-1.pdfKIT-601 Lecture Notes-UNIT-1.pdf
KIT-601 Lecture Notes-UNIT-1.pdf
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data Scientist
 

Más de Vrishit Saraswat (20)

Artificial intelligence-in-radiology
Artificial intelligence-in-radiologyArtificial intelligence-in-radiology
Artificial intelligence-in-radiology
 
The thyroid gland
The thyroid gland The thyroid gland
The thyroid gland
 
The scrotum
The scrotumThe scrotum
The scrotum
 
The peripheral veins
The peripheral veinsThe peripheral veins
The peripheral veins
 
The peripheral arteries
The peripheral arteries   The peripheral arteries
The peripheral arteries
 
Nuchal translucency
Nuchal translucencyNuchal translucency
Nuchal translucency
 
Ist trimester ultrasound
Ist trimester ultrasoundIst trimester ultrasound
Ist trimester ultrasound
 
Head trauma
Head traumaHead trauma
Head trauma
 
Follicular study
Follicular studyFollicular study
Follicular study
 
Fetal urogenital usg
Fetal urogenital usgFetal urogenital usg
Fetal urogenital usg
 
Fetal head & neck usg
Fetal head & neck usgFetal head & neck usg
Fetal head & neck usg
 
Fetal brain usg 3
Fetal brain usg  3Fetal brain usg  3
Fetal brain usg 3
 
Fetal brain usg 2
Fetal brain usg 2Fetal brain usg 2
Fetal brain usg 2
 
Fetal brain usg 1
Fetal brain usg   1Fetal brain usg   1
Fetal brain usg 1
 
The fetal musculoskeletal system
The fetal musculoskeletal systemThe fetal musculoskeletal system
The fetal musculoskeletal system
 
The fetal heart
The fetal heartThe fetal heart
The fetal heart
 
Role of color doppler and mr breast in
Role of color doppler and mr breast inRole of color doppler and mr breast in
Role of color doppler and mr breast in
 
Patttern of lung disease on chest x ray
Patttern of lung disease on   chest x rayPatttern of lung disease on   chest x ray
Patttern of lung disease on chest x ray
 
NSIP
NSIPNSIP
NSIP
 
Ivp in urinary tract diseases
Ivp in urinary tract diseasesIvp in urinary tract diseases
Ivp in urinary tract diseases
 

Último

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 

Introduction to Data Science.pptx

  • 2. 01 Definitions and Examples 02 What exactly data science and data scientist do? 03 Relation between artificial ineliigence and data science 04 To become a data scientist one should know the various techniques. TABLEOFCONTENTS DataScience&Importance DataScience Process AIandDataScience Prerequisites forDS
  • 3. WhatisDataScience? Data science is an interdisciplinary field that uses algorithms, procedures, and processes to examine large amounts of data in order to uncover hidden patterns, generate insights, and direct decision making.
  • 5. Career Opportunities "The rise of Data Science needs will create roughly 11.5 million job openings by 2026" US Bureau of Labour Statistics "By 2026, Data Scientists and Analysts will become the number one emerging role in the world." World Economic Forum Data Science and Artificial Intelligence are amongst the hottest fields of the 21st century that will impactall segments of daily life by 2025, from transport and logistics to healthcare and customer service.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Examples: Oil giant Shell, for instance, used data science to anticipate machine failure at facilities across the world. Agricultural company Cargill developed a mobile data-tracking app that helps shrimp farmers reduce mortality rates. Dr. Pepper Snapple Group analyzed data with machine learning to glean more details about beverage sales and vendors. And freight company Pitt Ohio used historical data and predictive analytics to estimate delivery time with 99 percent accuracy.
  • 21. Facts on Data Generation
  • 22. Facts on Data Genaration Statistics show that more than 500 terabytes of new data are entered into the databases of the social networking site Facebook every day. •A single Jet engine can generate over 10 terabytes of data in 30 minutes of flight time. With several thousand flights per day, data generation reaches several petabytes. •Stock Exchange is also an example of big data that generates about a terabyte of new trade data per day
  • 24. CollectData Raw data is gathered from various sources that explain the business problem Using various statistical analysis, and machine learning approaches, data modeling is performed to get the optimum solutions that best explain the business problem. Actionable insights that will serve as a solution for the business problems gathered through data science. How does Data Science Work? AnalyzeData Insights
  • 25. CollectData Gather the previous data on the sales that were closed. Use statistical analysis to find out the patterns that were followed by the leads that were closed. Use machine learning to get actionable insights for finding out potential leads. Consider an Example! AnalyzeData Insights Suppose there is an organization that is working towards finding out potential leads for their sales team. They can follow the following approach to get an optimal solution using Data Science:
  • 26. Letscheckrelationship betweenAIandDataScience “In above example we saw machine learning is required for insights”
  • 28. Data science and artificial intelligence are not the same. “Data science and artificial intelligence are two technologies that are transforming the world. While artificial intelligence powers data science operations, data science is not completely dependent on AI. Data Science is leading the fourth industrial revolution. ”
  • 29. Datascience alsorequires machinelearningalgorithms, which results in dependencyonAI.
  • 30. Comparison BetweenAIandDataScience • Data science jobs require the knowledge of ML languages like R and Python to perform various data operations and computer science expertise. • Data science uses more tools apart from AI. This is because data science involves multiples steps to analyze data and generate insights. • Data science models are built for statistical insights whereas AI is used to build models that mimic cognition and human understanding.
  • 31. Comparison BetweenAIandDataScience • Today’s industries require both, data science and artificial intelligence. Data science will help them make necessary data-driven decisions and assess their performance in the market, while artificial intelligence will help industries work with smarter devices and software that will minimize workload and optimize all the processes for improves innovation.
  • 34. Computer Science MachineLearning Statistics DataScienceisasciencewhichuses: Visualization To collect ,clean, Integrate, analyze, visualize, interact with data to create data products. HumanComputer Interaction
  • 35. RoadmapofDataScience Commonly used languages are python and R Involves charts,graphs etc. We have already two libraries in Python for this i.e Seaborn, matplotlib Like mean, median , mode, standard deviation etc. Know about basic algorithms of ML like Linear regression, Logistic regression, Decision tree, SVM algorithm, KNN algorithm , Random forest algorithm. 1.LearnProgramming Language 3.DataVisualization 2.Mathematics &Statistics 4.MachineLearning 5.Project Try to make projects with the help of Kaggle
  • 36. Python: • Python is also most popular programming language among data scientists these days. • Python is very versatile and can be used in almost all the processes in Data Science. • Be it data mining or running embedded systems, python can do everything, and because of this, 40% of the people that participated in a survey by O’Reilly said that they used Python most often. • Pandas, a python library, is used for data analysis and can do anything from plotting data with histograms, to importing data from spreadsheets. • Python can take data in various formats and import SQL tables to your code easily. • The python packages you need to master are Numpy, Matplotlib, PyTorch, Pandas, Scikit-Learn, and Seaborn in Python.
  • 37. RprogrammingLanguage •Following Python, the next skill in the list was R Programming, mentioned in 32% of the job postings. R is a language specifically designed for Data Science. • It can be used to solve any Data Science related problem that you might encounter. •It is the most popular language among Data Scientists. •Infact, 43% of data scientists prefer to use R for solving statistical problems. • It is one of the most important Data Science Prerequisites. However, the learning curve is steep. It is difficult to master, especially if you already have an expertise in any other programming language. •R can implement ML algorithms to give us a vast variety of statistical and graphical techniques like time-series analysis, clustering, classical statistical tests etc. •It is used for calculations and data manipulation. Tidyverse, Ggplot2, Stringr, Dplyr and Caret are some of the things to master in R.
  • 38. Mathematics and Statistics Mathematics is one of the very popular Prerequisites for Data Science. Probability and Statistics are used for data imputation, visualization of features, feature transformation, model evaluation, dimensionality reduction, feature engineering and data preprocessing. Multivariable Calculus is used to build Machine Learning Models. For Model Evaluation, Data Preprocessing and Data Transformation, we use linear Algebra. A matrix is used to represent a Data set.
  • 39. Mathematics and Statistics There is no defined syllabus for what you need to learn in Mathematics and Statistics but here are a few topics you should be familiar with – •Mean, Median, Mode, Variance, Standard Deviation, and Percentiles •Bayes Theorem And Probability Distribution (Normal, Poisson, and Binomial) •Covariance Matrix and Correlation Coefficient •Mean Square Error and R2 Score •Statistical tests like p-value, hypothesis Testing, and chi-square •Multivariate Functions, Cost Functions and Maxima and Minima of a Function •Step Function, Sigmoid Function, Logit function and Rectified Linear Unit •Vectors and Matrices; Transpose, Inverse, and Determinant of a matrix •Dot Product, Cross Product, Eigenvalues, and Eigenvectors
  • 40. Data Visualization Data Visualization is a very important Prerequisite for Data Science. In simple words, data visualization is a representation of data visually, through graphs and charts. A data scientist should be able to represent data graphically, using charts, graphs, maps, etc.
  • 41. Data Visualization There are multiple components in a good data visualization – •Data Component – The first step in visualizing data is understanding the type of data, for example, it could be continuous data, discrete data, or categorical data. •Geometric Component – This means deciding what kind of visualization will best suit your data- histograms, bar plots, heatmaps, scatter plots, pair plots, line graphs, etc. •Mapping Component – In this component, you decide what variable you should use as the x-variable (independent variable) and y-variable (dependent variable). This is especially important for multi- dimensional datasets. •Labels Component – This has the axes labels, legends, titles, font size, etc. •Scale Component – Here you decide which scale you will be using- log scale, linear scale, etc. •Ethical Component – This is to make sure that the visualization you have done, tells the true story, and doesn’t mislead the audience.
  • 42. Machine Learning and Artificial Intelligence – •ML helps in analyzing large amounts of data using algorithms. Using Machine Learning, major parts of a data scientist’s jobs can be automated. •Only a small percentage of Data Scientists are proficient with advanced machine learning techniques like adversarial learning, neural networks, reinforcement learning, Outlier Detection, Time Series etc. •The most skilled data scientists are highly familiar with advanced machine learning techniques such as recommendation engines and Natural Language Processing. •If you want to stand out from the crowd and be one at the top tier, knowledge of machine learning techniques such as logistic regression, supervised machine learning, decision trees, Survival Analysis, Computer Vision, etc., is a must.
  • 43. Start making small Data Science projects with the help of Kaggle InsideKaggleyou’llfindallthecode&datayouneedto doyourdatasciencework. Useover50,000publicdatasetsand400,000 publicnotebookstoconqueranyanalysisinnotime.