SlideShare una empresa de Scribd logo
1 de 37
Descargar para leer sin conexión
Webinar for Aspirants of Data Analytics
Data Analytics
Prakash Pimpale
Joint Director, C-DAC Mumbai
A premier R&D organization of the Ministry of Electronics
and Information Technology (MeitY), Govt. of India
Introduction Applications Career Prospects
Data
Data are a set of values of
qualitative or quantitative
variables about one or more
entities.
Analytics
Analytics is the systematic
computational analysis of data.
It is used for the discovery,
interpretation, and
communication of meaningful
patterns in data. It also entails
applying data patterns towards
effective decision-making.
Science
Systematic enterprise that
builds and organizes knowledge
in the form of testable
explanations and predictions
about the universe
Derived Def
• Data Analytics
• Analytics is the
systematic computational analysis
of set of values of qualitative or
quantitative variables about one or
more entities.
• Data Science
• Systematic enterprise that builds
and organizes knowledge in the
form of testable explanations and
predictions about the universe set
of values of qualitative or
quantitative variables about one or
more entities.
Data Science
Data Analytics
Data Scientist
Data Engineer
Data Wrangler
Analytics Engineer
Visualization Engineer
Data Analytics
is not 'just'
Computer Programming
Visualization
Machine Learning
Includes integrated applications of above
and lots of Domain Understanding.
Data Analytics Constitutes
Data
Existing Digitised System or Data –
eCommerce site/app transactions,
banking applications with customer
and transaction records, survey
results, etc.
Data Infrastructure
Storage, Processing and Management
– Databases, transformation tools,
bigdata infra (like Apache Spark,
Cassandra, HDFS), etc.
Data Analytics/Science Tools
Algorithms, visualizations and
programming tools – statistical
analysis, classification, prediction
algorithms, charting libraries, Python,
R, etc.
Why now
The Data
And this is just the social
applications!
There is additional data
being generated through
Enterprise and Consumer
applications!
DOMO
Why now
The Infrastructure
Why now
The Machine Learning
Mobidev
Data Analytics Pipeline
Source
1
Store and Manage
2
Pre-process
3
Explore
4
Analyse/Model/Predict/Present
5
A lot must happen prior to the real analytics
An Important Pillar
A non-Machine Learning Based Solutions
Non-ML System
Rules
Input
Output
A Developer
With package 8 lpa
ML System (Train) Rules
Input
Output
A Data Science/Analytics Developer
ML System
(Production)
New
Inputs
Output
Machine Learning Based Solutions
An Example of
Machine Learning
Analysis
Visualization
Swine flu mortality
comparison for major Indian
states for the first and second
waves
Visualization
Enron email dataset visualized
Visualization
• Cambridge Intelligence
Major Challenges
Lack of Skills how do we do all of these?
Domain Knowledge what does 'customer churn' mean, which of these products are for male and which of these for female?
Problem Formulation and choice
of techniques
which one of classification, prediction, numeric, non-numeric or just exploratory, etc.?
Cleaning of the Data what is that we don't need from this or what is misleading in it?
Integration of the data from
multiple data sources
if it's not in single place, how do we integrate these with minimum error and maximum data?
Choice of the Right Data and
Data Sources
which of the data is useful and what is the right place to get it?
Availability of Data is data for what you want to solve available?
Some more
challenges and
resolutions
More at https://analyticsindiamag.com/7-challenges-
faced-by-data-scientists-in-data-processing-in-2020/​
Applications
Data is the new Oil!
Applications
Applications
Applications
Applications
Prospect
Survey by Harvard Business Review
Analytics Services on lessons learned from
early adopter of this group of
technologies
Prospect Another survey by
Accenture "Closing the
Data Value Gap"
Prospect
The same survey mentions one of the
major reasons behind the gap – lack
of skills or skilled resources in Data
Analytics
How can you
get into this..
Build Skills.
Practitioners Speak
What do you expect from Fresh Data Analytics Developer of Data Scientist in your team?
I will speak for the projects I have been part of. I have always expected my developers to
be good at Problem Solving and Python. Familiarity of the Data Analytics tool stack is
something that will be an add on, but good problem-solving skills is a must!
Prasad Pawar
Senior Data Scientist working with TCS.
Practitioners Speak
What kinds of role a fresh Data Scientist will get to
perform?
When I got into this industry, I was expected to just create
models given data. I was not much worried about where it
came from. But now in some of my projects the analytics
developers are expected to know everything in the data
analytics stack. When the project has small team this is de
facto requirement. But yes, being 'really good' at some part
of the stack will make you visible and go longer.
Nitin Agarwal
Senior Data Scientist
from AI@Scale Team, Fractal
Practitioners
Speak
What do you do as a Data Analyst?
I joined as Data Analytics developer recently. I am
working on a cloud-based analytics tool. The tool was
not taught in the course but the concepts, lifecycle of
data analytics projects, functioning of various tools
which was taught to me in the course helps me a lot.
I am confident that with that foundational
understanding I can explore, learn and work with any
tool that may be required.
Rahul Shilpakar
Data Analyst working for a MNC
& an alumni of PG-DBDA, C-DAC
Thank you!

Más contenido relacionado

La actualidad más candente

Python and BIG Data analytics | Python Fundamentals | Python Architecture
Python and BIG Data analytics | Python Fundamentals | Python ArchitecturePython and BIG Data analytics | Python Fundamentals | Python Architecture
Python and BIG Data analytics | Python Fundamentals | Python ArchitectureSkillspeed
 
Array computing and the evolution of SciPy, NumPy, and PyData
Array computing and the evolution of SciPy, NumPy, and PyDataArray computing and the evolution of SciPy, NumPy, and PyData
Array computing and the evolution of SciPy, NumPy, and PyDataTravis Oliphant
 
Scaling Python to CPUs and GPUs
Scaling Python to CPUs and GPUsScaling Python to CPUs and GPUs
Scaling Python to CPUs and GPUsTravis Oliphant
 
First NL-HUG: Large-scale data processing at SARA with Apache Hadoop
First NL-HUG: Large-scale data processing at SARA with Apache HadoopFirst NL-HUG: Large-scale data processing at SARA with Apache Hadoop
First NL-HUG: Large-scale data processing at SARA with Apache HadoopEvert Lammerts
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridEvert Lammerts
 
Large-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with HadoopLarge-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with HadoopEvert Lammerts
 
Analyzing Data With Python
Analyzing Data With PythonAnalyzing Data With Python
Analyzing Data With PythonSarah Guido
 
Data Analysis and Visualization using Python
Data Analysis and Visualization using PythonData Analysis and Visualization using Python
Data Analysis and Visualization using PythonChariza Pladin
 
Python for Big Data Analytics
Python for Big Data AnalyticsPython for Big Data Analytics
Python for Big Data AnalyticsEdureka!
 
Notes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop MapreduceNotes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop MapreduceEvert Lammerts
 
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...Maurice Nsabimana
 
Study of Various Tools for Data Science
Study of Various Tools for Data ScienceStudy of Various Tools for Data Science
Study of Various Tools for Data ScienceIRJET Journal
 
Intro to Python Data Analysis in Wakari
Intro to Python Data Analysis in WakariIntro to Python Data Analysis in Wakari
Intro to Python Data Analysis in WakariKarissa Rae McKelvey
 
High Performance Python - Marc Garcia
High Performance Python - Marc GarciaHigh Performance Python - Marc Garcia
High Performance Python - Marc GarciaMarc Garcia
 
Ted Willke, Intel Labs MLconf 2013
Ted Willke, Intel Labs MLconf 2013Ted Willke, Intel Labs MLconf 2013
Ted Willke, Intel Labs MLconf 2013MLconf
 
Beyond Kaggle: Solving Data Science Challenges at Scale
Beyond Kaggle: Solving Data Science Challenges at ScaleBeyond Kaggle: Solving Data Science Challenges at Scale
Beyond Kaggle: Solving Data Science Challenges at ScaleTuri, Inc.
 
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and DistributedMaking Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and DistributedTuri, Inc.
 
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기 Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기 Mario Cho
 
Data science in ruby is it possible? is it fast? should we use it?
Data science in ruby is it possible? is it fast? should we use it?Data science in ruby is it possible? is it fast? should we use it?
Data science in ruby is it possible? is it fast? should we use it?Rodrigo Urubatan
 
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...MLconf
 

La actualidad más candente (20)

Python and BIG Data analytics | Python Fundamentals | Python Architecture
Python and BIG Data analytics | Python Fundamentals | Python ArchitecturePython and BIG Data analytics | Python Fundamentals | Python Architecture
Python and BIG Data analytics | Python Fundamentals | Python Architecture
 
Array computing and the evolution of SciPy, NumPy, and PyData
Array computing and the evolution of SciPy, NumPy, and PyDataArray computing and the evolution of SciPy, NumPy, and PyData
Array computing and the evolution of SciPy, NumPy, and PyData
 
Scaling Python to CPUs and GPUs
Scaling Python to CPUs and GPUsScaling Python to CPUs and GPUs
Scaling Python to CPUs and GPUs
 
First NL-HUG: Large-scale data processing at SARA with Apache Hadoop
First NL-HUG: Large-scale data processing at SARA with Apache HadoopFirst NL-HUG: Large-scale data processing at SARA with Apache Hadoop
First NL-HUG: Large-scale data processing at SARA with Apache Hadoop
 
Hadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG Grid
 
Large-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with HadoopLarge-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with Hadoop
 
Analyzing Data With Python
Analyzing Data With PythonAnalyzing Data With Python
Analyzing Data With Python
 
Data Analysis and Visualization using Python
Data Analysis and Visualization using PythonData Analysis and Visualization using Python
Data Analysis and Visualization using Python
 
Python for Big Data Analytics
Python for Big Data AnalyticsPython for Big Data Analytics
Python for Big Data Analytics
 
Notes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop MapreduceNotes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop Mapreduce
 
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
Using Crowdsourced Images to Create Image Recognition Models with Analytics Z...
 
Study of Various Tools for Data Science
Study of Various Tools for Data ScienceStudy of Various Tools for Data Science
Study of Various Tools for Data Science
 
Intro to Python Data Analysis in Wakari
Intro to Python Data Analysis in WakariIntro to Python Data Analysis in Wakari
Intro to Python Data Analysis in Wakari
 
High Performance Python - Marc Garcia
High Performance Python - Marc GarciaHigh Performance Python - Marc Garcia
High Performance Python - Marc Garcia
 
Ted Willke, Intel Labs MLconf 2013
Ted Willke, Intel Labs MLconf 2013Ted Willke, Intel Labs MLconf 2013
Ted Willke, Intel Labs MLconf 2013
 
Beyond Kaggle: Solving Data Science Challenges at Scale
Beyond Kaggle: Solving Data Science Challenges at ScaleBeyond Kaggle: Solving Data Science Challenges at Scale
Beyond Kaggle: Solving Data Science Challenges at Scale
 
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and DistributedMaking Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and Distributed
 
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기 Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
Koss Lab 세미나 오픈소스 인공지능(AI) 프레임웍파헤치기
 
Data science in ruby is it possible? is it fast? should we use it?
Data science in ruby is it possible? is it fast? should we use it?Data science in ruby is it possible? is it fast? should we use it?
Data science in ruby is it possible? is it fast? should we use it?
 
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
Jean-François Puget, Distinguished Engineer, Machine Learning and Optimizatio...
 

Similar a Data Analytics Webinar for Aspirants

Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Simplilearn
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)NirupamNishant2
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfSujata Gupta
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?Aspire Techsoft Academy
 
Real World End to End machine Learning Pipeline
Real World End to End machine Learning PipelineReal World End to End machine Learning Pipeline
Real World End to End machine Learning PipelineSrivatsan Srinivasan
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute PoojaPatidar11
 
integrating-cognitive-services-into-your-devops-strategy
integrating-cognitive-services-into-your-devops-strategyintegrating-cognitive-services-into-your-devops-strategy
integrating-cognitive-services-into-your-devops-strategyKarthik Jaganathan
 
Integrating cognitive services in to your devops strategy
Integrating cognitive services in to your devops strategyIntegrating cognitive services in to your devops strategy
Integrating cognitive services in to your devops strategyAspire Systems
 
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxUnlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxAPTRON Solutions Noida
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data ScienceSanghamitra Deb
 
Data+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdfData+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdfneelakandan2001kpm
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data ScientistRohit Dubey
 
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...Agile Testing Alliance
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningEng Teong Cheah
 
Board Infinity Data Science Brochure - data science learning path
Board Infinity Data Science Brochure -  data science learning pathBoard Infinity Data Science Brochure -  data science learning path
Board Infinity Data Science Brochure - data science learning pathBoard Infinity
 
Machine Tool And How You Can Work around It.pdf
Machine Tool And How You Can Work around It.pdfMachine Tool And How You Can Work around It.pdf
Machine Tool And How You Can Work around It.pdfLenore Industries
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Edureka!
 

Similar a Data Analytics Webinar for Aspirants (20)

Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
Brochure data science learning path board-infinity (1)
Brochure   data science learning path board-infinity (1)Brochure   data science learning path board-infinity (1)
Brochure data science learning path board-infinity (1)
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdf
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Real World End to End machine Learning Pipeline
Real World End to End machine Learning PipelineReal World End to End machine Learning Pipeline
Real World End to End machine Learning Pipeline
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
 
integrating-cognitive-services-into-your-devops-strategy
integrating-cognitive-services-into-your-devops-strategyintegrating-cognitive-services-into-your-devops-strategy
integrating-cognitive-services-into-your-devops-strategy
 
Integrating cognitive services in to your devops strategy
Integrating cognitive services in to your devops strategyIntegrating cognitive services in to your devops strategy
Integrating cognitive services in to your devops strategy
 
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptxUnlocking Insights_ The Power of Data Analytics in the Modern World.pptx
Unlocking Insights_ The Power of Data Analytics in the Modern World.pptx
 
From Rocket Science to Data Science
From Rocket Science to Data ScienceFrom Rocket Science to Data Science
From Rocket Science to Data Science
 
Data+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdfData+Science+in+Python+-+Data+Prep+&+EDA.pdf
Data+Science+in+Python+-+Data+Prep+&+EDA.pdf
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data Scientist
 
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
#ATAGTR2021 Presentation : "Use of AI and ML in Performance Testing" by Adolf...
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Board Infinity Data Science Brochure - data science learning path
Board Infinity Data Science Brochure -  data science learning pathBoard Infinity Data Science Brochure -  data science learning path
Board Infinity Data Science Brochure - data science learning path
 
Machine Tool And How You Can Work around It.pdf
Machine Tool And How You Can Work around It.pdfMachine Tool And How You Can Work around It.pdf
Machine Tool And How You Can Work around It.pdf
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
 

Más de Prakash Pimpale

Data Science - a brief keynote
Data Science - a brief keynoteData Science - a brief keynote
Data Science - a brief keynotePrakash Pimpale
 
Technology Entrepreneurship for Students
Technology Entrepreneurship for StudentsTechnology Entrepreneurship for Students
Technology Entrepreneurship for StudentsPrakash Pimpale
 
Collaboration tools in education
Collaboration tools in educationCollaboration tools in education
Collaboration tools in educationPrakash Pimpale
 
Entrepreneurship and Startups - Introduction
Entrepreneurship and Startups - IntroductionEntrepreneurship and Startups - Introduction
Entrepreneurship and Startups - IntroductionPrakash Pimpale
 

Más de Prakash Pimpale (7)

Data Science - a brief keynote
Data Science - a brief keynoteData Science - a brief keynote
Data Science - a brief keynote
 
Technology Entrepreneurship for Students
Technology Entrepreneurship for StudentsTechnology Entrepreneurship for Students
Technology Entrepreneurship for Students
 
Genetic Algorithms
Genetic AlgorithmsGenetic Algorithms
Genetic Algorithms
 
NLTK introduction
NLTK introductionNLTK introduction
NLTK introduction
 
Collaboration tools in education
Collaboration tools in educationCollaboration tools in education
Collaboration tools in education
 
Entrepreneurship and Startups - Introduction
Entrepreneurship and Startups - IntroductionEntrepreneurship and Startups - Introduction
Entrepreneurship and Startups - Introduction
 
Support Vector Machines
Support Vector MachinesSupport Vector Machines
Support Vector Machines
 

Último

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 

Último (20)

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 

Data Analytics Webinar for Aspirants

  • 1. Webinar for Aspirants of Data Analytics Data Analytics Prakash Pimpale Joint Director, C-DAC Mumbai A premier R&D organization of the Ministry of Electronics and Information Technology (MeitY), Govt. of India
  • 3. Data Data are a set of values of qualitative or quantitative variables about one or more entities.
  • 4. Analytics Analytics is the systematic computational analysis of data. It is used for the discovery, interpretation, and communication of meaningful patterns in data. It also entails applying data patterns towards effective decision-making.
  • 5. Science Systematic enterprise that builds and organizes knowledge in the form of testable explanations and predictions about the universe
  • 6. Derived Def • Data Analytics • Analytics is the systematic computational analysis of set of values of qualitative or quantitative variables about one or more entities. • Data Science • Systematic enterprise that builds and organizes knowledge in the form of testable explanations and predictions about the universe set of values of qualitative or quantitative variables about one or more entities.
  • 7. Data Science Data Analytics Data Scientist Data Engineer Data Wrangler Analytics Engineer Visualization Engineer
  • 8. Data Analytics is not 'just' Computer Programming Visualization Machine Learning Includes integrated applications of above and lots of Domain Understanding.
  • 9. Data Analytics Constitutes Data Existing Digitised System or Data – eCommerce site/app transactions, banking applications with customer and transaction records, survey results, etc. Data Infrastructure Storage, Processing and Management – Databases, transformation tools, bigdata infra (like Apache Spark, Cassandra, HDFS), etc. Data Analytics/Science Tools Algorithms, visualizations and programming tools – statistical analysis, classification, prediction algorithms, charting libraries, Python, R, etc.
  • 10. Why now The Data And this is just the social applications! There is additional data being generated through Enterprise and Consumer applications! DOMO
  • 12. Why now The Machine Learning Mobidev
  • 13. Data Analytics Pipeline Source 1 Store and Manage 2 Pre-process 3 Explore 4 Analyse/Model/Predict/Present 5
  • 14. A lot must happen prior to the real analytics
  • 16. A non-Machine Learning Based Solutions Non-ML System Rules Input Output A Developer With package 8 lpa
  • 17. ML System (Train) Rules Input Output A Data Science/Analytics Developer ML System (Production) New Inputs Output Machine Learning Based Solutions
  • 20. Visualization Swine flu mortality comparison for major Indian states for the first and second waves
  • 23. Major Challenges Lack of Skills how do we do all of these? Domain Knowledge what does 'customer churn' mean, which of these products are for male and which of these for female? Problem Formulation and choice of techniques which one of classification, prediction, numeric, non-numeric or just exploratory, etc.? Cleaning of the Data what is that we don't need from this or what is misleading in it? Integration of the data from multiple data sources if it's not in single place, how do we integrate these with minimum error and maximum data? Choice of the Right Data and Data Sources which of the data is useful and what is the right place to get it? Availability of Data is data for what you want to solve available?
  • 24. Some more challenges and resolutions More at https://analyticsindiamag.com/7-challenges- faced-by-data-scientists-in-data-processing-in-2020/​
  • 30. Prospect Survey by Harvard Business Review Analytics Services on lessons learned from early adopter of this group of technologies
  • 31. Prospect Another survey by Accenture "Closing the Data Value Gap"
  • 32. Prospect The same survey mentions one of the major reasons behind the gap – lack of skills or skilled resources in Data Analytics
  • 33. How can you get into this.. Build Skills.
  • 34. Practitioners Speak What do you expect from Fresh Data Analytics Developer of Data Scientist in your team? I will speak for the projects I have been part of. I have always expected my developers to be good at Problem Solving and Python. Familiarity of the Data Analytics tool stack is something that will be an add on, but good problem-solving skills is a must! Prasad Pawar Senior Data Scientist working with TCS.
  • 35. Practitioners Speak What kinds of role a fresh Data Scientist will get to perform? When I got into this industry, I was expected to just create models given data. I was not much worried about where it came from. But now in some of my projects the analytics developers are expected to know everything in the data analytics stack. When the project has small team this is de facto requirement. But yes, being 'really good' at some part of the stack will make you visible and go longer. Nitin Agarwal Senior Data Scientist from AI@Scale Team, Fractal
  • 36. Practitioners Speak What do you do as a Data Analyst? I joined as Data Analytics developer recently. I am working on a cloud-based analytics tool. The tool was not taught in the course but the concepts, lifecycle of data analytics projects, functioning of various tools which was taught to me in the course helps me a lot. I am confident that with that foundational understanding I can explore, learn and work with any tool that may be required. Rahul Shilpakar Data Analyst working for a MNC & an alumni of PG-DBDA, C-DAC