SlideShare a Scribd company logo
1 of 43
Data Skills for Digital Era
The Top Data Skills You Need To Get Hired
Main Focus
Data Science Business Intelligence
Big Data Data Engineering
Mohtat@ut.ac.ir 2
Data Science
Math & Statistics
Computer Science
Subject Matter Expertise
Mohtat@ut.ac.ir 4
Data Science is an
interdisciplinary field about
processes and systems to
extract knowledge or
insights from data, which is
a continuation of some of
the data analysis fields such
as statistics, data mining,
and predictive analytics,
similar to Knowledge
Discovery in
Databases (KDD).
Types of Analytics
Descriptive
Diagnostic
Prescriptive
Predictive
Mohtat@ut.ac.ir 6
Data
Science
Technology
Application
Mohtat@ut.ac.ir 8
Critical Skills for Data Scientists
Python
R
SQL
Data Mining Tools
Knime , RapidMiner,
IBM SPSS Modeler
Excel
BI Tools
Tableau, Power BI, Qlik
Mohtat@ut.ac.ir 9
Top Python Libraries in Data Science
TensorFlow
“TensorFlow is an open source
software library for numerical
computation using data flow graphs.
PyTorch
“PyTorch is a Python package that
provides Deep neural networks built
on a tape-based autograd system
Numpy
“NumPy is the fundamental
package needed for scientific
computing with Python.
Scikit-Learn
“scikit-learn is a Python module for
machine learning built on NumPy,
SciPy and matplotlib.
Keras
“Keras is a high-level neural networks
API, written in Python and capable of
running on top of TensorFlow, CNTK,
or Theano.
Scipy
“SciPy is open-source software for
mathematics, science, and engineering.
Pandas
“pandas is a Python package providing
fast, flexible, and expressive data
structures designed to make working
with "relational" or "labeled" data both
easy and intuitive
Matplotlib
“Matplotlib is a Python 2D plotting
library which produces publication-
quality figures in a variety of
hardcopy formats and interactive
environments across platforms.
Scrapy
“Scrapy is a fast high-level web crawling
and web scraping framework, used to
crawl websites and extract structured
data from their pages.
Mohtat@ut.ac.ir 10
Top Skills every Data Scientist needs to Master
TensorFlow Keras Hadoop Spark Hive Java Matlab
Mohtat@ut.ac.ir 11
Most Essential Skills for Data Scientists
Complex Problem Solving
Team Working
Emotional Intelligence
Creativity
Critical Thinking
Negotiation
Mohtat@ut.ac.ir 12
Applied Data Science with Python
Michigan University(Coursera)
Basic Data Visualization Machine Learning Text Mining SNA
Applied Text Mining in Python
Introduction to Data Science in Python
Applied Plotting, Charting & Data
Representation in Python
Applied Machine Learning in Python Applied Social Network Analysis in
Python
Mohtat@ut.ac.ir 13LOGO HERE
Data Science Books
14
The Long Road To Become a Data Scientist
Business Intelligence
encompasses a wide variety of
tools, applications and
methodologies that enable
organizations to collect data
from internal systems and
external sources; prepare it for
analysis; develop and run
queries against that data; and
create reports, dashboards and
data visualizations to make the
analytical results available to
corporate decision-makers, as
well as operational workers.
BI
Mohtat@ut.ac.ir 17
Business Skills
Link to Business Strategy
Define Priorities
Define BI Vision
Lead Organization / BPR
Analytics Skills
Data Mining
Social BI
IT Skills
Infrastructure
Build Technology
Data Integration & Quality
Business
Intelligence
Architect
Simple is what it needs in business
Top Business Intelligence Skills
SQL
Data Warehousing
Data Analysis
Tableau
ETL
23%
85%
28%
41%
65%
Mohtat@ut.ac.ir 20
28%
Top Business Intelligence Skills
Business Analyst
Oracle
SQL Server BI
Business Process
Data Modeling 17%
85%
19%
21%
22%
Mohtat@ut.ac.ir 21
19%
Top Business Intelligence Tools
Tableau Power BI Qlik
Your Choice Is Clear
Mohtat@ut.ac.ir 22
Big Data
Volume
Terabyte
Distribute
Big Table
Velocity
Real-time
Stream Processing
Variety
Structured
Unstructured
Text, Image, Video
Mohtat@ut.ac.ir 27
Big data is a term used to
refer to data sets that are
too large or complex for
traditional data-processing
application software to
adequately deal with.
It’s what organizations do
with the data that matters.
Big data can be analyzed
for insights that lead to
better decisions and
strategic business moves.
Hadoop Ecosystem
3 Types of Big Data Jobs
1 2
3
Big Data Developer
Big Data Administration
Big Data Analytics
Mohtat@ut.ac.ir 29
Top Big Data Programming Languages
Not only Hadoop, many other big data analysis tools like Storm,
Spark, and Kafka are written in Java and run on the JVM
Java
Python is a simple, open-source, general-purpose language.
Hence, it is easy to learn Python for anyone.. With its rich set
of utilities and libraries and easy-to-use features, it works
wonder for big data processing and analysis.
Python
Scala is a rival of Java and Python in the world of Data Science
and becoming more and more popular due to extensive use of
Apache Spark in Big data Hadoop industry.
Scala
Mohtat@ut.ac.ir 30
Pathway to Success
Success
Apache Hadoop
Apache Spark
Start
NoSQL Database
Data Analytics
Data Visualization
Mohtat@ut.ac.ir 31
Big Data Companies & Vendors
Cloudera, Inc. is a US-based
software company that
provides a software platform
for data engineering, data
warehousing, machine
learning and analytics that
runs in the cloud or on
premises
Cloudera
MapR is a business software
company headquartered in
Santa Clara, California. MapR
provides access to a variety of
data sources from a single
computer cluster, including big
data workloads
MapR
Hortonworks is a data software
company based in Santa Clara,
California that develops,
supports, and provides expertise
on a set of open-source software
designed to manage data and
processing for things such as IOT,
single view of X, and advanced
analytics and machine learning
Hortonworks
34
‫داده‬‫کالن‬ ‫زیرساخت‬ ‫اجرا‬ ‫و‬ ‫نصب‬
Mohtat@ut.ac.ir
35
‫داده‬‫کالن‬ ‫زیرساخت‬ ‫اجرا‬ ‫و‬ ‫نصب‬
Mohtat@ut.ac.ir
Big Data Specialization
Michigan University(Coursera)
Introduction to Big Data
Big Data Modeling and
Management Systems
Big Data Integration and Processing
Machine Learning With Big Data
Graph Analytics for Big Data
Mohtat@ut.ac.ir 36LOGO HERE
Apache Spark
Berkeley University
Mohtat@ut.ac.ir 37LOGO HERE
Big Data Book
38
Data Scientist VS Data Engineer
Mohtat@ut.ac.ir 40
Dolor sit ametis
Data Engineering
Data Scientist
Data Pipelines
Visualization & Storytelling
Programming
Modeling & Advance Analytics
Math & Statistics
System Implementation
Data Engineering
Data engineers develop, maintain,
test and evaluate data solutions
within organizations. ... A data
engineer builds large-scale data
processing systems, is an expert in
data warehousing solutions and
should be able to work with the
latest (NoSQL) database
technologies.
Clean and wrangle data
into a usable state
Mohtat@ut.ac.ir 41
How To Become A Data Engineer
Linux
NoSQL & SQL
Python / Java / Scala
Agile Development
Data Ingestion
Processing Frameworks
Mohtat@ut.ac.ir 42
Best Data Processing Frameworks
MapReduce is a programming model
and an associated implementation for
processing and generating big data
sets with a parallel, distributed
algorithm on a cluster
Apache Spark is an open-
source distributed
general-purpose cluster-
computing framework.
Apache Storm is a free
and open source
distributed realtime
computation system.
The core of Apache Flink
is a distributed streaming
dataflow engine written in
Java and Scala
43
Cassandra
Best NoSQL Database
Mohtat@ut.ac.ir 44
Data Ingestion Tools
Apache Kafka
SSIS & ODI
Apache NiFi
Logstash
Mohtat@ut.ac.ir 45
Mohtat@ut.ac.ir
https://www.linkedin.com/in/mohtat
https://www.t.me/DataAnalysis
Contact Us
Thank You

More Related Content

What's hot

Big data analytics
Big data analyticsBig data analytics
Big data analyticsRavi Teja
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsRavi Teja
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Databricks
 
It Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got SemanticsIt Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got SemanticsOntotext
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overviewColleen Farrelly
 
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"Dataconomy Media
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...vinayiqbusiness
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersMelinda Thielbar
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systemsMultisoft Systems
 
Research Topics on Data Mining
Research Topics on Data MiningResearch Topics on Data Mining
Research Topics on Data MiningPhdtopiccom
 
Big data deep learning: applications and challenges
Big data deep learning: applications and challengesBig data deep learning: applications and challenges
Big data deep learning: applications and challengesfazail amin
 
Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Edureka!
 
Project Topics in Data Mining
Project Topics in Data MiningProject Topics in Data Mining
Project Topics in Data MiningPhdtopiccom
 
Data Science : Make Smarter Business Decisions
Data Science : Make Smarter Business DecisionsData Science : Make Smarter Business Decisions
Data Science : Make Smarter Business DecisionsEdureka!
 

What's hot (20)

Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
Building a Knowledge Graph with Spark and NLP: How We Recommend Novel Drugs t...
 
It Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got SemanticsIt Don’t Mean a Thing If It Ain’t Got Semantics
It Don’t Mean a Thing If It Ain’t Got Semantics
 
Big data and data science overview
Big data and data science overviewBig data and data science overview
Big data and data science overview
 
Apouc 2014-business-analytics-and-big-data
Apouc 2014-business-analytics-and-big-dataApouc 2014-business-analytics-and-big-data
Apouc 2014-business-analytics-and-big-data
 
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
Katharine Jarmul, Founder at Kjamistan - "Learn Data Wrangling with Python"
 
Data science
Data science Data science
Data science
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
 
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl WintersTools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
 
Data Science Project Lifecycle and Skill Set
Data Science Project Lifecycle and Skill SetData Science Project Lifecycle and Skill Set
Data Science Project Lifecycle and Skill Set
 
Data mining
Data miningData mining
Data mining
 
Data science using r multisoft systems
Data science using r  multisoft systemsData science using r  multisoft systems
Data science using r multisoft systems
 
Big data road map
Big data road mapBig data road map
Big data road map
 
Research Topics on Data Mining
Research Topics on Data MiningResearch Topics on Data Mining
Research Topics on Data Mining
 
Big data deep learning: applications and challenges
Big data deep learning: applications and challengesBig data deep learning: applications and challenges
Big data deep learning: applications and challenges
 
Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples
 
Project Topics in Data Mining
Project Topics in Data MiningProject Topics in Data Mining
Project Topics in Data Mining
 
Data Science : Make Smarter Business Decisions
Data Science : Make Smarter Business DecisionsData Science : Make Smarter Business Decisions
Data Science : Make Smarter Business Decisions
 
Myths of Data Science
Myths of Data ScienceMyths of Data Science
Myths of Data Science
 

Similar to Top Data Skills and Tools for Digital Era Careers

Data Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ایData Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ایHosseinieh Ershad Public Library
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...phdAssistance1
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistancephdAssistance1
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxAbderrahmanABID2
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
2019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 42019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 4Ferdin Joe John Joseph PhD
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Simplilearn
 
Data Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | DotechtalkData Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | DotechtalkDOTECHTALK
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo
 
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceIntroduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceFerdin Joe John Joseph PhD
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Bhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystemBhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystemVijayananda Mohire
 
Career guidance talk it makaut_ppt_sabyasachi mukhopadhyay
Career guidance talk it makaut_ppt_sabyasachi mukhopadhyayCareer guidance talk it makaut_ppt_sabyasachi mukhopadhyay
Career guidance talk it makaut_ppt_sabyasachi mukhopadhyaySabyasachi Mukhopadhyay
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data ScienceDataWorks Summit
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdfCIOWomenMagazine
 

Similar to Top Data Skills and Tools for Digital Era Careers (20)

Data Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ایData Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ای
 
Python para Manual de Ciência de Dados
Python para Manual de Ciência de DadosPython para Manual de Ciência de Dados
Python para Manual de Ciência de Dados
 
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
Coding‌ ‌Software‌ ‌and‌ ‌Tools‌ ‌used‌ ‌for‌ ‌Data‌ ‌Science‌ ‌Management‌ ‌...
 
Coding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - PhdassistanceCoding software and tools used for data science management - Phdassistance
Coding software and tools used for data science management - Phdassistance
 
Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
2019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 42019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 4
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
Data Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | DotechtalkData Mining Tools for Your Business | Dotechtalk
Data Mining Tools for Your Business | Dotechtalk
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceIntroduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Bhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystemBhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystem
 
Career guidance talk it makaut_ppt_sabyasachi mukhopadhyay
Career guidance talk it makaut_ppt_sabyasachi mukhopadhyayCareer guidance talk it makaut_ppt_sabyasachi mukhopadhyay
Career guidance talk it makaut_ppt_sabyasachi mukhopadhyay
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017 Proposed Talk Outline for Pycon2017
Proposed Talk Outline for Pycon2017
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
12 Pro Predictive Analysis Tools to Look Out for in 2024.pdf
 

Recently uploaded

How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 

Recently uploaded (20)

How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 

Top Data Skills and Tools for Digital Era Careers

  • 1. Data Skills for Digital Era The Top Data Skills You Need To Get Hired
  • 2. Main Focus Data Science Business Intelligence Big Data Data Engineering Mohtat@ut.ac.ir 2
  • 3.
  • 4. Data Science Math & Statistics Computer Science Subject Matter Expertise Mohtat@ut.ac.ir 4 Data Science is an interdisciplinary field about processes and systems to extract knowledge or insights from data, which is a continuation of some of the data analysis fields such as statistics, data mining, and predictive analytics, similar to Knowledge Discovery in Databases (KDD).
  • 7. Critical Skills for Data Scientists Python R SQL Data Mining Tools Knime , RapidMiner, IBM SPSS Modeler Excel BI Tools Tableau, Power BI, Qlik Mohtat@ut.ac.ir 9
  • 8. Top Python Libraries in Data Science TensorFlow “TensorFlow is an open source software library for numerical computation using data flow graphs. PyTorch “PyTorch is a Python package that provides Deep neural networks built on a tape-based autograd system Numpy “NumPy is the fundamental package needed for scientific computing with Python. Scikit-Learn “scikit-learn is a Python module for machine learning built on NumPy, SciPy and matplotlib. Keras “Keras is a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. Scipy “SciPy is open-source software for mathematics, science, and engineering. Pandas “pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive Matplotlib “Matplotlib is a Python 2D plotting library which produces publication- quality figures in a variety of hardcopy formats and interactive environments across platforms. Scrapy “Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. Mohtat@ut.ac.ir 10
  • 9. Top Skills every Data Scientist needs to Master TensorFlow Keras Hadoop Spark Hive Java Matlab Mohtat@ut.ac.ir 11
  • 10. Most Essential Skills for Data Scientists Complex Problem Solving Team Working Emotional Intelligence Creativity Critical Thinking Negotiation Mohtat@ut.ac.ir 12
  • 11. Applied Data Science with Python Michigan University(Coursera) Basic Data Visualization Machine Learning Text Mining SNA Applied Text Mining in Python Introduction to Data Science in Python Applied Plotting, Charting & Data Representation in Python Applied Machine Learning in Python Applied Social Network Analysis in Python Mohtat@ut.ac.ir 13LOGO HERE
  • 13. The Long Road To Become a Data Scientist
  • 14.
  • 15. Business Intelligence encompasses a wide variety of tools, applications and methodologies that enable organizations to collect data from internal systems and external sources; prepare it for analysis; develop and run queries against that data; and create reports, dashboards and data visualizations to make the analytical results available to corporate decision-makers, as well as operational workers. BI Mohtat@ut.ac.ir 17 Business Skills Link to Business Strategy Define Priorities Define BI Vision Lead Organization / BPR Analytics Skills Data Mining Social BI IT Skills Infrastructure Build Technology Data Integration & Quality
  • 17. Top Business Intelligence Skills SQL Data Warehousing Data Analysis Tableau ETL 23% 85% 28% 41% 65% Mohtat@ut.ac.ir 20 28%
  • 18. Top Business Intelligence Skills Business Analyst Oracle SQL Server BI Business Process Data Modeling 17% 85% 19% 21% 22% Mohtat@ut.ac.ir 21 19%
  • 19. Top Business Intelligence Tools Tableau Power BI Qlik Your Choice Is Clear Mohtat@ut.ac.ir 22
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. Big Data Volume Terabyte Distribute Big Table Velocity Real-time Stream Processing Variety Structured Unstructured Text, Image, Video Mohtat@ut.ac.ir 27 Big data is a term used to refer to data sets that are too large or complex for traditional data-processing application software to adequately deal with. It’s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.
  • 26. 3 Types of Big Data Jobs 1 2 3 Big Data Developer Big Data Administration Big Data Analytics Mohtat@ut.ac.ir 29
  • 27. Top Big Data Programming Languages Not only Hadoop, many other big data analysis tools like Storm, Spark, and Kafka are written in Java and run on the JVM Java Python is a simple, open-source, general-purpose language. Hence, it is easy to learn Python for anyone.. With its rich set of utilities and libraries and easy-to-use features, it works wonder for big data processing and analysis. Python Scala is a rival of Java and Python in the world of Data Science and becoming more and more popular due to extensive use of Apache Spark in Big data Hadoop industry. Scala Mohtat@ut.ac.ir 30
  • 28. Pathway to Success Success Apache Hadoop Apache Spark Start NoSQL Database Data Analytics Data Visualization Mohtat@ut.ac.ir 31
  • 29. Big Data Companies & Vendors Cloudera, Inc. is a US-based software company that provides a software platform for data engineering, data warehousing, machine learning and analytics that runs in the cloud or on premises Cloudera MapR is a business software company headquartered in Santa Clara, California. MapR provides access to a variety of data sources from a single computer cluster, including big data workloads MapR Hortonworks is a data software company based in Santa Clara, California that develops, supports, and provides expertise on a set of open-source software designed to manage data and processing for things such as IOT, single view of X, and advanced analytics and machine learning Hortonworks
  • 32. Big Data Specialization Michigan University(Coursera) Introduction to Big Data Big Data Modeling and Management Systems Big Data Integration and Processing Machine Learning With Big Data Graph Analytics for Big Data Mohtat@ut.ac.ir 36LOGO HERE
  • 35.
  • 36. Data Scientist VS Data Engineer Mohtat@ut.ac.ir 40 Dolor sit ametis Data Engineering Data Scientist Data Pipelines Visualization & Storytelling Programming Modeling & Advance Analytics Math & Statistics System Implementation
  • 37. Data Engineering Data engineers develop, maintain, test and evaluate data solutions within organizations. ... A data engineer builds large-scale data processing systems, is an expert in data warehousing solutions and should be able to work with the latest (NoSQL) database technologies. Clean and wrangle data into a usable state Mohtat@ut.ac.ir 41
  • 38. How To Become A Data Engineer Linux NoSQL & SQL Python / Java / Scala Agile Development Data Ingestion Processing Frameworks Mohtat@ut.ac.ir 42
  • 39. Best Data Processing Frameworks MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster Apache Spark is an open- source distributed general-purpose cluster- computing framework. Apache Storm is a free and open source distributed realtime computation system. The core of Apache Flink is a distributed streaming dataflow engine written in Java and Scala 43
  • 41. Data Ingestion Tools Apache Kafka SSIS & ODI Apache NiFi Logstash Mohtat@ut.ac.ir 45
  • 42.