This document discusses big data, data science, and career opportunities in these growing fields. It notes that data science involves using large datasets and tools like machine learning and predictive analytics. Popular technologies include Hadoop, Spark, Java, Python and R. Careers in back-end roles like systems administration and front-end roles like data scientist and analyst are increasing. Skills in areas like Linux, Hadoop, databases, Python and R are in demand. The document provides an overview of applications, industries and references for further information.
whole genome sequencing new and its types including shortgun and clone by clone
Dan D'Urso Discusses Big Data, Data Science Growth Opportunities, Applications and Required Skills
1.
2. DAN D’URSO
• OWNER ORANGE COAST DATABASE ASSOCIATES, INC.
• IT INSTRUCTOR AT UNIVERSITY OF PHOENIX
• 25 YEARS SOFTWARE MANAGER AT DIGITAL EQUIPMENT (NOW HEWLETT-PACKARD); SOFTWARE
ENGINEER COMPILERS AND OPERATING SYSTEMS AT BURROUGHS CORPORATION
• MBA QUANTITATIVE METHODS , MS COMPUTER SCIENCE
4/22/2017IT IN BIG DATA AND DATA SCIENCE
3. BIG DATA & DATA
SCIENCE
GROWTH OPPORTUNITIES
BIG DATA
DATA SCIENCE
APPLICATIONS
TECHNOLOGIES
CAREERS
SKILLS REQUIRED
4/22/2017IT IN BIG DATA AND DATA SCIENCE
4. GROWTH OPPORTUNITIES
• BIG DATA AND DATA SCIENCE ARE RELATED FIELDS – DATA SCIENCE HAS BEEN CALLED THE MARRIAGE OF
BIG DATA AND STATISTICS
• SEXIEST JOB OF THE 21ST CENTURY (HARVARD BUSINESS REVIEW, 2016)
• BY2018 WE WILL NEED 1.5 MILLION MANAGERS AND ANALYSTS AT LEAST FAMILIAR WITH DATA
OPERATIONS ALONG WITH 140 TO 190 THOUSAND DATA SCIENTISTS
• DATA SCIENTIST WILL BE 3RD HIGHEST PAYING OCCUPATION (LYNDA.COM, 2016)
4/22/2017IT IN BIG DATA AND DATA SCIENCE
5. WHAT IS BIG DATA
HUGE VOLUMES
EXTREME VELOCITY
WIDE VARIETY
4/22/2017IT IN BIG DATA AND DATA SCIENCE
8. BIG DATA VARIETY
4/22/2017IT IN BIG DATA AND DATA SCIENCE
• NEW DATA SOURCES
MAKING DATA ANALYSIS
JOB MORE COMPLEX
9. DATA SCIENCE
USES BIG DATA FROM DATABASES AND
SOCIAL MEDIA
APPLIES STATISTICAL AND VISUALIZATION
TOOLS
CURRENT USES INCLUDE PREDICTIVE
ANALYTICS AND MACHINE LEARNING
4/22/2017IT IN BIG DATA AND DATA SCIENCE
10. APPLICATIONS
• ELECTRONIC HEALTH RECORDS, PATIENT MONITORING
• REAL TIME PRODUCT OFFERS
• PREDICTIVE ANALYTICS USING SOCIAL MEDIA DATA (CROSS-SELLING, MARKET PRICES)
• FRAUD DETECTION
• ONLINE AD PLACEMENT
• MANUFACTURING PROCESS IMPROVEMENTS
• ANALYSIS OF SCIENTIFIC DATA: ASTRONOMY, PHYSICS, BIOLOGY (GENOMICS)
4/22/2017IT IN BIG DATA AND DATA SCIENCE
11. BASE TECHNOLOGIES
• UNIX/LINUX
• HADOOP/SPARK – PARALLEL PROCESSING USING
HUNDREDS OF COMPUTERS
• JAVA /SCALA – MAIN PROGRAMMING
LANGUAGES FOR ABOVE
• PYTHON – SWISS ARMY KNIFE PROGRAMMING
LANGUAGE
• R STATISTICS LANGUAGE
• ANALYTICAL TOOLS – EXCEL, TABLEAU, BI TOOLS,
SAS/SPSS, ETC.
4/22/2017IT IN BIG DATA AND DATA SCIENCE
12. CAREERS
BACK END
• SYSTEMS ADMINISTRATOR (UNIX/LINUX)
• DATA BASE ADMINISTRATOR
• NETWORK ADMINISTRATOR
FRONT END
• DATA SCIENTIST
• STATISTICIAN
• DATA ANALYST
• BUSINESS ANALYST
• BUSINESS INTELLIGENCE ANALYST
4/22/2017IT IN BIG DATA AND DATA SCIENCE
13. INDUSTRIES
POTENTIAL VARIES BY INDUSTRY
MOST WILL MAKE INCREASING USE OF
BIG DATA AND DATA SCIENTISTS
4/22/2017IT IN BIG DATA AND DATA SCIENCE
14. IT TECHNOLOGY SKILLS
DATA ENGINEERING
• UNIX/LINUX
• HADOOP/SPARK
• DATABASES (ESP. NOSQL)
• JAVA
• PYTHON
DATA ANALYSIS/DATA SCIENCE
• DATABASE QUERYING/BI TOOLS
• PYTHON
• R (LEADING STATISTICS LANGUAGE)
• YES, EXCEL, TABLEAU. ETC.
• UNIX/LINUX COMMAND LINE MAY BE HELPFUL
4/22/2017IT IN BIG DATA AND DATA SCIENCE
16. REFERENCES
• DAVENPORT T., & PATIL, D. (2012, OCTOBER). DATA SCIENTIST: THE SEXIEST JOB OF THE 21ST CENTURY. HARVARD BUSINESS REVIEW. RETRIEVED FROM HTTPS://HBR.ORG/2012/10/DATA-
SCIENTIST-THE-SEXIEST-JOB-OF-THE-21ST-CENTURY
• DAVIS, P. (2013, JULY 23). MCKINSEY REPORTHIGHLIGHTS THE IMPENDING DATA SCIENTIST SHORTAGE. PIVOTAL. RETRIEVED FROM: HTTPS://BLOG.PIVOTAL.IO/DATA-SCIENCE-
PIVOTAL/NEWS/MCKINSEY-REPORT-HIGHLIGHTS-THE-IMPENDING-DATA-SCIENTIST-SHORTAGE
• HURWITZ, J., NUGENT, A., HALPER, DR. F. & KAUFMAN, M. (2013). BIG DATA FOR DUMMIES. HOBOKEN, NJ: JOHN WILEY & SONS, INC.
• KING, J. (2014, DECEMBER 4). 2014 DATA SCIENCE SALARY SURVEY. RADAR. RETRIEVED FROM: HTTP://RADAR.OREILLY.COM/2014/12/2014-DATA-SCIENCE-SALARY-SURVEY.HTML
• LANEY, D. (2001, FEBRUARY 6). 3D DATA MANAGEMENT: CONTROLLINGDATA VOLUME, VELOCITY AND VARIETY. META GROUP. RETRIEVED FROM HTTP://BLOGS.GARTNER.COM/DOUG-
LANEY/FILES/2012/01/AD949-3D-DATA-MANAGEMENT-CONTROLLING-DATA-VOLUME-VELOCITY-AND-VARIETY.PDF
• LYNDA.COM. (2016). INTRODUCTION TO DATA SCIENCE. RETRIEVED FROM: HTTPS://WWW.LYNDA.COM/BIG-DATA-TUTORIALS/INTRODUCTION-DATA-SCIENCE/420305-2.HTML
• MANYIKA, J., CHUI, M., BROWN B., BUGHIN J., DOBBS R., ROXBURGH, C., BEYERS, A. (2011, MAY). BIG DATA: THE NEXT FRONTIERFOR INNOVATION, COMPETITION AND PRODUCTIVITY. REPORT –
MCKINSEY GLOBAL INSTITUTE. RETRIEVED FROM HTTP://WWW.MCKINSEY.COM/BUSINESS-FUNCTIONS/DIGITAL-MCKINSEY/OUR-INSIGHTS/BIG-DATA-THE-NEXT-FRONTIER-FOR-
INNOVATION
• PROVOST, F., & FAWCETT, T. (2013). DATA SCIENCE FOR BUSINESS. SEBASTOPOL, CA: O’REILLY MEDIA, INC.
4/22/2017IT IN BIG DATA AND DATA SCIENCE