SlideShare una empresa de Scribd logo
1 de 27
Google Cloud
Certifications &
Machine Learning
TensorFlow-NorthWest Meetup
January 29. 2020
GCP Reference ML Architecture
What’s Needed to Deploy ML to
Production
● ML centric
○ Data collection
○ Data
transformation
○ Feature
engineering
○ Model building
and evaluation
● Production centric
○ Data storage
○ Scalable transformation
processes
○ Workflows
○ Integration and
deployment
○ Monitoring
○ Security and Compliance
Source: https://www.7wdata.be/data-science/whats-the-difference-between-a-data-engineer-a-data-analyst-and-a-data-scientist/
Specifically, what do we need to know?
Know What Certifications Test
Know What Certifications Test
Customer
Implementation
Technical
requirements
Business
requirements
Technical
requirements
Customer
Implementation
Professional
Certification
Scope of GCP Certifications
Associate
Cloud Engineer
Professional Level
Certification
Technical
requirements
Business
requirements
Design
Plan
Ensure Feasibility
Optimize
Build
Deploy
Manage
Customer
Implementation
Build
Deploy
Manage
Technical
requirements
Build
Deploy
Manage
Who Benefits?
● Practitioners - salaries 15% more than
non-certified (IDC/Microsoft )
● Organizations - new hire productivity,
hire advancement, streamlined hiring
● Google - supports efforts to expand
market share by creating a pool of
knowledgeable professionals
Google Cloud Professional Data
Engineer Exam
○ Designing data processing and storage systems
○ Migrating data warehouse
○ Operationalizing storage, processing infrastructure, and
pipeline
○ Operationalizing ML models
○ Pre-built models
○ ML architecture e.g. edge computing
○ Security and compliance
○ Scalability and portability
Designing Data Processing & Storage
Systems
● Data modeling
● Latency, throughput,
transactions
● Fault tolerance
● Distributed systems
● Batch and stream
processing
● Job automation and
orchestration
● Event processing
Operationalizing Processing and
Storage
● Storage costs and performance
● Data cleansing
● Data lifecycle management
● Provisioning resources
● Monitoring and adjusting pipelines
● Effective use of managed services
Operationalizing Machine Learning
Models
● Pre-built ML models as a service
● Ingesting data
● Training machine learning models
● Training and serving infrastructure
● Hardware accelerators
● ML terminology
Options for Operational ML
● Compute Engine
○ GPU or TPUs
○ Deep Learning VM
○ C2-standard-60 60 vCPUs 240GB 257TB
● Kubernetes Engine
○ Supports GPUs and TPUs
○ Containers with TF, PyTorch, and R
○ Job and deployment APIs
● AI Platform
○ Serverless option
○ Train, evaluate, tune models
○ TensorFlow, Scikit Learn, XGBoost
Study Strategy
● Follow Certification Exam Guide
○ High level domains & detailed tasks
○ https://cloud.google.com/certification/guides/data-engineer/
● Take Practice Exam
○ Good assessment but the actual test is more difficult
● Identify weakest areas
○ We often focus on some, not all domains in our work
○ You will be tested on all domains
● Perform tasks using Cloud Console and Cloud Shell
Exam Taking Strategy
● Timed test, know your remaining time
○ 50 multiple choice questions
○ 2 hours
○ Mark questions for review
● Read questions carefully
○ Identify key services and software
○ Identify technical requirements
● Focus on how to choose between likely options or near misses
Services to Know in Depth
How to Choose Between
● Cloud SQL
● Cloud Spanner
● Cloud Firestore
● Cloud Bigtable
● BiqQuery
● Cloud Storage
Learning Resources
Certification Guides
Professional
Data Engineer
April, 2020
Final Thoughts ...
● Certifications help
define scope of
knowledge needed
for a particular role
● They are a measure
of competence, not
expertise
● Continuous learning
What you
know
What you think
you know
Circle of Competence
Image Sources
Slide 2” https://blog.jupyter.org/jupyterlab-is-ready-for-users-5a6f039b8906.
Slide 3: https://commons.wikimedia.org/wiki/File:Proto-Labs-Manufacturing-Injection-Molding.jpg
Slide 4: https://cloud.google.com/blog/products/ai-machine-learning/solution-implementing-industrial-predictive-maintenance-part-iii
Slide 8: https://www.7wdata.be/data-science/whats-the-difference-between-a-data-engineer-a-data-analyst-and-a-data-scientist/
Slide 10: https://irecusa.org/wp-content/uploads/2013/06/Task-Analysis-word-cloud.jpg
Slide 11 : https://irecusa.org/2013/06/job-task-analyses-at-the-very-heart-of-credentialing/
Slide 13 https://miro.medium.com/max/2048/0*zb4dcNV0gFVgFTJm.png

Más contenido relacionado

Similar a Google Cloud Certifications & Machine Learning

Google Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsGoogle Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsDaniel Zivkovic
 
Google cloud certification
Google cloud certificationGoogle cloud certification
Google cloud certificationabilashiniN
 
MOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCMOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCgdgsurrey
 
Kubernetes for machine learning
Kubernetes for machine learningKubernetes for machine learning
Kubernetes for machine learningAkash Agrawal
 
[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送
[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送
[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送Google Cloud Platform - Japan
 
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...Andrejs Prokopjevs
 
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
[Study Guide] Google Professional Cloud Architect (GCP-PCA) CertificationAmaaira Johns
 
Platform Engineering
Platform EngineeringPlatform Engineering
Platform EngineeringOpsta
 
Google cloud big data summit master gcp big data summit la - 10-20-2015
Google cloud big data summit   master gcp big data summit la - 10-20-2015Google cloud big data summit   master gcp big data summit la - 10-20-2015
Google cloud big data summit master gcp big data summit la - 10-20-2015Raj Babu
 
Cloud Manthn Software Solutions Pvt Ltd - What we do ?
Cloud Manthn Software Solutions Pvt Ltd - What we do ?Cloud Manthn Software Solutions Pvt Ltd - What we do ?
Cloud Manthn Software Solutions Pvt Ltd - What we do ?amodkadam
 
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AIRsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AISanjana Chowdhury
 
Multiplier Effect: Case Studies in Distributions for Publishers
Multiplier Effect: Case Studies in Distributions for PublishersMultiplier Effect: Case Studies in Distributions for Publishers
Multiplier Effect: Case Studies in Distributions for PublishersJon Peck
 
SaaS startups - Software Engineering Challenges
SaaS startups - Software Engineering ChallengesSaaS startups - Software Engineering Challenges
SaaS startups - Software Engineering ChallengesMalinda Kapuruge
 
Implementing and Extending Oracle PLM Cloud for Gibson Overseas
Implementing and Extending Oracle PLM Cloud for Gibson OverseasImplementing and Extending Oracle PLM Cloud for Gibson Overseas
Implementing and Extending Oracle PLM Cloud for Gibson OverseasJade Global
 
Columbus AWS Meetup: AWS Certifications
Columbus AWS Meetup: AWS CertificationsColumbus AWS Meetup: AWS Certifications
Columbus AWS Meetup: AWS CertificationsAndrewMay59
 
Accelerating Digital Transformation: It's About Digital Enablement
Accelerating Digital Transformation:  It's About Digital EnablementAccelerating Digital Transformation:  It's About Digital Enablement
Accelerating Digital Transformation: It's About Digital EnablementJoshua Gossett
 
Sunrun slide for informatica summit - Harish Ramachandraiah
Sunrun slide for informatica summit - Harish RamachandraiahSunrun slide for informatica summit - Harish Ramachandraiah
Sunrun slide for informatica summit - Harish RamachandraiahHarish Ramachandraiah
 
City & County of Denver's MuleSoft Journey
City & County of Denver's MuleSoft JourneyCity & County of Denver's MuleSoft Journey
City & County of Denver's MuleSoft JourneySandeep Deshmukh
 

Similar a Google Cloud Certifications & Machine Learning (20)

Google Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification TipsGoogle Associate Cloud Engineer Certification Tips
Google Associate Cloud Engineer Certification Tips
 
Google cloud certification
Google cloud certificationGoogle cloud certification
Google cloud certification
 
MOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCMOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDC
 
Kubernetes for machine learning
Kubernetes for machine learningKubernetes for machine learning
Kubernetes for machine learning
 
[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送
[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送
[Cloud OnAir] Talks by DevRel Vol.4 データ管理とデータ ベース 2020年8月27日 放送
 
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
Oracle EBS Journey to the Cloud - What is New in 2022 (UKOUG Breakthrough 22 ...
 
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
[Study Guide] Google Professional Cloud Architect (GCP-PCA) Certification
 
Platform Engineering
Platform EngineeringPlatform Engineering
Platform Engineering
 
Google cloud big data summit master gcp big data summit la - 10-20-2015
Google cloud big data summit   master gcp big data summit la - 10-20-2015Google cloud big data summit   master gcp big data summit la - 10-20-2015
Google cloud big data summit master gcp big data summit la - 10-20-2015
 
DevOps Days Rockies MLOps
DevOps Days Rockies MLOpsDevOps Days Rockies MLOps
DevOps Days Rockies MLOps
 
Cloud Manthn Software Solutions Pvt Ltd - What we do ?
Cloud Manthn Software Solutions Pvt Ltd - What we do ?Cloud Manthn Software Solutions Pvt Ltd - What we do ?
Cloud Manthn Software Solutions Pvt Ltd - What we do ?
 
Rsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AIRsqrd AI: From R&D to ROI of AI
Rsqrd AI: From R&D to ROI of AI
 
Multiplier Effect: Case Studies in Distributions for Publishers
Multiplier Effect: Case Studies in Distributions for PublishersMultiplier Effect: Case Studies in Distributions for Publishers
Multiplier Effect: Case Studies in Distributions for Publishers
 
SRE & Kubernetes
SRE & KubernetesSRE & Kubernetes
SRE & Kubernetes
 
SaaS startups - Software Engineering Challenges
SaaS startups - Software Engineering ChallengesSaaS startups - Software Engineering Challenges
SaaS startups - Software Engineering Challenges
 
Implementing and Extending Oracle PLM Cloud for Gibson Overseas
Implementing and Extending Oracle PLM Cloud for Gibson OverseasImplementing and Extending Oracle PLM Cloud for Gibson Overseas
Implementing and Extending Oracle PLM Cloud for Gibson Overseas
 
Columbus AWS Meetup: AWS Certifications
Columbus AWS Meetup: AWS CertificationsColumbus AWS Meetup: AWS Certifications
Columbus AWS Meetup: AWS Certifications
 
Accelerating Digital Transformation: It's About Digital Enablement
Accelerating Digital Transformation:  It's About Digital EnablementAccelerating Digital Transformation:  It's About Digital Enablement
Accelerating Digital Transformation: It's About Digital Enablement
 
Sunrun slide for informatica summit - Harish Ramachandraiah
Sunrun slide for informatica summit - Harish RamachandraiahSunrun slide for informatica summit - Harish Ramachandraiah
Sunrun slide for informatica summit - Harish Ramachandraiah
 
City & County of Denver's MuleSoft Journey
City & County of Denver's MuleSoft JourneyCity & County of Denver's MuleSoft Journey
City & County of Denver's MuleSoft Journey
 

Más de Dan Sullivan, Ph.D.

How to Design a Modern Data Warehouse in BigQuery
How to Design a Modern Data Warehouse in BigQueryHow to Design a Modern Data Warehouse in BigQuery
How to Design a Modern Data Warehouse in BigQueryDan Sullivan, Ph.D.
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?Dan Sullivan, Ph.D.
 
Getting Started with BigQuery ML
Getting Started with BigQuery MLGetting Started with BigQuery ML
Getting Started with BigQuery MLDan Sullivan, Ph.D.
 
Unstructured text to structured data
Unstructured text to structured dataUnstructured text to structured data
Unstructured text to structured dataDan Sullivan, Ph.D.
 
A first look at tf idf-pdx data science meetup
A first look at tf idf-pdx data science meetupA first look at tf idf-pdx data science meetup
A first look at tf idf-pdx data science meetupDan Sullivan, Ph.D.
 
ACID vs BASE in NoSQL: Another False Dichotomy
ACID vs BASE in NoSQL: Another False DichotomyACID vs BASE in NoSQL: Another False Dichotomy
ACID vs BASE in NoSQL: Another False DichotomyDan Sullivan, Ph.D.
 
Big data, bioscience and the cloud biocatalyst june 2015 sullivan
Big data, bioscience and the cloud   biocatalyst june 2015 sullivanBig data, bioscience and the cloud   biocatalyst june 2015 sullivan
Big data, bioscience and the cloud biocatalyst june 2015 sullivanDan Sullivan, Ph.D.
 
Tools and Techniques for Analyzing Texts: Tweets to Intellectual Property
Tools and Techniques for Analyzing Texts: Tweets to Intellectual PropertyTools and Techniques for Analyzing Texts: Tweets to Intellectual Property
Tools and Techniques for Analyzing Texts: Tweets to Intellectual PropertyDan Sullivan, Ph.D.
 
Modeling with Document Database: 5 Key Patterns
Modeling with Document Database: 5 Key PatternsModeling with Document Database: 5 Key Patterns
Modeling with Document Database: 5 Key PatternsDan Sullivan, Ph.D.
 
Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2
Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2
Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2Dan Sullivan, Ph.D.
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesDan Sullivan, Ph.D.
 
Limits of RDBMS and Need for NoSQL in Bioinformatics
Limits of RDBMS and Need for NoSQL in BioinformaticsLimits of RDBMS and Need for NoSQL in Bioinformatics
Limits of RDBMS and Need for NoSQL in BioinformaticsDan Sullivan, Ph.D.
 

Más de Dan Sullivan, Ph.D. (13)

How to Design a Modern Data Warehouse in BigQuery
How to Design a Modern Data Warehouse in BigQueryHow to Design a Modern Data Warehouse in BigQuery
How to Design a Modern Data Warehouse in BigQuery
 
With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?With Automated ML, is Everyone an ML Engineer?
With Automated ML, is Everyone an ML Engineer?
 
Getting Started with BigQuery ML
Getting Started with BigQuery MLGetting Started with BigQuery ML
Getting Started with BigQuery ML
 
Unstructured text to structured data
Unstructured text to structured dataUnstructured text to structured data
Unstructured text to structured data
 
A first look at tf idf-pdx data science meetup
A first look at tf idf-pdx data science meetupA first look at tf idf-pdx data science meetup
A first look at tf idf-pdx data science meetup
 
Text mining meets neural nets
Text mining meets neural netsText mining meets neural nets
Text mining meets neural nets
 
ACID vs BASE in NoSQL: Another False Dichotomy
ACID vs BASE in NoSQL: Another False DichotomyACID vs BASE in NoSQL: Another False Dichotomy
ACID vs BASE in NoSQL: Another False Dichotomy
 
Big data, bioscience and the cloud biocatalyst june 2015 sullivan
Big data, bioscience and the cloud   biocatalyst june 2015 sullivanBig data, bioscience and the cloud   biocatalyst june 2015 sullivan
Big data, bioscience and the cloud biocatalyst june 2015 sullivan
 
Tools and Techniques for Analyzing Texts: Tweets to Intellectual Property
Tools and Techniques for Analyzing Texts: Tweets to Intellectual PropertyTools and Techniques for Analyzing Texts: Tweets to Intellectual Property
Tools and Techniques for Analyzing Texts: Tweets to Intellectual Property
 
Modeling with Document Database: 5 Key Patterns
Modeling with Document Database: 5 Key PatternsModeling with Document Database: 5 Key Patterns
Modeling with Document Database: 5 Key Patterns
 
Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2
Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2
Sullivan GBCB Seminar Fall 2014 - Limits of RDMS for Bioinformatics v2
 
Text Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious DiseasesText Mining for Biocuration of Bacterial Infectious Diseases
Text Mining for Biocuration of Bacterial Infectious Diseases
 
Limits of RDBMS and Need for NoSQL in Bioinformatics
Limits of RDBMS and Need for NoSQL in BioinformaticsLimits of RDBMS and Need for NoSQL in Bioinformatics
Limits of RDBMS and Need for NoSQL in Bioinformatics
 

Último

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Último (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Google Cloud Certifications & Machine Learning

  • 1. Google Cloud Certifications & Machine Learning TensorFlow-NorthWest Meetup January 29. 2020
  • 2.
  • 3.
  • 4. GCP Reference ML Architecture
  • 5.
  • 6.
  • 7. What’s Needed to Deploy ML to Production ● ML centric ○ Data collection ○ Data transformation ○ Feature engineering ○ Model building and evaluation ● Production centric ○ Data storage ○ Scalable transformation processes ○ Workflows ○ Integration and deployment ○ Monitoring ○ Security and Compliance
  • 9. Specifically, what do we need to know?
  • 12. Customer Implementation Technical requirements Business requirements Technical requirements Customer Implementation Professional Certification Scope of GCP Certifications Associate Cloud Engineer Professional Level Certification Technical requirements Business requirements Design Plan Ensure Feasibility Optimize Build Deploy Manage Customer Implementation Build Deploy Manage Technical requirements Build Deploy Manage
  • 13.
  • 14. Who Benefits? ● Practitioners - salaries 15% more than non-certified (IDC/Microsoft ) ● Organizations - new hire productivity, hire advancement, streamlined hiring ● Google - supports efforts to expand market share by creating a pool of knowledgeable professionals
  • 15. Google Cloud Professional Data Engineer Exam ○ Designing data processing and storage systems ○ Migrating data warehouse ○ Operationalizing storage, processing infrastructure, and pipeline ○ Operationalizing ML models ○ Pre-built models ○ ML architecture e.g. edge computing ○ Security and compliance ○ Scalability and portability
  • 16. Designing Data Processing & Storage Systems ● Data modeling ● Latency, throughput, transactions ● Fault tolerance ● Distributed systems ● Batch and stream processing ● Job automation and orchestration ● Event processing
  • 17. Operationalizing Processing and Storage ● Storage costs and performance ● Data cleansing ● Data lifecycle management ● Provisioning resources ● Monitoring and adjusting pipelines ● Effective use of managed services
  • 18. Operationalizing Machine Learning Models ● Pre-built ML models as a service ● Ingesting data ● Training machine learning models ● Training and serving infrastructure ● Hardware accelerators ● ML terminology
  • 19. Options for Operational ML ● Compute Engine ○ GPU or TPUs ○ Deep Learning VM ○ C2-standard-60 60 vCPUs 240GB 257TB ● Kubernetes Engine ○ Supports GPUs and TPUs ○ Containers with TF, PyTorch, and R ○ Job and deployment APIs ● AI Platform ○ Serverless option ○ Train, evaluate, tune models ○ TensorFlow, Scikit Learn, XGBoost
  • 20. Study Strategy ● Follow Certification Exam Guide ○ High level domains & detailed tasks ○ https://cloud.google.com/certification/guides/data-engineer/ ● Take Practice Exam ○ Good assessment but the actual test is more difficult ● Identify weakest areas ○ We often focus on some, not all domains in our work ○ You will be tested on all domains ● Perform tasks using Cloud Console and Cloud Shell
  • 21. Exam Taking Strategy ● Timed test, know your remaining time ○ 50 multiple choice questions ○ 2 hours ○ Mark questions for review ● Read questions carefully ○ Identify key services and software ○ Identify technical requirements ● Focus on how to choose between likely options or near misses
  • 22. Services to Know in Depth
  • 23. How to Choose Between ● Cloud SQL ● Cloud Spanner ● Cloud Firestore ● Cloud Bigtable ● BiqQuery ● Cloud Storage
  • 26. Final Thoughts ... ● Certifications help define scope of knowledge needed for a particular role ● They are a measure of competence, not expertise ● Continuous learning What you know What you think you know Circle of Competence
  • 27. Image Sources Slide 2” https://blog.jupyter.org/jupyterlab-is-ready-for-users-5a6f039b8906. Slide 3: https://commons.wikimedia.org/wiki/File:Proto-Labs-Manufacturing-Injection-Molding.jpg Slide 4: https://cloud.google.com/blog/products/ai-machine-learning/solution-implementing-industrial-predictive-maintenance-part-iii Slide 8: https://www.7wdata.be/data-science/whats-the-difference-between-a-data-engineer-a-data-analyst-and-a-data-scientist/ Slide 10: https://irecusa.org/wp-content/uploads/2013/06/Task-Analysis-word-cloud.jpg Slide 11 : https://irecusa.org/2013/06/job-task-analyses-at-the-very-heart-of-credentialing/ Slide 13 https://miro.medium.com/max/2048/0*zb4dcNV0gFVgFTJm.png

Notas del editor

  1. Brian
  2. You’ll need to know how to choose a storage system based on the structure of the data, the volume of data, latency requirements, and query patterns. In genearl, BigQuery is the go to servcie for data warehousing and in some cases ML. Bigtable is well suited to high volume, low latency use cases, like IoT. Cloud Firestore is a document database and a managed service subsitiute for MongoDB.