SlideShare una empresa de Scribd logo
1 de 24
Descargar para leer sin conexión
AWS Cloudwatch
Felipe Almeida
Rio Cloud Computing Meetup October / 2016
Monitoring your AWS Services with Metrics
and Alarms
Structure
● Intro
● Usage
● Metrics - Builtin
● Metrics - Custom
● Alarms
● Events
● Logs
● Third-party services
● Keep in mind
2
Intro
● CloudWatch is the AWS metric collection service
● Metrics are individual measurements of some quantity of interest,
at a given point in time:
Examples:
● At 23:45:29 GMT, on 23 February 2016, CPU usage for
instance XPTO was 56%
3
Intro
● Nearly all AWS services can be monitored using CloudWatch
● Metrics provide visibility into your applications
● They enable you to make informed decisions such as
○ Whether to downscale a machine that’s not being used too
much (save $$)
○ Whether to scale up a machine that’s close to full capacity
(avoid failures)
○ Decide what caused an application to fail (debug)
4
Usage
● You can use CloudWatch in multiple ways:
○ Using the AWS Console
○ Via other services that have monitoring functionality
○ Using the CloudWatch API through the AWS SDK
5
Usage - Via other services
● Via other services
○ Nearly all services have a monitoring or metrics tab
6EMR
Usage - Via other services
7Elastic Beanstalk
Usage - Via other services
8DynamoDB Table
Usage - Via other services
● When you open the monitoring tab for a service, what you’re
looking at are actually CloudWatch Metrics:
9
Monitoring Tab for an
Elastic Beanstalk
Environment uses
CloudWatch Metrics
Usage - Console
● Via cloudwatch itself (on the console)
10
Usage - Console
● You can build a Dashboard with metrics you choose
11
Usage - API
● CloudWatch (like most AWS Services) can also be used
programatically
○ I.e. it has an API that you can use via the AWS SDK (library for
Java, .NET, Python, etc)
12
Metrics - Builtin
● Services usually have a set of builtin metrics
○ These metrics are collected by default
13
Metrics - Builtin
● For example, the following metrics (not exhaustive list) are
collected by default for EC2 Instances:
○ CPUUtilization (percent)
○ DiskReadBytes (bytes)
○ DiskWriteBytes (bytes)
○ NetworkIn (bytes)
○ NetworkOut (bytes)
14
Metrics - Custom
● You can also use CloudWatch for your own metrics.
● You can then do anything you can for regular (builtin) metrics,
such as:
○ View them on the AWS console
○ Create Dashboards for them
○ Set up alerts
15
Metrics - Custom
● For some types of custom metrics, there are community-provided
scripts to help you publish them to CloudWatch, such as:
○ Memory utilization for EC2 Instances
○ Swap Utilization for EC2 Instances
○ Disk Space Usage for EC2 Instances
● These metrics are controlled by the O.S. and therefore cannot be
accessed automatically by AWS.
16
Alarms
● You can also create alarms for any individual metric on
CloudWatch, which are triggered if the criteria you have defined are
met
● For example, you can have AWS send you an e-mail if
FreeStorageSpace metric for a ElasticSearch cluster you have
becomes lower than 10GB.
17
Alarms
● In addition to notifying you when something happens, you can also
configure AutoScaling Actions to be taken if some criteria are
met:
○ For example, if the CPUUtilization metric reaches 90% for
all machines in a AutoScaling Group, add extra instances to
that environment
18
Events
● CloudWatch Events are a new addition in CloudWatch
● They enable you to execute custom actions in response to State
Changes such as
○ An EC2 instance has been started
○ An EC2 instance has been terminated
● Custom actions include running a Lambda Function, publishing the
event to Kinesis, etc.
19
Logs
● CloudWatch Logs refers to Agents you can install on your
instances to have them send application logs to CloudWatch
● You can filter your logs and set up Alarms when certain criteria are
met, such as:
○ There have been more than 10 Errors in Apache in the last
minute
○ There have been 5 log messages matching
“IllegalArgumentException” in the last hour
20
Related Services (Third-Party)
● There are tons of third-party services that build on top of or
otherwise leverage CloudWatch metrics, such as:
○ SignalFX (extends and sends custom metrics to CloudWatch)
○ AppDynamics (sends all sorts of low-level and
application-level metrics to CloudWatch and displays them in
an unified manner; also provides event correlation)
○ NewRelic (sends data to CLoudWatch)
○ Grafana (uses CloudWatch as a Data source)
○ Logstash (you can use CloudWatch as an output)
○ Nagios (consumes CloudWatch metrics)
21
Keep in mind
● Metrics last 2 weeks by default
● Metrics are per-region
● Some services (e.g. Elastic BeanStalk) have extra metrics that
need to be explicitly enabled before they can be used.
● CloudWatch can also be used to monitor AWS Costs (Billing), even
though it’s not a service per se
22
Keep in mind
● In order to publish custom Amazon CloudWatch metrics, the
instances in your environment need permission to use
CloudWatch (see your instance profile for more information).
● CloudWatch is not free so you may want to track only relevant
metrics
○ In addition, if you use CloudWatch via the AWS SDK, try to
collect high-level metrics to keep cost down
■ E.g. track BeanStalk metrics rather than metrics for
individual instances
23
Links
● Monitoring Scripts for Linux EC2 Instances
● All Builtin Metrics for all supported AWS services
● AWS CloudWatch Pricing
24

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

AWS 101
AWS 101AWS 101
AWS 101
 
AWS SQS SNS
AWS SQS SNSAWS SQS SNS
AWS SQS SNS
 
Top 10 AWS Identity and Access Management (IAM) Best Practices (SEC301) | AWS...
Top 10 AWS Identity and Access Management (IAM) Best Practices (SEC301) | AWS...Top 10 AWS Identity and Access Management (IAM) Best Practices (SEC301) | AWS...
Top 10 AWS Identity and Access Management (IAM) Best Practices (SEC301) | AWS...
 
Introduction to EC2
Introduction to EC2Introduction to EC2
Introduction to EC2
 
AWS Webcast - Introduction to EBS
AWS Webcast - Introduction to EBS AWS Webcast - Introduction to EBS
AWS Webcast - Introduction to EBS
 
Introduction to AWS Cost Management
Introduction to AWS Cost ManagementIntroduction to AWS Cost Management
Introduction to AWS Cost Management
 
AWS Storage services
AWS Storage servicesAWS Storage services
AWS Storage services
 
AWS Cloud trail
AWS Cloud trailAWS Cloud trail
AWS Cloud trail
 
Amazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for KubernetesAmazon EKS - Elastic Container Service for Kubernetes
Amazon EKS - Elastic Container Service for Kubernetes
 
Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...
Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...
Amazon CloudWatch Tutorial | AWS Certification | Cloud Monitoring Tools | AWS...
 
AWS Security Hub
AWS Security HubAWS Security Hub
AWS Security Hub
 
AWS core services
AWS core servicesAWS core services
AWS core services
 
Intro to Amazon ECS
Intro to Amazon ECSIntro to Amazon ECS
Intro to Amazon ECS
 
Aws VPC
Aws VPCAws VPC
Aws VPC
 
AWS for Backup and Recovery
AWS for Backup and RecoveryAWS for Backup and Recovery
AWS for Backup and Recovery
 
Introduction to Amazon Relational Database Service
Introduction to Amazon Relational Database ServiceIntroduction to Amazon Relational Database Service
Introduction to Amazon Relational Database Service
 
Intro to Amazon S3
Intro to Amazon S3Intro to Amazon S3
Intro to Amazon S3
 
AWS 기반의 마이크로 서비스 아키텍쳐 구현 방안 :: 김필중 :: AWS Summit Seoul 20
AWS 기반의 마이크로 서비스 아키텍쳐 구현 방안 :: 김필중 :: AWS Summit Seoul 20AWS 기반의 마이크로 서비스 아키텍쳐 구현 방안 :: 김필중 :: AWS Summit Seoul 20
AWS 기반의 마이크로 서비스 아키텍쳐 구현 방안 :: 김필중 :: AWS Summit Seoul 20
 
사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...
사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...
사례로 알아보는 Database Migration Service : 데이터베이스 및 데이터 이관, 통합, 분리, 분석의 도구 - 발표자: ...
 
Amazon Virtual Private Cloud (VPC): Networking Fundamentals and Connectivity ...
Amazon Virtual Private Cloud (VPC): Networking Fundamentals and Connectivity ...Amazon Virtual Private Cloud (VPC): Networking Fundamentals and Connectivity ...
Amazon Virtual Private Cloud (VPC): Networking Fundamentals and Connectivity ...
 

Destacado

SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012
SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012
SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012
Amazon Web Services
 

Destacado (20)

Cloudwatch: Monitoring your Services with Metrics and Alarms
Cloudwatch: Monitoring your Services with Metrics and AlarmsCloudwatch: Monitoring your Services with Metrics and Alarms
Cloudwatch: Monitoring your Services with Metrics and Alarms
 
Using AWS CloudWatch Custom Metrics and EC2 Auto Scaling -VSocial Infrastructure
Using AWS CloudWatch Custom Metrics and EC2 Auto Scaling -VSocial InfrastructureUsing AWS CloudWatch Custom Metrics and EC2 Auto Scaling -VSocial Infrastructure
Using AWS CloudWatch Custom Metrics and EC2 Auto Scaling -VSocial Infrastructure
 
Deployment and Management on AWS:
 A Deep Dive on Options and Tools
Deployment and Management on AWS:
 A Deep Dive on Options and ToolsDeployment and Management on AWS:
 A Deep Dive on Options and Tools
Deployment and Management on AWS:
 A Deep Dive on Options and Tools
 
Transparency and Control with AWS CloudTrail and AWS Config
Transparency and Control with AWS CloudTrail and AWS ConfigTransparency and Control with AWS CloudTrail and AWS Config
Transparency and Control with AWS CloudTrail and AWS Config
 
Revolutionising Cloud Operations with AWS Config, AWS CloudTrail and AWS Clou...
Revolutionising Cloud Operations with AWS Config, AWS CloudTrail and AWS Clou...Revolutionising Cloud Operations with AWS Config, AWS CloudTrail and AWS Clou...
Revolutionising Cloud Operations with AWS Config, AWS CloudTrail and AWS Clou...
 
AWS re:Invent 2016: 5 Security Automation Improvements You Can Make by Using ...
AWS re:Invent 2016: 5 Security Automation Improvements You Can Make by Using ...AWS re:Invent 2016: 5 Security Automation Improvements You Can Make by Using ...
AWS re:Invent 2016: 5 Security Automation Improvements You Can Make by Using ...
 
Cloud watchingwithcloudwatch
Cloud watchingwithcloudwatchCloud watchingwithcloudwatch
Cloud watchingwithcloudwatch
 
Cloudreach Voices AWS CloudWatch and Smart Monitoring
Cloudreach Voices AWS CloudWatch and Smart MonitoringCloudreach Voices AWS CloudWatch and Smart Monitoring
Cloudreach Voices AWS CloudWatch and Smart Monitoring
 
Tackling a 1 billion member social network
Tackling a 1 billion member social networkTackling a 1 billion member social network
Tackling a 1 billion member social network
 
Aws cost optimization: lessons learned, strategies, tips and tools
Aws cost optimization: lessons learned, strategies, tips and toolsAws cost optimization: lessons learned, strategies, tips and tools
Aws cost optimization: lessons learned, strategies, tips and tools
 
CloudWatch Custom Metrics
CloudWatch Custom MetricsCloudWatch Custom Metrics
CloudWatch Custom Metrics
 
Modern Monitoring - devops roadtrip
Modern Monitoring - devops roadtripModern Monitoring - devops roadtrip
Modern Monitoring - devops roadtrip
 
SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012
SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012
SRG302 Archiving in the Cloud using Amazon Glacier - AWS re: Invent 2012
 
Site24x7 PHP Monitoring for DevOps
Site24x7 PHP Monitoring for DevOpsSite24x7 PHP Monitoring for DevOps
Site24x7 PHP Monitoring for DevOps
 
Cloud watch
Cloud watchCloud watch
Cloud watch
 
Manage Security & Compliance of Your AWS Account using CloudTrail
Manage Security & Compliance of Your AWS Account using CloudTrailManage Security & Compliance of Your AWS Account using CloudTrail
Manage Security & Compliance of Your AWS Account using CloudTrail
 
Monitoring and Alerting
Monitoring and AlertingMonitoring and Alerting
Monitoring and Alerting
 
RMG203 Cloud Infrastructure and Application Monitoring with Amazon CloudWatch...
RMG203 Cloud Infrastructure and Application Monitoring with Amazon CloudWatch...RMG203 Cloud Infrastructure and Application Monitoring with Amazon CloudWatch...
RMG203 Cloud Infrastructure and Application Monitoring with Amazon CloudWatch...
 
Monitoring 改造計畫:流程觀點
Monitoring 改造計畫:流程觀點Monitoring 改造計畫:流程觀點
Monitoring 改造計畫:流程觀點
 
Elasticsearch for Data Analytics
Elasticsearch for Data AnalyticsElasticsearch for Data Analytics
Elasticsearch for Data Analytics
 

Similar a Cloudwatch: Monitoring your AWS services with Metrics and Alarms

004 - Logging in the Cloud -- hide01.ir.pptx
004 - Logging in the Cloud  --  hide01.ir.pptx004 - Logging in the Cloud  --  hide01.ir.pptx
004 - Logging in the Cloud -- hide01.ir.pptx
nitinscribd
 

Similar a Cloudwatch: Monitoring your AWS services with Metrics and Alarms (20)

Application & Account Monitoring in AWS
Application & Account Monitoring in AWSApplication & Account Monitoring in AWS
Application & Account Monitoring in AWS
 
re:Invent re:Peat
re:Invent re:Peatre:Invent re:Peat
re:Invent re:Peat
 
004 - Logging in the Cloud -- hide01.ir.pptx
004 - Logging in the Cloud  --  hide01.ir.pptx004 - Logging in the Cloud  --  hide01.ir.pptx
004 - Logging in the Cloud -- hide01.ir.pptx
 
Build a custom metrics on aws cloud
Build a custom metrics on aws cloudBuild a custom metrics on aws cloud
Build a custom metrics on aws cloud
 
Kks sre book_ch10
Kks sre book_ch10Kks sre book_ch10
Kks sre book_ch10
 
Cloud Computing- Easy to use and Affordable
Cloud Computing- Easy to use and AffordableCloud Computing- Easy to use and Affordable
Cloud Computing- Easy to use and Affordable
 
Container world 2019 Canary Release
Container world 2019 Canary ReleaseContainer world 2019 Canary Release
Container world 2019 Canary Release
 
How to Develop and Operate Cloud First Data Platforms
How to Develop and Operate Cloud First Data PlatformsHow to Develop and Operate Cloud First Data Platforms
How to Develop and Operate Cloud First Data Platforms
 
How to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and ApplicationsHow to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and Applications
 
ENT203 Monitoring and Autoscaling, a Match Made in Heaven
ENT203 Monitoring and Autoscaling, a Match Made in HeavenENT203 Monitoring and Autoscaling, a Match Made in Heaven
ENT203 Monitoring and Autoscaling, a Match Made in Heaven
 
Scaling Monitoring At Databricks From Prometheus to M3
Scaling Monitoring At Databricks From Prometheus to M3Scaling Monitoring At Databricks From Prometheus to M3
Scaling Monitoring At Databricks From Prometheus to M3
 
Amazon WorkSpaces-Virtual Desktops in Cloud
Amazon WorkSpaces-Virtual Desktops in CloudAmazon WorkSpaces-Virtual Desktops in Cloud
Amazon WorkSpaces-Virtual Desktops in Cloud
 
Ghost Environment
Ghost EnvironmentGhost Environment
Ghost Environment
 
Avanttic tech dates - de la monitorización a la 'observabilidad'
Avanttic tech dates - de la monitorización a la 'observabilidad'Avanttic tech dates - de la monitorización a la 'observabilidad'
Avanttic tech dates - de la monitorización a la 'observabilidad'
 
Webinar Monitoring in era of cloud computing
Webinar Monitoring in era of cloud computingWebinar Monitoring in era of cloud computing
Webinar Monitoring in era of cloud computing
 
Sprint 73
Sprint 73Sprint 73
Sprint 73
 
Adaptive Scaling of Microgateways on Kubernetes
Adaptive Scaling of Microgateways on KubernetesAdaptive Scaling of Microgateways on Kubernetes
Adaptive Scaling of Microgateways on Kubernetes
 
OSMC 2017 | Monitoring Challenges in a World of Automation by Anthony Goddard
OSMC 2017 | Monitoring Challenges in a World of Automation by Anthony GoddardOSMC 2017 | Monitoring Challenges in a World of Automation by Anthony Goddard
OSMC 2017 | Monitoring Challenges in a World of Automation by Anthony Goddard
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1
 
Citi Tech Talk: Monitoring and Performance
Citi Tech Talk: Monitoring and PerformanceCiti Tech Talk: Monitoring and Performance
Citi Tech Talk: Monitoring and Performance
 

Más de Felipe

Más de Felipe (16)

Aula rotulação automática - Automatic tagging
Aula rotulação automática - Automatic taggingAula rotulação automática - Automatic tagging
Aula rotulação automática - Automatic tagging
 
First steps with Keras 2: A tutorial with Examples
First steps with Keras 2: A tutorial with ExamplesFirst steps with Keras 2: A tutorial with Examples
First steps with Keras 2: A tutorial with Examples
 
Word embeddings introdução, motivação e exemplos
Word embeddings  introdução, motivação e exemplosWord embeddings  introdução, motivação e exemplos
Word embeddings introdução, motivação e exemplos
 
Cloud Certifications - Overview
Cloud Certifications - OverviewCloud Certifications - Overview
Cloud Certifications - Overview
 
Online Machine Learning: introduction and examples
Online Machine Learning:  introduction and examplesOnline Machine Learning:  introduction and examples
Online Machine Learning: introduction and examples
 
Exemplos de uso de apache spark usando aws elastic map reduce
Exemplos de uso de apache spark usando aws elastic map reduceExemplos de uso de apache spark usando aws elastic map reduce
Exemplos de uso de apache spark usando aws elastic map reduce
 
Pré processamento de grandes dados com Apache Spark
Pré processamento de grandes dados com Apache SparkPré processamento de grandes dados com Apache Spark
Pré processamento de grandes dados com Apache Spark
 
Hadoop MapReduce and Apache Spark on EMR: comparing performance for distribut...
Hadoop MapReduce and Apache Spark on EMR: comparing performance for distribut...Hadoop MapReduce and Apache Spark on EMR: comparing performance for distribut...
Hadoop MapReduce and Apache Spark on EMR: comparing performance for distribut...
 
Boas práticas no desenvolvimento de software
Boas práticas no desenvolvimento de softwareBoas práticas no desenvolvimento de software
Boas práticas no desenvolvimento de software
 
Rachinations
RachinationsRachinations
Rachinations
 
Ausgewählte preußische Tugenden
Ausgewählte preußische TugendenAusgewählte preußische Tugenden
Ausgewählte preußische Tugenden
 
Short intro to scala and the play framework
Short intro to scala and the play frameworkShort intro to scala and the play framework
Short intro to scala and the play framework
 
Conceitos e exemplos em versionamento de código
Conceitos e exemplos em versionamento de códigoConceitos e exemplos em versionamento de código
Conceitos e exemplos em versionamento de código
 
DevOps Series: Extending vagrant with Puppet for configuration management
DevOps Series: Extending vagrant with Puppet for configuration managementDevOps Series: Extending vagrant with Puppet for configuration management
DevOps Series: Extending vagrant with Puppet for configuration management
 
DevOps Series: Defining and Sharing Testable Machine Configurations with vagrant
DevOps Series: Defining and Sharing Testable Machine Configurations with vagrantDevOps Series: Defining and Sharing Testable Machine Configurations with vagrant
DevOps Series: Defining and Sharing Testable Machine Configurations with vagrant
 
D3.js 30-minute intro
D3.js   30-minute introD3.js   30-minute intro
D3.js 30-minute intro
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Cloudwatch: Monitoring your AWS services with Metrics and Alarms

  • 1. AWS Cloudwatch Felipe Almeida Rio Cloud Computing Meetup October / 2016 Monitoring your AWS Services with Metrics and Alarms
  • 2. Structure ● Intro ● Usage ● Metrics - Builtin ● Metrics - Custom ● Alarms ● Events ● Logs ● Third-party services ● Keep in mind 2
  • 3. Intro ● CloudWatch is the AWS metric collection service ● Metrics are individual measurements of some quantity of interest, at a given point in time: Examples: ● At 23:45:29 GMT, on 23 February 2016, CPU usage for instance XPTO was 56% 3
  • 4. Intro ● Nearly all AWS services can be monitored using CloudWatch ● Metrics provide visibility into your applications ● They enable you to make informed decisions such as ○ Whether to downscale a machine that’s not being used too much (save $$) ○ Whether to scale up a machine that’s close to full capacity (avoid failures) ○ Decide what caused an application to fail (debug) 4
  • 5. Usage ● You can use CloudWatch in multiple ways: ○ Using the AWS Console ○ Via other services that have monitoring functionality ○ Using the CloudWatch API through the AWS SDK 5
  • 6. Usage - Via other services ● Via other services ○ Nearly all services have a monitoring or metrics tab 6EMR
  • 7. Usage - Via other services 7Elastic Beanstalk
  • 8. Usage - Via other services 8DynamoDB Table
  • 9. Usage - Via other services ● When you open the monitoring tab for a service, what you’re looking at are actually CloudWatch Metrics: 9 Monitoring Tab for an Elastic Beanstalk Environment uses CloudWatch Metrics
  • 10. Usage - Console ● Via cloudwatch itself (on the console) 10
  • 11. Usage - Console ● You can build a Dashboard with metrics you choose 11
  • 12. Usage - API ● CloudWatch (like most AWS Services) can also be used programatically ○ I.e. it has an API that you can use via the AWS SDK (library for Java, .NET, Python, etc) 12
  • 13. Metrics - Builtin ● Services usually have a set of builtin metrics ○ These metrics are collected by default 13
  • 14. Metrics - Builtin ● For example, the following metrics (not exhaustive list) are collected by default for EC2 Instances: ○ CPUUtilization (percent) ○ DiskReadBytes (bytes) ○ DiskWriteBytes (bytes) ○ NetworkIn (bytes) ○ NetworkOut (bytes) 14
  • 15. Metrics - Custom ● You can also use CloudWatch for your own metrics. ● You can then do anything you can for regular (builtin) metrics, such as: ○ View them on the AWS console ○ Create Dashboards for them ○ Set up alerts 15
  • 16. Metrics - Custom ● For some types of custom metrics, there are community-provided scripts to help you publish them to CloudWatch, such as: ○ Memory utilization for EC2 Instances ○ Swap Utilization for EC2 Instances ○ Disk Space Usage for EC2 Instances ● These metrics are controlled by the O.S. and therefore cannot be accessed automatically by AWS. 16
  • 17. Alarms ● You can also create alarms for any individual metric on CloudWatch, which are triggered if the criteria you have defined are met ● For example, you can have AWS send you an e-mail if FreeStorageSpace metric for a ElasticSearch cluster you have becomes lower than 10GB. 17
  • 18. Alarms ● In addition to notifying you when something happens, you can also configure AutoScaling Actions to be taken if some criteria are met: ○ For example, if the CPUUtilization metric reaches 90% for all machines in a AutoScaling Group, add extra instances to that environment 18
  • 19. Events ● CloudWatch Events are a new addition in CloudWatch ● They enable you to execute custom actions in response to State Changes such as ○ An EC2 instance has been started ○ An EC2 instance has been terminated ● Custom actions include running a Lambda Function, publishing the event to Kinesis, etc. 19
  • 20. Logs ● CloudWatch Logs refers to Agents you can install on your instances to have them send application logs to CloudWatch ● You can filter your logs and set up Alarms when certain criteria are met, such as: ○ There have been more than 10 Errors in Apache in the last minute ○ There have been 5 log messages matching “IllegalArgumentException” in the last hour 20
  • 21. Related Services (Third-Party) ● There are tons of third-party services that build on top of or otherwise leverage CloudWatch metrics, such as: ○ SignalFX (extends and sends custom metrics to CloudWatch) ○ AppDynamics (sends all sorts of low-level and application-level metrics to CloudWatch and displays them in an unified manner; also provides event correlation) ○ NewRelic (sends data to CLoudWatch) ○ Grafana (uses CloudWatch as a Data source) ○ Logstash (you can use CloudWatch as an output) ○ Nagios (consumes CloudWatch metrics) 21
  • 22. Keep in mind ● Metrics last 2 weeks by default ● Metrics are per-region ● Some services (e.g. Elastic BeanStalk) have extra metrics that need to be explicitly enabled before they can be used. ● CloudWatch can also be used to monitor AWS Costs (Billing), even though it’s not a service per se 22
  • 23. Keep in mind ● In order to publish custom Amazon CloudWatch metrics, the instances in your environment need permission to use CloudWatch (see your instance profile for more information). ● CloudWatch is not free so you may want to track only relevant metrics ○ In addition, if you use CloudWatch via the AWS SDK, try to collect high-level metrics to keep cost down ■ E.g. track BeanStalk metrics rather than metrics for individual instances 23
  • 24. Links ● Monitoring Scripts for Linux EC2 Instances ● All Builtin Metrics for all supported AWS services ● AWS CloudWatch Pricing 24