SlideShare una empresa de Scribd logo
1 de 54
Descargar para leer sin conexión
Modern Radiology for
  Distributed Systems



  Dietrich Featherston
  @d2fn
Thursday, October 11, 12
This is a talk about
                           monitoring




Thursday, October 11, 12
But not just any kind of
                           monitoring

                           Non-invasive monitoring



Thursday, October 11, 12
non-invasive monitoring

                           measures taken to describe the
                           state of a system with minimal
                           changes to the system being
                           monitored




Thursday, October 11, 12
Insight


                           Radiographic
                             Imagery




                                          Invasiveness

Thursday, October 11, 12
preventative care

                           measures taken to prevent
                           diseases or injuries rather than
                           curing them or treating their
                           symptoms




Thursday, October 11, 12
Non-invasive monitoring
                           techniques focus primarily
                           on host-based metrics

                           Why is this a problem?



Thursday, October 11, 12
Because applications are
                                 distributed




Thursday, October 11, 12
Information emitted
                           about nodes in the network


                                     n                  Information emitted
                                                            about edges
                                                           in the network



                                                            n²
 Network size
Thursday, October 11, 12
We analyze cell-structure
                           because we can’t envision
                              the whole organism

                           We react to disease and
                           injury because we lack
                              preventative care


Thursday, October 11, 12
We lack preventative care for
                      applications because our
                      non-invasive monitoring
                      techniques are growing less
                      and less meaningful



Thursday, October 11, 12
Radiology is useful in
                           illuminating non-invasive
                           monitoring of distributed
                           systems



Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Context is
                           everything



Thursday, October 11, 12
How do we use
                             context?


Thursday, October 11, 12
!!!   Context




                  Your Big
                 Dumb Data
Thursday, October 11, 12
Human
                                         brain
                           Diagnoses
                                           +
                                       med school




     Radiographic
       Imagery
Thursday, October 11, 12
E.T.     Signal
                                  Processing




              VLA Output
Thursday, October 11, 12
Application Topology
                                         Signal Processing
                                         Expert Brain

                           Application
                            Behavior




                              Network
Thursday, October 11, 12
                               Data
dimensions (11)       measurements (8)
     epoch seconds         egress packets
     epoch minutes         egress octets
     epoch hours           ingress packets
     node id               ingress octets
     source ip             retransmits
     source port
                           errors
     dest ip
     dest port             app-rtt
     interface             handshake-rtt
     country
     network/asn


Thursday, October 11, 12
Case Study #1

                           GC-Death of a distributed
                           JVM application




Thursday, October 11, 12
Thursday, October 11, 12
Case Study #2

                Symptoms:
                - Latent Riak handoff
                - Cluster throughput bottoming out



Thursday, October 11, 12
Thursday, October 11, 12
busy_dist_port




Thursday, October 11, 12
+zdbbl 8192




Thursday, October 11, 12
Thursday, October 11, 12
Case Study #3

                           Bringing a dead riak node
                           back online




Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Thursday, October 11, 12
Case Study #4

                           Retransmits 10% of total
                           network throughput




Thursday, October 11, 12
Thursday, October 11, 12
var put: HttpPut = null
      try {
        // ... put data
      }
      catch {
        case e: Exception =>
          // ... handle exception
      }
      finally {
        if(put != null) {
          put.abort()
        }
      }


Thursday, October 11, 12
var put: HttpPut = null
      try {
        // ... put data
      }
      catch {
        case e: Exception =>
          // ... handle exception
      }
      finally {
        if(put != null) {
          put.abort()
        }
      }


Thursday, October 11, 12
Source: http://hc.apache.org/httpcomponents-client-ga/httpclient/apidocs/org/apache/http/client/methods/HttpRequestBase.html#abort()




  abort
  public void abort()
  Description copied from interface: HttpUriRequest
  Aborts execution of the request.




                                                               THANKS
Thursday, October 11, 12
129    public void abort() {
   130        ClientConnectionRequest localRequest;
   131        ConnectionReleaseTrigger localTrigger;
   132        
   133        this.abortLock.lock();
   134        try {
   135            if (this.aborted) {
   136                return;
   137            }            
   138            this.aborted = true;
   139            
   140            localRequest = connRequest;
   141            localTrigger = releaseTrigger;
   142        } finally {
   143            this.abortLock.unlock();
   144        }        
   145
   146        // Trigger the callbacks outside of the lock, to prevent
   147        // deadlocks in the scenario where the callbacks have
   148        // their own locks that may be used while calling
   149        // setReleaseTrigger or setConnectionRequest.
   150        if (localRequest != null) {
   151            localRequest.abortRequest();
   152        }
   153        if (localTrigger != null) {
   154            try {
   155                localTrigger.abortConnection();
   156            } catch (IOException ex) {
   157                // ignore
   158            }
   159        }
   160    }

Thursday, October 11, 12
Thursday, October 11, 12
augmented intelligence
                           precedes
                           artificial intelligence




Thursday, October 11, 12
95
18




     Wilhelm Röntgen
     discovers X-Rays
     First medical use of x-rays in
     human imaging takes place one
     month later




Thursday, October 11, 12
95

                             05
18

                           19


                           First English text on
                           chest radiography


     Wilhelm Röntgen
     discovers X-Rays
     First medical use of x-rays in
     human imaging takes place one
     month later




Thursday, October 11, 12
20
   95

                             05
18




                                                                    19
                           19


                           First English text on
                           chest radiography

                                            Society of Radiographers formed
     Wilhelm Röntgen
     discovers X-Rays
     First medical use of x-rays in
     human imaging takes place one
     month later




Thursday, October 11, 12
Recognition of radiology as
                           a formal medical discipline
                           was a cultural problem, not
                              a technology problem



                                    http://www.bshr.org.uk/page13.html
Thursday, October 11, 12
If you want to talk to me about the
        query language used to ask questions
        of the network data we collect at
        Boundary talk to me after or hit me up
        on twitter.


                     @d2fn
                     github.com/dietrichf

Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12
Find 45 minutes get volume_1s_meter_ip [
                     meter in {1, 2, 226, 301};
  of total traffic   epochMillis from -18h for 45m;
 seen on meters categorize
                   ]

 1, 2, 226, & 301 sum(ingress) as ingress,
                     sum(egress) as egress,
starting 18 hours sum(ingressPackets +
    ago broken            egressPackets) as packets,
                     sum(retransmits) as retransmits,
 down by peer ip bymean(appRttUsec/1000) as appRttMs
 retain top 10 by epochMillis, ip
                   retain
    the ratio of     top 10
  retransmits to on retransmits/packets
                     per epochMillis

      packets
Thursday, October 11, 12

Más contenido relacionado

Destacado (7)

Cad cam
Cad camCad cam
Cad cam
 
Invokedynamic in 45 Minutes
Invokedynamic in 45 MinutesInvokedynamic in 45 Minutes
Invokedynamic in 45 Minutes
 
PDM - Precedence Diagram Method
PDM - Precedence Diagram MethodPDM - Precedence Diagram Method
PDM - Precedence Diagram Method
 
Solid modelling cg
Solid modelling cgSolid modelling cg
Solid modelling cg
 
Introduction to solid modeling
Introduction to solid modelingIntroduction to solid modeling
Introduction to solid modeling
 
Netflix Architecture Tutorial at Gluecon
Netflix Architecture Tutorial at GlueconNetflix Architecture Tutorial at Gluecon
Netflix Architecture Tutorial at Gluecon
 
Solid modeling
Solid modelingSolid modeling
Solid modeling
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Distributed systems-radiology

  • 1. Modern Radiology for Distributed Systems Dietrich Featherston @d2fn Thursday, October 11, 12
  • 2. This is a talk about monitoring Thursday, October 11, 12
  • 3. But not just any kind of monitoring Non-invasive monitoring Thursday, October 11, 12
  • 4. non-invasive monitoring measures taken to describe the state of a system with minimal changes to the system being monitored Thursday, October 11, 12
  • 5. Insight Radiographic Imagery Invasiveness Thursday, October 11, 12
  • 6. preventative care measures taken to prevent diseases or injuries rather than curing them or treating their symptoms Thursday, October 11, 12
  • 7. Non-invasive monitoring techniques focus primarily on host-based metrics Why is this a problem? Thursday, October 11, 12
  • 8. Because applications are distributed Thursday, October 11, 12
  • 9. Information emitted about nodes in the network n Information emitted about edges in the network n² Network size Thursday, October 11, 12
  • 10. We analyze cell-structure because we can’t envision the whole organism We react to disease and injury because we lack preventative care Thursday, October 11, 12
  • 11. We lack preventative care for applications because our non-invasive monitoring techniques are growing less and less meaningful Thursday, October 11, 12
  • 12. Radiology is useful in illuminating non-invasive monitoring of distributed systems Thursday, October 11, 12
  • 16. Context is everything Thursday, October 11, 12
  • 17. How do we use context? Thursday, October 11, 12
  • 18. !!! Context Your Big Dumb Data Thursday, October 11, 12
  • 19. Human brain Diagnoses + med school Radiographic Imagery Thursday, October 11, 12
  • 20. E.T. Signal Processing VLA Output Thursday, October 11, 12
  • 21. Application Topology Signal Processing Expert Brain Application Behavior Network Thursday, October 11, 12 Data
  • 22. dimensions (11) measurements (8) epoch seconds egress packets epoch minutes egress octets epoch hours ingress packets node id ingress octets source ip retransmits source port errors dest ip dest port app-rtt interface handshake-rtt country network/asn Thursday, October 11, 12
  • 23. Case Study #1 GC-Death of a distributed JVM application Thursday, October 11, 12
  • 25. Case Study #2 Symptoms: - Latent Riak handoff - Cluster throughput bottoming out Thursday, October 11, 12
  • 30. Case Study #3 Bringing a dead riak node back online Thursday, October 11, 12
  • 34. Case Study #4 Retransmits 10% of total network throughput Thursday, October 11, 12
  • 36. var put: HttpPut = null try {   // ... put data } catch {   case e: Exception =>     // ... handle exception } finally {   if(put != null) {     put.abort()   } } Thursday, October 11, 12
  • 37. var put: HttpPut = null try {   // ... put data } catch {   case e: Exception =>     // ... handle exception } finally {   if(put != null) {     put.abort()   } } Thursday, October 11, 12
  • 38. Source: http://hc.apache.org/httpcomponents-client-ga/httpclient/apidocs/org/apache/http/client/methods/HttpRequestBase.html#abort() abort public void abort() Description copied from interface: HttpUriRequest Aborts execution of the request. THANKS Thursday, October 11, 12
  • 39. 129    public void abort() { 130        ClientConnectionRequest localRequest; 131        ConnectionReleaseTrigger localTrigger; 132         133        this.abortLock.lock(); 134        try { 135            if (this.aborted) { 136                return; 137            }             138            this.aborted = true; 139             140            localRequest = connRequest; 141            localTrigger = releaseTrigger; 142        } finally { 143            this.abortLock.unlock(); 144        }         145 146        // Trigger the callbacks outside of the lock, to prevent 147        // deadlocks in the scenario where the callbacks have 148        // their own locks that may be used while calling 149        // setReleaseTrigger or setConnectionRequest. 150        if (localRequest != null) { 151            localRequest.abortRequest(); 152        } 153        if (localTrigger != null) { 154            try { 155                localTrigger.abortConnection(); 156            } catch (IOException ex) { 157                // ignore 158            } 159        } 160    } Thursday, October 11, 12
  • 41. augmented intelligence precedes artificial intelligence Thursday, October 11, 12
  • 42. 95 18 Wilhelm Röntgen discovers X-Rays First medical use of x-rays in human imaging takes place one month later Thursday, October 11, 12
  • 43. 95 05 18 19 First English text on chest radiography Wilhelm Röntgen discovers X-Rays First medical use of x-rays in human imaging takes place one month later Thursday, October 11, 12
  • 44. 20 95 05 18 19 19 First English text on chest radiography Society of Radiographers formed Wilhelm Röntgen discovers X-Rays First medical use of x-rays in human imaging takes place one month later Thursday, October 11, 12
  • 45. Recognition of radiology as a formal medical discipline was a cultural problem, not a technology problem http://www.bshr.org.uk/page13.html Thursday, October 11, 12
  • 46. If you want to talk to me about the query language used to ask questions of the network data we collect at Boundary talk to me after or hit me up on twitter. @d2fn github.com/dietrichf Thursday, October 11, 12
  • 47. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 48. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 49. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 50. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 51. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 52. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 53. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12
  • 54. Find 45 minutes get volume_1s_meter_ip [ meter in {1, 2, 226, 301}; of total traffic epochMillis from -18h for 45m; seen on meters categorize ] 1, 2, 226, & 301 sum(ingress) as ingress, sum(egress) as egress, starting 18 hours sum(ingressPackets + ago broken egressPackets) as packets, sum(retransmits) as retransmits, down by peer ip bymean(appRttUsec/1000) as appRttMs retain top 10 by epochMillis, ip retain the ratio of top 10 retransmits to on retransmits/packets per epochMillis packets Thursday, October 11, 12