SlideShare una empresa de Scribd logo
1 de 24
Descargar para leer sin conexión
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
                 20-APR-2012
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

                                     2
Outline

●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   3
                               (c) with Histograms, 3 - Conclusions
Basic Information
Execution Environments
●   Personal Laptop
    ●   Ubuntu 11.10, 64-bit
    ●   Intel Quad Core i5
    ●   4GB RAM


●   Boada Server
    ●   Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
    ●   12 Cores with HT support
    ●   24 GΒ RAM
     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   4
                                (c) with Histograms, 3 - Conclusions
Basic Information
NAS Parallel Benchmark
●   Evaluate the performance of parallel supercomputers

●   Several Applications                                   MG – MPI Version
    ●    IS, EP, CG, MG                                   Multi-Grid on a sequence
    ●    FT, BT, SP, LU                                              of meshes



●   Extrae → Produce traces
●   Paraver → Analyse traces
        1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   5
                                   (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   6
                               (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   7
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   8
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   9
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Initialization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   10
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Execution




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   11
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Finalization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   12
                             (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   13
                                (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Instructions




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   14
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Cycles




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   15
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
IPC: Instructions Per Cycle




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   16
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
L1 Cache Misses




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   17
                              (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   18
                              (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Time
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   19
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   20
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   21
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   22
                            (c) with Histograms, 3 - Conclusions
Conclusions
●   Scalability
    ●   In laptop: No way!
    ●   In Boada: Yes!

●   #Processors Increase
        → L1 Cache Misses Increase


●   Useful information very fast → Histograms!

     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   23
                               (c) with Histograms, 3 - Conclusions
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
               20-APR-2012
                                           24

Más contenido relacionado

Destacado

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...Maria Stylianou
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your SecretsMaria Stylianou
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based SchedulingMaria Stylianou
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareMaria Stylianou
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Maria Stylianou
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBconfluent
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformconfluent
 

Destacado (10)

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your Secrets
 
Pregel - Paper Review
Pregel - Paper ReviewPregel - Paper Review
Pregel - Paper Review
 
Erlang in 10 minutes
Erlang in 10 minutesErlang in 10 minutes
Erlang in 10 minutes
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based Scheduling
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
 
Google's Dremel
Google's DremelGoogle's Dremel
Google's Dremel
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 

Similar a Instrumenting the MG applicaiton of NAS Parallel Benchmark

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...EL-Hachemi Guerrout
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRAssuser58d6dc2
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmajayrampelli
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...South Tyrol Free Software Conference
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013Elsa von Licy
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Salvatore La Bua
 

Similar a Instrumenting the MG applicaiton of NAS Parallel Benchmark (8)

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
SRA final project
SRA final projectSRA final project
SRA final project
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRA
 
Dongliang_Slides
Dongliang_SlidesDongliang_Slides
Dongliang_Slides
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...
 

Último

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 

Último (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Instrumenting the MG applicaiton of NAS Parallel Benchmark

  • 1. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012
  • 2. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 2
  • 3. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 3 (c) with Histograms, 3 - Conclusions
  • 4. Basic Information Execution Environments ● Personal Laptop ● Ubuntu 11.10, 64-bit ● Intel Quad Core i5 ● 4GB RAM ● Boada Server ● Intel(R) Xeon(R) CPU E5645 @ 2.40GHz ● 12 Cores with HT support ● 24 GΒ RAM 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 4 (c) with Histograms, 3 - Conclusions
  • 5. Basic Information NAS Parallel Benchmark ● Evaluate the performance of parallel supercomputers ● Several Applications MG – MPI Version ● IS, EP, CG, MG Multi-Grid on a sequence ● FT, BT, SP, LU of meshes ● Extrae → Produce traces ● Paraver → Analyse traces 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 5 (c) with Histograms, 3 - Conclusions
  • 6. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 6 (c) with Histograms, 3 - Conclusions
  • 7. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 7 (c) with Histograms, 3 - Conclusions
  • 8. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 8 (c) with Histograms, 3 - Conclusions
  • 9. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 9 (c) with Histograms, 3 - Conclusions
  • 10. Instrumentation by Observation Initialization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 10 (c) with Histograms, 3 - Conclusions
  • 11. Instrumentation by Observation Execution 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 11 (c) with Histograms, 3 - Conclusions
  • 12. Instrumentation by Observation Finalization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 12 (c) with Histograms, 3 - Conclusions
  • 13. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 13 (c) with Histograms, 3 - Conclusions
  • 14. Instrumentation using Performance Counters Instructions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 14 (c) with Histograms, 3 - Conclusions
  • 15. Instrumentation using Performance Counters Cycles 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 15 (c) with Histograms, 3 - Conclusions
  • 16. Instrumentation using Performance Counters IPC: Instructions Per Cycle 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 16 (c) with Histograms, 3 - Conclusions
  • 17. Instrumentation using Performance Counters L1 Cache Misses 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 17 (c) with Histograms, 3 - Conclusions
  • 18. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 18 (c) with Histograms, 3 - Conclusions
  • 19. Instrumentation using Histograms Time Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 19 (c) with Histograms, 3 - Conclusions
  • 20. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 20 (c) with Histograms, 3 - Conclusions
  • 21. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 21 (c) with Histograms, 3 - Conclusions
  • 22. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 22 (c) with Histograms, 3 - Conclusions
  • 23. Conclusions ● Scalability ● In laptop: No way! ● In Boada: Yes! ● #Processors Increase → L1 Cache Misses Increase ● Useful information very fast → Histograms! 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 23 (c) with Histograms, 3 - Conclusions
  • 24. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012 24