SlideShare una empresa de Scribd logo
1 de 24
Descargar para leer sin conexión
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
                 20-APR-2012
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

                                     2
Outline

●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   3
                               (c) with Histograms, 3 - Conclusions
Basic Information
Execution Environments
●   Personal Laptop
    ●   Ubuntu 11.10, 64-bit
    ●   Intel Quad Core i5
    ●   4GB RAM


●   Boada Server
    ●   Intel(R) Xeon(R) CPU E5645 @ 2.40GHz
    ●   12 Cores with HT support
    ●   24 GΒ RAM
     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   4
                                (c) with Histograms, 3 - Conclusions
Basic Information
NAS Parallel Benchmark
●   Evaluate the performance of parallel supercomputers

●   Several Applications                                   MG – MPI Version
    ●    IS, EP, CG, MG                                   Multi-Grid on a sequence
    ●    FT, BT, SP, LU                                              of meshes



●   Extrae → Produce traces
●   Paraver → Analyse traces
        1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   5
                                   (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   6
                               (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   7
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   8
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation




1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   9
                           (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Initialization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   10
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Execution




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   11
                             (c) with Histograms, 3 - Conclusions
Instrumentation by Observation
Finalization




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   12
                             (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   13
                                (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Instructions




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   14
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
Cycles




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   15
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
IPC: Instructions Per Cycle




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   16
                              (c) with Histograms, 3 - Conclusions
Instrumentation
            using Performance Counters
L1 Cache Misses




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   17
                              (c) with Histograms, 3 - Conclusions
Outline
●   Basic Information

●   Instrumentation
    ●   By observation
    ●   Using Performance Counters
    ●   Using Histograms

●   Conclusions

    1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   18
                              (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Time
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   19
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   20
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   21
                            (c) with Histograms, 3 - Conclusions
Instrumentation using Histograms
Percentage
Histogram




  1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   22
                            (c) with Histograms, 3 - Conclusions
Conclusions
●   Scalability
    ●   In laptop: No way!
    ●   In Boada: Yes!

●   #Processors Increase
        → L1 Cache Misses Increase


●   Useful information very fast → Histograms!

     1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters,   23
                               (c) with Histograms, 3 - Conclusions
34324 - Measurement Tools and Techniques



              Instrumenting
            the MG application
        of NAS Parallel Benchmark


               Maria Stylianou
             marsty5@gmail.com
               20-APR-2012
                                           24

Más contenido relacionado

Destacado

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...Maria Stylianou
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your SecretsMaria Stylianou
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based SchedulingMaria Stylianou
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareMaria Stylianou
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Maria Stylianou
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBconfluent
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformconfluent
 

Destacado (10)

A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
A Survey on Large-Scale Decentralized Storage Systems to be used by Volunteer...
 
How Companies Learn Your Secrets
How Companies Learn Your SecretsHow Companies Learn Your Secrets
How Companies Learn Your Secrets
 
Pregel - Paper Review
Pregel - Paper ReviewPregel - Paper Review
Pregel - Paper Review
 
Erlang in 10 minutes
Erlang in 10 minutesErlang in 10 minutes
Erlang in 10 minutes
 
Automatic Energy-based Scheduling
Automatic Energy-based SchedulingAutomatic Energy-based Scheduling
Automatic Energy-based Scheduling
 
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication MiddlewareSPARJA: a Distributed Social Graph Partitioning and Replication Middleware
SPARJA: a Distributed Social Graph Partitioning and Replication Middleware
 
Google's Dremel
Google's DremelGoogle's Dremel
Google's Dremel
 
Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...Performance Analysis of multithreaded applications based on Hardware Simulati...
Performance Analysis of multithreaded applications based on Hardware Simulati...
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
 
Apache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platformApache kafka-a distributed streaming platform
Apache kafka-a distributed streaming platform
 

Similar a Instrumenting the MG applicaiton of NAS Parallel Benchmark

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...EL-Hachemi Guerrout
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRAssuser58d6dc2
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmajayrampelli
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...South Tyrol Free Software Conference
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013Elsa von Licy
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Salvatore La Bua
 

Similar a Instrumenting the MG applicaiton of NAS Parallel Benchmark (8)

Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
SRA final project
SRA final projectSRA final project
SRA final project
 
Incheon National University - EATED SRA
Incheon National University - EATED SRAIncheon National University - EATED SRA
Incheon National University - EATED SRA
 
Dongliang_Slides
Dongliang_SlidesDongliang_Slides
Dongliang_Slides
 
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithmReal time-image-processing-applied-to-traffic-queue-detection-algorithm
Real time-image-processing-applied-to-traffic-queue-detection-algorithm
 
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
SFScon21 - Alex Bojeri - Artificial Intelligence Algorithms for Automatic Seg...
 
Mi rna data analysis 2013
Mi rna data analysis 2013Mi rna data analysis 2013
Mi rna data analysis 2013
 
Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...Design and Implementation of Modules for the Extraction of Biometric Paramete...
Design and Implementation of Modules for the Extraction of Biometric Paramete...
 

Último

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Último (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Instrumenting the MG applicaiton of NAS Parallel Benchmark

  • 1. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012
  • 2. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 2
  • 3. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 3 (c) with Histograms, 3 - Conclusions
  • 4. Basic Information Execution Environments ● Personal Laptop ● Ubuntu 11.10, 64-bit ● Intel Quad Core i5 ● 4GB RAM ● Boada Server ● Intel(R) Xeon(R) CPU E5645 @ 2.40GHz ● 12 Cores with HT support ● 24 GΒ RAM 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 4 (c) with Histograms, 3 - Conclusions
  • 5. Basic Information NAS Parallel Benchmark ● Evaluate the performance of parallel supercomputers ● Several Applications MG – MPI Version ● IS, EP, CG, MG Multi-Grid on a sequence ● FT, BT, SP, LU of meshes ● Extrae → Produce traces ● Paraver → Analyse traces 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 5 (c) with Histograms, 3 - Conclusions
  • 6. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 6 (c) with Histograms, 3 - Conclusions
  • 7. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 7 (c) with Histograms, 3 - Conclusions
  • 8. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 8 (c) with Histograms, 3 - Conclusions
  • 9. Instrumentation by Observation 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 9 (c) with Histograms, 3 - Conclusions
  • 10. Instrumentation by Observation Initialization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 10 (c) with Histograms, 3 - Conclusions
  • 11. Instrumentation by Observation Execution 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 11 (c) with Histograms, 3 - Conclusions
  • 12. Instrumentation by Observation Finalization 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 12 (c) with Histograms, 3 - Conclusions
  • 13. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 13 (c) with Histograms, 3 - Conclusions
  • 14. Instrumentation using Performance Counters Instructions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 14 (c) with Histograms, 3 - Conclusions
  • 15. Instrumentation using Performance Counters Cycles 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 15 (c) with Histograms, 3 - Conclusions
  • 16. Instrumentation using Performance Counters IPC: Instructions Per Cycle 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 16 (c) with Histograms, 3 - Conclusions
  • 17. Instrumentation using Performance Counters L1 Cache Misses 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 17 (c) with Histograms, 3 - Conclusions
  • 18. Outline ● Basic Information ● Instrumentation ● By observation ● Using Performance Counters ● Using Histograms ● Conclusions 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 18 (c) with Histograms, 3 - Conclusions
  • 19. Instrumentation using Histograms Time Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 19 (c) with Histograms, 3 - Conclusions
  • 20. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 20 (c) with Histograms, 3 - Conclusions
  • 21. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 21 (c) with Histograms, 3 - Conclusions
  • 22. Instrumentation using Histograms Percentage Histogram 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 22 (c) with Histograms, 3 - Conclusions
  • 23. Conclusions ● Scalability ● In laptop: No way! ● In Boada: Yes! ● #Processors Increase → L1 Cache Misses Increase ● Useful information very fast → Histograms! 1 - Basic Info, 2 - Instrumentation (a) by Observation, (b) with Performance Counters, 23 (c) with Histograms, 3 - Conclusions
  • 24. 34324 - Measurement Tools and Techniques Instrumenting the MG application of NAS Parallel Benchmark Maria Stylianou marsty5@gmail.com 20-APR-2012 24