SlideShare una empresa de Scribd logo
1 de 33
IBM GLOBAL SERVICES New Orleans, LA P12 Brett Allison Leveraging Open Source to Manage  SAN Performance  July 25-29, 2005 ©  IBM Corporation 2005
Trademarks & Disclaimer ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Table of Contents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What Is Open Source Software? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
What is a SAN?  ISL’s Core Switch - A Core Switch - B Fabric Edge Switch - A Edge Switch - B Links Links Storage Switch - A Storage Switch - B Servers Storage Servers
What Can We Measure on the Attached Server?  Server View SAN Storage Fabric LUN PATH A PATH B Read Kbytes/sec, Write Kbytes/sec, I/Os per second, Reads/sec, Writes/sec, End-to-End Response Time Physical Volume  Read Kbytes/sec, Write Kbytes/sec, I/Os per second, Reads/sec, Writes/sec Virtual Path/LUN Read Kbytes/sec, Write Kbytes/sec, I/Os per second Adapter Metrics Component HBA HBA
What SAN Fabric Components Can We Measure? Fabric Edge Switch - A Edge Switch - B ISL’s Core Switch - A Core Switch - B Links Links Storage Switch - A Storage Switch - B What can be measured?
What Can We Measure on the Storage Server? KB/sec, RT Physical NVS Delays Cache Hits Logical Volume: Reads, Writes, Sequential I/Os, KB/sec,  I/O Time Physical
Problem Definition and Constraints ,[object Object],[object Object],[object Object],[object Object],[object Object]
What is the Solution? See Appendix H for Requirements sar,  iostat,  filemon MRTG SNMP TSE, DB2,  PERL Collect Post Process PERL, PHP PERL, PHP Extract/Show MYSQL Store RRDTOOL MYSQL Mixed OSS Legend OSS+Glue Server ESS Switch Apache, PHP Browser
Server Collection/Post-processing/Store ,[object Object],[object Object],[object Object],[object Object],[object Object],Appendix A,B Collect Post Process Appendix C Appendix D Store Server Extract/Show Apache, PHP Browser
SAN Fabric – Collection with MRTG/SNMP ,[object Object],[object Object],[object Object],[object Object],[object Object],Switch MRTG SNMP Collect
SAN Fabric Data Storage with MRTG/RRDTool ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Switch MRTG SNMP Collect RRDtool Store
SAN Fabric View with RRDTOOL/PERL/Apache ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Switch MRTG SNMP Collect RRDTOOL Apache, PHP Browser Store Extract/Show
Sample Port Throughput Chart
Storage Server – Collection ,[object Object],[object Object],[object Object],[object Object],[object Object]
Storage Server – Post Process and Import ,[object Object],Array Configuration ,[object Object],[object Object],Array Performance 2 B 3 1 72.8 10000 1024 12288 800 023-12345 23.59 0 20050121 DISK GROUP LOOP ADAPTER CLUSTER SIZE GB RPM NVS PER CLUSTER CACHE PER CLUSTER MODEL ESS SN END TIME START TIME START DATE 7 0.23 308 14919.43 43.05 94.69 25.37 1076.54 21.59 98 6.21 1046 34.41 rank10  MAX NVS FULL AVG NVS FULL CACHE HOLD MIN CACHE HOLD AVG SEQ PCT READ PCT AVG WRITEKBYTESRATE AVG READKBYTESRATE AVG ARRAY RT MS DISK UTIL MAX DISK UTIL AVG MAX IO Rate AVG IO RATE ARRAY
Storage Server – View – Define Reports/Charts MYSQL Capacity Health Check Server Component Customer Exceptions Business Logic,  SQL Queries
Storage Server – View – Define Forms 1) Select ESS Reports, then “continue” 2) Click to select the ESS, or hold the ctrl key to select multiples
Storage Server – View ESS Array Summary Report
Storage Server – Chart Array Exceptions Based on the exception table in the previous slide we can drill down by clicking on the exception and chart the exceptions
Storage Server – ESS Health Check Customer View ,[object Object],[object Object]
Appendix A - Measure End-to-End Host Disk I/O Response Time The iostat package for Linux is only valid with a 2.4 & 2.6 kernel See Appendix B for links to more information Avg. Disk sec/Read Physical Disk perfmon NT/Wintel svctm (ms) iostat –d 2 5 *iostat Linux iostat –xcn 2 5 sar –d  filemon -o /tmp/filemon.log -O all  Command/Object iostat sar filemon Native Tool svc_t (ms) Solaris avserv (ms) HP-UX read time (ms) write time (ms) AIX Metric(s) OS
Appendix B:  Getting LUN Serial Numbers for ESS Devices Note :  ESS Utilities for AIX/HP-UX/Solaris are available at:  http://www-1.ibm.com/servers/storage/support/disk/2105/downloading.html Host config. -  http://www.redbooks.ibm.com/abstracts/tips0553.html Device Name LUN SN lsvpcfg SDD Linux SDD ESS Util Tool Device Name Serial Datapath query device Wintel VG, hostname, Connection, hdisk LUN SN lsvp –a AIX, HP-UX, Solaris Other Metrics Key Command OS
Appendix C:  Format ‘ lsvp –a ’ and ‘ filemon ’ (Logic) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Appendix D:  Import Data into MYSQL - Logic ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Appendix E:  DB2 Query Wrapper - Logic ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],### PERL Code snippet #### This is not a fully functional script, it is just an example open (KSHFILE, "> $kshfile") || &msg("die","Could not write $kshfile! $!"); # Open shell script print KSHFILE "db2 connect to $remote user $db2user using $db2pass"; # Print db2 connect info to shell print KSHFILE "db2 -tf $runfile"; # Print command to run query file print KSHFILE "db2 connect reset"; # Reset connection close(KSHFILE);  # Close the shell script system("chmod +x $kshfile"); # Modify shell script to have execute perms exec("$kshfile"); # Execute shell script
Appendix F:  DB2 Query for Array Performance Data Note :  This information is relevant only if you have the TotalStorage Expert installed and access to the DB2 command line on the TSE server. SELECT DISTINCT A.*, B.M_CARD_NUM, B.M_LOOP_ID, B.M_GRP_NUM FROM DB2ADMIN.VPCRK A, DB2ADMIN.VPCFG B WHERE ( ( A.PC_DATE_B >= '%STARTDATE' AND A.PC_DATE_E <= '%ENDDATE' AND A.PC_TIME_B >= '%STARTTIME' AND A.PC_TIME_E <= '%ENDTIME' AND A.M_MACH_SN = '%ESSID' AND A.M_MACH_SN = B.M_MACH_SN AND A.M_ARRAY_ID = B.M_ARRAY_ID AND A.P_TASK = B.P_TASK ) ) ORDER BY A.M_ARRAY_ID, A.PC_DATE_B, A.PC_DATE_E with ur;
Appendix G:  DB2 Query for Array Configuration Data Note :  This information is relevant only if you have the TotalStorage Expert installed and access to the DB2 command line on the TSE server. SELECT  DISTINCT A.M_MACH_SN, A.M_MODEL_N, A.M_CLUSTER_N, A.M_RAM, A.M_NVS, C.I_DDM_RPM, C.I_DDM_GB_CAPACITY FROM  DB2ADMIN.VPVPD A, DB2ADMIN.VMPDX B, DB2ADMIN.VcMDDM C WHERE ( (  A.M_MACH_SN = B.I_VSM_SN AND B.I_VSM_IDX = C.I_VSM_IDX ) ) ORDER BY A.M_MACH_SN, A.M_CLUSTER_N;
Appendix H:  Requirements - URLs ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Appendix I:  Array Summary Report Logic ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Appendix J:  Useful Links Slide ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Biography Brett Allison has been doing distributed systems performance related work since 1997 including J2EE application analysis, UNIX/NT, and Storage technologies.  His current role is Performance and Capacity Management team lead ITDS.  He has developed tools, processes, and service offerings to support storage performance and capacity.  He has spoken at a number of conferences and is the author of several White Papers on performance

Más contenido relacionado

La actualidad más candente

Hadoop Interacting with HDFS
Hadoop Interacting with HDFSHadoop Interacting with HDFS
Hadoop Interacting with HDFSApache Apex
 
Arcserve udp recovery point server and global deduplication 12-2014
Arcserve udp   recovery point server and global deduplication 12-2014Arcserve udp   recovery point server and global deduplication 12-2014
Arcserve udp recovery point server and global deduplication 12-2014Gina Tragos
 
Hadoop basic commands
Hadoop basic commandsHadoop basic commands
Hadoop basic commandsbispsolutions
 
Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Andrew Underwood
 
Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...
Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...
Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...inside-BigData.com
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introductionacogoluegnes
 
Hadoop infrastructure scaling with the Dell PowerEdge FX2
Hadoop infrastructure scaling with the Dell PowerEdge FX2 Hadoop infrastructure scaling with the Dell PowerEdge FX2
Hadoop infrastructure scaling with the Dell PowerEdge FX2 Principled Technologies
 
The Open Source and Cloud Part of Oracle Big Data Cloud Service for Beginners
The Open Source and Cloud Part of Oracle Big Data Cloud Service for BeginnersThe Open Source and Cloud Part of Oracle Big Data Cloud Service for Beginners
The Open Source and Cloud Part of Oracle Big Data Cloud Service for BeginnersEdelweiss Kammermann
 
Hadoop Summit 2012 | HBase Consistency and Performance Improvements
Hadoop Summit 2012 | HBase Consistency and Performance ImprovementsHadoop Summit 2012 | HBase Consistency and Performance Improvements
Hadoop Summit 2012 | HBase Consistency and Performance ImprovementsCloudera, Inc.
 
Technical track 2: arcserve UDP for virtualization & cloud
Technical track 2: arcserve UDP for virtualization & cloudTechnical track 2: arcserve UDP for virtualization & cloud
Technical track 2: arcserve UDP for virtualization & cloudarcserve data protection
 
Storage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - RedisStorage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - RedisSameer Tiwari
 
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral ProgramBig Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Programinside-BigData.com
 
Apache Hadoop YARN, NameNode HA, HDFS Federation
Apache Hadoop YARN, NameNode HA, HDFS FederationApache Hadoop YARN, NameNode HA, HDFS Federation
Apache Hadoop YARN, NameNode HA, HDFS FederationAdam Kawa
 
3 installation-setup-of-r3
3 installation-setup-of-r33 installation-setup-of-r3
3 installation-setup-of-r3sanganiraju
 

La actualidad más candente (20)

Hadoop Interacting with HDFS
Hadoop Interacting with HDFSHadoop Interacting with HDFS
Hadoop Interacting with HDFS
 
Bluestore
BluestoreBluestore
Bluestore
 
Arcserve udp recovery point server and global deduplication 12-2014
Arcserve udp   recovery point server and global deduplication 12-2014Arcserve udp   recovery point server and global deduplication 12-2014
Arcserve udp recovery point server and global deduplication 12-2014
 
Hadoop basic commands
Hadoop basic commandsHadoop basic commands
Hadoop basic commands
 
Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016Dell Lustre Storage Architecture Presentation - MBUG 2016
Dell Lustre Storage Architecture Presentation - MBUG 2016
 
Technical track 1: arcserve UDP deep dvie
Technical track 1: arcserve UDP deep dvieTechnical track 1: arcserve UDP deep dvie
Technical track 1: arcserve UDP deep dvie
 
Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...
Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...
Performance Comparison of Intel Enterprise Edition Lustre and HDFS for MapRed...
 
HDFS_Command_Reference
HDFS_Command_ReferenceHDFS_Command_Reference
HDFS_Command_Reference
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 
HDFS Internals
HDFS InternalsHDFS Internals
HDFS Internals
 
Hadoop File System Shell Commands,
Hadoop File System Shell Commands,Hadoop File System Shell Commands,
Hadoop File System Shell Commands,
 
Hadoop infrastructure scaling with the Dell PowerEdge FX2
Hadoop infrastructure scaling with the Dell PowerEdge FX2 Hadoop infrastructure scaling with the Dell PowerEdge FX2
Hadoop infrastructure scaling with the Dell PowerEdge FX2
 
The Open Source and Cloud Part of Oracle Big Data Cloud Service for Beginners
The Open Source and Cloud Part of Oracle Big Data Cloud Service for BeginnersThe Open Source and Cloud Part of Oracle Big Data Cloud Service for Beginners
The Open Source and Cloud Part of Oracle Big Data Cloud Service for Beginners
 
Hadoop Summit 2012 | HBase Consistency and Performance Improvements
Hadoop Summit 2012 | HBase Consistency and Performance ImprovementsHadoop Summit 2012 | HBase Consistency and Performance Improvements
Hadoop Summit 2012 | HBase Consistency and Performance Improvements
 
Understanding Hadoop
Understanding HadoopUnderstanding Hadoop
Understanding Hadoop
 
Technical track 2: arcserve UDP for virtualization & cloud
Technical track 2: arcserve UDP for virtualization & cloudTechnical track 2: arcserve UDP for virtualization & cloud
Technical track 2: arcserve UDP for virtualization & cloud
 
Storage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - RedisStorage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
Storage Systems for big data - HDFS, HBase, and intro to KV Store - Redis
 
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral ProgramBig Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
Big Lab Problems Solved with Spectrum Scale: Innovations for the Coral Program
 
Apache Hadoop YARN, NameNode HA, HDFS Federation
Apache Hadoop YARN, NameNode HA, HDFS FederationApache Hadoop YARN, NameNode HA, HDFS Federation
Apache Hadoop YARN, NameNode HA, HDFS Federation
 
3 installation-setup-of-r3
3 installation-setup-of-r33 installation-setup-of-r3
3 installation-setup-of-r3
 

Destacado

The Next Leap Forward LTO-7 - Spectra Logic
The Next Leap Forward LTO-7 - Spectra LogicThe Next Leap Forward LTO-7 - Spectra Logic
The Next Leap Forward LTO-7 - Spectra Logicspectralogic
 
TS1150 Webinar Slides
TS1150 Webinar SlidesTS1150 Webinar Slides
TS1150 Webinar Slidesspectralogic
 
Ibm tivoli storage manager in a clustered environment sg246679
Ibm tivoli storage manager in a clustered environment sg246679Ibm tivoli storage manager in a clustered environment sg246679
Ibm tivoli storage manager in a clustered environment sg246679Banking at Ho Chi Minh city
 
Presentation oracle on power power advantages and license optimization
Presentation   oracle on power power advantages and license optimizationPresentation   oracle on power power advantages and license optimization
Presentation oracle on power power advantages and license optimizationsolarisyougood
 
Visual studio 2008 overview
Visual studio 2008 overviewVisual studio 2008 overview
Visual studio 2008 overviewsagaroceanic11
 
Sparc t4 systems customer presentation
Sparc t4 systems customer presentationSparc t4 systems customer presentation
Sparc t4 systems customer presentationsolarisyougood
 
Ibm tivoli storage manager bare machine recovery for aix with sysback - red...
Ibm tivoli storage manager   bare machine recovery for aix with sysback - red...Ibm tivoli storage manager   bare machine recovery for aix with sysback - red...
Ibm tivoli storage manager bare machine recovery for aix with sysback - red...Banking at Ho Chi Minh city
 
Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762
Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762
Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762Banking at Ho Chi Minh city
 
Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...
Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...
Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...VibrantGroup
 
Overview of v cloud case studies
Overview of v cloud case studiesOverview of v cloud case studies
Overview of v cloud case studiessolarisyougood
 
2.ibm flex system manager overview
2.ibm flex system manager overview2.ibm flex system manager overview
2.ibm flex system manager overviewsolarisyougood
 
RHT Upgrading to vSphere 5
RHT Upgrading to vSphere 5RHT Upgrading to vSphere 5
RHT Upgrading to vSphere 5virtualsouthwest
 
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...Circling Cycle
 
CTI Group- Blue power technology storwize technical training for customer - p...
CTI Group- Blue power technology storwize technical training for customer - p...CTI Group- Blue power technology storwize technical training for customer - p...
CTI Group- Blue power technology storwize technical training for customer - p...Tri Susilo
 
V mware v center orchestrator 5.5 knowledge transfer kit
V mware v center orchestrator 5.5 knowledge transfer kitV mware v center orchestrator 5.5 knowledge transfer kit
V mware v center orchestrator 5.5 knowledge transfer kitsolarisyougood
 
Virtual san hardware guidance &amp; best practices
Virtual san hardware guidance &amp; best practicesVirtual san hardware guidance &amp; best practices
Virtual san hardware guidance &amp; best practicessolarisyougood
 

Destacado (20)

The Next Leap Forward LTO-7 - Spectra Logic
The Next Leap Forward LTO-7 - Spectra LogicThe Next Leap Forward LTO-7 - Spectra Logic
The Next Leap Forward LTO-7 - Spectra Logic
 
TS1150 Webinar Slides
TS1150 Webinar SlidesTS1150 Webinar Slides
TS1150 Webinar Slides
 
Ibm tivoli storage manager in a clustered environment sg246679
Ibm tivoli storage manager in a clustered environment sg246679Ibm tivoli storage manager in a clustered environment sg246679
Ibm tivoli storage manager in a clustered environment sg246679
 
Presentation oracle on power power advantages and license optimization
Presentation   oracle on power power advantages and license optimizationPresentation   oracle on power power advantages and license optimization
Presentation oracle on power power advantages and license optimization
 
AIX 5L Differences Guide Version 5.3 Edition
AIX 5L Differences Guide Version 5.3 EditionAIX 5L Differences Guide Version 5.3 Edition
AIX 5L Differences Guide Version 5.3 Edition
 
Visual studio 2008 overview
Visual studio 2008 overviewVisual studio 2008 overview
Visual studio 2008 overview
 
IBMRedbook
IBMRedbookIBMRedbook
IBMRedbook
 
Accelerate Return on Data
Accelerate Return on DataAccelerate Return on Data
Accelerate Return on Data
 
Sparc t4 systems customer presentation
Sparc t4 systems customer presentationSparc t4 systems customer presentation
Sparc t4 systems customer presentation
 
Ibm tivoli storage manager bare machine recovery for aix with sysback - red...
Ibm tivoli storage manager   bare machine recovery for aix with sysback - red...Ibm tivoli storage manager   bare machine recovery for aix with sysback - red...
Ibm tivoli storage manager bare machine recovery for aix with sysback - red...
 
Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762
Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762
Proof of concept guide for ibm tivoli storage manager version 5.3 sg246762
 
Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...
Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...
Aix admin course provider Navi Mumbai | AIX Admin Course Training Navi Mumbai...
 
Overview of v cloud case studies
Overview of v cloud case studiesOverview of v cloud case studies
Overview of v cloud case studies
 
RHT Design for Security
RHT Design for SecurityRHT Design for Security
RHT Design for Security
 
2.ibm flex system manager overview
2.ibm flex system manager overview2.ibm flex system manager overview
2.ibm flex system manager overview
 
RHT Upgrading to vSphere 5
RHT Upgrading to vSphere 5RHT Upgrading to vSphere 5
RHT Upgrading to vSphere 5
 
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
HP-UX Dynamic Root Disk vs Solaris Live Upgrade vs AIX Multibos by Dusan Balj...
 
CTI Group- Blue power technology storwize technical training for customer - p...
CTI Group- Blue power technology storwize technical training for customer - p...CTI Group- Blue power technology storwize technical training for customer - p...
CTI Group- Blue power technology storwize technical training for customer - p...
 
V mware v center orchestrator 5.5 knowledge transfer kit
V mware v center orchestrator 5.5 knowledge transfer kitV mware v center orchestrator 5.5 knowledge transfer kit
V mware v center orchestrator 5.5 knowledge transfer kit
 
Virtual san hardware guidance &amp; best practices
Virtual san hardware guidance &amp; best practicesVirtual san hardware guidance &amp; best practices
Virtual san hardware guidance &amp; best practices
 

Similar a Leveraging OSS to Manage SAN Performance

Ds8000 Practical Performance Analysis P04 20060718
Ds8000 Practical Performance Analysis P04 20060718Ds8000 Practical Performance Analysis P04 20060718
Ds8000 Practical Performance Analysis P04 20060718brettallison
 
Mainframe Architecture & Product Overview
Mainframe Architecture & Product OverviewMainframe Architecture & Product Overview
Mainframe Architecture & Product Overviewabhi1112
 
IBM SAN Volume Controller Performance Analysis
IBM SAN Volume Controller Performance AnalysisIBM SAN Volume Controller Performance Analysis
IBM SAN Volume Controller Performance Analysisbrettallison
 
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
Building a high-performance data lake analytics engine at Alibaba Cloud with ...Building a high-performance data lake analytics engine at Alibaba Cloud with ...
Building a high-performance data lake analytics engine at Alibaba Cloud with ...Alluxio, Inc.
 
C Cure Users Group Presentation Final 4
C Cure Users Group Presentation Final 4C Cure Users Group Presentation Final 4
C Cure Users Group Presentation Final 4halgig
 
Ceph Day Bring Ceph To Enterprise
Ceph Day Bring Ceph To EnterpriseCeph Day Bring Ceph To Enterprise
Ceph Day Bring Ceph To EnterpriseAlex Lau
 
Oracle Exec Summary 7000 Unified Storage
Oracle Exec Summary 7000 Unified StorageOracle Exec Summary 7000 Unified Storage
Oracle Exec Summary 7000 Unified StorageDavid R. Klauser
 
Linux Desktop Automation
Linux Desktop AutomationLinux Desktop Automation
Linux Desktop AutomationRui Lapa
 
SHARE.ORG in Boston Aug 2013 RHEL update for IBM System z
SHARE.ORG in Boston Aug 2013 RHEL update for IBM System zSHARE.ORG in Boston Aug 2013 RHEL update for IBM System z
SHARE.ORG in Boston Aug 2013 RHEL update for IBM System zFilipe Miranda
 
Android porting for dummies @droidconin 2011
Android porting for dummies @droidconin 2011Android porting for dummies @droidconin 2011
Android porting for dummies @droidconin 2011pundiramit
 
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise Ceph Community
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephDanielle Womboldt
 
Ceph Day Beijing - SPDK in Ceph
Ceph Day Beijing - SPDK in CephCeph Day Beijing - SPDK in Ceph
Ceph Day Beijing - SPDK in CephCeph Community
 
Planning for-high-performance-web-application
Planning for-high-performance-web-applicationPlanning for-high-performance-web-application
Planning for-high-performance-web-applicationNguyễn Duy Nhân
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Community
 
Problem Determination Tools
Problem Determination ToolsProblem Determination Tools
Problem Determination ToolsCICS ROADSHOW
 

Similar a Leveraging OSS to Manage SAN Performance (20)

Ds8000 Practical Performance Analysis P04 20060718
Ds8000 Practical Performance Analysis P04 20060718Ds8000 Practical Performance Analysis P04 20060718
Ds8000 Practical Performance Analysis P04 20060718
 
Mainframe Architecture & Product Overview
Mainframe Architecture & Product OverviewMainframe Architecture & Product Overview
Mainframe Architecture & Product Overview
 
IBM SAN Volume Controller Performance Analysis
IBM SAN Volume Controller Performance AnalysisIBM SAN Volume Controller Performance Analysis
IBM SAN Volume Controller Performance Analysis
 
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
Building a high-performance data lake analytics engine at Alibaba Cloud with ...Building a high-performance data lake analytics engine at Alibaba Cloud with ...
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
 
C Cure Users Group Presentation Final 4
C Cure Users Group Presentation Final 4C Cure Users Group Presentation Final 4
C Cure Users Group Presentation Final 4
 
Ceph Day Bring Ceph To Enterprise
Ceph Day Bring Ceph To EnterpriseCeph Day Bring Ceph To Enterprise
Ceph Day Bring Ceph To Enterprise
 
Oracle Exec Summary 7000 Unified Storage
Oracle Exec Summary 7000 Unified StorageOracle Exec Summary 7000 Unified Storage
Oracle Exec Summary 7000 Unified Storage
 
Linux Desktop Automation
Linux Desktop AutomationLinux Desktop Automation
Linux Desktop Automation
 
SHARE.ORG in Boston Aug 2013 RHEL update for IBM System z
SHARE.ORG in Boston Aug 2013 RHEL update for IBM System zSHARE.ORG in Boston Aug 2013 RHEL update for IBM System z
SHARE.ORG in Boston Aug 2013 RHEL update for IBM System z
 
optimizing_ceph_flash
optimizing_ceph_flashoptimizing_ceph_flash
optimizing_ceph_flash
 
Sun Web Server Brief
Sun Web Server BriefSun Web Server Brief
Sun Web Server Brief
 
Android porting for dummies @droidconin 2011
Android porting for dummies @droidconin 2011Android porting for dummies @droidconin 2011
Android porting for dummies @droidconin 2011
 
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Bring Ceph to Enterprise
 
Ceph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for CephCeph Day Beijing - SPDK for Ceph
Ceph Day Beijing - SPDK for Ceph
 
Ceph Day Beijing - SPDK in Ceph
Ceph Day Beijing - SPDK in CephCeph Day Beijing - SPDK in Ceph
Ceph Day Beijing - SPDK in Ceph
 
CSL_Cochin_c
CSL_Cochin_cCSL_Cochin_c
CSL_Cochin_c
 
Planning for-high-performance-web-application
Planning for-high-performance-web-applicationPlanning for-high-performance-web-application
Planning for-high-performance-web-application
 
Sun Web Server Brief
Sun Web Server BriefSun Web Server Brief
Sun Web Server Brief
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK
 
Problem Determination Tools
Problem Determination ToolsProblem Determination Tools
Problem Determination Tools
 

Leveraging OSS to Manage SAN Performance

  • 1. IBM GLOBAL SERVICES New Orleans, LA P12 Brett Allison Leveraging Open Source to Manage SAN Performance July 25-29, 2005 © IBM Corporation 2005
  • 2.
  • 3.
  • 4.
  • 5. What is a SAN? ISL’s Core Switch - A Core Switch - B Fabric Edge Switch - A Edge Switch - B Links Links Storage Switch - A Storage Switch - B Servers Storage Servers
  • 6. What Can We Measure on the Attached Server? Server View SAN Storage Fabric LUN PATH A PATH B Read Kbytes/sec, Write Kbytes/sec, I/Os per second, Reads/sec, Writes/sec, End-to-End Response Time Physical Volume Read Kbytes/sec, Write Kbytes/sec, I/Os per second, Reads/sec, Writes/sec Virtual Path/LUN Read Kbytes/sec, Write Kbytes/sec, I/Os per second Adapter Metrics Component HBA HBA
  • 7. What SAN Fabric Components Can We Measure? Fabric Edge Switch - A Edge Switch - B ISL’s Core Switch - A Core Switch - B Links Links Storage Switch - A Storage Switch - B What can be measured?
  • 8. What Can We Measure on the Storage Server? KB/sec, RT Physical NVS Delays Cache Hits Logical Volume: Reads, Writes, Sequential I/Os, KB/sec, I/O Time Physical
  • 9.
  • 10. What is the Solution? See Appendix H for Requirements sar, iostat, filemon MRTG SNMP TSE, DB2, PERL Collect Post Process PERL, PHP PERL, PHP Extract/Show MYSQL Store RRDTOOL MYSQL Mixed OSS Legend OSS+Glue Server ESS Switch Apache, PHP Browser
  • 11.
  • 12.
  • 13.
  • 14.
  • 16.
  • 17.
  • 18. Storage Server – View – Define Reports/Charts MYSQL Capacity Health Check Server Component Customer Exceptions Business Logic, SQL Queries
  • 19. Storage Server – View – Define Forms 1) Select ESS Reports, then “continue” 2) Click to select the ESS, or hold the ctrl key to select multiples
  • 20. Storage Server – View ESS Array Summary Report
  • 21. Storage Server – Chart Array Exceptions Based on the exception table in the previous slide we can drill down by clicking on the exception and chart the exceptions
  • 22.
  • 23. Appendix A - Measure End-to-End Host Disk I/O Response Time The iostat package for Linux is only valid with a 2.4 & 2.6 kernel See Appendix B for links to more information Avg. Disk sec/Read Physical Disk perfmon NT/Wintel svctm (ms) iostat –d 2 5 *iostat Linux iostat –xcn 2 5 sar –d filemon -o /tmp/filemon.log -O all Command/Object iostat sar filemon Native Tool svc_t (ms) Solaris avserv (ms) HP-UX read time (ms) write time (ms) AIX Metric(s) OS
  • 24. Appendix B: Getting LUN Serial Numbers for ESS Devices Note : ESS Utilities for AIX/HP-UX/Solaris are available at: http://www-1.ibm.com/servers/storage/support/disk/2105/downloading.html Host config. - http://www.redbooks.ibm.com/abstracts/tips0553.html Device Name LUN SN lsvpcfg SDD Linux SDD ESS Util Tool Device Name Serial Datapath query device Wintel VG, hostname, Connection, hdisk LUN SN lsvp –a AIX, HP-UX, Solaris Other Metrics Key Command OS
  • 25.
  • 26.
  • 27.
  • 28. Appendix F: DB2 Query for Array Performance Data Note : This information is relevant only if you have the TotalStorage Expert installed and access to the DB2 command line on the TSE server. SELECT DISTINCT A.*, B.M_CARD_NUM, B.M_LOOP_ID, B.M_GRP_NUM FROM DB2ADMIN.VPCRK A, DB2ADMIN.VPCFG B WHERE ( ( A.PC_DATE_B >= '%STARTDATE' AND A.PC_DATE_E <= '%ENDDATE' AND A.PC_TIME_B >= '%STARTTIME' AND A.PC_TIME_E <= '%ENDTIME' AND A.M_MACH_SN = '%ESSID' AND A.M_MACH_SN = B.M_MACH_SN AND A.M_ARRAY_ID = B.M_ARRAY_ID AND A.P_TASK = B.P_TASK ) ) ORDER BY A.M_ARRAY_ID, A.PC_DATE_B, A.PC_DATE_E with ur;
  • 29. Appendix G: DB2 Query for Array Configuration Data Note : This information is relevant only if you have the TotalStorage Expert installed and access to the DB2 command line on the TSE server. SELECT DISTINCT A.M_MACH_SN, A.M_MODEL_N, A.M_CLUSTER_N, A.M_RAM, A.M_NVS, C.I_DDM_RPM, C.I_DDM_GB_CAPACITY FROM DB2ADMIN.VPVPD A, DB2ADMIN.VMPDX B, DB2ADMIN.VcMDDM C WHERE ( ( A.M_MACH_SN = B.I_VSM_SN AND B.I_VSM_IDX = C.I_VSM_IDX ) ) ORDER BY A.M_MACH_SN, A.M_CLUSTER_N;
  • 30.
  • 31.
  • 32.
  • 33. Biography Brett Allison has been doing distributed systems performance related work since 1997 including J2EE application analysis, UNIX/NT, and Storage technologies. His current role is Performance and Capacity Management team lead ITDS. He has developed tools, processes, and service offerings to support storage performance and capacity. He has spoken at a number of conferences and is the author of several White Papers on performance

Notas del editor

  1. Scope - The primary focus of this presentation is how to leverage open source software to help in managing Shared Storage performance. The storage server will be the focus with particular emphasis on ESS. This solution is a small one-off solution.
  2. “ Shared storage” typically refers to the storage shared on a SAN. This includes the s torage Area Network Switches and other Fabric components (ISL’s, routers, etc) We can measure many of the components in the SAN including but not limited to: Server HBA’s, Switch ports, and Storage Server I/O components. Link information includes Tput, packets/sec, errors
  3. From the point of view of the server storage is allocated to the physical disks. These disks are accessed via Host Bus Adapters (HBA). Throughput statistics are available on most systems at the HBA level. If a multi-pathing software is implemented, the virtual path typically corresponds to the storage allocation unit on the storage server (LUN). Most servers provide throughput information to the virtual paths. When multi-pathing software is implemented, more than 1 host physical volume will point to the same virtual path. In addition to throughput information, most servers provide end-to-end response time for the physical volumes. This provides the ability to identify if 1 path is performing better than another path (ie – Fabric congestion). All of this information can be measured on most servers using native utilities. There is typically very little visibility into the storage server or network pathing from the server’s view point. In some cases vendor specific server based utilities can provide configuration information that can be used to summarize server based performance data for things like I/Os to a certain storage server component. In any case, these views do not provide visibility outside the server.
  4. We can measure many of the components in the SAN including but not limited to: Server HBA’s, Switch ports, and Storage Server I/O components. The switch ports can provide information such as Kbytes/sec Throughput, packets/sec, and errors/sec
  5. Kbytes/sec and response time for the Ports on the HA side are only available via the API, and the CLI. They are not available in the TSE, MDM, or TPC-DiskT The TSE has 2 performance tables: VPCRK (Array/DG) and VPCCH (LUN). Higher level components do not have measurement, however, you can roll up the data to higher levels such as ESS, Cluster, Adapter, and Loop. In addition to the raw data, several important fields are included in the array level data: Array Avg RT ms, Avg Disk Utilization, % Sequential, % Read The NVS and Cache hit stats are stored at a LUN level.
  6. At this point I have not found a single OSS or Vendor Tool that provides all the necessary infromation to manage the performance of the SAN. There are a number of vendors that seem to have tools that will do this including IBM/Tivoli but either the tools are not compatible with our environment, or they are not ready able to provide all the features we require at this point.
  7. Attached server configuration data is important and the commands used are dependent on the OS type and the Storage Server type. There are a number of options for monitoring ESS including TSE, MDM, TPC for Disk, CLI, API -&gt; CIM agent/CIMOM and several 3 rd party products, and they all have their advantages and disadvantages. The TSE/MDM/TPC-Disk all offer similar data stored in DB2 tables. The CLI provides the information in a formatted report that would require significant formatting prior to analysis. The API is a potential option but I simply did not have the time to create my own collector. As a result, I am currently using a number of different tools to manage the performance of the environment. Collectively these tools provide the essentials for managing the environment. Open source is used in some cases but not all as seen in the slide. Requirements Monitor SAN components Utilize existing data sources: TotalStorage Expert for ESS EFCM log files for Fabric – SNMP in future with MRTG Native distributed server utilities for attached servers Monitor frequently enough to be useful Store data for historical purposes and trending Correlate disparate components
  8. To collect mrtg you can run it as a daemon or run it from cron (Most of the standard linux distributions have mrtg) What type of Data: Port level – Octets in/out, uptime SNMP can be used to pull other information, wwpn, error counts, firmware level, switch rebooted. PERL::SNMP module OR NET::SNMP binary distribution contains SNMP library and PERL MIB module that ties into binary. The problem with PERL::SNMP it does not load the MIB modules CRICKET is another way to do this; http://cricket.sourceforge.net/
  9. mrtgrrd is a CGI script that will query RRD files – takes about a seconds. Included in the set of contributed files. Easy to set up. Set a couple vars and point to the mrtg files to read RRDTool is OS specific so when moving between OS’s you must exported as XML Standard import script for XML formatted data to RRDTool is called: rrdtool dump, and rrdrestore
  10. Not generating graphs automatically is a Good Thing – Processing time big time!
  11. This slide provides a high level description of the key components required for collecting the data. As a side note, the queries used against the TSE db2 database are very similar to what would be ran against a TPC for Disk or MDM DB2 database. It is assumed that a directory structure is already configured. The script to execute the SQL query does 2 things: Takes a query template and replaces key parameters such as start and end date and time with the correct values, It then creates a small shell script that executes the SQL query The queries provided in the appendices listed above are used to gather array level configuration and performance data
  12. Array level data is used because it provides physical measurements for the arrays. Using the configuration information gathered it can also provide a summary of ESS performance at various higher level components. These components include Cluster and Disk Adapter. This report should inlclude a calculated metric for scoring the health of the arrays. The VPCRK table where the array level data is extracted from provides some cluster level information: cache, nvs. These should be save in the Exception table as
  13. Exception charts inlcude the array level and cluster level exceptions that were created and imported in the previous step from the array level data. The healthcheck reports should provide a high level summary of the health of ESS at both a server level, a component level (Cluster, Adapter, Array) as well as a summary of all ESS’s for a given customer. For instances of shared ESS’s, the customer does not reflect a single customer’s perspective. In this step it is necessary to define the data necessary to complete the report. The easiest way to do this is to design the report, and then map the data from the report to the data in the DB. During this step or before the next step you should define your SQL queries required for each report and any business logic required
  14. Forms should provide a means for the user (you) to select the report Type: Healtcheck (Server Level), Rank Report, as well as the required parameters for the SQL statement that will pull the data required for the specific report: Parms include: Start Date, End Date, Start Time, End Time, ESS, Report Type
  15. Forms should provide a means for the user (you) to select the report Type: Healtcheck (Server Level), Rank Report, as well as the required parameters for the SQL statement that will pull the data required for the specific report: Parms include: Start Date, End Date, Start Time, End Time, ESS, Report Type
  16. Forms should provide a means for the user (you) to select the report Type: Healtcheck (Server Level), Rank Report, as well as the required parameters for the SQL statement that will pull the data required for the specific report: Parms include: Start Date, End Date, Start Time, End Time, ESS, Report Type
  17. Forms should provide a means for the user (you) to select the report Type: Healtcheck (Server Level), Rank Report, as well as the required parameters for the SQL statement that will pull the data required for the specific report: Parms include: Start Date, End Date, Start Time, End Time, ESS, Report Type
  18. Generally speaking the I/O response time is the amount of time it takes from the point where the I/O request hits the device driver until the I/O is returned from the device driver
  19. For IBM’ers I have a sample script that I can make available. For external customers I would advise you to contact your local IBM AIX field reps to see if they have anything or roll your own script.
  20. For IBM’ers I have a sample script that I can make available. For external customers I would advise you to contact your local IBM AIX field reps to see if they have anything or roll your own script.
  21. For IBM’ers I have a sample script that I can make available. For external customers I would advise you to contact your local IBM AIX field reps to see if they have anything or roll your own script.