This document discusses virtualizing tier 1 applications. It begins by showing how virtualization adoption has increased significantly for mission critical applications. It then discusses specific steps and considerations for virtualizing tier 1 applications, including:
1. Ensuring the platform (hardware, virtualization software, etc.) can adequately support the application.
2. Ensuring the people and processes are in place to design, implement, operate and troubleshoot the virtualized application. This includes discussing skills, support models, change management and monitoring.
3. Reviewing the application itself and existing reference architectures to understand virtualization best practices and sizing for that application. The goal is to virtualize at the application layer rather than physical server layer.
4. The Trend Is Clear… % of customers running appsin productionon VMware 56% 53% 50% 41% 34% 36% 27% 24% MS Exchange MS SharePoint MS SQL Oracle Middleware Oracle DB IBM WebSphere IBM DB2 SAP Source: VMware customer survey, September 2008, sample size 1038 Data: Within subset of VMware customers running a specific app, % that have at least one instance of that app in production in a VM In a recent Gartner poll, 73% of customers claimed to use x86 virtualization for mission critical applications in production Source: Gartner IOM Conference (June 2008) “Linux and Windows Server Virtualization Is Picking Up Steam” (ID Number: G00161702)
6. VMware VMmark Use VMmark as the basis for CPU selection It is the official benchmark for VMware, and it uses multiple workload Other benchmark are not run on vSphere, and typically test 1 workload Use it as a guide only You need head room and HA. How it’s done 1 Tile = 6 VM. Each is 1 vCPU. MS Exchange, MySQL, Apache, J2EE, File Server, Idle VM Result page: www.vmware.com/products/vmmark/results.html VMmark 1.1 results is directly comparable to VMmark 1.0 results. The underlying virtual hardware definitions and load levels for each workload have not changed. 1 Tile = 6 VM Each 2 CPUs 2GB RAM
7. Sample Results (as at Feb 2010. Major update in May) VMware does not recommend 1 OEM over the other. I use 1 OEM here so we don’t compare between OEM Look at this number This number is when comparing with same #Tiles Opteron 8439, 24 cores Xeon 5570, 8 cores Opteron 2435, 12 cores Xeon 5470, 8 cores This tells us that DL380 G6can run 17 Tiles, at 100% utilisation. Each Tile has 6 VM, but 1 is idle. 17 x 5 VM = 85 active VM in 1 box! At 70% ESX host utilisation, that’s around 59 VM. DL 385 G6 runs 11 tiles. The 2 boxes have similar date submission, around mid 2009.
8. Xeon 5500 improvement over previous generation Confidential 8 Around 2:1 improvement Details Servers were Dell M600 and Dell 2950 Storage were a few Dell EqualLogic PS5000XV iSCSI arrays. MS Exchange Load Generator (LoadGen) was used to run the tests. The VM had 10GB of RAM and installed with Windows 2008 x64 Enterprise Edition and the Exchange mailbox role.
11. Performance App Delivery Oracle Licensing App QoS Consolidation Match Native Performance Even in Consolidation Scenarios 95%+ Oracle instances match native performance on VMware Virtualization-Friendly Licensing Increase utilization of Oracle licenses Reduce HW costs by >50% Consolidate servers by 4X – 20X Accelerate Application Delivery Provision On-Demand (production and test/dev) Increase Application Quality of Service Scale dynamically Built-in High Availability and simple Disaster Recovery Why Deploy Oracle Products on VMware?
13. Designing for Tier 1 How does your Architecture….. lend itself for troubleshooting? ensure high availability? integrate with DR? integrate with security? minimise risk during vSphere update/upgrade? encourage the expert own their respective area? Storage, Network, Server help Application team monitor/troubleshoot? Is vSphere alone sufficient? Products from VMware Products from non VMware that deliver superior technology Next few slides show an example of an architecture.
21. Root cause analysis and faster troubleshootingApplication Performance Monitoring
22. Value add over vSphere AppSpeed provides application-awareness to vSphere. Answer questions like: What applications are running in my data center? Are the users getting good response times? When the application is slow, is it at all my VMware fault? Where is the bottleneck? Am I meeting my SLA? What are the different transactions a user makes on an application and how are they performing? What is the relationship between application performance and resource utilization?
31. The “people” side Knowledge Design knowledge Troubleshooting knowledge Virtual environment is easy to install & configure. Troubleshooting is a different ball game. Practice helps. Read the logs (fastest way to put your kids to sleep ) VCP & VTSP are Level 100. Consider Advance VCP. Process vCOE Proper Support TAM Business Critical Support
32. Design Knowledge Design Process Overview ESX/ESXi Host Design vSphere Virtual DC Design vSphere Network Design vSphere Storage Design Virtual Machine Design Management and Monitoring Design
33. Troubleshooting Knowledge ESXi Troubleshooting ESX, ESXi, and vCenter Log Files Network Troubleshooting Management Troubleshooting Storage Troubleshooting vMotion & storage vMotion Troubleshooting DRS, HA, DPM, VM Troubleshooting
37. Common Questions What’s in it for me? Is my application supported? Who owns Vmware, Windows or UNIX? Who’s going to pay for these LUNs? Trunks into a server? No way! How are we going to secure this environment? How do we monitor and who do we call?
38. Business Critical Support Designated Support Team Direct Routing to Senior Level Engineers Customer Site Profile Proactive Health Check Root Cause Analysis Support for Migration and Updates Monthly Support Review Meeting http://www.vmware.com/support/services/bcs.html
41. Common Questions How do we charge for this? What should we track, and how do we do it? You mean it automatically changes? What kind of changes are there and what are the risks? How do we manage releases? When do we expand? How do we react to alerts?
42. Virtualization Capability Assessment Before doing Tier 1…. take an honest look at your organisation capability for Tier 1? no need to be long & detailed assessment.
44. VCA Methodology - Assessing the VI Ecosystem Does the environment have the appropriate number of staff and/or skill set to implement, operate and manage the VMware Infrastructure? Do the appropriate procedures exist to allow the organization to effectively manage the VMware Infrastructure at the required service level? Do the necessary tools exist to support the processes and operate the virtualization environment? Can the current VMware Infrastructure design (represented in the existing implementation) provide the desired service levels or operational behavior (i.e. automation, etc.)?
47. Application Review the Ref Arch Desktop MS Exchange 2007 SAP Do Application-sizing, not physical sizing. 1000 SAP SD users, not 8 core Xeon 55890 Go Horizontal if possible. Consolidation Ratio may be lower
Source: Intel preso at Vmworld.Workingclosely with Vmware and our OEMs, VMmark performs very efficiently on Xeon 5500 series.As you can see all the major OEMs has published Vmmark scores on Nehalem based servers – super efficient virtualization is one of the requirements for cloud computing; Xeon 5500 series is the platform of choice
This is another example of how VMware enables higher performance from physical hosts.
The good news with SQL server consolidation – and Oracle consolidaiton – is that you can realize very significant cost savings not just on the infrastructure side, but also on the licensing side. Today, most SQL / ORCL database licenses are used at just 5% - 10% on average. By consolidating, you not only increase infrastructure utilization, but also license utilization. In this example, we are going from 8 individual DB each on a dedicated host, to a couple of shared host running 8 DB instances. With Enterprise edition, just need to license physical processors, so you effectively consolidate your licenses in addition to servers.This leads to a very significant reduciton in licensing costs, especially since software licenses tend to dominate the cost structure for SQL deployments.
Failback requires some setup _after_ fail-over is performed.Unique featuresSite Recovery Manager2 main featuresAutomated DRProof to Audit or Business that IT has done the DR properly with minimal hit on IT resources.It is possible because we have a standard API that many storage vendors have written to. They allow us to control their array (e.g. discover LUN, stop replication)Global ViewBrowse and Search across multiple data center. Scale up to 10 data centers worldwide.
Technically, you maybe able to hit 120 VM or more per cluster. But from best practice point of view, we don’t recommend such high ratio. 120 VM in 7 hosts (1 is assumed to be unavailable) is > 17:1 ratio
Infra VMs (vCenter, Update Manager, etc) connect to Management LAN. Since they are on the same physical NIC, we need to create VLAN.
In most customer environments, early virtualization deployments often focus on virtuliazation “candidates” that are considered lower risk – perhaps file and print servers or applications that are not considered mission critical. As customers seek to expand the virtualization footprint, by definition they want to virtualize business-critical apps – and vCenter AppSpeedis an excellent tool to help them achieve that with confidence. There are two main use cases for AppSpeed: 1. Guarantee application SLA for virtualized multi-tier apps 2. Perform an “Assured Migration”Guarantee application SLAAppSpeed allows you to continually monitor the performance of an application ensuring SLAs are met. The product automatically discovers and maps applications and components all the way down to specific database tables…even if the database or any app tier is still on a physical system. The product will then monitor real-time user end user experience for all transactions – breaking down performance into its component parts (e.g. network, application and database latency). Any time an application does not meet its assigned threshold, the user is alerted, and can quickly analyze the root cause of the problem. Users can quickly identify and dive into the application’s dependencies to pin-point the source of the problem. Perform “Assured Migration”One of the top inhibitors to Virtualizing applications is the concern expressed by app owners that applications may not perform as well in virtualized environments. The answer is to measure performance on the physical app pre virtualization, and again after it has been virtualized. In both use cases, AppSpeed, provides a shared view of the application that be understood by both IT and application owners, engendering better communication about application performance overall, and reducing fingerpointing and enable faster troubleshooting and resolution if and when trouble arises. In short, With vCenter AppSpeed, customers can be sure that their business critical applications will run as well or better in virtual mode.