May 2010

You are currently browsing the monthly archive for May 2010.

Some Vblock Posts

The Vblock is starting to get a bit more attention, which is a good thing. Part of it is driven by an increased awareness of the Vblock as a solution; I’d like to think of that as something Ed Saipetch and I had a little to do with after our Gestalt IT Tech Field Day presentation in Boston, but I’m not so naive as to actually believe that. However, I am confident that part of the increased visibility and discussion around Vblock is due to the continued investment and continued persistence by all three members of the VCE Coalition. Regardless of the reason, I’ve collected a few Vblock posts here for your continued enjoyment:

The Case for the Vblock « Jason Nash’s Blog
Vblocks – StorageNerve
VCE VBlock – Perspective for the Technical Decision Maker
VCE Vblock – Alignment of Technology and Operations
UCS, VCE, Acadia, Vblocks and the Journey to the Cloud
What makes up a VCE Vblock 0? Overview Video with Scott Lowe & Mike Foley
Cisco, VMware, EMC Detail Ambitious Vblock Expansion

Tags: , , , , ,

VM6 Software has released VMex 2.1, a new solution built on top of Microsoft Hyper-V that blends high availability, virtual storage, simplified management, and a virtual desktop session manager into an “all-in-one software stack”. Some have even compared it to the hardware stacks emerging from the VCE Coalition and others.

It’s a interesting solution aimed squarely at the SMB market, where VMware has so far been relatively unsuccessful. VM6 doesn’t appear to harbor any misconceptions that their solution competes directly with VMware vSphere; they are focusing almost exclusively on cost and ease of use in their solution.

For more information on the VMex 2.1 release, see the VM6 Software press release.

Tags: , ,

It has been a busy week since I last posted, and the blogging/micro-blogging world has been quite busy. I’ve gathered quite the collection of links and posts over the last week or so; here are a few that caught my eye. Welcome to Virtualization Short Take #41!

  • About a month ago Rick Vanover posted a quick note about the use of Disk.SchedNumReqOutstanding as a potential performance tweak. As Rick mentions at the end of his article, it’s important to “test before and after with an intensive workload”; otherwise, you could find yourself actually hurting performance. Like so many performance tweaks, it really depends upon your specific environment and your specific workloads. Definitely refer to some of the linked resources at the end of Rick’s article (Duncan’s stuff is always helpful) for more details.
  • Speaking of Duncan, his post on vCPU limits is a great read and helps dispel a common misconception about vCPU limits. Remember the definition of a hertz folks—300 million cycles per second (300MHz) means 300 million cycles per second. It doesn’t mean 500 million cycles per second, or 700 million cycles per second.
  • Frank Denneman’s article on memory reservations and resource pools is also a really good read.
  • Kenneth van Ditmarsch has a good post on using datastore permissions to help ensure that VMs are properly placed based on SLAs. This is the kind of operational advice that I think many organizations still need.
  • Continuing our theme of resource allocation, here’s a good post on the effect of shares.
  • If you’re interested in an early look at some of the features targeted for inclusion in VMware View 4.5, have a look at Matthijs Haverink’s post on View 4.5 expected features. If Matthijs’ information is accurate, it looks like VMware has some good stuff planned.
  • I had a URL in my Yojimbo collection for part 5 of the series on Hyper-V dynamic memory, but it doesn’t seem to work anymore. I think the blog post was pulled. If anyone has a working link (yes, I’ve already checked Google), feel free to post it in the comments.
  • Jeremy Waldrop of Varrow brings to light a potential issue with the Cisco “Palo” adapter (now called the Virtual Interface Controller, or VIC) and PowerPath/VE. There is a workaround that fixes the problem. It’s important to note that the Cisco VIC isn’t fully vetted or validated for Vblock yet; that’s still in progress.
  • As a follow up from my mention of this issue in VST #40, I have more information on the Changed Block Tracking (CBT) issue. This post from VMware has more information on the specific conditions needed to produce the problem. I have to say, it looks like a pretty specific set of circumstances. I’m curious to know your thoughts: is this a corner case, or a really significant problem? Personally, I’m leaning toward the former.
  • EMC virtual appliances are really taking off; Chad unwrapped the FMA virtual appliance and fellow vSpecialist team member Nick Weaver unveiled v2 of the “Uber” Celerra VSA as well. I haven’t had the chance to play with the FMA virtual appliance yet, but I’m traveling tonight so maybe I’ll mess around with it on my laptop tonight from the hotel. (Yes, I’m a geek. What can I say?)
  • Following Citrix’s announcement of XenClient, their bare metal client hypervisor, and VMware’s response that perhaps the bare metal client hypervisor’s use cases are more limited than many might think, Citrix has responded by explaining XenClient to VMware. Bare metal hypervisors, unmanaged type 2 hypervisors, and policy-managed type 2 hypervisors all have value in the desktop virtualization space. Perhaps VMware should write a response to Citrix explaining the idea behind check-out/check-in of policy-controlled VMs? While I’m sure that I won’t be very popular with VMware for saying this, I do have to agree with Citrix here: discounting the value of bare metal client hypervisors on the basis of a single use case is a bit disingenuous, especially when you’ve been promoting client hypervisors for a while.
  • Looking to stay sharp and stay relevant in today’s changing IT landscape? Mike DiPetrillo offers some suggestions for skills that IT folks should embrace.
  • Kevin Goodman shared some information here on consolidation ratios with his Cisco UCS environment. He admits he is constrained by RAM, which is common in many data centers today. There are two answers to that problem today: full-width UCS blades with support for massive amounts of RAM; or expensive, high-capacity RAM modules to drive memory capacity higher. It also looks like the Nehalem EX chipset is going to help address that problem with support for more memory buses and more memory slots. Once again I find it interesting that virtualization is helping to drive hardware development.
  • Forbes Guthrie has published v5 of his connections and ports diagram for VMware ESX/ESXi. Definitely a useful resource!
  • This VMware KB article helps clarify the behavior of TPS with Intel Xeon 5500 (Nehalem)-based systems. This isn’t new information (I believe Duncan might have pointed it out first?), but it’s nice to see clarification of the behavior.
  • OK, I’m probably showing my ignorance here (I haven’t had the opportunity to spend as much time with View Manager as I would like), but who knew View Manager had a command-line tool?

I guess that will wrap things up for this issue of Virtualization Short Takes. I hope you’ve found something useful!

Tags: , , , , , , ,

The third annual Carolina VMware Users Summit is fast approaching, and this year looks awesome! Joining Mike Laverick—who is attending the event for the third year in a row—is NetApp’s Vaughn Stewart. Both Chad Sakac and I, both with EMC Corporation, are also going to be there.

The current conference agenda is available on the Charlotte VMUG Regional User Conference home page. You can also register for the event there.

If you are anywhere near the Charlotte, NC area on Friday, June 11, 2010, I highly encourage you to make the time to stop by the event. You won’t be disappointed!

Tags: ,

Welcome to the 40th post in the Virtualization Short Take series, where I share with you various virtualization-related links, thoughts, and news tidbits. (Occasionally, I throw in some stuff that’s not virtualization related just to see if you are paying attention.) Enjoy!

  • There have been a couple of posts now discussing Storage IO Control, a new feature that is possibly slated for inclusion in a future release of VMware vSphere. Storage IO Control extends the disk shares model cluster-wide, allowing administrators to properly shape access to back-end storage resources. The inimitable Scott Drummonds discussed it on Pivot Point (his blog), and Craig Stewart also recently published an article about Storage IO Control over at Gestalt IT. There’s a fair amount of duplication between the two articles (Craig based his article partly on Scott’s), but both are worth a read if you need to come up to speed on this new feature. (Quick disclaimer: I’m discussing Storage IO Control here only because it’s been mentioned elsewhere by others. As to whether or not this feature will or will not appear in a future release, or when that future release might be, I know nothing. OK?)
  • I don’t know why, but I saw this virtual appliance on VMware’s web site earlier today and it has triggered a nagging feeling in the back of my head. Is the future of computing found in simple “scale out” building blocks like this?
  • This article on “VM stall” by Andi Mann just re-confirms something I’ve been saying for a while: there are still too many companies out there that aren’t taking full advantage of virtualization. If you’re one of the almost 54% of companies that is still less than 30% virtualized, what’s holding you back?
  • Among all the other announcements from EMC World last week, this little tidbit might have gotten overlooked. Chad blogged about a fix contained in FLARE 30—the updated version announced at the conference—that addresses a problem with iSCSI initiators and the CLARiiON arrays. Good work, EMC engineering!
  • Over on VMware’s ESXi Chronicles blog, Charu Chaubal recently published a two-part series on hardware monitoring via CIM (part 1 and part 2).
  • A reader dropped me an e-mail about an issue uncovered in their environment while trying to automate the VMware Tools installation. Apparently, the VMware Tools installation relies on 8.3 file naming conventions. Normally, this wouldn’t be a problem, but in environments where 8.3 file name creation is disabled…well, you can see where there might be a problem. No workaround has yet been found. Any wizards out there who have suggestions are welcome to add them to the comments of this post.
  • Two posts popped up in the last couple of weeks regarding the default number of ports on a Nexus 1000V port profile: this post by Kevin Goodman and this post by Jason Nash. Fortunately, it’s a quick process to increase the default maximum of 32 ports.
  • Running your VMware vSphere environment on NFS? Have a look at this document from VMware.
  • Didier Pironet of DeinosCloud, the same gentleman who showed us how to increase the number of VMware HA primary nodes, posted a guide on adjusting the memory usage of Tomcat (the engine behind VMware vCenter’s web services). This is most likely an unsupported configuration change, but it might be handy in test/development environments.
  • Kevin Goodman also had a good post on configuring EMC PowerPath on Linux on Cisco UCS. I know, I know: this isn’t strictly related to virtualization, but it’s close enough in my book.
  • Alastair of demitasse.co.nz finally declares that the emperor has no clothes when he states why he believes users don’t want a client hypervisor. Personally, I tend to agree with him; I think a hosted hypervisor is far more valuable on the client-side space (especially in the BYOPC scenario). Just because you can run a bare metal hypervisor on your laptop doesn’t necessarily mean that you should run a bare metal hypervisor on your laptop.
  • There seems to be a fair amount of confusion around the vStorage APIs; perhaps this is due to the different subsets of the vStorage APIs. There are the vStorage APIs for Array Integration (VAAI); these were discussed in some detail last week at EMC World. There are also the vStorage APIs for Multipathing (VAMP), which serve to support multipathing plugins like PowerPath/VE. Finally, there are the vStorage APIs for Data Protection (VADP), which are the APIs that serve to replace VMware Consolidated Backup. If you’d like to know more about VADP in particular, this VMware KB article has a list of frequently asked questions about VADP.
  • Tom Howarth has brought to light a potentially serious problem with Changed Block Tracking (CBT), a key part of the vStorage APIs for Data Protection (VADP) that enables lots of backup and recovery applications.
  • While reviewing one of the weekly VMware KB digests, I came across this VMware KB article in which virtual NICs are sometimes detected as removable hardware; this can, in turn, cause the virtual NIC to disappear from the virtual machine. It appears that the only workaround for this behavior is to disable HotPlug.
  • Xsigo recently posted a comparison of their I/O virtualization solution vs. other I/O virtualization solutions. They include FCoE as an I/O virtualization solution, but as I’ve said in the past I don’t consider FCoE an I/O virtualization solution. To include FCoE in this sort of comparison is kind of like saying that an apple is a poor orange because it lacks a thick outer skin. FCoE wasn’t designed to do I/O virtualization—it was designed to carry Fibre Channel traffic over Ethernet. Despite the liberty with which comparative technologies are selected, the article is worth reading nevertheless.

And to round out this issue of Virtualization Short Takes, here are a few “bonus links” I found:

UCS with disjointed L2 domains
The “Mini-Rack” Approach to Blade Server Design
Hot adding or removing a Cisco 3750 from a stack
EMC World Cubed – Here’s all the Video
ESXTOP, My understanding

That’s it for now. I hope you found something useful. Feel free to share more useful links in the comments, and thanks for reading!

Tags: , , , , , , ,

The announcement of VPLEX last week at EMC World 2010 in Boston has introduced lots of new possibilities in how virtualized data centers should be designed and deployed. Of particular interest to a number of people is how VPLEX interacts with VMware Site Recovery Manager (SRM).

Jason Nash of Varrow mulls over VPLEX and VMware SRM in this post, where he asks if VMware SRM is even necessary:

If money was no object, or if you needed VPLEX anyway, where does that leave VMware’s Site Recovery Manager?

While many of Jason’s points are absolutely valid, there are some other very important things to keep in mind:

  1. First, and perhaps most importantly, VPLEX and VMware SRM is not an “OR” discussion, it’s an “AND” discussion. As noted by Chad Sakac’s response, there is a VPLEX SRA currently planned. There are some issues to work through—such as the current requirement by VMware SRM for two vCenter Server instances—but I’m confident these issues will be resolved.
  2. Second, the behavior of VPLEX in the event of an unplanned outage lends itself well to VMware SRM-like behavior. In the event that the VPLEX cluster in Site A (your primary site) loses connectivity to the VPLEX cluster in Site B (your secondary/DR/failover site), a set of rules defined by the user control which cluster will continue to have read/write access to distributed devices. The other cluster will suspend I/O to distributed devices until a user manually resumes I/O. This makes VPLEX behavior in an unplanned outage act a lot like VMware SRM already, and is probably one of the reasons why a VPLEX SRA is under development. As long as there are steps that can be automated in some programmatic way, there is continued value for VMware SRM.
  3. Third, it’s important to keep in mind that the requirements for vMotion over distance include spanned Layer 2 VLANs (using something like Overlay Transport Virtualization from Cisco); VMware SRM has no such requirement. Further, VPLEX is currently limited to synchronous distances; VMware SRM is limited only by the underlying replication mechanisms. This means that VMware SRM continues to be a very valid deployment option even in organizations that may also deploy VPLEX.

Jason does make one point that I absolutely agree with:

But, to me, this yet again adds more reasons to virtualize all the applications that you can. Virtualizing the tier 1 apps reduces your DR plan by a large amount with SRM and this would make it even simpler.

The more applications you virtualize, the more you will be able to take advantage of products like VMware SRM and VPLEX. So what are you waiting for? Get virtualizing!

Tags: , , , , , ,

I’ll have my own write-up on VPLEX soon—I’m taking the time to prepare a thoughtful response rather than just repeating the same things that have been said elsewhere—but until then here are some of the things that are being said elsewhere (note that I’ve included negative coverage as well, to show that I welcome all viewpoints):

Your Virtual Machine Teleporter is ready Are you?
EMC creates VMware-inspired storage system
VPLEX’ing their way to Distributed Data Centers, How EMC stole the show (Part 1-Hardware)
VPLEX undressed
Enterprise Computing: VPLEX – A Dreary Storage Cluster?
Hyper-V + VPLEX = Goodness!
Implementing EMC VPLEX and Microsoft Hyper-V and SQL Server with Enhanced Failover Clustering Support
VPLEX – The buzz from EMC World
VPLEX: Redefining the Boundaries of the Data Center
This is VPLEX
vMotion over Distance support with EMC VPLEX Metro

As I prepare my own full-length post on VPLEX, feel free to post a comment letting me know what you’d most like to know about VPLEX. I’ll do what I can to help you get the information you need!

Tags: , ,

This is the fourth and final post in my series on creating Cisco Unified Computing System (UCS) service profiles. What I’ve tried to do so far in this series is define the various elements that I recommend you create or define before you start trying to create a service profile or service profile template. Certainly you could start creating service profiles right off the bat, but I’ve found it easier to have all these other elements defined first. In part 1 of the series, I showed you what networking-related elements to define. In part 2 of the series, I described the storage-related elements, and in part 3 of the series I covered the remaining elements.

Creating a Service Profile Template

Now that all the prerequisite elements have been defined, follow these steps to create a service profile template (creating an actual service profile would follow the same basic steps):

  1. Right-click on Service Profile Templates and select Create Service Profile Template.
  2. On the first screen, supply a name for the new service profile template and make the template an Updating template. (This will ensure that changes made to the template are propagated to service profiles cloned from this template. You can always unbind the service profile from the template to prevent changes from propagating.)
  3. Select the UUID pool created earlier.
  4. Supply a description for the service profile template. Click Next to continue.
  5. On the second screen, select the desired local disk configuration policy.
  6. Do not select a scrub policy.
  7. Select Expert for the method of configuring SAN connectivity.
  8. Select the WWNN pool created earlier.
  9. Near the bottom of the screen (you might need to scroll down to see it), click Add (with the green plus symbol) to add a vHBA.
  10. Specify a name for the vHBA, such as “vHBA-A”. This name should match the name specified in the boot policy defined earlier.
  11. Select Use SAN Connectivity Template.
  12. Select the vHBA template created earlier and, optionally, an adapter performance profile. Click OK to return to the service profile template wizard.
  13. Repeat to add a second vHBA (if a dual fabric configuration). When using blades that have the Cisco Virtual Interface Controller (VIC) mezzanine card (aka “Palo”), the number of vHBAs can be greater than two.
  14. When you are done adding vHBAs, click Next to continue.
  15. On the third screen, do not select a dynamic vNIC connection policy.
  16. Select Expert for how you would like to configure LAN connectivity.
  17. Click Add (with the green plus symbol) to add a vNIC.
  18. Specify a name for the vNIC (like “vNIC-A”) and select Use LAN Connectivity Template. If you intend to use network boot, the name specified here must match the vNIC name specified in the boot policy.
  19. Select the vNIC template you created earlier and, optionally, select an adapter performance profile.
  20. Click OK to return to the service profile template wizard.
  21. Repeat steps 17 through 20 for each desired vNIC. When using blades that have the Cisco VIC mezzanine card the number of vNICs can be greater than two.
  22. When you are finished adding vNICs, click Next to continue.
  23. Do not make any changes to the vNIC/vHBA placement. Click Next to continue.
  24. Select the boot policy you created earlier. Be sure that the local disk configuration policy selected in step 5 is consistent with the boot policy you just selected. (This means that if the local disk configuration policy is set for No Local Disk, then your boot policy should not specify local disk as a boot method.)
  25. When prompted for server assignment, choose Assign Later and click Next.
  26. At the final screen, choose the IPMI profile you created earlier and the serial-over-LAN policy you created earlier.
  27. Click Finish to complete the creation of the service profile template.

And that’s it! What you should have found in walking through these steps is that the process of actually creating the service profile template was really just a matter of selecting pools and policies that you’d already defined. By doing all the legwork up front, you dramatically simplify the creation of the service profile or service profile template.

Of course, one of the advantages of UCS is its flexibility; meaning you don’t have to do things this way. This is just one way that I’ve found quite useful and, to be honest, a way that makes sense to me.

Courteous comments are always welcome! Feel free to add any clarifications, corrections, or other thoughts in the comments below.

Tags: , ,

Here are a few notes from another Symmetrix session I attended Monday afternoon. It’s not a complete session blog (the presenter was presenting so much information I couldn’t capture it all).

Auto-provisioning groups include storage groups, port groups, and initiator groups. A storage group is just a collection of storage resources that are presented to a cluster of hosts. Similarly, a port group is a collection of front-end ports, and initiator groups are just groups of hosts.

These three groups are combined together into a masking view. The masking view ensures that the desired initiators (in an initiator group) have access to the desired storage resources (in a storage group) through the desired ports (in a port group).

The SYMCLI symaccess command perform all the necessary auto-provisioning functions. You can also use SMC to perform the tasks.

A storage group is simply a collection of Symmetrix logical volumes that are used by an application, a server, or a collection of servers. Storage groups are not only used for presenting storage to hosts, but also for Fully Automated Storage Tiering (FAST) policies.

A port group is just a collection of front-end director ports that are used together.

An initiator group is just a collection of host bus adapters (HBAs) that are used together. This could be all the HBAs on a server, all the HBAs in a group of servers, or some other arbitrary collection of initiators. Initiator groups can contain other initiator groups (called cascaded initiator groups).

Masking views aren’t only for provisioning storage; they are also helpful in making changes to storage presentation. Changes to groups contained in a masking view are automatically reflected in the masking view. Adding a storage device to the storage group, adding a port to the port group, or adding an initiator to the initiator group automatically updates the masking view.

This greatly simplifies storage allocation and de-allocation.

Cascaded initiator groups (initiator groups that contain other initiator groups) can be useful in presenting storage to clustered resources. For example, you might have an initiator group for all the HBAs in a server. This initiator group can then be used to present a boot LUN. The same host-specific initiator group can then be combined with other host-specific initiator groups into a cascaded group, and the cascaded group can be put into a masking view to present storage resources to the entire cluster at once.

Groups (and masking views) can combine single objects (a single HBA, a single front-end director port, a single storage resource) or multiple objects. Either approach is valid.

There is a compatibility mode with auto-provisioning groups that allows continued use of the symmask command. Even with compatibility mode, there are still changes that will likely need to be made to existing scripts.

Tags: , , ,

This is a liveblog of the session titled “Performance Improvements and Enhancements of Enginuity 5874 for Symmetrix VMAX: What’s New Since Last Year”. The presenter’s name is John Aurin. (Note: the product branding is changing from “V-Max” to “VMAX”.)

The session starts with a recap of the Symmetrix VMAX architecture: the directors, front-end ports, the Virtual Matrix connectivity for sharing data between directors, and the various expansion cabinets and drives that are supported. VMAX supports EFDs (200 or 400GB), 15K Fibre Channel drives (146GB to 600GB), 10K Fibre Channel drives (400GB to 600GB), and SATA drives (1TB).

The discussion now shifts to front-end connectivity on the VMAX. For mainframe connectivity, the VMAX supports zHPF for high-performance FICON connectivity. The use of zHPF for high-performance FICON connectivity improves performance (up to 214% vs. the previous-generation DMX4). 8Gb Fibre Channel support is also available with the latest VMAX software.

Some best practices for front end port layout:

  • Go wide across the “A” ports before going deep across the “B” ports.
  • Mix different systems across the A and B ports.
  • Be careful to provide enough ports when doing frame consolidation.
  • Use general recommendations for FICON on VMAX on a port level.

Next the speaker moves into SRDF software compression. This is off by default, but can be enabled and can be turned on for all SRDF modes. It’s typically not recommended for SRDF/S, but is recommended for SRDF/A or Adaptive Copy. The choice to leave software compression off by default is typically because organizations deploy network optimization technologies. It cannot be enabled via SYMCLI; it must be enabled via inline command only.

Cloning is the next topic that the speaker discusses. The latest release of the code supports asynchronous copy on first write with virtual provisioning. This offers a performance improvement versus the previous behavior.

Fully Automated Storage Tiering (FAST) is the next topic to be discussed. The current version of FAST works only on full LUNs and is based on Symmetrix Optimizer. FAST uses multiple algorithms to make the swap/move decisions. It is supported for mainframes on the VMAX. FAST does lock the frame briefly, but it does not hold the lock for the duration of the move/swap. The presenter shows a few graphs and reports that show how FAST handles hot drives or high drive utilization by leveraging EFDs or fast Fibre Channel drives for more demanding workloads.

The latest Symmetrix code release supports up to four daisy chain bays added with four engines. Two engines can now support 1200 drives; four engines can support up to 2400 drives. There is no additional latency on the extra drive bays.

Now the discussion moves to zero reclamation. This is only applicable in a virtual provisioning environment. Zero reclamation is a mechanism to return (de-allocate) thin extents that contain all zeros. Zero reclamation is administrator-initiated via SYMCLI. Zero reclamation is probably a one-time event, unless the host OS on the systems attached to the VMAX have the ability to write zeros (like Sdelete).

Both zero reclamation and write leveling are background tasks and don’t affect replication, snaps, front-end workloads, etc.

Write leveling is used to rebalance allocation and usage across devices when new devices are added to a virtual provisioning pool.

The session wraps up with some general Enginuity performance enhancements from GA to the current code release:

  • 20% improvement in throughput between 5874 GA and 5874 2Q10 SR (these improvements are due to increased use of cache locality)
  • Significant enhancements in rebuild times for RAID 5 (3+1), RAID 5 (7+1), and RAID 6 (6+2) configurations from GA to latest code release

With that, the speaker wrapped up the session with a summary of the various topics discussed: SRDF software compression, FAST, 8Gg FC/FICON support, Zero Reclaim, Write Leveling, and Enginuity improvements (faster rebuild, more snaps, enhanced throughput).

Tags: , , ,

« Older entries