I’m relatively new to NetApp deduplication (formerly A-SIS), so this article won’t be an advanced treatise on NetApp deduplication or its deep inner workings. Instead, this is intended to be a quick guide to setting up NetApp deduplication for others, like myself, who may be familiar with Data ONTAP but not necessarily deduplication.
Obviously, the first step will be to ensure that your NetApp storage system is licensed for deduplication. As of March 10, NetApp made the NearStore option, which was a prerequisite for deduplication, free. Yes, you read that right: free. Since NearStore is a prerequisite, you’ll need to be sure to license that first:
license add <Code for NearStore>
license add <Code for Deduplication>
Once deduplication is licensed, then you can enable it on a per-volume basis using the “sis on” command:
sis on /vol/<volname>
Note, however, that the volume cannot exceed a certain size, based on the storage system model, in order for deduplication to work. These volume size limits are laid out in TR3505. Note that the volume must never have been any bigger than the size limits described, so this means you can’t size it down to the limits set forth and then run deduplication.
Once it’s running, you can check the status with:
sis status /vol/<volname>
After it’s finished running, you can see your space savings like this:
df -s /vol/<volname>
After running deduplication on a small NFS volume that housed only three VMs, the “df -s” command showed a space savings of 64%. That’s pretty impressive!
Moving forward, deduplication will run automatically every night at midnight, as shown by this command:
sis config /vol/<volname>
That should be enough to get most everyone started. Feel free to post comments or corrections below.
UPDATE: I’ve corrected the broken TR-3505 link. Thanks, Tom!
-
Scott,
Nearstore is free, but how much does the Deduplication license ballpark for, any idea, just a rough estimate is all?
Thanks,
Ron
-
Cool posting, Scott!
When I was working at the university, we beta’d A-SIS about a year ago. We were using VMFS datastores on FCP luns, so the savings were not apparent to the VirtualCenter Admin as far as free space goes that he might use. (It turns out that it is possible to use the extra volume free space to store extra snapshots or more headroom for SnapMirror, but you may get some real WOW from the VM admins to show it off live on NFS stores!)
Are you seeing the NFS volume free space increase after you run the A-SIS process, or is a management/vpxa service restart necessary? Also, was this filesystem ACTIVE during your A-SIS deduplication processing window? What was the CPU impact, if so? Just curious… (we were seeing up to 83% by separating Windows temp/swap to a VMFS datastore on a volume where snapshots were disabled - no sense trying to dedupe that stuff)
Wes
-
Correct. Both licenses are free.
-
Indeed, I am. Funny thing is, that stuff is VERY well implemented internally. I hear you’re out here on occasion, so look me up, or contact me via email to follow up.
wes
-
Is this functionality for OnTAP 7.3 only, or can we still get it with 7.2.3? We have a FAS3020 that I’m thinking this should work on, but I’m having a hard time coming up with the “free” license codes.
I thought I’d just contact support, plug the codes in and forge ahead, but that’s not working out the way I thought it would.
-
After dededuplicating existing data with ASIS the “old” stuff is usually still hidden in snapshots. These snapshots have to expire (if they were automatic) and then space savings will increase even more. I started a series with our experiences with VMware / ASIS on my personal blog (http://21stcenturystorage.cebis.net/).
BTW great work, Scott - your blog is a valuable source for many admins in our company
-
Any clues if Ontap 7.3 will remove/change any of the asis/flexvol limitations?
The limits according to tr-3505 is just plain artificial and silly how low. They would force us into extra managment work just to be able to use ASIS. Being used to Datadomains, the Netapp-numbers seems like “buy bigger that you need” practise from the company.
-
Scott,
This post was invaluable in letting me know that A-SIS licensing was now free and how to get it set up. I had recently upgraded our FAS3020’s to OnTAP 7.2.4 and our ESX servers to 3.5.0 so I was ready to rock. On Friday I created my first A-SIS volumes and started to Storage VMotion VMs into them. I’m using iSCSI and I’m turning off Space Reservation for the LUNs (and experimenting with (file Space Reservation for the volumes) to reap the storage savings as well as give the A-SIS metadata the room it needs to work. I have all VMware workload on these, so of course I’m seeing a great return rate (40+% so far). Thanks!
-
Hey Scott,
First off, I echo several others comments; this is a great resource for all of us NetApp devotees out there.Can you provide or point me to any resources that detail the affect SIS has on *existing* SnapMirror operations? I’ve found plenty about SIS + SM = good, but most sites talk about turning on SM after SIS.
We are in the process of reorganizing several sets of data via SM that I can’t afford to stop and restart. However, because of savings we’ve seen (up to 71% - and that’s not even VMware!), we’d like to turn it on everywhere.
Theory says it should work just fine and begin transferring the deduped blocks with a big hit in the snap reserve until the exising snaps expire, but unfortunately, I don’t have the capacity to test that theory.
-
Good post. Thanks for the help.
-
good post, just wanted to update the broken TR3505 link in the middle of the article.
here’s the updated link: http://media.netapp.com/documents/tr-3505.pdf
-
Hi Scott,
recently our storage provider in the Netherlands upgraded our Ontap to version 7.2.6 on our NetApp NearStore R200 in order to activate de-duplication.
We use the R200 as our backup storage for Windows data , using Acronis v9 software (in fact it is creating an image file of each disk drive).
Each month a full Acronis backup (which is a block based backup ) of all Windows servers is (C: , D: and E: drive are written to the R200 Windows volume (using CIFS) and subsequently the incrementals starts untill the beginning of the next month then the cyclus restarts again with a full backup.I would assume that specially the C: drives (only Windows OS) of each backup do not change a lot, so a high dedup ratio was expected.
Unfortunately I find on all volumes a space saving of less than 1 percent
Could you provide me a reason for this poor de-duplication ratio ?Thanks in advace, Stefan Kaihatu
-
Stefan, this may be a bit late, and I’ll admit to not knowing how Acronis stores backups.
However, if the backups are compressed to save space, this will mean that it will pretty much not use dedupe at all. Even a very small change to the data could mean a completely different (no identical blocks) compressed backup file.




20 comments
Comments feed for this article
Trackback link: http://blog.scottlowe.org/2008/03/31/quick-guide-to-setting-up-netapp-deduplication/trackback/