How I Learned to Stop Worrying and Love Storage Automation
The first storage performance horseman is spindles: If you don’t have enough disk units, performance will suffer. I have been laying out storage on enterprise arrays since the dark ages, and one of the first lessons I learned was allocating data to avoid hotspots. I remember spending hours back in the 1990’s hunched over custom Excel spreadsheets trying to get my storage layout just right, balancing the workload across every available disk.

This is how we used to avoid hotspots in 1998: Carefully planning every detail of the storage layout.
Each disk drive consists of a spindle of spinning platters with read/write heads move back and forth. Each time you access a piece of data that’s not in cache, the drive moves its arm over the platter to access the correct piece of data. Since each drive can only access one piece of data at once, and since caches can only hold so much data, tuning a system to minimize the number of requests per drive is essential.
Manual storage array layout was an art, but we never fooled ourselves into thinking our designs were optimal. There were just too many intractable problems, so we had to compromise at every turn:
- We usually had no performance data to base our layout decisions on, so we had to rely on guesses and rules of thumb
- Workloads tend to change over time and manual layouts are painful to modify
- The smallest unit of allocation was an entire LUN or drive, so even the best disk layout mixed hot and rarely-accessed data everywhere
- Much of the allocated space was unused, so we used expensive disks to store nothing
One might think that, 10 years later, advances in technology would have solved these basic issues. But for many people using many of the so-called modern mainstream enterprise storage systems, these problems remain. Continue Reading »
Tags: 3PAR, Compellent, CX, deduplication, Dell, DMX, DMX-5, EMC, EqualLogic, flash, HDS, HP, IBM, LeftHand, NetApp, optimization, Optimizer, performance, RAID, SSD, SVC, thin provisioning, tiered storage, USP-V, V-Series, virtual provisioning, wide striping