A colleague – Isilon’s Rob Peglar – pointed out an interesting paper written by researchers at the University of Toronto in collaboration with Microsoft. The paper makes a case for using an HDD to cache writes to an SSD to improve storage system performance.
“Wait a minute!” you say. “An HDD as a cache for an SSD? This can’t be possible!”
According to the paper: Extending SSD Lifetimes with Disk-Based Write Caches (Soundararajan et al) an HDD can be used to both extend the life of an SSD and to improve its write speed. Even the paper states that this is: “somewhat contrary to intuition.”
The research tapped into the HDD’s high sequential write speed, a specification that is as good as most SSDs, to write a journal file that would be loaded into the SSD according to certain carefully designed rules. Writes are coalesced, and repetitive writes to the same block are collapsed into a single write. Data is written from the log into the SSD sequentially, which speeds this operation while reducing the SSD’s write amplification.
With reduced write traffic the SSD performs fewer background operations, which improves the performance of not only writes but also reads.
Of course, there is a serious penalty if the system tries to read data whose most recent copy is in the HDD rather than the SSD. The management program, dubbed “Griffin,” is tuned to minimize the occurrence of such events.
The university measured a reduction of 52% in the number of writes presented to the SSD, with average I/O latency reduced by 56%.
Objective Analysis has performed its own tests of performance in systems based on SSDs and HDDs together, and has determine not only that SSDs are an adjunct to HDDs but not a replacement, but we also find that SSDs pose more of a threat to DRAM than they do to HDDs. See How PC NAND will Undermine DRAM at the Objective Analysis website Reports page.
Hi Jim – Great website – I’ve created a link on our Link Exchange page at http://www.futurestorage.co.uk/article/LinkExchange – also check out our blog at http://www.blog.solidstatediskshop.com/2011/the-virtualisation-journey/ on the GreenBytes HA-3000 which uses 2x SSDs and 13x HDDs to achieve 150,000 IOPS – the SSDs do more than just cache as they hold a database of the location of files on the HDDs that allow inline compression and deduplication – I wonder if this technology will trickle down to the desktop… interesting technology advances..
Best Regards
Phil, Thanks! That GreenBytes system sounds like a tiger.
We have a lot to learn about how best to use HDDs and SSDs together both in the data center as well as in the PC, but this is an approach that just might catch on!