Elsevier

Information Sciences

Exploiting the performance gains of modern disk drives by enhancing data locality

Abstract

Due to the widening performance gap between RAM and disk drives, a large number of I/O optimization methods have been proposed and designed to alleviate the impact of this gap. One of the most effective approaches of improving disk access performance is enhancing data locality . This is because the method could increase the hit ratio of disk cache and reduce the seek time and rotational latency. Disk drives have experienced dramatic development since the first disk drive was announced in 1956. This paper investigates some important characteristics of modern disk drives. Based on the characteristics and the observation that data access on disk drives is highly skewed, the frequently accessed data blocks and the correlated data blocks are clustered into objects and moved to the outer zones of a modern disk drive. The idea attempts to enhance spatial locality, improve the efficiency of aggressive sequential prefetch, and take advantage of Zoned Bit Recording (ZBR). An experimental simulation is employed to investigate the performance gains generated by the enhanced data locality. The performance gains are analyzed by breaking down the disk access time into seek time, rotational latency, data transfer time, and hit ratio of the disk cache. Experimental results provide useful insights into the performance behaviours of a modern disk drive with enhanced data locality.

Introduction

The storage hierarchy in current computer architectures is designed to take advantage of data access locality to improve overall performance. Each level of the hierarchy has higher speed, lower latency, and smaller size than lower levels. For decades, the hierarchical arrangement has suffered from significant bandwidth and latency gaps among processor, RAM, and disk drive [27], [32], [38]. The performance gap between processor and RAM has been alleviated by fast cache memories. However, the performance gap of RAM to disk drive has been widened to six orders of magnitude in 2000 and will continue to widen by about 50% per year [38].

Since the first disk drive was announced in 1956, disk drives have grown by over six orders of magnitude in density and over four orders in performance [29]. Over the last decade, areal density, track density and linear density have achieved 100%, 50%, and 30% growth respectively. Revolutions Per Minute (RPM) has been increased from 3600 in 1981 to 15,000 in 2000. Due to the significant growth in both the linear density and RPM, Internal Data Rate (IDR) has been growing at an exponential rate of 40% each year over the past 15 years [11]. Unfortunately, the basic mechanical architecture in disk drive has not changed too much. Slowed by the mechanical delays, disk access time was improved only about 8% per year [12]. Therefore, the disk I/O subsystem is repeatedly identified as a major bottleneck to system performance in many computing systems. The widening gap will be more serious when disk drives reach its physical limits due to the super paramagnetic effect.

The increasing performance gap between RAM and disk drive has long been a primary obstacle to improve overall system performance. To alleviate the impact of this widening gap, a lot of research efforts have been invested to improve disk access time. We only mention some of them which are related to our work in this paper. Ruemmler and Wilkes [36] employed disk shuffling to move frequently accessed data into the centre of a disk drive and organize the data into an organ pipe to reduce mean seek distances. They constructed a repeatable simulation environment across a range of workloads and disk drive types for comparing different shuffling algorithms. Their research indicated that the benefits are small to moderate, but are likely to be much larger with file systems that do not do a good initial data placement. Akyürek and Salem [1] presented an adaptive technique to copy a small number of frequently referenced disk blocks from their original locations to a reserved space near the middle of the disk. Their experiments showed that seek times are reduced 30–85% (depending on workloads) by adaptively rearranging about 3% of the data on the disk drive. FS2 [16] dynamically places multiple copies of data in file system's free blocks according to the disk access patterns observed at runtime to reduce the head positioning latencies. Because one or more replicas can be accessed in addition to their original data block, choosing the nearest replica that provides the fastest access can significantly improve disk I/O performance.

The above methods can substantially reduce seek time or rotational latency. However, those approaches may not be effective on modern disk drives due to evolutional disk drive technologies. First of all, disk access time mainly consists of seek time, rotational latency and data transfer time. Due to the advance of Voice Coil Motors (VCM) electric drive and the increasing track density, long distance seeks of modern disk drives may not involve enormously more overhead than short ones. This results in a significant proportion decrease of seek time in disk access time. Secondly, due to geometric features, outer tracks on disk platters are much larger than the inner tracks. Modern disk drives employ a technique called Zoned Bit Recording (ZBR) to take advantage of its geometric features to increase disk capacity by varying the number of sectors per track with the distance from the spindle [30]. This characteristic results in much higher data transfer rate of outer zones than that of inner zones. Finally, the organ pipe is formed by placing the most frequently accessed cylinder in the middle of the disk drive, the next most frequently accessed cylinders on either side of the middle cylinder, and so on. This arrangement is provably optimal for independent disk accesses [36]. However, block correlations are common semantic patterns in storage systems, so the organ pipe arrangement could destroy the original block correlations [21]. The combination of these three reasons significantly counteracts the performance improvement of data reorganization.

This paper explores the performance gains of data reorganization based on a modern disk drive. Some new characteristics of modern disk drives, which are related to data reorganization, are reviewed. Based on the characteristics, blocks, which are correlated to the frequently accessed block, are clustered into objects, and the objects are moved to the outer zones of the disk drive. The aggressive sequential prefetch of the modern disk drive is enhanced and the block correlations remain due to the frequency based objects. Disk access time is broken into four basic components: seek time, rotational latency, data transfer time, and hit ratio of disk cache, each one is analyzed separately to determine its actual performance gains. Experimental results provide useful insights into the performance behaviour of block reorganization of a modern disk drive.

The remainder of this paper is organized as follows. An overview of modern disk drives is introduced in Section 2. Section 3 describes some important features of workload. The motivations of this paper are depicted in Section 4. Section 5 illustrates how to implement the block reorganization. The simulation environment and experimental validation are depicted in Section 6. Section 7 concludes the paper with remarks on main contributions and indications.

Section snippets

Modern disk drive overview

Disk access time T access is mainly composed of seek time T seek , rotational latency T rotate and data transfer time T transfer . The seek time measures the time for the disk head to move to a specified track. When the disk head arrives at the required track, the time spent on rotating the required sector to appear underneath the disk head is called rotational latency. The data transfer time is the amount of data divided by data transfer rate. T access is expressed as follows: T access = T seek + T rotate + T

Data access locality

Data locality is a measure of how well data can be selected, retrieved, compactly stored, and reused for subsequent accesses. In general, there are two basic types of data locality: temporal, and spatial. The temporal locality denotes that a data is accessed at one point in time will be accessed in the near future. The temporal locality relies on the access patterns of different applications and can therefore change dynamically. The spatial locality defines that the probability of accessing a

Motivations

Due to the highly skewed access patterns, if the frequently accessed blocks are distributed across the whole disk drive, long seek distances from each other may be involved. Most modern disk drives employ aggressive sequential prefetch to exploit spatial locality and improve I/O access performance, but the effect of the prefetch could be significantly reduced in the above scenario, because a large number of blocks with low access frequency could be prefetched to disk cache, thus reducing the

Implementation

Our Implementation is composed of a frequency tracker, a mapping table, and a data migration mechanism. The three major components are illustrated in Fig. 2. The frequency tracker monitors the stream of I/O requests. Periodically, it produces or updates a mapping table which contains a list of frequently accessed objects ordered by frequency. A newly incoming request is compared against the mapping table. The request will be redirected to a new location if the request is hit in the table.

Experimental validation

A real implementation of the comprehensive and complicated system would be difficult and take an extremely long time. Trace-driven simulation is a principal approach to evaluate the effectiveness of our proposed design, because it is much easier to change parameters and configurations in comparison with a real implementation. The trace driven simulation is a form of event driven simulation in which the events are taken from a real system that operates under conditions similar to the ones being

Discussion and conclusion

Since the first disk drive was announced in 1956, disk drives have experienced dramatic development, but still lagged far behind the performance improvement of processor and RAM. A lot of I/O optimization methods have been devised to alleviate the gap between RAM and disk driver. One of the most effective approaches of improving disk access performance is increasing the hit ratio of disk cache and thus reducing the number of physical disk I/Os.

We reviewed some important characteristics of

Acknowledgements

I would like to thank the anonymous reviewers for helping me refine this paper. Their constructive comments and suggestions are very helpful. Thanks also to my friends Michael Sandling and Andrew Clemens who polished the language of this article. In addition, I am grateful to Prof. Witold Pedrycz for giving me the opportunity to clarify my thoughts.

References (43)

  • et al.

    Kernel class-wise locality preserving projection

    Information Sciences

    (2008)

  • A.J.T. Lee et al.

    An efficient algorithm for mining frequent inter-transaction patterns

    Information Sciences

    (2007)

  • C. Hsu et al.

    Hierarchical clustering of mixed data based on distance hierarchy

    Information Sciences

    (2007)

  • Y. Deng et al.

    EED: energy efficient disk drive architecture

    Information Sciences

    (2008)

  • S. Akyürek et al.

    Adaptive block rearrangement

    ACM Transactions on Computer Systems

    (1995)

  • M. Baker, J. Hartman, M. Kupfer, K. Shirriff, J. Ousterhout, Measurements of a distributed file system, in: Proceedings...
  • A.J.C. Bik, Reshaping access patterns for improving data locality, in: Proceedings of the Sixth Workshop on Compilers...
  • J.S. Bucy, G.R. Ganger, The DiskSim simulation environment version 3.0 reference manual, Technical Report...
  • Cello 1999 traces, Storage Systems Program HP Laboratories....
  • Y. Deng et al.

    Optimal clustering size of small file access in network attached storage device

    Parallel Processing Letters

    (2006)

  • G.R. Ganger, B.L. Worthington, R.Y. Hou, Y.N. Patt, Disk subsystem load balancing: disk striping vs. conventional data...
  • G. Ganger, Blurring the line between oses and storage devices, Technical report, Carnegie Mellon University, December...
  • G.R. Ganger, M.F. Kaashoek, Embedded inodes and explicit grouping: exploiting disk bandwidth for small files, in:...
  • Hitachi Global Storage Technologies – HDD Technology Overview Charts....
  • W.W. Hsu et al.

    The performance impact of I/O optimizations and disk improvements

    IBM Journal of Research and Development

    (2004)

  • W.W. Hsu et al.

    The automatic improvement of locality in storage systems

    ACM Transactions on Computer Systems

    (2005)

  • W.W. Hsu et al.

    Characteristics of I/O traffic in personal computer and server workloads

    IBM Systems Journal

    (2003)

  • H. Huang, W. Hung, K.G. Shin, FS2: dynamic data replication in free disk space for improving disk performance and...
  • R. Karedla et al.

    Caching strategies to improve disk system performance

    Computer

    (1994)

  • Y. Kim, S. Gurumurthi, A. Sivasubramaniam, Understanding the performance–temperature interactions in disk I/O of server...
  • S.H. Kim et al.

    Zoned-RAID

    ACM Transactions on Storage

    (2007)

  • Cited by (15)

    View full text