Robust benchmarking for archival storage tiers

Dongjin Lee Lee*, Michael O'Sullivan, Cameron Walker, Monique MacKenzie

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

Until recently archival storage tiers have consisted of tape-based devices with a large storage capacity, but limited I/O performance for data retrieval. However, the growing capacity and shrinking cost of disk-based devices means that disk-based systems are now a realistic option for enterprise archival storage tiers. Given the increasingly diverse options for archival storage, robust benchmarking of possible technologies for archival storage tiers is vital for reducing risk before deployment. This paper investigates benchmarks that utilize archival workloads developed from an analysis of historical file size distributions. These benchmarks not only provide more appropriate measurements of system performance as an archive than traditional approaches, but we also incorporate the variation observed in the historical data to provide "best" and "worst" case workloads for benchmarking. By considering not only the usual workload, but also workloads at either end of the archival workload spectrum, our benchmarking is robust. It provides measures of performance for the envelope of typical archival workload observed from empirical data.

Original languageEnglish
Title of host publicationPDSW'11 - Proceedings of the 6th Parallel Data Storage Workshop, Co-located with SC'11
Pages1-6
Number of pages6
DOIs
Publication statusPublished - 1 Dec 2011
Event6th Parallel Data Storage Workshop, PDSW'11, Co-located with SC'11 - Seattle, WA, United States
Duration: 13 Nov 201113 Nov 2011

Publication series

NamePDSW'11 - Proceedings of the 6th Parallel Data Storage Workshop, Co-located with SC'11

Conference

Conference6th Parallel Data Storage Workshop, PDSW'11, Co-located with SC'11
Country/TerritoryUnited States
CitySeattle, WA
Period13/11/1113/11/11

Keywords

  • archival
  • benchmark
  • disk
  • distribution
  • file size
  • storage system
  • workload

Fingerprint

Dive into the research topics of 'Robust benchmarking for archival storage tiers'. Together they form a unique fingerprint.

Cite this