Available at: http://digitalcommons.calpoly.edu/theses/768
Date of Award
MS in Computer Science
This thesis explores hardware command capture as a viable means of analyzing real world hard drive usage. Hardware command capture provides insight into the IO stack where current tools fail to reach. A software platform is presented which provides trace conversion and analysis capabilities. This platform is written in Python and designed to handle traces of arbitrary size while being easily extensible for future projects to build upon. A novel Sequential Stream Detection algorithm built upon the software platform is then presented. This algorithm detects application level sequential streams and provides interesting insight into the sequential nature of the applications analyzed. The software platform and Sequential Stream Detector were validated and run against a range of workloads including video playback, large project compilations, and synthetic benchmarks. Where applicable, each workload was run on multiple file systems (ext2, ext3, ext4, Btrfs) to compare the effects of stream allocation across file systems. It is shown that stream allocation is consistent across file systems suggesting stream detection may be a valuable workload identification tool.