Preprint version. Published in Proceedings fromt he International Conference on World Wide Web: Lyon, France, April 16, 2012, pages 13-22.
Copyright © ACM 2012. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in WWW 2012 Companion,(April 16, 2012) and is available at http://dx.doi.org/10.1145/2187980.2187983.
Over the past couple of years, Netflix has significantly expanded its online streaming offerings, which now encompass multiple delivery platforms and thousands of titles available for instant view. This paper documents the design and development of an outage detection system for the online services provided by Netflix. Unlike other internal quality control measures used at Netflix, this system uses only publicly available information: the tweets, or Twitter posts, that mention the word “Netflix,” and has been developed and deployed externally, on servers independent of the Netflix infrastructure. This paper discussed the system and provides assessment of the accuracy of its real-time detection and alert mechanisms.