Back to Search Start Over

Leveraging Website Popularity Differences to Identify Performance Anomalies

Authors :
Chadi Barakat
Mark Crovella
Renata Teixeira
Giulio Grassi
Middleware on the Move (MIMOVE)
Inria de Paris
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
Design, Implementation and Analysis of Networking Architectures (DIANA)
Inria Sophia Antipolis - Méditerranée (CRISAM)
Computer Science Department [Boston] (Boston University)
Boston University [Boston] (BU)
Inria Project Lab BetterNet BetterNet
This work was supported by the French National ResearchAgency under grant BottleNet no. ANR-15-CE25-0013 and by Inria within the Project Lab BetterNet. Mark Crovella was supported by NSF grant CNS-1618207, and by grants from Inria Paris, U. Sorbonne-Pierre et Marie Curie LIP6, and the Laboratory of Information, Networking and Communication Sciences (LINCS)
ANR-15-CE25-0013,BottleNet,Comprendre et diagnostiquer les dégradations des communications de bout en bout dans l'Internet(2015)
Source :
INFOCOM 2021-IEEE International Conference on Computer Communications, INFOCOM 2021-IEEE International Conference on Computer Communications, May 2021, Vancouver / Virtual, Canada. ⟨10.1109/INFOCOM42981.2021.9488832⟩, INFOCOM, HAL
Publication Year :
2021
Publisher :
HAL CCSD, 2021.

Abstract

International audience; Web performance anomalies (e.g. time periods when metrics like page load time are abnormally high) have significant impact on user experience and revenues of web service providers. Existing methods to automatically detect web performance anomalies focus on popular websites (e.g. with tens of thousands of visits per minute). Across a wider diversity of websites, however, the number of visits per hour varies enormously, and some sites will only have few visits per hour. Low rates of visits create measurement gaps and noise that prevent the use of existing methods. This paper develops WMF, a web performance anomaly detection method applicable across a range of websites with highly variable measurement volume. To demonstrate our method, we leverage data from a website monitoring company, which allows us to leverage cross-site measurements. WMF uses matrix factorization to mine patterns that emerge from a subset of the websites to "fill in" missing data on other websites. Our validation using both a controlled website and synthetic anomalies shows that WMF's F1-score is more than double that of the state-of-the-art method. We then apply WMF to three months of web performance measurements to shed light on performance anomalies across a variety of 125 small to medium websites.

Details

Language :
English
Database :
OpenAIRE
Journal :
INFOCOM 2021-IEEE International Conference on Computer Communications, INFOCOM 2021-IEEE International Conference on Computer Communications, May 2021, Vancouver / Virtual, Canada. ⟨10.1109/INFOCOM42981.2021.9488832⟩, INFOCOM, HAL
Accession number :
edsair.doi.dedup.....6377dd7a545c32b73207775471823913
Full Text :
https://doi.org/10.1109/INFOCOM42981.2021.9488832⟩