(Created page with " == Abstract == Dependable complex systems often operate under variable and non-stationary conditions, which requires efficient and extensive monitoring and error detectio...")
 
m (Scipediacontent moved page Draft Content 173114931 to Brancati et al 2016a)
 
(No difference)

Latest revision as of 13:01, 16 February 2021

Abstract

  Dependable complex systems often operate under variable and non-stationary conditions, which requires efficient and extensive monitoring and error detection solutions. Among the many, the paper focuses on anomaly detection techniques, which monitor the evolution of some specific indicators through time to identify anomalies, i.e. deviations from the expected operational behavior. The timely identification of anomalies in dependable, fault tolerant systems allows to timely detect errors in the services and react appropriately. In this paper, we investigate the possibility to monitor the evolution of indicators through time using the random walk model on indicators belonging to Operating Systems, specifically in our study the Linux Red Hat EL5. The approach is based on the experimental evaluation of a large set of heterogeneous indicators, which are acquired under different operating conditions, both in terms of workload and faultload, on an air traffic management target system. The statistical analysis is based on a best-fitting approach aiming to minimize the integral distance between the empirical data distribution and some reference distributions. The outcomes of the analysis show that the idea of adopting a random walk model for the development of an anomaly detection monitor for critical systems that operates at Operating System level is promising. Moreover, standard distributions such as Laplace and Cauchy, rather than Normal, should be used for setting up the thresholds of the monitor. Further studies that involve a new application, a different Operating System and a new layer (an Application Server) will allow verifying the generalization of the approach to other fault tolerant systems, monitored layers and set of indicators.


Original document

The different versions of the original document can be found in:

https://api.elsevier.com/content/article/PII:S0263224115005965?httpAccept=text/xml,
http://dx.doi.org/10.1016/j.measurement.2015.11.010 under the license https://www.elsevier.com/tdm/userlicense/1.0/
https://flore.unifi.it/handle/2158/1015007,
http://rcl.dsi.unifi.it/publication/show/795-2,
https://core.ac.uk/display/160049274,
https://academic.microsoft.com/#/detail/2212321128
Back to Top

Document information

Published on 01/01/2016

Volume 2016, 2016
DOI: 10.1016/j.measurement.2015.11.010
Licence: Other

Document Score

0

Views 3
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?