Fault detection and diagnosis from the logging and bookkeping data
Xiangliang Zhang, Cecile Germain and Michele Sebag
In: second Enabling Grids for E-sciencE (EGEE) User Forum, May 9-11, 2007, Manchester, UK.
Autonomic Computing (AC) is defined as “computing systems that manage themselves in accordance with high-level objectives from humans”. AC is now a well-established scientific domain, and a priority for industry. Automated detection, diagnosis, and ultimately management, of software/hardware problems define autonomic dependability. The paper reports on applying state of the art autonomic dependability methods to the Logging and Bookeeping data, with promising results on detection.