|
The Essentials of Reliability-Centered Maintenance Part II John E. Skog P.E. Doble Consultant
Editors Note: Part I of The Essentials of RCM was printed in the last edition of the Doble Exchange. Included was a discussion of Functions and Failure Modes. In this edition, an overview of Failure Effects, Failure Causes and Task Selection will be presented. Introduction Traditional scheduled maintenance is based on the premise that every item in a complex system has a "right age" at which a complete overhaul is necessary to ensure safety and operation reliability. Historically, we have discovered that many types of failures can not be prevented or their frequency reduced by such maintenance activities no matter how intensively maintenance is performed. In response to this problem, design engineers have mitigated the consequences of such failures by making their designs "failure-tolerant". The recognition that the relation between aging and maintenance is not simple or straight forward and the implementation of improved equipment designs has forced maintenance personnel to re-evaluate the concepts of traditional scheduled maintenance. Reliability Centered Maintenance (RCM) is a process that was developed as a result of the above conditions. The process was so named to emphasize the role that reliability theory and practice plays in properly focusing PM activities on the retention of the equipment’s original design reliability. RCM allows one to obtain the full design operating ability of the equipment. It does not necessarily identify a new series of maintenance tasks. It identifies those tasks, which are most applicable and ineffective, and at the same time provides a framework for developing an optimal preventive maintenance program. Failure Effects and Consequences It is the consequences of a failure that generally determines if preventive maintenance should be considered. If a failure takes place, and no operational, economic or safety consequence results, the need for preventive maintenance is extremely difficult to justify. Conversely, if significant consequences result, effective preventive maintenance procedures must be identified and implemented.
When the loss of a function occurs, the effects of the failure may different depending on one’s frame of reference. In order to understand the consequences of a failure, one must understand the effects that result on a local, system and remote basis. Local effects are the consequences observed at the failure site. Local effects include interactions between the failed component and the surrounding equipment. As an example the loss of oil from a transformer may have negative effects on the local environment if it contaminates other equipment or enters a storm water drainage system. System effects are those impacts that the loss of function poses to the substation or electrical system. As a general rule, all failures that impact the functionality of the system being analyzed are system effects. Continuing with our previous transformer example, the loss of cooling may result in a system consequence of reduced transformer output. Remote affects are those effects on equipment and systems outside the boundaries being analyzed with the RCM process. Remote effects have broader impacts on the substation and utility and may be beyond the immediate scope of the RCM analysis. Continuing with our previous transformer example, the loss of cooling may result in the need to overload other transformers in adjacent substations.
Failure Causes When a failure mode results in a significant failure consequence, the failure mode is considered critical and preventive maintenance justified. In order to prevent the failure from occurring, the specific events that lead to the loss of function must be determined. It is the cause of failure, not the failure mode that must be addressed by preventive maintenance. When the exact cause of a functional failure is determined, an appropriate and cost-effective PM task can then be identified. Generally, only when the failure cause is dominant, will a PM task be considered. If the cause is not dominant, the chances of the PM task being effective are small. The resulting preventive maintenance tasks are focused on predicting, preventing or resolving the cause of the failure. Task Selection Task selection involves the identification of appropriate tasks to address the causes of critical failure modes identified during the RCM analysis. In a pure sense, selecting tasks involves the identification of technically and cost effective maintenance actions that are best suited at predicting, preventing or mitigating the effect of a functional failure. In a broader sense, task selection also includes redesign, failure finding procedures and corrective maintenance. The RCM process for identifying effective PM tasks is structured and rigorous. The Keys of RCM The five features discussed above are the essentials of RCM. Identifying and understanding those functions critical to the operation of the substation or electrical system is the key to RCM. Once understood, continuous availability of these critical and important functions then becomes the foremost goal of the maintenance program. By fully understanding the events or causes that lead to the loss of these critical functions, one can develop focused maintenance activities that predict, prevent or resolve these failures. Finally, by determining the operational effects resulting from the loss of these critical functions, one can then quantify the value of preventing the loss and thus ensure that all preventive and mitigating actions are cost effective. In the Next Doble Exchange: Task Selection Logic
Editors Note: John Skog is a Doble Consultant in the area of Maintenance Management. He introduced the Client Group to RCM in 1993. John performs RCM training and consulting services through Doble and is available to assist clients in the refinement of their current maintenance programs. |