Human reliability

Human reliability is related to the field of human factors engineering, and refers to the reliability of humans in fields such as manufacturing, transportation, the military, or medicine. Human performance can be affected by many factors such as age, circadian rhythms, state of mind, physical health, attitude, emotions, propensity for certain common mistakes, errors and cognitive biases, etc.

Human reliability is very important due to the contributions of humans to the resilience of systems and to possible adverse consequences of human errors or oversights, especially when the human is a crucial part of the large socio-technical systems as is common today. User-centered design and error-tolerant design are just two of many terms used to describe efforts to make technology better suited to operation by humans.

Human Reliability Analysis Techniques
A variety of methods exist for Human Reliability Analysis (HRA) (see Kirwan and Ainsworth, 1992; Kirwan, 1994). Two general classes of methods are those based on probabilistic risk assessment (PRA) and those based on a cognitive theory of control.

PRA-Based Techniques
One method for analyzing human reliability is a straightforward extension of probabilistic risk assessment (PRA): in the same way that equipment can fail in a plant, so can a human operator commit errors. In both cases, an analysis (functional decomposition for equipment and task analysis for humans) would articulate a level of detail for which failure or error probabilities can be assigned. This basic idea is behind the Technique for Human Error Rate Prediction (THERP) (Swain & Guttman, 1983). THERP is intended to generate human error probabilities that would be incorporated into a PRA. The Accident Sequence Evaluation Program (ASEP) Human Reliability Procedure is a simplified form of THERP; an associated computational tool is Simplified Human Error Analysis Code (SHEAN) (Wilson, 1993). More recently, the US Nuclear Regulatory Commission has published the Standardized Plant Analysis Risk (SPAR) human reliability analysis method also because of human eror (SPAR-H) (Gertman et al, 2005).

Cognitive Control Based Techniques
Erik Hollnagel has developed this line of thought in his work on the Contextual Control Model (COCOM) (Hollnagel, 1993) and the Cognitive Reliability and Error Analysis Method (CREAM) (Hollnagel, 1998). COCOM models human performance as a set of control modes -- strategic (based on long-term planning), tactical (based on procedures), opportunistic (based on present context), and scrambled (random) -- and proposes a model of how transitions between these control modes occur. This model of control mode transition consists of a number of factors, including the human operator's estimate of the outcome of the action (success or failure), the time remaining to accomplish the action (adequate or inadequate), and the number of simultaneous goals of the human operator at that time. CREAM is a human reliability analysis method that is based on COCOM.

Related Techniques
Related techniques in safety engineering and reliability engineering include Failure mode and effects analysis, Hazop, Fault tree, and SAPHIRE: Systems Analysis Programs for Hands-on Integrated Reliability Evaluations.

Human Error
Human error has been cited as a cause or contributing factor in disasters and accidents in industries as diverse as nuclear power (e.g., Three Mile Island accident), aviation (see pilot error), space exploration (e.g., Space Shuttle Challenger Disaster), and medicine (see medical error). It is also important to stress that "human error" mechanisms are the same as "human performance" mechanisms; performance later categorized as 'error' is done so in hindsight (Reason, 1991; Woods, 1990): thereofore actions later termed "human error" are actually part of the ordinary spectrum of human behaviour. The study of absent-mindedness in everyday life provides ample documentation and categorization of such aspects of behavior. Recently, human error has been reconceptualized as resiliency to emphasize the positive aspects that humans bring to the operation of technical systems (see Hollnagel, Woods and Leveson, 2006).

Categories of Human Error
There are many ways to categorize human error (see Jones, 1999).


 * exogenous versus endogenous (i.e., originating outside versus inside the individual) (Senders and Moray, 1991)
 * situation assessment versus response planning (e.g., Roth et al, 1994) and related distinctions in
 * errors in problem detection (also see signal detection theory)
 * errors in problem diagnosis (also see problem solving)
 * errors in action planning and execution (Sage, 1992) (for example: slips or errors of execution versus mistakes or errors of intention; see Norman, 1988; Reason, 1991)
 * By level of analysis; for example, perceptual (e.g., optical illusions) versus cognitive versus communication versus organizational.

The cognitive study of human error is a very active research field, including work related to limits of memory and attention and also to decision making strategies such as the availability heuristic and other cognitive biases. Such heuristics and biases are strategies that are useful and often correct, but can lead to systematic patterns of error.

Misunderstandings as a topic in human communication have been studied in Conversation Analysis, such as the examination of violations of the Cooperative principle and Gricean maxims.

Organizational studies of error or dysfunction have included studies of safety culture. One technique for organizational analysis is the Management Oversight Risk Tree (MORT) (Kirwan and Ainsworth, 1992; also search for MORT on the FAA Human Factors Workbench.

Human Factors Analysis and Classification System (HFACS)

 * See Human Factors Analysis and Classification System in Main article: National Fire Fighter Near-Miss Reporting System

The Human Factors Analysis and Classification System (HFACS) was developed initially as a framework to understand "human error" as a cause of aviation accidents (Shappell and Wiegmann, 2000; Wiegmann and Shappell, 2003). It is based on James Reason's Swiss cheese model of human error in complex systems. HFACS distinguishes between the "active failures" of unsafe acts, and "latent failures" of preconditions for unsafe acts, unsafe supervision, and organizational influences. These categories were developed empirically on the basis of many aviation accident reports.

Unsafe acts are performed by the human operator "on the front line" (e.g., the pilot, the air traffic controller, the driver). Unsafe acts can be either errors (in perception, decision making or skill-based performance) or violations (routine or exceptional). The "errors" here are similar to the above discussion. Violations are the deliberate disregard for rules and procedures. As the name implies, routine violations are those that occur habitually and are usually tolerated by the organization or authority. Exceptional violations are unusual and often extreme. For example, driving 60 mph in a 55-mph zone speed limit is a routine violation, but driving 130 mph in the same zone is exceptional.

There are two types of preconditions for unsafe acts: those that relate to the human operator's internal state and those that relate to the human operator's practices or ways of working. Adverse internal states include those related to physiology (e.g., illness) and mental state (e.g., mentally fatigued, distracted). A third aspect of 'internal state' is really a mismatch between the operator's ability and the task demands; for example, the operator may be unable to make visual judgments or react quickly enough to support the task at hand. Poor operator practices are another type of precondition for unsafe acts. These include poor crew resource management (issues such as leadership and communication) and poor personal readiness practices (e.g., violating the crew rest requirements in aviation).

Four types of unsafe supervision are: Inadequate supervision; Planned inappropriate operations; Failure to correct a known problem; and Supervisory violations.

Organizational influences include those related to resources management (e.g., inadequate human or financial resources), organizational climate (structures, policies, and culture), and organizational processes (such as procedures, schedules, oversight).

Controversies
Some researchers have argued that the dichotomy of human actions as "correct" or  "incorrect"  is a harmful oversimplification of a complex phenomena (see Hollnagel and Amalberti, 2001). A focus on the variability of human performance and how human operators (and organizations) can manage that variability may be a more fruitful approach. Furthermore, as noted above, the concept of "resiliency" highlights the positive roles that humans can play in complex systems.

Standards and Guidance Documents

 * IEEE Standard 1082 (1997): IEEE Guide for Incorporating Human Action Reliability Analysis for Nuclear Power Generating Stations

Tools

 * Eurocontrol Human Error Tools
 * EPRI HRA Calculator

Research Labs

 * Erik Hollnagel at the Cognitive Systems Engineering Laboratory at Linkoping University
 * Human Reliability Analysis at the US Sandia National Laboratories
 * Center for Human Reliability Studies at the US Oak Ridge National Laboratory
 * | Flight Cognition Laboratory at NASA Ames Research Center
 * David Woods at the Cognitive Systems Engineering Laboratory at The Ohio State University

Media coverage

 * “Human Reliability. We break down just like machines“ Industrial Engineer - November 2004, 36(11): 66

인간요인오류 ヒューマンエラー Human Error