AccessMyLibrary provides FREE access to over 30 million articles from top publications available through your library.

System of system failures: meta methodology for IT engineering safety.(Research Paper)(information technology)(Report)

Systems Research and Behavioral Science

| January 01, 2009 | Nakamura, Takafumi; Kijima, Kyoich | COPYRIGHT 2009 John Wiley & Sons, Inc. This material is published under license from the publisher through the Gale Group, Farmington Hills, Michigan.  All inquiries regarding rights should be directed to the Gale Group. (Hide copyright information)Copyright

INTRODUCTION

The purpose of this paper is to propose a meta methodology to promote engineering safety by learning from previous system failures. The predominant worldview in IT engineering is that systems failures can be prevented at the design phase. This worldview is obvious if we examine mainstream, current methodologies for managing system failures. These methodologies use a reductionist approach and are based on a static model (Nakamura and Kijima, 2007, 2008). It is often pointed out that most such methodologies have difficulty coping with emergent properties in a proactive manner and preventing the introduction of various side effects from quick (i.e. temporary) fixes, which leads to repeating failures of similar type. The main reason for this situation is that current methodologies tend to identify a system failure as a single, static event, so organizational learning tends to be limited to a single loop rather than a double loop in rectifying the model of the model (i.e. the meta model) of action (i.e. the operating norm). This indicates that we need a meta methodology that can manage the dynamic aspects of system failure, by ensuring the efficacy of its countermeasures through the promotion of double loop learning.

In this paper, we propose a meta methodology called System of System Failures (SOSF), along with a system diagnostic failure flow, in order to overcome the current methodologies' shortcomings. We also demonstrate this meta methodology's efficacy through an application in IT engineering.

In the following section, we explain the features of current troubleshooting techniques and limitations with respect to certain aspects of system failures. In the next section, the three key features required in order to overcome these limitations, as well as SOSF, which actually overcomes the limitations are described. In the subsequent section, we discuss the dynamic aspects of system failures and their side effects, which is followed by tools (such as the diagnostic system failure flow) that fully utilize SOSF in the actual application phase to promote double loop learning. Furthermore, we give an application example of a server problem, and finally followed by a concluding discussion of the efficacy of SOSF.

LIMITATIONS OF CURRENT TROUBLESHOOTING TECHNIQUES

The predominant technology of current IT troubleshooting is based on a predefined goal-seeking model, van Gigch (1991) points out the main shortcomings of system improvement in this model, as follows: (1) Engineers look for causes of malfunctions within the system boundary. The rationale of system improvement tends to justify systems as ends in themselves, without considering that a system exists only to satisfy the requirements of larger systems in which it is included. (2) Engineers seek to restore systems back to normal. A lasting solution cannot result from an improvement in the operation of a present system. An improvement in operations is not a lasting improvement. (3) Engineers tend to hold incorrect, obsolete assumptions and goals. It is not difficult to find organizations in which the formulation of assumptions and goals has not been explicit. Fostering system improvement in this context is senseless. (4) Engineers act as 'planner followers' rather than as 'planner leaders'. Another manifestation of the problem of holding incorrect assumptions and pursuing the wrong goals can be traced to different concepts of planning and of the planner's role. In the context of system design, the planner must be a planner leader, planning to influence trends, instead of a planner follower, planning to satisfy trends.

This paper focuses on system failure aspects that current methodologies cannot manage properly in the sense pointed out by van Gigch. To summarize, these aspects are soft, systemic, emergent and dynamic (i.e. they accommodate multiple stakeholders' worldviews).

Related articles from newspapers, magazines, journals, and more
Understanding information system failures from the complexity perspective.
Magazine article from: Journal of Social Sciences Mukherjee, Indranil October 1, 2008 700+ words
...complexity framework to Information System failures. The paper is organized as follows...necessary to discuss about information system failures is first developed. Different research...utilized in the study of information system failures are introduced next. Fundamental...
Serious quality system failures found at Aurora.
Newspaper article from: Validation Times Heitman, Evelyn January 1, 2005 700+ words
...Dedicated Breast MRI System device, FDA found serious quality system failures, including problems with the quality policy and system...company's recognition of the seriousness of its quality system failures. The firm did not respond to a request for an update prior...
Hospitals, offices have similar rates of medical errors. (System Failures...
Magazine article from: OB GYN News McNamara, Damian December 1, 2002 700+ words
...The crux of his presentation was that errors are caused by system failures. For example, getting two look-alike medicines confused...can point to inefficiencies in an office setup. Both are system failures. "There is a lot of slippage in our systems for processing...
System failures cost 125m [pounds sterling]. (News Round-Up).(global securites...
Magazine article from: Financial Management (UK) December 1, 2002 700+ words
...securities industry 800 million [pounds sterling] a year, 125 million [pounds sterling] of which is lost on security system failures. These occur when systems cannot provide codes for trades with multiple listings or when they fail to find codes fast enough...
Testing the plausibility of psi-mediated computer system failures.
Magazine article from: The Journal of Parapsychology Radin, Dean I. March 1, 1990 700+ words
TESTING THE PLAUSIBILITY OF PSI-MEDIATED COMPUTER SYSTEM FAILURES
California Skirts Energy Disaster with Near-System Failures.(Knight...
Newspaper article from: Knight Ridder/Tribune Business News Devera, Michelle December 8, 2000 700+ words
...crisis week by declaring its fifth consecutive Stage 2 emergency Friday, keeping energy officials in a tailspin of near-system failures. The Stage 2 alert comes on the heels of Thursday night's first-ever Stage 3 emergency, when the state skinned past...
LAN analyzer diagnoses Ethernet faults before they cause system failures....
Magazine article from: EDN Harold, Peter March 31, 1988 700+ words
LAN analyzer diagnoses Ethernet faults before they cause system failures Because the OSI (Open Systems Interconnection) layer-4 protocols used in IEEE-802.3 (Ethernet) LANs automatically retransmit...
Winternals Recovery Manager Delivers Dramatic Reductions in Recovery Time...
Press release article from: Business Wire March 31, 2003 700+ words
...network, with no data loss. "An important shift in 2003 will be in perceiving improvements in the time to recover following system failures as being more important than the time it takes to back up data," said Raymond Paquet, vice president and research director...
For more facts and information, see all results
©2009 Gale, a part of Cengage Learning. All rights reserved.
About us | FAQs | Contact us | Privacy policy | Terms and conditions
Other Gale sites: Encyclopedia.com | HighBeam Research | Acquire Content | Books & Authors | Goliath | MovieRetriever | Smart QandA