A Tale of Two Stories
National Patient Safety Foundation

  Report from a Workshop on
Assembling the Scientific Basis
for Progress on Patient Safety

Appendix B
 

 
Incident Reporting Systems in Medicine and Experience With the
Aviation Safety Reporting System

 



Charles Billings, MD
Institute for Ergonomics
Ohio State University

This appendix contains the talk by Charles Billings, MD, Chief Scientist (retired), NASA Ames, on the lessons learned from incident reporting in aviation. Dr. Billings designed, started and managed the Aviation Safety Reporting System 22 years ago when at NASA's Ames Research Center. His talk framed the discussion on the second day of the workshop. The lessons he abstracted from the aviation experiences represent the best guidance available on incident reporting. Medicine is quite different from aviation in many ways. What proved successful in aviation is not likely to transfer directly and literally to medicine. However, the lessons Dr. Billings has derived are generic and can serve as a guide to develop successful systems in medicine.
 

This is only a brief digest of what I think are some of the most compelling and important issues regarding formalized incident reporting. I hesitate to use the term systems to describe the many different approaches to incident reporting. To call them systems would dignify them unjustifiably, at least at this point. But I acknowledge that there are various requirements that, in the past, have shaped the systems during design and implementation. The experience with the aviation safety reporting system, in which I developed the ideas that form the basis for this presentation, exemplifies these requirements. The first and most critical requirement for a successful incident reporting system is a demonstrated, tangible, widely agreed upon need for more and better information. If a substantial portion of a community believes that it already knows what needs to be known about incidents, then it is unlikely to give more than lip service to finding out what incident reporting can discover. Strong, widely held consensus that more and better information is needed, is essential for the development of successful incident reporting. The second requirement is for a respected body, one independent of the influences of other stakeholders, to conduct the collection and analysis of data. This is an absolute requirement. Some disagree with this, but many incident reporting schemes have come to grief over the years by being installed in that body which was charged with oversight or in some other body that was subservient to the body charged with oversight, of the community whose activities were being assessed by the incident reporting system.

Two other factors have led to incident reporting schemes coming to grief in the recent past. One is adequate funding to permit expertise to be brought to bear. These systems cannot be run with a couple of clerks and a keypunch operator. For any useful degree of understanding of the reports, incident reporting requires expertise at the South end equal to that which was on the North end, that is, there must be expertise used in evaluating the reports as they are obtained. The other factor necessary is time. There must be adequate time to establish the system, to gather data and, more importantly to sell the system, its input, data, information, reports, and conclusions, to those authorities who represent the ultimate recipients of its products.

To some, these prerequisite requirements may seem obvious, but the simple fact is that incident reporting schemes have sometimes failed for want of them.

Let me address very briefly the question of whether incident reporting should be mandated. Those of you who have read the two volumes of materials that were sent in preparation for the conference (and who were still reading carefully by the time you got to the end of them!), will have recognized that the New York hospital incident reporting system is a mandatory system. But the other documents in these volumes you find in several points the observation that incidents are underreported. Well, that will come as a less than an overwhelming surprise to most. Some claim that mandatory reporting is necessary.

They say that if you don't have mandatory reporting everyone will protect themselves. I add only that if you do have mandatory reporting, everyone else will protect themselves. Despite mandatory reports in some locales, the number of incidents reported is small. For example, according to the materials, Carolina had 15 reports in its first year and Colorado had 17 reports in 2 years. I leave it to you to decide whether you think that is adequate or not. I will here make the claim that, in some form, in one way or another, all incident reporting becomes voluntary. It either becomes voluntary because of inertia on the part of reporters, or it becomes voluntary because of constraints within the establishment and the environment, or it becomes voluntary because hospitals (and there are at least one or two in your two source books) decide that they are not required to report this particular event because of the fine print in that particular incident reporting regulation or statute.

Underreporting is a recognized problem. But I'm not at all sure that that is the critical problem. Staying within the medical context and referring to the materials from your source books, there are enough reports of mishaps with potassium chloride, lidocaine, vincristine and other drugs and devices to have made it very clear that a problem with these exists. The information that these events occur is already present. We may well ask what it is that keeps us from making progress on safety, given that we already know about the existence of these problems. What is added by more formal, elaborate (and expensive) incident reporting? We may hypothesize that if the events had become known to a central organization, there would have been discovery of trends and systematic evaluation of what was going on that, in turn, would have led to some unspecified but important change. But it is clear, even at this stage, that the hypothetical, safe, non-punitive incident reporting system is not the primary obstacle to making progress on safety in medicine. Existing agencies can take action once a problem is adequately defined, explored, and explained (and, I would have to add, publicized). A central question facing us is not really how many there are, but how many is enough. That is to say, there are already many signals that point to a variety of failures. Part of the consensus that needs to be formed for successful incident reporting is consensus about what is a sufficiently strong signal to warrant action. The sourcebook materials suggest that such a consensus remains to be developed.

Stakeholders influence the likelihood of developing consensus. Now once again I've dipped into the sourcebook to the CEO's retrospective report on the Ben Kolb case, which is volume 1 tab 4. I've added two here because they were spoken of elsewhere but I found that the CEO listed a rather substantial number of stakeholders in that two-page report.

The CEO of the organization
The governing body
The staff of the organization
The staff directly involved
Physicians and nurses
Support personnel
The patient
The family
"The government(s)"
Risk management department
The insurer(s)
The media
The medical examiners
Outside consultants
Accrediting organizations
The union(s)
Advocacy groups
The attorneys

That, if you will forgive me, is a lot of people, organizations, interests, stakes and conflicting goals. If I am prepared to argue to you that incident reporting requires consensus, it seems reasonable to ask whether consensus can be reached among these stakeholders on anything including whether it is day or night, whether the sun is moving to the west or the east.

It will surely be asked by many why consensus among these stakeholders is necessary. What does it buy if you have it? First, it buys a substantial number of people and organizations as advocates for incident reporting and the system that supports it. This is essential to keeping the system a working, living entity in a contentious, politically driven environment. Second, it buys the participation of those people who can, if they wish, do something with the information you produce.

It might be useful to turn the question of consensus on its head. What does it cost you if you don't have it? At this point I will diverge just slightly to answer Richard Cook's question, posed at the beginning of this conference. He asked what critical failures we now recognize after a lifetime of research. This led me to think back on the history of the aviation safety reporting system and to identify the most significant failure in its development. The NASA Aviation Safety Reporting System was established in response to the cries of virtually everybody in the community, directly at the order of the FAA Administrator working through the Administrator of the National Aeronautics and Space Administration. NASA was chosen because we represented a respected and presumably objective third party. We were given one month to establish the system, guide it through about as many stakeholders as are shown in the table, and present it to the Administrator as a done deal. We (or, rather, I), in my naiveté made an assumption. I assumed that since the chief of the FAA was asking for it, the FAA wanted it. This was a bad mistake.

Over the past 21 years, the NASA Aviation Safety Reporting System has had the support of virtually everybody in the United States aviation community to a greater or lesser extent with one notable exception. The exception is the people under the FAA Administrator-and there are roughly 22,000 of them-whom he did not consult before he came to NASA and asked for an incident reporting system. We do well to remember that a primary issue is who may be hurt by reporting. This is especially of concern where use of immunity (and, originally in the ASRS, transactional immunity as well) is a prominent feature.

What was the largest segment of the FAA aside from the air traffic control system? Divisions involving regulations and enforcement. Whose enmity did we earn the day this thing was announced? Those who had to make it work in the community. That is the worst mistake I made in 40 years.

So the absence of consensus about the need for and characteristics of incident reporting was a critical flaw in the development of ASRS. What does it cost you if you don't have consensus? It costs you in passivity; resistance to acting on a lot of the recommendations derived from data received; delays in implementing; even ridicule. "You know it isn't a problem," was one form this passivity and resistance took. Another was "You guys just don't understand." Consensus is critical and it must include all the stakeholders, not just a few or a special team or a division or an agency or a company.

I also want to point out that consensus isn't enough. It's necessary but it is not sufficient. Incident reporting also requires understanding and that is even tougher to establish. Incident reports are unique sets of data. Each incident is unique and not easily classified or pigeonholed. Generalizations may be possible in retrospect, given enough detailed data and enough understanding of the data. But this means understanding details of the task, the context, the environment, and its constraints. This is why you have got to have experts looking at reports. Simply constructing taxonomies is grossly insufficient and it permits only counting of incidents that fall under phrase a, b or c of the taxonomy.

Counting incidents is a waste of time. Why? Because incident reporting is inherently voluntary. Because the population from which the sample is drawn is unknown and therefore can not be characterized, and because you lose too much information and gain too little in the process of condensing and indexing these reports unless you do what we were fortunate enough to do blindly, and that is keep all the narratives. Every ASRS report is in fact a narrative rather than a categorization. And the ASRS keeps every word, except those necessary to be discarded to de-identify the reports. The evaluation of incidents in such a system requires an understanding of all that. A deep understanding.

Let me give you an example. We got a report about 2 years after the ASRS started which, paraphrased, said "I had a frightening experience this morning. I took my airplane, a Lockheed 1011, off at Los Angeles headed for Vancouver, and it took full right aileron to keep it in the air. The airplane wanted very badly to roll sharply to the left." Fortunately, it was a cold day at Los Angeles, the load in the airplane was light, the pilot, carrying full right aileron all the way and with help from the first officer, managed to struggle around the airport and get it back on the ground at Los Angeles and no one was hurt. Whereupon it was found by the flight engineer that the two outboard spoilers on the right wing were both fully extended and had been since the airplane left the ramp.

Why was this not caught? Neither of those two spoilers carries sensors for spoiler deflection, so the crew, once the airplane was powered up, had no indication in the cockpit that those devices were extended. You can't see them from the cockpit. The airplane was powered down at the time the first officer made the walk around, and in an all-hydraulic airplane, controls can be in any position whatever when there is no hydraulic power on. We had three other similar reports after that, but we really didn't wait for a second report. We immediately got why that happened and reported it, de-identified, to those people we thought needed to know about it. As a matter of fact, the reporting back was conducted by telephone and that incident was handled initially within 24 hours.

Counts are not what you are after. You can't, in a voluntary system, determine the incidence or prevalence of a problem. But if you get two or three reports like that, you can be pretty sure that you have a problem, and that is what you are after in this kind of a system. On the other hand, having extolled the kinds of things incident reporting can do for you more briefly than I usually do, incident reporting is not the whole answer. I think this is particularly germane to your considerations here. In aviation accident reporting is mandatory, and accidents are very precisely defined. Yesterday we discussed the problems associated with defining and characterizing what an accident is. In aviation we have a much easier job, and the distinction between an accident, which must be reported, and an incident is easier to make than is the case in health care. Keep in mind that incident reporting in aviation is voluntary under most circumstances. Incident reporting and accident reporting are not substitutes one for the other. They are complementary.

Incident reports, properly interpreted, provide new knowledge. It is important to remember that this is all they provide. They are not a panacea. They only provide knowledge about what is going on in a particular domain or area of purview. The incident reports themselves tell you little. They generally do not tell you how it could be done better. The analysts, looking at many incidents, may or may not be able to tell you how it may be done better. They are more likely to know, in many cases, more than the incident reporters. Certainly they are more likely, over time, through reading these things, to gain some experience and understanding. But like descriptive epidemiology anywhere else, incident reports can provide only descriptions of phenomena. Analytic studies and other research remain critical to a full understanding of the phenomena.

Acting on the new knowledge that comes from combining incident reporting with analytical studies and detailed research is not and should not be within the purview of an incident reporting system. I emphasize this because too many people have thought that incident reporting was the core and primary component of what was needed. These people thought that simply from the act of collecting incidents, solutions and fixes would be generated sui generis and that this would make safety better. Although much is unclear about incident reporting systems, this one fact is quite clear: incident reporting is only one component of what you need. Using new knowledge gained from these systems must be the responsibility of the stake-holders I have listed. This says that the use of this knowledge to spur new analysis, new research, to guide regulation, to inform management decision making, to change performance, must be the responsibility of everybody else in the domain. It must not be the job of the people who run the incident reporting system, who have got to remain objective and disinterested in order to do this job as it needs to be done.

The incident reporting system must remain an objective and disinterested third party to retain its effectiveness. Otherwise, sooner or later, it will be accused, properly, of bias. There are enough biases we cannot get rid of; it clearly does not make sense to add more. So, I ask you to consider among other things, what should be reported. Is harm required? How much harm? What kind of harm? Is an adverse event-and that is the term Kaplan and his colleagues are using in Texas-determined by processes or by outcomes? Can incidents be differentiated from accidents in these settings? The difference is very critical with respect to legal liability. Unfortunately as we see the increasing trend, referred to in your sourcebook, toward criminal prosecution as well as civil liability, this question becomes increasingly important. It becomes increasingly important because it is entirely possible that if someone decides to go after a criminal indictment following such an incident or accident or mishap, you may be harboring evidence. That is not a particularly desirable position for the hapless expert working in one of these outfits to be in. I believe the legal term for it is obstruction of justice. This was a serious problem in Canada, where a violation of what in the US is called an air navigation order is not only a civil violation, it is also a criminal offense. The people who work within the Canadian confidential air safety reporting system had to be specifically immunized by Parliment to allow them to do this task. That would be true in any nation that was governed by the Napoleonic code. It is increasingly a potential threat in the common-law nations as well, given what I said yesterday about corporate manslaughter, corporate crimes involving damage or injury to people. So does it matter? Twenty years ago I would certainly have said, "Probably not." Now I would say, "Yes, I think it probably does." I am not sure yet of the way to get around this, and I'm doing a fairly extensive study at the present time to try to figure out how much it matters. But there is certainly, in aviation at least, an increasing trend toward imprisoning or invoking other criminal sanctions against people for offenses committed in the course of their employment in this industry.

Should records ever be protected? There are certainly going to be differences of opinion with respect to that. Can they ever be protected? One of the most treasured bits of data within the aviation industry, are the cockpit voice recorder (CVR) tapes. The police in New Zealand, after an accident there in 1995, subpoenaed, then sought a court order, then went to trial to get the CVR tape from that accident in connection with prosecution of the pilot and co-pilot, one of whom lived and the other of whom did not. The police prevailed at the appellate level and were given the CVR tape as criminal evidence. That has led to a very substantial effort in New Zealand to decide what are the limits of evidence gathering in this environment. And, as a matter of fact, the New Zealand Law Commission is looking into that at the present moment. This is a very serious question. Another question, related to it, is should the analytical records of an incident reporting system be protected? Sufficiently detailed, these analytical records may make it almost unnecessary to get to the raw data. In this country, thus far, raw data, from the aviation safety reporting system once de-identified has been considered hearsay and therefore inadmissible in the vast majority of cases. That may be the balm that can be spread upon the wounds in some of these other nations within the common-law states but in any event, this question remains an important one.

And finally, is an incident reporting system possible in this, that is to say, the medical environment? Thinking of the number of stakeholders involved, thinking of the enormity of the hypothesized problem, I think that we all need to reserve judgment. We need to have more evidence, gathered by different means-converging evidence to bring to bear on this question. Until this problem is scoped, neither the National Patient Safety Foundation nor anyone else is going to be able to figure out how to tackle it, so I think that that should clearly be one of your front-end research objectives. But however large the problem, the diversity amongst the stakeholders is very worrying. Now in the supplementary sourcebook, I've included some view graphs describing the NASA Aviation Safety Reporting System. A couple of those deal with our advisory committee, which represents a broad segment of the large, complex, diverse, aviation industry. But the size, complexity, and diversity of aviation are simply not on the scale of medicine; in fact, aviation is nowhere near. I would offer you some hope that it may be possible to define areas within medicine in which there is a somewhat smaller constituency, a somewhat smaller group of stakeholders and within which, therefore, the problem may be slightly more tractable than it will be if you decide that your purview is all of medicine. I am not sure I would know at this point, even after 20-plus years of experience with this business, how to design a system for medicine. I think I could perhaps design a system for some subsegments of medicine, that are more tightly circumscribed, but I hasten to add that I am not even sure of that.

These are some of the questions you are going to have to face as you begin to think about incident reporting systems in this particular domain. They make success in medicine much less certain, in contrast with the undoubted success that incident reporting systems have had in the aviation environment and a very few others. There are many other detailed issues to consider in developing incident reporting within medicine, issues that you will need to address in detail, should you chose to pursue this line of work. All these other questions deserve attention. But you will find, I think, that the ones I have presented here today are among the largest, most important ones that will be critical to the success or failure of any new incident reporting system in medicine. I think those kinds of questions need to be thought about very carefully, deliberately, intelligently, and thoroughly before beginning.

Question 1: Would you comment about your efforts over the years to keep the ASRS alive?

Well, that was the first thing we had to do. In some ways, this was actually fairly straightforward because the industry, by and large, recognized that they had a problem. The Administrator of the FAA had a political problem, but this is not particularly germane in our discussion today. The larger aviation industry, including all the stakeholders, had a real problem and recognized it. So this particular seed fell upon reasonably fertile ground. We got along reasonably well initially until we began to recognize the depths of the disinterest of the FAA working staff in this program. We did everything we could to interest them. We went out and we talked to as many of them as we could get to. We worked within FAA headquarters to try to get them to understand what they had put us into.

This was not entirely successful, but we got along by and by fairly well, until political problems arose concerning a freedom of information act request asking for an evaluation and discussion of air traffic congestion in terminal areas. The ASRS was obligated to respond to this, and it did. We put together a report whose principal finding of concern was that near mid-air collisions appeared from our data to be most likely in the highest density terminal control areas in the United States. The next most likely areas for near mid-air collisions were what were called Airport Radar Service Areas. These were the terminal areas around secondary but still busy airports. Near mid-air collisions were less likely to occur anywhere else.

This would probably not surprise you all much and it did not surprise us. But, it was of course a direct affront to the agency which regulates air space in the United States, and which had been saying, as part of its campaign to establish some 10 or 12 new terminal control areas, that they were the safest place in the air. The internal political response to this report was complex but amounted to an effort to remove the legal immunity that protected reports to the ASRS. Although this was forestalled by the intervention of Congress, it is clear in retrospect that the survival of the system as a working entity depended on the consensus within the community that had been reached early in the development of the system. It is one example of the importance of stakeholders who are directly and actively involved.

Question 2: Does the fact that pilots are at personal risk when flying have something to do with the success of the system? Is that a reason that you think this system might work much better in aviation than it might be made to work in medicine?

Well, it is certainly true that pilots are normally the first people at the scene of an accident. But the answer in my view is no, it is not the sense of personal risk that has driven pilots to participate in the system. I read the first 30,000 of these reports that came in. I have not read the ensuing 340,000 or so. One is continually amazed by the number of people who do far more than they need in order to qualify for immunity under this system. People write pages of descriptions, they send tapes, they volunteer to come to our offices. They make clear in a number of ways that they want us to understand, in all its rich detail, the complexity of the incident in which they had been involved and which they had already reported by mail. One indication of the sincerity and dedication of these people is that some have had such close interaction that they have become personal friends. There is no question about the motivation of the pilot community in general with respect to safety issues, none whatever. The reports are not grudging acknowledgement or pro forma filings but rather quite rich and human descriptions of troubling, often frightening events. I believe that the reporting to this system is motivated not by the sense of personal risk that attaches to flying but rather from two major factors: (1) the sincere interest in improving safety by identifying hazards and (2) the sincere (and, in my view, well grounded) belief that the system to which they are reporting uses that information productively and deliberately to improve safety rather than simply as a means of counting failures.

Parenthetically, I know that the system has been "gamed" in the course of adjudication of labor issues. I think it would be very surprising if it were not. But that becomes relatively easy to pick out; one notices it. One occasionally gets some very good information out of those reports incidentally, because some of that gaming is over issues that are real and substantive. The fatigue and jet lag issue is one that I think of immediately although that is by no means the only one. But once again you have to have a human-a smart human and a motivated human-who understands and has been there at the sharp end of the process from which reports come as well as at the analytical end. And I do not believe there is any substitute for that.

Question 3: Could you comment on expertise issues in the procedure for examining incidents and looking for patterns?

Pilots get rusty after about so long; as you know, many of our analytical people are retired pilots. We have instituted a set of rotations for these people, whereby they would come and work for us for a period of time and then they would move on and we would replace them with others coming out of the cockpit. Now that has been in place for probably fifteen years now, and it is also true with respect to our air traffic controllers. No one can stay adequately familiar with the niceties in an industry this complex unless he or she has been actively working in it until comparatively recently.

Interestingly, although we have been overwhelmed at times by the number of reports, the system for handling those reports has changed remarkably little. They are all hand-read by an expert who is appropriate to the field of consideration in the report. The reports are still identified at that stage and are being handled as classified material, so that if more data is required we can get it directly from the reporter. We still have the pilot's name, phone numbers, things of that sort. We will attempt to call the pilot back and discuss in more detail the incident that was reported and sometimes secondary issues as well, before the reports are de-identified. We do classification on a number of classification fields. The reports are processed, the narratives are de-identified with respect to person, flight numbers, and things of that sort, and then those narrative reports are keyed directly into a data base as well as of 60 or 70 fields of coded data. So the classification system, which is primarily used for indexing, provides access to the narrative. It is also possible to search the narrative fields by words, by phrases, and considerable amount of research has been done using that, as well. We would have lost all of that had we destroyed the narrative.

Question 4: What does it cost to run this system now?

To the best of my knowledge it's about two million dollars per year for about 35,000 to 40,000 reports.

Question 5: Currently in the medical field, legal requirements for discoverability drive access to these incident reports. In Louisiana and Arizona, for instance, between 50% and 100% of incident reports are discoverable in civil cases. This has created a substantial incentive to make these reports narrow and short. There is a strong emphasis, stated explicitly in most cases as directions to the reporting individual, to limit what is described to objective rather than subjective information. For example, regarding a fall in the hospital, what people are instructed to record and report is the bare fact that someone slipped and fell. If the individual said, "I slipped on water" the incident reporter can put that in quotes but needs to avoid concluding that they slipped on water. Thus these reports are limited to observations of the scene, noting the times, locations and so on. Are the ASRS reports specifically limited in those ways-that is, narrow, succinct, objective and without any opinions about the incident or how it occurred? Will such a system produce results similar to those obtained via the ASRS?

Absolutely not. We rely on preservation of this material as classified material initially, on rapid de-identification, and on blurring enough of the details of such a report so that it would at least be difficult to introduce it as evidence. But we also need the reports of people, of pilots, who we understand are experts in their field and whose opinions, thoughts, and observations we value. We do not try to limit them or channel their descriptions in a particular way. Indeed, it would surely limit the value of the reports if we were to artificially constrain the narratives to eliminate any conclusions or opinions from the pilots. But these reports are data, and we all recognize that data has biases and limits and the important thing is to try to understand these. This goes back to the problem of throwing away data when you throw away the narratives-counting doesn't work, that is not what you are after in incident reporting. Limiting the narratives in the way you describe would be throwing away the data even before you got it, a serious mistake in my view.

Now with respect to de-identifying the events in the database-that can be a bit hard. I think particularly of this city in which we find ourselves, and arriving here early one morning shortly before I retired from NASA, to see a 747 sitting ingloriously in the dirt, between Runway 10-Right and the taxiway adjacent to it. Now that was not, as it happened, an accident. The monetary value of it was not high enough to require defining it that way, partly because it was soft mud and there was little damage to the plane. But I would have hated to try to de-identify that report sufficiently so that it would be in the database but unidentifiable. A 747 is one big moose when it is sitting out there where everyone can see it.

 

  Appendix C: List of Sourcebook Materials

Table of Contents


 

Copyright 1998 National Patient Safety Foundation at the AMA

Prepared for Web publication by
Annenberg Center for Health Sciences