Assessing the Generalizability of Prognostic Information

  1. Amy C. Justice, MD, PhD;
  2. Kenneth E. Covinsky, MD, MPH; and
  3. Jesse A. Berlin, ScD
  1. From Pittsburgh Veterans Affairs Health Care System and University of Pittsburgh, Pittsburgh, Pennsylvania; University of California, San Francisco, and San Francisco Veterans Affairs Medical Center, San Francisco, California; and School of Medicine and Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania, Philadelphia, Pennsylvania.

    Abstract

    Physicians are often asked to make prognostic assessments but often worry that their assessments will prove inaccurate. Prognostic systems were developed to enhance the accuracy of such assessments. This paper describes an approach for evaluating prognostic systems based on the accuracy (calibration and discrimination) and generalizability (reproducibility and transportability) of the system's predictions. Reproducibility is the ability to produce accurate predictions among patients not included in the development of the system but from the same population. Transportability is the ability to produce accurate predictions among patients drawn from a different but plausibly related population. On the basis of the observation that the generalizability of a prognostic system is commonly limited to a single historical period, geographic location, methodologic approach, disease spectrum, or follow-up interval, we describe a working hierarchy of the cumulative generalizability of prognostic systems.

    This approach is illustrated in a structured review of the Dukes and Jass staging systems for colon and rectal cancer and applied to a young man with colon cancer. Because it treats the development of the system as a “black box” and evaluates only the performance of the predictions, the approach can be applied to any system that generates predicted probabilities. Although the Dukes and Jass staging systems are discrete, the approach can also be applied to systems that generate continuous predictions and, with some modification, to systems that predict over multiple time periods. Like any scientific hypothesis, the generalizability of a prognostic system is established by being tested and being found accurate across increasingly diverse settings. The more numerous and diverse the settings in which the system is tested and found accurate, the more likely it will generalize to an untested setting.

    Article and Author Information

    • Grant Support: This work is derived in part from a doctoral dissertation on health systems by Dr. Justice for the Graduate Group in Managerial Science and Applied Economics, University of Pennsylvania (1996), which was supported by Agency for Health Care Policy and Research National Research Service Award #5 T32 HS00009 from the University of Pennsylvania.

    • Requests for Reprints: Amy C. Justice, MD, PhD, Pittsburgh Veterans Affairs Health Care System, Section of General Internal Medicine (11E), University Drive C, Pittsburgh, PA 15240.

    • Current Author Addresses: Dr. Justice: Pittsburgh Veterans Affairs Health Care System, Section of General Internal Medicine (11E-120), University Drive C, Pittsburgh, PA 15240.

    • Dr. Covinsky: Division of Geriatrics, 111G, Veterans Affairs Medical Center, 4150 Clement Street, San Francisco, CA 94121.

    • Dr. Berlin: Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania, 423 Guardian Drive, Room 815, Blockley Hall, Philadelphia, PA 19104-6021.

    « Previous | Next Article »Table of Contents