Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Noninvasive medical imaging often is essential to cancer management at multiple times in the course of the illness.
Imaging currently is used for screening to detect cancer, characterizing lesions, performing locoregional and systemic staging, providing prognostic information, assessing response during and after therapy, restaging after treatment, performing follow-up of patients for recurrence, and precisely guiding biopsies and therapies such as external beam or systemic radiation, brachytherapy, or thermal and other ablations.
More invasive interventional radiologic procedures also can guide and be used to monitor and deliver vascular or intraluminal delivery of treatments such as radioactive microspheres, embolic materials, radiofrequency ablation or cryoablation, and therapeutic drugs.
Imaging methods range from the traditional anatomic methods—radiography, computed tomography (CT), and ultrasonography—to the more functional methods of magnetic resonance imaging (MRI) and nuclear medicine methods, including positron emission tomography (PET), single-photon emission computed tomography (SPECT), and planar nuclear imaging. Hybrid methods combining PET and CT, SPECT and CT, and PET and MRI are growing in importance. Optical imaging is promising but remains limited, by limited penetration of light through tissues, to superficial structures in most cases.
Plain films and mammography remain useful techniques, with mammography (including digital mammography and tomosynthesis) being the main imaging method and clearly proven capable of reducing cancer deaths when applied in the screening setting. A downside of screening imaging is the small but real risk of “overdiagnosis” and “overtreatment”—detection and aggressive treatment of cancers that may not be life-threatening.
CT remains a cornerstone technology for most oncologic imaging, and CT technology allowing for rapid-sequence angiography is finding new applications, as is three-dimensional reconstruction of CT data sets. Screening data with CT colonography continues to improve, and in some studies it has been found to be comparable to traditional colonoscopy for colon cancer screening. CT scanning for lung cancer screening reduces lung cancer death rates when applied to high-risk populations. The radiation dose from CT is a concern, and major efforts to reduce the dose from CT scanning have been implemented in newer CT systems.
MRI is the imaging tool of choice for central nervous system, spinal, and musculoskeletal neoplasms, as well as for assessment of vascular and some hepatobiliary and pelvic lesions. MRI also can be used to detect breast cancers, especially in women with dense breasts. Concerns regarding gadolinium-associated nephrogenic systemic fibrosis (NSF) have led to caution in the use of MRI contrast medium in patients with impaired renal function. There are also minor concerns regarding gadolinium accumulation in the brain, although these are of no known clinical significance. Newer MRI techniques such as diffusion imaging complement diffusion contrast enhancement (DCE) MRI and appear promising in assessing response to tumor treatment. Multiparametric MRI sequences are being used more frequently to characterize the prostate to determine if and where biopsies should be performed and to help in conducting active surveillance, delaying or reducing prostatectomies.
Bone scans using single-photon methods (technetium-99m [ 99m Tc] methylene diphosphonate) remain the dominant procedure for detecting suspected bone metastases; however, the PET agent fluorine-18 ( 18 F) sodium fluoride is increasingly applied since its approval by the US Food and Drug Administration (FDA). These techniques may be less sensitive for marrow involvement than MRI and other PET techniques for detecting bone metastases of many tumors.
PET and PET-CT technology using 18 F-fluorodeoxyglucose (FDG) continues to grow in a wide variety of applications, and its use is becoming increasingly routine in the management of patients with cancer at varying states of the disease process. PET is used with increasing frequency in the staging and follow-up of lung, colorectal, and head and neck cancers, as well as lymphomas and other types of tumors, and is now a routine tool in lymphoma management at several points in the disease. PET with non-FDG tracers is a promising research area with growing clinical applications. There has been particular progress in imaging of prostate cancer with several imaging agents including FDA-approved C-11 choline.
The fusion of anatomic and functional images to create hybrid “anatomolecular images” with software or dedicated instruments such as PET-CT, SPECT-CT, or the newer PET-MRI devices also is seeing rapid growth in applications in cancer imaging. Fully diagnostic CT scans coupled with PET imaging in the form of PET-CT often provide valuable composite imaging for cancer management. PET-MRI is an evolving technology, but fully integrated simultaneous systems are being used much more widely.
Imaging management for staging lung cancer and characterizing solitary pulmonary nodules often includes FDG-PET in addition to CT when the technology is available because PET-CT has higher accuracy in lung cancer assessments compared with CT.
Imaging management of suspected recurrences of colorectal cancer, head and neck cancer, lymphoma, and many other cancers often now includes the use of PET in addition to CT. Negative FDG-PET scans of the neck after chemoradiotherapy can obviate the need for surgery in patients with head and neck cancer. Response criteria for FDG-avid lymphomas are now mainly PET based, and PET assessments of treatment response are increasingly applied. Use of PET at earlier stages in the workup is becoming increasingly common, as is the use of PET in early assessments of the efficacy of cancer therapies. Adapting treatments based on the response seen on PET-CT is also increasingly applied with treatment deintensification approaches for rapid responders, and treatment intensification for poor responders being informed by PET. Neuroendocrine tumors are now diagnosed with gallium-68 ( 68 Ga) radiopeptides binding to the somatostatin receptors, and these tumors can be treated with radiopeptides labeled with lutetium-177 ( 177 Lu), among other therapeutic agents.
In prostate cancer, available imaging methods remain suboptimal for detection of primary tumor and early determination of local or systemic tumor spread. MRI nodal contrast agents are promising but not yet routinely available, and magnetic resonance spectroscopy has had only limited success in the prostate. A variety of MRI sequences including T2 images, diffusion images, and DCE MRI appear to improve on purely anatomic MRI approaches for lesion detection and detection of extracapsular involvement. More standardized reading systems such as Prostate Imaging–Reporting and Data System (PI-RADS) may lead to more consistent use of this technology to help differentiate low- and high-risk tumors from each other and inform potential active surveillance choices. A variety of innovative radiotracers for PET show promise for detecting disease recurrence, and C-11 choline is now FDA approved in the United States for use in prostate cancer, as is the synthetic amino acid fluciclovine. 68 Ga- or 18 F-radiolabeled peptides with binding to the prostate cancer prostate-specific membrane antigen (PSMA) molecule show great promise diagnostically and can guide radiopeptide therapies.
Visceral angiography for diagnostic purposes has been substantially supplanted by CT and MRI methods; however, it remains important as a tool for intravascular delivery of therapies such as chemotherapy, coils, or radioactive microspheres.
CT, ultrasonography (including with ultrasonographic contrast media), fluoroscopy, and innovative MRI systems can guide interventional procedures such as thermal and cryotherapeutic lesion ablations. Bifunctional methods including a radionuclide probe and an optical probe may guide diagnosis and surgery.
Highly specific probe-reporter systems are being developed to allow for optical and radionuclide imaging of transfected gene biodistribution and function. These approaches face major regulatory challenges when being translated to humans.
Combined anatomic and functional information is being applied to allow for more precise planning of external beam radiation therapy, including intensity-modulated radiation therapy (IMRT) and conformal therapy, methods that potentially allow for increasing dose escalation and minimization of toxicity to normal tissues.
Emerging imaging methods are proving increasingly useful in providing information on the physiology and molecular characteristics of lesions. This means that a multiparametric biologic imaging phenotype for tumors can be obtained and makes it possible to display heterogeneities in tumors. Given the complexity and multidimensionality of these imaging data, artificial intelligence (AI) approaches are being applied to augment radiologist performance. “Radiomics” is the emerging field of linking detailed analysis of quantitative images to tumor genetics and biology. Imaging phenotypes can more precisely guide individualized tumor treatment to yield a higher probability of success without excessive toxicity for treatment of the selected neoplastic process.
Noninvasive imaging is of fundamental and increasing importance in the daily management of the patient with cancer. Although physical examination and laboratory diagnosis remain key for planning treatment, for solid tumor management, imaging tests represent a major objective metric of disease presence or absence and activity and may be used at different times during the course of the disease to monitor the efficacy (or lack of efficacy) of treatment. Imaging to determine tumor size is an objective end point in disease management used to compare different types of cancer treatment and treatment across institutions. The use of imaging also is increasing in the drug development process and in developing new cancer therapies.
Specific clinical questions addressed by imaging include screening for the presence of cancer (e.g., breast, lung, or colorectal), characterizing anatomic lesions as malignant or benign, and staging a neoplasm—that is, determining the size and local extent of a primary lesion and determining whether it is localized or locoregionally or systemically metastatic. Such studies are essential for determining whether the patient is a candidate for surgical resection, identifying the extent of the field for radiation therapy, and determining whether systemic chemotherapy is appropriate. Initial staging of tumor size and extent also can provide important prognostic data. During the course of treatment, imaging is used to determine response of the cancer. Imaging also is often used to follow patients for recurrence or development of second malignancies. Imaging is being used more often as a method to assist in delivery of minimally invasive therapeutic procedures to ablate cancers, to guide radiation therapy, and to guide the dosage of therapeutic drugs, including radiopharmaceuticals, more precisely.
Imaging often is the best means of noninvasively identifying and assessing tumors. With information gleaned from imaging studies, the prognosis can be established and treatment decisions made with greater certainty. Before discussing the varying imaging methods available for patients with cancer, this chapter considers some general principles that are applicable to all imaging tests.
The major roles of imaging in the current and evolving practice of cancer management are shown in Table 16.1 .
|
90% SENSITIVE AND SPECIFIC TEST |
|
90% SENSITIVE AND SPECIFIC TEST |
|
But these are the same test! |
Noninvasive imaging is used to perform a wide variety of important tasks. Although the best way to determine the medical usefulness of a diagnostic test can be argued, a few key concepts are required to understand and compare diagnostic tests. These can be applied to one of the most basic tasks (i.e., determining whether tumor is present) and also to the ability of imaging to predict resectability or response to treatment.
Sensitivity describes how often the imaging test would give a “positive result” in a patient with cancer (i.e., true-positive finding). Ideally, the test precisely detects and locates one or multiple cancers in a given patient. Thus
Sensitivity can be calculated on a per-patient basis or a per–malignant lesion basis. The per-patient basis most commonly is used in screening studies for early diagnosis, whereas the per-lesion basis may be used in patients expected to have multiple sites of tumor. Per-lesion detection analyses can be misleading because they can be heavily biased by a single patient's results if that patient has multiple tumor foci.
It can sometimes be difficult to judge how “good” a test is by reading the literature. Sensitivity is supposed to be substantially independent of study population composition, but as discussed in the next section, certain imaging tests may be insensitive for some very early-stage disease, but very, very sensitive for more advanced disease. Each imaging test has a limit of detection threshold below which tumors cannot be detected because they are not distinguishable from the background tissues. Thus the patient population and very often the tumor burden and average tumor size can make a difference in the sensitivity of a test for detection of cancer. Virtually all noninvasive imaging tests are less sensitive for small-volume disease than for large-volume disease. For example, if an imaging test is used in a patient population in which patients have advanced disease before seeking medical attention (e.g., they are symptomatic at presentation), the imaging test may have far greater sensitivity than if it were used in patients with earlier-stage, smaller tumors. For example, positron emission tomography (PET) with Fluorine-18 ( 18 F)–fluorodeoxyglucose (FDG) has been reported to be more than 90% sensitive for detecting metastatic melanoma, but it is less than 20% sensitive in detecting early (low tumor volume) nodal metastases of melanoma at initial surgical resection. Mammography has higher sensitivity in women with more radiolucent than radiodense breasts. A test with high sensitivity has a low number of false-negative results. The false-negative fraction usually is expressed as 1 – sensitivity (in this case, sensitivity being rated on a 0–1 scale).
Specificity is the frequency with which a test result is negative if no disease is present, or the true-negative ratio. As a percentage, specificity is
Specificity can be calculated on a per-patient, per-lesion, or per-region basis. The per-patient calculations commonly are performed in the screening setting. They also can be done per region of the body (e.g., Is the liver free of tumor? Are the draining lymph nodes free of tumor?). It is technically difficult and sometimes impossible to know exactly how many tumor foci are present because this depends on the reference gold standard. It is not possible to perform “whole-body” biopsies antemortem, so some very small tumor foci may not be known to be present when disease is diagnosed. Specificity can be affected substantially if the imaging test is used in a population that has a characteristic that can result in false-positive results for the imaging test. For example, inflammatory and infectious lung disease, such as active tuberculosis (TB) or sarcoidosis, if present in a patient population, can result in false-positive findings on PET or computed tomography (CT) scans or other imaging methods. In this situation, the specificity of FDG-PET, and likely of CT, for staging the mediastinum for cancer would vary. Thus the specificity of PET for assessing mediastinal lymph nodes may be much lower in areas of the world with endogenous TB than in developed areas without it. Therefore an imaging test that is very useful in one part of the world may be far less useful in another part of the world. A highly specific test has a low frequency of false-positive results (i.e., a low frequency of positive test results in the patient population that does not have the disease). The ideal imaging test has both high sensitivity and high specificity, although none of our current imaging tests have perfect sensitivity and specificity.
For detection of disease, a binary, yes-or-no answer as to whether disease is present or absent is desirable. When such binary answers can be provided, it is simple to mathematically provide an accuracy value for a diagnostic imaging test. Thus accuracy is
where TP = true positive, TN = true negative, FP = false positive, and FN = false negative.
In this case, the number of patients with each finding is recorded. A highly accurate test is one with a low prevalence of false-positive and false-negative results.
Sensitivity and specificity define a test reasonably well, but its performance in a specific patient is affected by the characteristics of the population from which the patient is drawn. A physician normally wants to know whether an individual patient has cancer and whether the tumor is localized or metastatic (and where). The correct answer is binary in most cases, but imaging does not always reveal the true status of the individual patient. Thus the statistical likelihood of the accuracy of the result might be conveyed in the clinical imaging test report. This statistical likelihood will be related to the accuracy of the test as well as to the patient population characteristics. Therefore the positive predictive value often is of considerable clinical relevance. For example, the positive predictive value of a test with 90% sensitivity and 90% specificity will vary markedly, depending on the frequency of disease in the population. With these test performance characteristics, the clinician could reach two different conclusions regarding the practical value of the same imaging test ( Table 16.1 ).
A test that is effective in a patient population with a high prevalence of a disease may be far less valuable in a patient population with a lower prevalence of the same disease because there would be far too many false-positive results. The most effective use of imaging technology is in groups of patients in whom the imaging characteristics are expected to be robust enough to allow for predictions in individual patients. These challenges are particularly apparent when a test that was developed and validated in a patient population with disease is used to evaluate individuals with a lower prevalence of tumor (i.e., screening), or smaller tumors. In this situation, the number of false-positive findings may rise dramatically, sometimes nearly completely negating the value of the test.
Costs associated with a high false-positive rate can include excessive biopsies, with both considerable economic and personal costs, as well as an increased radiation dose in the population tested. Higher radiation doses in a population may lead to a risk of an increased prevalence of cancer. A clear balance must be achieved between when imaging tests are applied, especially in patient groups with a low prevalence of cancer, to minimize generating risks and maximize disease detection with the test. Screening programs may have specific challenges discussed later in this chapter. They may reveal disease that may be clinically irrelevant, resulting in “overdiagnosis” and “overtreatment.” However, proving overtreatment is challenging and ultimately requires randomized trials. That said, high-level evidence exists that screening for breast cancer and lung cancer in appropriate populations reduces cancer death rates. In addition, strong data also exist to support virtual colonoscopy for colorectal cancer screening.
Cancer imaging tests are interpreted by imaging specialists, often radiologists. As with all of medicine, there is considerable science involved in image interpretation, but the human element, or “art,” as it is referred to in some settings, also is involved. In developed countries, medical specialty boards have been established to ensure that practitioners have a basal level of training and knowledge, thereby providing some level of uniformity to image interpretations. However, even with board certification and extensive training, not all imaging specialists interpret a given imaging study in the same manner. Thus although the goal of an imaging test often is a simple binary “yes, there is tumor” or “no, there is not tumor” answer, there are varying degrees of certainty in the interpretation of an image in most instances. Some readers read with high sensitivity, whereas others read with high specificity. Unless a test is very robust, it is hard to achieve both high sensitivity and high specificity.
An example of a receiver operating characteristic (ROC) curve is shown in Fig. 16.1 . This set of curves reflects the performance of PET imaging in detecting axillary metastases in patients with newly diagnosed breast cancer. The axes of the curves are the true-positive fraction (sensitivity/100), which forms the y-axis, and the false-positive fraction (1 – specificity/100), which forms the x-axis, on a scale of 0 to 1. A perfect diagnostic test would yield no false-positive or false-negative results. The greater the area under an ROC curve, the greater the accuracy of the test.
The results shown in Fig. 16.1 are from three readers who graded PET scans using a five-point certainty scale (i.e., not a simple yes/no, but a continuum from definitely abnormal to definitely normal). The three readers had similar ROC curves, indicating that they were of generally comparable accuracy. For the same test, however, two readers may be reading at different points on the ROC curve, meaning that one is more sensitive and one is more specific, but both are of equal accuracy.
An excellent reader may have a greater area under the ROC curve (AUC) than a less skilled reader, meaning that the more experienced (and hopefully more capable) reader is both more sensitive and more specific than a less experienced (and presumably less capable) reader. However, virtually none of our imaging tests is perfect, and varying “cut points” between disease and normalcy often are made, affecting the overall performance of the test. In this study, the AUC of 0.7 to 0.76 was not viewed as sufficiently good for the task of nodal detection of metastatic cancer spread to the axilla. Despite this, a very high sensitivity or a very high specificity can be achieved depending on which part of the curve one operates in. A higher, hypothetical curve, with an AUC of 0.9, is shown for a more robust test, such as a higher-resolution PET system devoted to imaging the axilla.
In practice, sentinel node sampling, often guided by imaging or a radionuclide-sensitive probe system, is assuming a very important role in this area of tumor staging. Practically, if a rather insensitive test has a high positive predictive value, then the test may be of value if the result is positive, but of little value if negative. For example, a strongly positive PET scan for axillary metastases may obviate the need for a pretreatment axillary dissection in a patient with newly diagnosed advanced breast cancer in whom neoadjuvant chemotherapy could be given.
Although sensitivity, specificity, and accuracy commonly are used to characterize the tumor detection process, other metrics may be of greater importance. For example, some studies have focused on how often imaging substantially changes management. This kind of study is of great practical interest, but the optimal methods to assess such changes in treatment decisions are evolving. Ideally, one would like to show that the use of imaging, especially a new imaging technology, when applied randomly to half of the study population provided a reduction in the number of adverse events in the imaged population, improved survival, or had comparable outcomes at lower costs than standard treatments. As an example, a reduction in the number of “futile thoracotomies” has been used as a metric of success for PET versus CT in planning the treatment of newly diagnosed lung cancer. Ideally, randomization of patients to imaged versus not imaged groups can be shown to improve survival. Performance of randomized trials in which a portion of the patients undergo imaging and the other patients do not (or they obtain different kinds of imaging), with an end point of survival, will be of great interest. Unfortunately, such studies are complex or impossible, because management of patients after imaging may be altered markedly based on the imaging results. Therefore it can be difficult to separate the imaging study effect from the treatment effect. Ultimately, however, for some imaging studies to be adopted, such evaluations of survival will be needed. This point is particularly relevant to screening, as discussed later.
Registry data have been applied with substantial benefit to determine if planned or actual patient management is altered through the use of imaging tests. The National Oncologic PET Registry (NOPR) has provided a great deal of information on the use of FDG-PET imaging in the management of patients with a variety of cancers. The NOPR collected questionnaire data from referring physicians on intended patient management before and after PET. After 1 year, the cohort included data from 22,975 studies (83.7% PET-CT) from 1178 centers. Overall, physicians changed their intended management in 36.5% (95% confidence interval [CI], 35.9–37.2) of cases after PET, supporting the usefulness of PET for cancer imaging in registry cases, which are from a wide range of sources.
Screening programs for cancer often have taken the form of laboratory tests such as the Papanicolaou (Pap) smear, or, more recently, blood tests for tumor markers. The success of the Pap smear in reducing mortality rates from cervical cancer is incontrovertible. The use of imaging in screening for cancer is an example of success and is of considerable interest, but also is a source of considerable controversy. As discussed in detail in the chapters on breast cancer and lung cancer, screening mammography programs have been shown to be capable of saving lives in women older than 50 years. These programs also may save lives in women 40 to 50 years of age, but the data are less compelling.
Studies have been initiated in which CT scanning is used in an attempt to detect early lung cancer. Screening high-risk populations with CT imaging was proven to reduce lung cancer–specific mortality in the National Lung Screening Trial (NLST). This followed results from the Early Lung Cancer Action Project (ELCAP), a large study screening patients at increased risk of lung cancer with low-dose CT, which reported promising results in 1999. The ELCAP showed that lung cancers are detected at a smaller size and that patients whose cancers are detected by screening live longer following diagnosis than patients whose tumors are not detected by screening. Whether this translates into longer-term survival for the screened population remains unclear. The ELCAP study further evaluated 31,567 asymptomatic persons at risk for lung cancer with use of low-dose CT from 1993 through 2005, and from 1994 through 2005; 27,456 repeated screenings were performed. A diagnosis of lung cancer was made in 484 participants based on screening. Of particular note, 412 patients (85%) had clinical stage I lung cancer. Ten-year survival approached 90% in this group. This study demonstrated that annual spiral CT screening can detect lung cancer that is curable.
The results of another large randomized trial of lung cancer screening, the NLST, were reported in 2011. In this trial, 53,454 persons at high risk for lung cancer were enrolled from over 30 US sites. Study participants were randomly assigned to undergo three annual screenings with either low-dose CT (26,722 participants) or single-view posteroanterior chest radiography (26,732); 24.2% and 6.9%, respectively, of the CT-screened and chest radiography–screened groups had positive screening studies at some point. There was a high false-positive screening rate: 96.4% of the positive screening results in the low-dose CT group and 94.5% in the radiography group. The incidence of lung cancer was significantly higher (1060 versus 941 cancers) in the low-dose CT group, as compared with the chest radiography group, There were 247 deaths from lung cancer per 100,000 person-years in the low-dose CT group and 309 deaths per 100,000 person-years in the radiography group, a relative reduction in mortality from lung cancer with low-dose CT screening of 20.0% (95% CI, 6.8–26.7; P = .004). The rate of death from any cause was reduced in the low-dose CT group as compared with the radiography group by 6.7% (95% CI, 1.2–13.6; P = .02).
This exciting trial has raised some controversies: the vast majority of small pulmonary nodules identified are not malignant, the costs of the screening and of the medical care for incidentally detected lesions are substantial, and a substantial radiation burden, which in principle could be carcinogenic, is delivered to patients undergoing the screening. However, there is considerable hope that screening programs can achieve a reduction in lung cancer mortality as the radiation is given to patients later in their lives, when risks of radiation-induced cancers are reduced. Indeed, the evidence supporting lung cancer screening with CT was of sufficiently high quality to be included as a covered benefit in the United States under insurance policies consistent with the Affordable Care Act (ACA).
Based on the findings of the NLST, and the approval by US Medicare of screening, many centers have offered lung cancer screening clinics for high-risk patients. A risk, however, for lower-risk patients is that screening may have an unacceptably high false-positive rate. It is important to note that the promising results of NLST are for a higher-risk population, and extrapolations of benefit to lower-risk populations may be highly problematic.
As discussed later in this chapter, CT colonography holds excellent promise as a screening method for colon cancer, as an alternative to optical colonoscopy, and is increasingly being paid for by health insurers based on its accuracy.
Other areas in which screening by noninvasive imaging has been performed include colorectal cancer, where virtual colonoscopy can be used to look for early colon cancers, and in the pelvis in women who are at risk for ovarian cancer. More recently, screening CT centers offering a virtual evaluation of the entire body have become available, and even more recently, magnetic resonance imaging (MRI) and PET screening have been offered in some locales. The growth of these centers has been driven emotionally and economically; these tests are not yet well founded scientifically in the screening setting. In some cultures, screening imaging is done more commonly than in others. The risk of false positives in patients at low cancer risk is substantial.
Screening carries challenges, risks, and costs that are beyond the scope of this overview chapter. However, several key points apply to all screening approaches, including those using noninvasive imaging. These points include (1) whether a screening program is reasonable to consider; (2) lead-time bias; (3) length bias; (4) the overall economic cost implications of screening, especially the costs of investigating false-positive results, and (5) the risk that radiation used in screening may include subsequent cancers.
The requirements that a screening program must meet to be considered “reasonable” may differ substantially based on the specific society's values and a specific individual's perception of risk. However, in general, the following characteristics are important for cancer screening:
The cancer must have a considerable public health effect.
The disease must have an asymptomatic period in which detection by imaging is possible.
A therapeutic intervention that should lead to better survival or quality of life must be available.
The prevalence of the disease must be sufficient in the population being screened to justify screening (especially the cost). Low prevalence of disease lowers the positive predictive value of positive scans.
Medical, surgical, or other treatment must be available for the early-stage cancer identified by screening.
The screening test itself must not cause disease at a significant rate.
There must be a high likelihood that the patients in whom early cancer is identified by image-based screening will go on to undergo a suitable therapeutic intervention.
Furthermore, the imaging test itself must be acceptable to patients (in terms of level of discomfort, cost, and radiation burden), and it must be sufficiently sensitive to identify cancer often and sufficiently specific to minimize false-positive results. Finally, the costs of the screening process, and attendant costs related to false-positive results, must be compatible with the society's or the individual's economic resources, and the screening procedure must pose little or no risk to the patient.
Another important consideration in screening programs is lead-time bias. This concept, simply stated, indicates that if the natural history of a disease is unchanged but the diagnosis is made earlier in the course of the illness, the apparent survival will be improved. For example, let us assume that a tumor has a 6-year natural history from its beginning until the death of the patient, and that treatment is ineffective. The disease might become clinically detectable after 4 years and lead to death in 6 years, a 2-year survival after diagnosis. With screening, if the tumor is detected 3 years after the onset of disease and no improvement in treatment occurs, then the survival in the screened population would appear to increase from 2 to 3 years after diagnosis. This illusion of improved survival in the screened population is a considerable concern and can lead to inappropriate enthusiasm for screening programs.
Another important consideration in screening is the possibility of length bias. This is a more complex concept, but it may be related to the types of cancer that can be detected by screening programs. A possibility is that very rapidly growing and presumably highly lethal cancers are less likely to be detected by annual screening programs, whereas more slowly growing cancers, which have an intrinsically better prognosis, may be detected more frequently by screening. In fact, some of the early cancers discovered by screening may not be biologically relevant at all. If so, the patients with cancers identified in the screened population could appear to have a better survival than the patients with cancers identified in the unscreened population. This concern exists for prostate cancer screening by prostate-specific antigen (PSA), for example, wherein there are major concerns regarding overdiagnosis of slow-growing, and presumed indolent, cancers that might not need surgery. This concern also exists for breast cancer, wherein some argue that a small to a more substantial portion of the cancers diagnosed would not have proven harmful to patients had they been left undiagnosed and untreated. That said, we do not fully know which cancers need no treatment, and an issue existing with “overdiagnosis” is that it really is “overtreatment.” Better screening tests that identify biologically relevant cancers would be ideal.
A third factor is the selection bias that is difficult to control for in retrospective observational studies: how the patients and their referring physicians determined to have a scan performed. Selection bias occurs when there are unintended differences between the groups observed that, although they are associated with the variable used to sort the groups—for example, exposure in case-control studies and outcome in cohort studies—affect measurement of the study variable. For instance, in a case-control study of the effects on disease-specific mortality of a particular screening program, investigators would examine records from patients who have died from the disease in question versus those who have not, then determine the rates of the screening intervention in these two populations.
It is possible that those likeliest to have sought screening were the ones at highest risk for the disease. Results for the screening's effect on mortality could be underestimated in such a scenario. Selection bias also can work in the opposite direction, wherein the high-risk or poor-prognosis individuals are less likely to seek screening. With the relative absence of large randomized prospective trials, as is commonly the case when evaluating imaging as a screening tool, one may be tempted to base conclusions on the findings of cohort or case-control studies. However, the most accurate conclusions would be drawn from a prospective randomized study design.
Collectively, lead-time bias and length bias and, at times, selection bias can make screening programs appear to improve the survival of patients with cancer. Because of these major biases intrinsic to screening, very large, randomized, studies are required to show that overall cancer-specific mortality (and ideally mortality from all causes) declines as a result of screening programs.
All-cause mortality is a critical parameter. If the treatment of a presumed tumor discovered by a screening imaging test carries with it a risk of death or morbidity, the screening program could lower cancer-specific mortality but not all-cause mortality. If the screened population is very young, late adverse effects of screening may be difficult to detect, such as slightly increased risks of cancer due to radiation.
The advent of imaging and other screening tools that uncover a tumor long before it is symptomatic also brings up the need for biomarker discovery to help physicians to determine whether to treat what has been discovered through screening. The experience with PSA screening in prostate cancer serves to illustrate the point that not all cancers identified by screening eventually lead to death. This is likely also the case for some early-stage breast cancers of low proliferative rate and potential. Discovering and validating biomarkers that can help to distinguish reliably between lethal and nonlethal tumor types will be of great help in reducing unnecessary treatment in patients with less active disease. Identifying and phenotyping tumors with the greatest risk for progression is of great importance because it is clear that not all cancers carry the same risk of mortality if left untreated.
The determination of whether a screening program is valuable to society is often, in part, based on its cost and benefit. The concept of quality-adjusted life-years (QALYs) often is applied. This concept is defined as the economic cost to society required to result in 1 additional year of good-quality life for a member of the society. In many Western countries, a figure of $50,000 has been considered a useful guide (although many affluent countries are willing to spend far more per QALY), with QALYs costing less than this amount considered cost-effective. Such a guideline, however, does not necessarily apply when individuals make their own determinations as to whether to pay for a screening test. For example, it is reasonable to expect that those with greater disposable income would be willing to pay more per QALY than those with less disposable income. Thus it can be difficult to generalize about the cost efficacy of screening procedures.
Even when people choose to undergo a screening test at their own expense, considerable costs can be transferred to society as a result of the screening program. For example, if a screening test has a high rate of false-positive results, a substantial number of follow-up biopsies or procedures will be performed and can cost a great deal of money. Such costs can dramatically raise the total cost per QALY. Invasive procedures also can increase the likelihood of morbidity or death as a result of additional investigations. To determine the true cost per QALY associated with screening, these additional costs and risks must be considered. Thus screening remains an area of great promise, but also of considerable controversy.
Screening beyond breast imaging must overcome major hurdles before it is likely to be accepted. There has been considerable progress for both virtual colonoscopy as an alternative to optical colonoscopy and for CT scanning of the lungs in patients at high risk for lung cancer. In both these populations, the Centers for Medicare and Medicaid Services (CMS) pays for screening, as do ACA-compatible private insurance policies. It is almost certainly the case, however, that in high-risk patient groups, such as those with a family history of cancer or major carcinogen exposure with a high penetrance or early onset of disease, screening may prove of greater value than in the general population. As an example, MRI screening in patients at high risk for breast cancer is increasingly accepted as appropriate.
Noninvasive imaging methods in humans cannot enable detection and localization of a single malignant cell, although flow cytometric methods are very sensitive for finding a very few cancer cells in a patient's blood, and there is a great interest in assays for circulating tumor cells and for tumor DNA and RNA in the blood. Imaging methods are improving, however, and detection of a much smaller number of cells is possible in small-animal models. It has been estimated that by the time a tumor reaches 3 to 5 mm in diameter, which is the lower limit in size for detection by the best current noninvasive methods in humans, the tumor has undergone more than 25 doublings and contains 0.1 to 1 billion cells, depending on their size. In contrast, a cytologist on a very good day or with use of flow cytometric methods may be able to identify a single cell as malignant by using a microscope.
Realistically, even for histologic assessment of malignancy, typically a group of tumor cells must be present before cancer is diagnosed. However, light microscopy and more sensitive techniques such as immunohistochemistry and polymerase chain reaction studies mean that pathologic techniques potentially will be more sensitive than imaging methods. One very important proviso is that for light microscopy pathologic methods to be effective, actual examination of the malignant cells is required; the sample containing the tumor must be cut appropriately and viewed under a microscope. This task may be impossible, because the 8-µm sections that are used for pathologic examination typically are taken only from a small portion of a tumor or lymph node, whereas most of the tumor or node will be unexamined (e.g., to assess a 1-cm node by using 8-µm-thick sections, approximately 125 slices would be required). This large number of sections is not typically obtained.
Despite the markedly superior sensitivity of histologic methods over noninvasive imaging, there is a major sampling error issue for histologic sampling, and, paradoxically, imaging in some diseases potentially may be more sensitive than histologic examination for cancer. This situation can arise in mammography, in which small tumor foci can be seen on the mammogram but can be missed on cytologic sampling, possibly because of sampling error. When tumors are imaged with noninvasive methods, the entire tumor is visualized, not just a small portion. So, paradoxically, imaging, despite limited resolution, can be more sensitive than cytology or pathology. However, if exhaustive and thorough sampling is performed, with microscopic examination of tissue there usually is greater sensitivity for tumor than there is with current noninvasive imaging techniques.
With the advent of serum protein tumor markers, circulating tumor cell assays, and DNA- and RNA-based methods, it is possible that imaging will not always be a primary tool for cancer detection. These other methods may be more sensitive than imaging, although imaging has the distinct advantage of both detection and physical localization of tumor foci. In addition, noninvasive imaging can detect heterogeneity in cancers that cannot be identified with circulating cells or DNA. Such information is potentially more “actionable” medically than simply detecting whether tumor is present or absent.
One of the major goals of noninvasive imaging is to stage the tumor precisely to allow the clinician to best choose treatment and determine the prognosis. The evolving concepts of tumor staging are discussed elsewhere in this text, but improvements in detection technology can change the understanding of the natural history of a given stage of disease.
Patients with small or microscopic lung cancer metastases to the mediastinum are likely to do better than patients with bulky metastases, but both may have the same stage of disease. As the sensitivity for detecting small lesions improves, it becomes possible to identify more patients with small primary tumors and small metastases to the lymph nodes or small systemic foci of metastatic disease.
When primary tumors are detected at ever earlier and less advanced stages as imaging and other detection methods improve, patients are assigned to a higher stage than was used historically. Their inclusion in the advanced-stage group, as opposed to the localized disease group, appears to improve survival and outcome. The outcome of the lower-staged group also may improve because a subset of patients with now-detectable tumors has been removed from that group. The outcome of the overall group will not have changed because the numbers of patients in each group will have been altered.
One has to be very cautious in extrapolating historical survival data in a given advanced cancer stage based on an insensitive staging method to that observed with a more sensitive method, which might move some patients to a higher stage, so-called stage migration.
Broadly stated, cancer imaging can be performed with anatomic or functional (“molecular”) imaging methods. The traditional imaging of the patient with cancer, and the most established methods, are based on anatomic imaging. However, interest is increasing in more functional methods in cancer imaging. Furthermore, several anatomic imaging methods offer functional components that complement the anatomic method. Hybrid images, derived from and displaying both functional and anatomic data, also are becoming more widely available, often coming from the same hybrid imaging machine, such as in PET-CT. Imaging data are almost exclusively digital or digitized and suitable for postprocessing and image exchange. The major imaging modalities are discussed in the following section.
The traditional radiograph remains an important part of cancer imaging. It commonly is used to detect bone tumors and can be used to detect lung cancers in the thorax. The method displays mainly water, air, fat, and calcium density and is affected by overlapping tissue in front of or behind the lesion. Radiographs have become increasingly digitized in the last few years, with the introduction of film digitization and solid-state image screen capture devices.
Plain radiographs offer exceptional resolution but provide relatively little image contrast if there is not much calcium present. The radiation dose from a plain film radiograph depends on which portion of the body is being examined. In most centers, plain film radiographs for cancer management are being used less often, with CT scanning increasingly replacing radiographs in the abdomen and MRI in the brain and extremities.
Mammography is a specialized form of plain radiograph. Very high-resolution images of the breast are obtained using specialized x-ray sources and devices optimized for breast cancer detection. Digital mammography is now available and offers greater flexibility of image display because of the digital image format. A limitation of the digital format, however, is the field of view of the imaging phosphor, which may be too small to fully include some breast tissue. In the last several years, there has been a major shift toward digital mammography.
The Digital Mammographic Imaging Screening Trial (DMIST), which included nearly 50,000 women, showed comparable overall accuracy between film-screen and digital mammography in the overall study. A higher accuracy for cancer detection in women younger than 50, women with radiodense breasts, and premenopausal and perimenopausal women was seen with digital mammography as compared with film-screen mammography. Despite this higher accuracy, the reported sensitivity for both techniques for cancers was only 41%, with a positive predictive value of 12%, albeit with 98% specificity. This is far from ideal performance for a screening test. The move to digital mammography has occurred in part to increase diagnostic accuracy but also to allow digital images to be viewed on a picture archiving and communication system (PACS), as is the case with virtually all other diagnostic imaging methods in more and more imaging centers.
A newer technique, tomosynthesis, is under evaluation; in this technique, a number of “slices” of the breast are generated during a mammographic image acquisition that involves a moving x-ray source. This approach offers considerable promise going forward, but is early in its evolution technically. It has received FDA approval and will likely improve on the performance of mammography. Data suggest that tomosynthesis may help identify the 15% to 30% of breast cancers not identified by full-field digital mammography (FFDM). The specificity of tomosynthesis may be better than that of FFDM, as well. However, there are challenges with the large volumes of data and the potential failure to detect micrometastatic disease. The cost efficacy of such an approach remains in evolution, and larger multicenter studies will inform medical decisions regarding the precise role of digital tomosynthesis—for example, whether it can totally replace mammography.
Two emerging technologies in breast imaging—contrast-enhanced breast imaging and molecular breast imaging (MBI)—may ultimately play an important role. The former involves intravenous administration of contrast agent to identify areas of altered blood vessel density and permeability (often seen in cancer). The latter uses a radioactive tracer to identify areas rich in mitochondrial density and flow, such as cancer. Both are showing better performance than mammography alone in some settings, with performance approaching that of MRI.
CT is now established as the dominant imaging technique for cancer detection and follow-up. Currently, tumor response and staging criteria very commonly are based on tumor size as measured on CT. CT scanners acquire images by using an x-ray source and digital detector elements. The x-ray source rotates rapidly around the patient, usually with a single scan level taken in 0.5 seconds or less. Faster and faster rotation speeds of the scanners, along with multiple simultaneous detectors capable of imaging multiple slice thicknesses in a rapid spiral motion, are being used.
Scanners with 16 and 64 simultaneous slices are commonly in use, with some scanners with 256 or more slices now in use. Large–field-of-view detectors may allow even more of the body to be evaluated nearly instantaneously. Current fast scanners can potentially scan the entire body in a fraction of a minute. Although such evaluations provide key information about lesion size, some lesions may elude detection unless contrast medium is given intravenously, orally, or both. With such devices, it also is possible to capture contrast in arteries or veins to achieve superior visualization of these structures, which can then be displayed three dimensionally or in a volume-rendered fashion.
Although more slices in a CT scanner make for faster CT scans, it is not clear that having more slices always results in a superior diagnostic-quality scan. More slices and faster scans do mean that whole-body scans in a single breath hold can be obtained, which is advantageous because it can reduce the frequency of breathing artifacts.
A clear disadvantage of CT is its cost, both technically and in terms of the radiation dose. In the past several years, increased concern has developed regarding the total radiation dose being delivered by CT, particularly in children, because of the potential risks of carcinogenesis. Great efforts have been undertaken to reduce the dose from CT, especially with use of more advanced software methods for reconstruction.
Although CT is an exceptional technique, it remains a predominantly anatomic imaging method. Information can be derived based on the timing of intravenous CT contrast enhancement, such as the ability to estimate tumor blood flow, but it is not easily extracted without a substantial radiation dose from repeated images. Thus only a limited number of post-intravenous contrast images are obtained with CT, to limit radiation dose. All CT images are digital. Another major challenge with CT is the large amount of image data generated for analysis, data that can take the radiologist a long time to interpret fully. Newer dual-energy or multispectral CT scanners generate even more data and can provide virtual contrast-enhanced and non–contrast-enhanced images from a post–contrast administration acquisition. The large amount of data in a series of CT scans is of interest for purposes of further characterization beyond a visual one. Radiomic features can be extracted and analyzed, which may add information beyond tumor size and location. In addition, artificial intelligence approaches may be of growing importance to characterize large data sets.
Historically, angiography has been performed after intravascular insertion of catheters into arteries, followed by rapid injection of iodinated contrast media, along with rapid-sequence filming of the images. The improving ability of rapid-sequence CT scanning to show the vascular anatomy (CT angiography) has caused it to rapidly replace angiography for diagnostic purposes. Angiography still can be used to produce the most precise maps of vascular anatomy before organ transplantation or radical cancer surgery. Most angiography now is performed with digital image-capture devices, known as digital angiography.
Angiographic delivery of therapy is, however, an important area. This form of imaging can allow for therapeutic delivery of embolization materials such as coils, delivery of chemotherapeutic agents regionally, or delivery of radioactive microspheres, for example.
Angiography has high resolution but typically delivers a high dose of radiation energy to the patient. The use of angiography remains essential for studies to evaluate gastrointestinal (GI) bleeding, but with CT angiography continuing to improve in quality, the use of diagnostic angiography has become less frequent in routine clinical practice while cancer therapies by catheter have grown in application.
Become a Clinical Tree membership for Full access and enjoy Unlimited articles
If you are a member. Log in here