Lung cancer: Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening

David S. Gierada, Thomas K. Pilgram, Melissa Ford, Richard M. Fagerstrom, Timothy R. Church, Hrudaya Nath, Kavita Garg, Diane C. Strollo

Research output: Contribution to journalArticlepeer-review

99 Scopus citations


Purpose: To evaluate agreement among radiologists on the interpretation of pulmonary findings at low-dose computed tomographic (CT) screening examinations for lung cancer. Materials and Methods: Institutional review board approval and informed consent were obtained. HIPAA guidelines were followed. Sixteen radiologists from the 10 National Lung Screening Trial screening centers of the National Cancer Institute's Lung Screening Study network reviewed image subsets from 135 baseline low-dose screening CT examinations in 135 trial participants (89 men, 46 women; mean age, 62.7 years ± 5.4 [standard deviation]). Interpretations were classified into one of four of the following categories: non-calcified nodule 4 mm or larger in greatest transverse dimension (positive screening result); noncalcified nodule smaller than 4 mm in greatest transverse dimension (negative screening result); calcified, benign nodule (negative screening result); or no nodule (negative screening result). A recommendation for follow-up evaluation was obtained for each case. Interobserver agreement was evaluated by using the multirater κ statistic and by using response frequencies and descriptive statistics. Results: Multirater κ values ranged from 0.58 (for agreement among all four classifications; 95% confidence interval: 0.55, 0.61) to 0.64 (for agreement on classification as a positive or negative screening result; 95% confidence interval: 0.62, 0.65). The average percentage of reader pairs in agreement on the screening result per case (percentage agreement) was 82%. There was wide variation in the total number of abnormalities detected and classified as pulmonary nodules, with differences of up to more than twofold among radiologists. For cases classified as positive, multirater κ for follow-up recommendations was 0.35. Conclusion: Interobserver agreement was moderate to substantial; potential for considerable improvement exists.

Original languageEnglish
Pages (from-to)265-272
Number of pages8
Issue number1
StatePublished - Jan 2008


Dive into the research topics of 'Lung cancer: Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening'. Together they form a unique fingerprint.

Cite this