| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Original Article |
1 Department of Surgery, Stavanger University Hospital, P.O. Box 8100, 4068, Stavanger, Norway
2 Institute of Surgical Sciences, University of Bergen, Bergen, Norway
Correspondence: Address correspondence and reprint requests to: Hartwig Körner, MD, PhD; E-mail: koerner{at}online.no
| ABSTRACT |
|---|
|
|
|---|
Methods: One hundred ninety-four consecutive patients surgically treated with curative intent for CRC between July 1996 and June 1999 had systematic follow-up for five years. Follow-up included imaging, coloscopy and serial CEA measurements. Complete data including CEA measurements were available from 153 patients. ROC analysis of CEA was done with regard to detection of recurrent disease.
Results: Depending on the chosen cut-off value of CEA, DA varied widely within the normal range (CEA
10 U/ml). CEA >4 U/ml provided the highest sensitivity (0.78) and specificity (0.91), compared to a sensitivity and specificity at the upper normal range (CEA = 10 U/ml) of 0.51 and 0.99, respectively. Thirty-three patients (24%) developed recurrence. Among 11 (5%) asymptomatic patients diagnosed by elevated CEA levels, only two patients (1.5%) were amenable to secondary curative surgery. A threefold increase of CEA in an individual patient had the same DA as the best cut-off value (>4 U/ml).
Conclusions: Diagnostic accuracy of CEA in follow-up after curative surgery for CRC is influenced by the chosen cut-off value. A threefold increase of CEA may indicate recurrent disease. The value of serial measurement of CEA was limited.
Key Words: Colorectal cancer Follow-up CEA Recurrence ROC analysis Diagnostic accuracy
Abbreviations: CI, confidence interval CEA, carcinoembryonic antigen CRC, colorectal cancer DA, diagnostic accuracy PPV, positive predictive value NPV, negative predictive value LR, likelihood ratio ROC, receiver operator characteristic analysis AUC, area under the curve
| INTRODUCTION |
|---|
|
|
|---|
CEA is a diagnostic test with a continuous spectrum of test results. Therefore, diagnostic properties expressed by sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) or likelihood ratio (LR) depend on the chosen cut-off value used to differentiate between normal and pathologic values.
Receiver operating characteristic curve (ROC) analysis is an appropriate statistical method for assessing the diagnostic accuracy (DA) of a test with a continuous spectrum of results. The area under the curve (AUC) is a measure of the overall DA of the test, and the cut-off value providing the highest sensitivity and specificity is calculated.1921
While most studies have analyzed CEA at a single cut-off level (e.g., the upper limit of the normal range or at a particular specificity level),12,18,22 ROC analysis of CEA has not been used with regard to follow-up after colorectal cancer according to the literature.2326 The purpose of this study was to analyze the DA of CEA by means of ROC analysis in patients undergoing systematic surveillance for recurrent disease after curative surgery for CRC in a large, unselected, population-based cohort.
| PATIENTS AND METHODS |
|---|
|
|
|---|
Preoperative serum-CEA was analyzed routinely. Patients <75 years of age (n = 194, 62%), had a systematic postoperative follow-up schedule according to the guidelines of the Norwegian GastroIntestinal Cancer Group (NGICG).27 The follow-up schedule included serial CEA measurementsin addition to imaging studies of the liver, lungs and colonoscopyuntil the end of the follow-up period (five years) or until a diagnosis of recurrence was made, Table 1
.5,27 Complete CEA values for analysis of DA were available in 153 patients (79%). This study cohort was similar to all the 194 patients with regard to age, gender and tumor characteristics.
|
The reference standard for the accuracy of CEA measurement was either the diagnosis of recurrence established by biopsy and/or imaging studies (positive target patient) or a disease-free interval of 60 months or more without proof of recurrence (negative target patient; end of follow-up).
Sensitivity was defined as the fraction of all diseased patients with a positive test (true positives). Specificity was defined as the fraction of all healthy patients with a negative test (true negatives). Likelihood ratio for a positive test (LR+) was defined as the ratio of the fractions of the true positives and false positives (sensitivity/1specificity). Likelihood ratio for a negative test (LR) was defined as the fraction of the false negatives and true negatives (1sensitivity/specificity).28,29
Pre-test probability was defined as the probability that a patient suffers from recurrence, i.e., prevalence of recurrent disease after curative resection. Post-test probability was defined as the probability of recurrent disease in a patient with a certain test result of CEA. Thus, the LR+ is a measurement of the increase in probability of disease given a certain test result. LR+ >10 or LR <0.1 were considered to be excellent test results (LR+ >10: a positive test is ten times more likely to occur in a patient with recurrence; LR <0.1: a false-negative test result in less than one out of ten patients with a negative test).28
Laboratory Analysis of CEA
CEA was analyzed with the immunoassay kit from Abbot Diagnostic Division, Abbot Park, IL, USA. The upper limit of normal values was <10 U/ml.
Statistical Analysis
Categorical data were analyzed by frequency tables and the chi-square test. Continuous variables were tested for normality by KolmogorovSmirnov test. CEA data were found to have non-normal distributions (KolmogorovSmirnov test P < 0.001) for both raw data and after log transformation. Central tendency was expressed by median with a 95% con-fidence interval (CI), and variables were compared by MannWhitney U-test.
For the analysis of diagnostic accuracy, the CEA value at evaluation (i.e., CEA value at last follow-up or at the diagnosis of recurrence) was compared with first postoperative CEA value of the follow-up schedule. We adopted this approach from our clinical practice to follow serial CEA measurements as a marker of potential recurrence, by comparing the most recent measurement with the immediate postoperative value. Increased CEA may indicate both locoregional recurrence and distant spread, but might also occur when a metachronous CRC develops. Consequently, CEA was analyzed with regard to all events. Data were stratified with regard to Dukes classification. The ratio of the CEA value at evaluation to the postoperative CEA (hereafter referred to as the "CEA ratio") was calculated to assess the increase in CEA, which indicates recurrence of CRC.
We performed ROC analysis of the CEA value at the time of evaluation, and of the CEA ratio values at evaluation. The area under the ROC curve (AUC) with 95% CI was calculated to express the overall diagnostic accuracy (DA) of the test. Cut-off values with the highest sensitivity and specificity, PPV, NPV, as well as positive and negative likelihood ratio (LR+ and LR) were calculated. Time-dependent events were analyzed by KaplanMeier statistics and compared by log-rank test.
Statistical analysis was performed with MedCalc Statistical Software v. 8.1.1 (Mariakerke, Belgium).
| RESULTS |
|---|
|
|
|---|
10 U/ml). The first postoperative CEA measurement was done within six months after surgery.5 The distribution pattern pre- and postoperatively as well as at evaluation was similar with regard to recurrence (KruskalWallis test; P = 0.68), but some of the patients with Dukes A had recurrence associated with increased CEA measurements, while they had normal CEA values at the time of primary diagnosis. Eighteen (49%) of 37 patients with recurrence had CEA values within the normal range (
10 U/ml).
|
|
|
|
|
|
|
4 U/ml to excellent in the upper half of the normal range (CEA 510 U/ml). A three-fold increase of CEA was associated with a 20-fold increase in the probability of recurrence as compared to the pre-test probability. LR performed best in the lower end of the normal range. ROC analysis was also done for both variables with regard to Dukes stages. The best cut-off value for the CEA measurement changed within the normal range of 10 U/ml (>3 U/ml in Dukes A, >8 U/ml in Dukes B, and >4 U/ml in Dukes C).
The cut-off value for the ratio of CEA at evaluation and postoperatively was the same in all Dukes stages (ratio of 3). The results from ROC analysis with regard to AUC, sensitivity, specificity, likelihood ratios, and predictive values were similar within the Dukes stage.
Clinical Outcome
Seventeen of the 37 patients (49%; 11% of all patients) had an asymptomatic recurrence. Fifteen patients (40%; 10% of all patients) had curative surgery for recurrent disease (Table 6
). Eleven asymptomatic patients (30%; 7% of all patients) were diagnosed solely due to increased CEA level. Only two of these patients (5%; 1.3% of all patients) were amenable to curative resection of recurrence.
|
| DISCUSSION |
|---|
|
|
|---|
ROC analysis of the ratio CEA revealed that a more than threefold increase was as accurate as the absolute CEA value for detection of recurrence. This observation shows that the absolute value of CEA is not sufficient for a correct interpretation of the test result, but has to be interpreted within the context of previous tests. Increase of CEA within the normal range (e.g., from 3 to 9 U/ml) was associated with recurrence in some patients, which strengthens the observation. In the various Dukes stages, where the best cut-off values were different within the normal range, the ratio CEA remained unchanged. A twofold increase of CEA was used in a study protocol for recurrent colorectal cancer, without an explained statistical rationale.10 As far as we know, the factor of increase of CEA has not been addressed extensively in the literature. However, half of the patients with recurrent disease had no CEA increase, and therefore no benefit from serial CEA measurement.
The results for PPV and NPV have to be interpreted with caution, because they depend directly on the prevalence of the disease.28 The figures in our study are based on a recurrence rate of 24% in an unselected population-based group of patients, and our results should apply well to a general Western population. In contrast, the use of LR avoids this problem.28 Our study showed LR+ figures of 9 for CEA >4 U/ml and 20 for the ratio of CEA, which indicate a nine- and 20-fold increase in the probability of disease with a positive test result, as compared to the pre-test probability. In contrast, the LR was only 0.27 and 0.24, respectively, which translates into one out of four patients with recurrence being falsely classified as healthy. Thus, a negative test result was not able to rule out recurrence, as a negative LR lower than 0.10 is generally considered to be a clinically useful test.21,31 Truly, this reflects the fact that many patients will have normal CEA values in spite of recurrent disease.
The CEA values are not corrected for smoking habits of the patients. It is known that CEA values can be falsely elevated in smokers.32 However, CEA measurements were taken serially in the individual patients preoperatively and during follow-up, and the individual patients are regarded to be their own controls. The event of a cancer disease might change smoking habits in some patients, but we do not think that this could influence our results. We are not aware of any study reporting different reference limits for smokers and nonsmokers. It is important to note that results from our analysis apply to the particular immunoassay used during the study period at our institution.
The ideal situation for evaluating a diagnostic test would be to have a reference standard which reflects the truth, and that the results from this particular test would not influence the reference standard.29 In the present study, recurrence was confirmed by biopsy, endoscopy and/or by imaging. In addition, a long follow-up time of at least five years ensured that the patients could be classified clearly at evaluation as either healthy or diseased. It might be possible that CEA changes prompted further examinations and thus led to a work-up bias and a falsely high DA. However, as CEA was used with other diagnostic tools according to the follow-up schedule (Table 1
), we believe a work-up bias is less likely. A study where clinicians are blinded for CEA results and possible clues for recurrent disease would be ethically controversial and difficult to perform. We think therefore that our data from a large population-based patient group are highly valid.
CEA is one of the most frequently used tests in follow-up after CRC, and a large number of studies on it have been published since the 1970s.1014,30,33 Several authors conclude that serial measurement of CEA is of limited value in the follow-up of colorectal cancer,11,30 while others found CEA measurements to be crucial in the management of these patients, even when they reported that only between 2% and 3% could benefit from a curative resection of recurrence based on CEA measurement.15,18 In our study, recurrence was diagnosed in eleven asymptomatic patients out of 153 (7%) by CEA monitoring, and only two of them (1.3%) could undergo curative surgery.
In conclusion, serial measurement of CEA during follow-up after curative surgery was of limited value. The choice of the cut-off value is of great importance when interpreting the test results. Many patients will never have increased CEA levels, even when incurable recurrence occurs. These aspects are, in our opinion, of great importance for the clinical use of CEA and for future studies of CEA relating to the follow-up of patients with colorectal cancer.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Received for publication January 9, 2006. Accepted for publication March 11, 2006.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K. Soreide Receiver-operating characteristic curve analysis in diagnostic, prognostic and predictive biomarker research J. Clin. Pathol., January 1, 2009; 62(1): 1 - 5. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |