| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Original Article |
1 First Department of Surgery, Division of Digestive and General Surgery, Niigata University Graduate School of Medical and Dental Sciences, 1-757 Asahimachi Dori, Niigata City 951-8510, Japan
2 Department of Otolaryngology, Niigata University Graduate School of Medical and Dental Sciences, Niigata City, Japan
3 Third Department of Anatomy, Division of Microscopic Anatomy and Bioimaging, Niigata University Graduate School of Medical and Dental Sciences, Niigata City, Japan
Correspondence: Address correspondence and reprint requests to: Vladimir A. Valera, MD, PhD; E-mail: vvalerar{at}med.niigata-u.ac.jp
| ABSTRACT |
|---|
|
|
|---|
Methods: A cohort of 106 surgically treated CRC patients was used for analysis. The expression of the cell-cycle-related Ki-67 protein in tumor samples was evaluated by immunohistochemistry. A score was assigned as the percentage of positive tumor cell staining, denoted as proliferation index (PI), and was used in a multivariate analysis using a recursive partitioning algorithm referred to as classification and regression tree (CART) to characterize the long-term survival after surgery.
Results: Of the covariates selected for their prognostic value, PI contributed most to the classification of survival status of patients. However, CART analysis selected the presence of distant metastasis as the best first split-up factor for predicting 5-year survival. CART then selected the following covariates for building up subgroups at risk for death: (1) PI; (2) pathological lymph node metastasis; (3) tumor size. Seven terminal subgroups were formed, with an overall misclassification rate of 16%.
Conclusions: These analyses demonstrated that a Ki-67-protein-based tumor proliferation index appeared as an independent prognostic variable that was consistently applied by the CART algorithm to classify patients into groups with similar clinical features and survival.
Key Words: Colorectal carcinomas Proliferation index Ki-67 CART analysis Prognosis
| INTRODUCTION |
|---|
|
|
|---|
The ability to predict tumor behavior based on factors other than tumor extent, identifying curable cancers or advanced-stage tumors with more favourable course, may significantly influence the selection of patients for additional therapy, potentially modifying pre or postoperative treatment plans, and finally may help to improve CRC-specific survival.3 However, the factors and the optimal strategies for selecting those patients have yet to be defined.
This has led to the massive current research on gene and protein expression analysis for the identification of possible prognostic factors. Individual molecular markers and patterns of markers are successfully subdividing traditional tumor classes into subsets that behave differently from each other.4 In this context, cell proliferation, as a result of several molecular pathways occurring in cancer cells, has been shown to correlate with tumor progression and patients survival in several human solid neoplasms, as estimated by the expression of a cell-cycle related product, the protein Ki-67.5
Common survival analyses applied in those prognostic studies usually employ a form of logistic regression for the estimation of the risk for death after surgery, including the Cox proportional hazards model. However, the assumptions of the method regarding the linearity of the relationship between covariates and outcome are not always met, and the visualization and clinical use of the results are sometimes difficult to interpret by clinicians.6 An alternative method that overcome these issues is known as classification and regression tree (CART) analysis. CART analysis belongs to a family of nonparametric regression methods based on recursive partitioning of data that builds a decision tree structure and classifies subjects, finally generating groups of patients with similar clinical features and survival times.7 It can be used to explore the data, identify possible high-risk subgroups, and uncover interactions or effect changes among prognostic factors. Unlike logistic regression, CART analysis does not assume a multiplicative risk model or a specific parametric probability, does not require a specification of the risk function, and is not affected by outlying observations. Most importantly, the results of CART analysis are presented as a decision tree, which is intuitive, easier to understand than the results of many other statistical methods, and facilitates the visualization and interpretation of the interactions between factors related to survival, with a flow-chart form to find group allocation.8,9
In this study, we applied this statistical method to model CRC patients data in order to evaluate the prognostic capabilities of the cell-proliferation-related Ki-67 protein, to explore the interactions between tumor cell proliferation and other clinicopathologic variables and their impact on survival and to illustrate explicitly how these covariates interact.
| METHODS |
|---|
|
|
|---|
Immunohistochemistry
Formalin-fixed, paraffin-embedded tissue blocks obtained from pathology archives were used for immunohistochemical staining. The procedure consisted in a double staining sequence, initially for the cell-cycle related protein Ki-67 followed by staining with an anticytokeratin antibody to differentiate proliferating epithelial cells from proliferating non-epithelial cells, as described previously.5 Only homogeneous brown nuclear staining for Ki-67 was considered positive. Negative controls resulted always in negative staining.
Assessment of proliferation index
Tumor samples were examined by two investigators without knowledge of the clinical or pathological data. A minimum of two double-immunostained sections per patient were evaluated microscopically at a total magnification of 400x. A proliferation index (PI) was calculated for each sample as the percentage of Ki-67-positive tumor cell nuclei in relation to all anticytokeratin-stained epithelial cells in at least three high power fields including 1,000 tumor cells (range: 1,0021,748), and was expressed as a mean value. In all cases, interobserver agreement as a measure of method reproducibility was 92% (intraclass correlation coefficient = 0.92).
Statistical Analysis
CART was implemented using 11 common clinical and pathological variables, including proliferation index, and the model was fit with the tree-structured survival analysis (TSSA) as described previously.10 This variant is based in the same recursive partitioning algorithm adapted to censored data and uses the log-rank test statistic for splitting. To assess the amount of overfitting, 1,000 10-fold cross-validation experiments were performed as suggested.11 In each of those 1,000 experiments, the data set was randomly split into smaller data sets and a pruning method was used to choose the best number of nodes for the tree. A restriction was imposed on the tree construction such that terminal subgroups resulting from any given split must have at least five patients.
Patient survival was measured from the time of surgery, and the survival distribution for each terminal group was estimated using the product limit method of KaplanMeier. Multivariate analysis of survival was also performed using Cox proportional hazards regression analysis. Statistical significance for the differences in survival was tested by the log-rank test based on a 500 permutation-based bootstrap, and defined by a resulting P value <0.05.
The sensitivity and specificity of the resulting tree analysis were evaluated using the above-mentioned 10-fold cross validation. CART results for the prediction of death were also compared with the results derived from a model based on logistic regression using the area under the curve (AUC) from the receiver operator characteristic curves. Analyses were performed using S-PLUS software (Version 6.0, Insightful Corp., Seattle, WA, USA).
| RESULTS |
|---|
|
|
|---|
|
CART Survival Analysis
The default tree was generated by allowing the program to determine the variable with the optimal first split. The results from the best tree indicated that distant metastatic involvement was chosen as the initial split 76% of the times, PI was selected with a 27% probability, and lymph node involvement was selected with a 23% probability. The next highest probability was 3%. The default tree, therefore, had an initial split on distant metastasis, and seven terminal nodes or subgroups were formed, with an overall misclassification rate of 16% (17 of 106). The complete model was found to have an overall sensitivity of 84% and a specificity of 82%.
The variables determining the structure of the tree included distant metastasis, PI, lymph node involvement and tumor size, with PI being chosen more than once for splitting other covariates. The structure of the default tree is presented in Fig. 1
, and the corresponding survival curves from four of the seven terminal groups generated are presented in Fig. 2
. The longest surviving subgroup (group 1) included 21 (19.8%) CRC patients without meta-static organ sites and a PI less than 25%, with no deaths occurring within this group. A second subgroup (group 2) with a relatively long median survival of 5.5 years included 22 patients without distant metastasis, with PI between 25 and 50% and without lymph node involvement. The shortest surviving subgroup (group 7) included CRC patients with liver metastases and a PI higher than 50%. These patients had a median survival of only 3.6 months (95% CI: 07.3 months). A subgroup of 24 patients (group 5) without distant metastasis, PI higher than 25%, positive lymph node metastasis and a primary tumor larger than 5 cm also had a short median survival of 36 months.
|
|
The Cox proportional hazards model confirmed the findings of the Ki-67-based PI as an independent factor of prognosis in the dataset (Table 2
).
|
| DISCUSSION |
|---|
|
|
|---|
In this study, a cellular marker of proliferation, the protein Ki-67, was evaluated by immunohistochemistry in CRC samples from patients who underwent surgical treatment. Classification and regression tree (CART) analysis was then used to assess the value of the protein in predicting patient survival and to explore the interaction with other common covariates.
Using a straightforward and relatively inexpensive immunohistochemical technique to evaluate tumor cell proliferation, the results demonstrated not only that the Ki-67 protein is useful for predicting outcome after surgery for colorectal cancer, but also identified previously unappreciated patient subsets inside the more established TNM classification based on the interaction between important common clinicopathologic covariates and this biological marker.
The results on proliferation index are in agreement with earlier observations demonstrating higher proliferative activity in more advanced colorectal tumors.14 While a direct comparison of studies is limited, the proliferation indices reported here lie within the 794% reported for colorectal carcinomas in the literature.15
Cell proliferation, as a summarizing feature of an altered cell cycle and underlying cell signaling in cancer, has been shown to be a powerful indicator of tumor behavior without the need of analyzing every factor involved in the pathway.16 This has been demonstrated in several solid tumors including colorectal adenocarcinomas, where an increased proliferating fraction of tumor cells, as identified by the protein Ki-67, was related with an increase in the risk of death after surgery.5
Proliferation index appeared not only as a significant independent factor of prognosis, but also was used more than once by the partitioning algorithm to select groups with well-defined differences in outcome. The main groups identified resulted in patients with distant metastasis (stage IV) and mid-or low-proliferation indices (PI<50%) whose long-term survival could be defined as "less poor", and whom may potentially beneficiate of more aggressive therapies. Another subgroup was formed by patients without positive lymph nodes (stage II) and tumors with PI over 25% that can be considered to have lower survival probabilities and could possibly beneficiate from adjuvant therapy in clinical trials settings, since adjuvant therapy for this stage II group is not currently indicated.17 A third group that is important to mention was formed by patients without distant metastasis, with positive lymph nodes (stage III), PI between 25 and 50%, and small tumors (<5 cm) with better 5-year survival, that may eventually receive less intensive or modified adjuvant treatment.
CART algorithm has been successfully applied in medicine since its conception in the early 1990s, but its use in predicting outcome in cancer seems to have been subutilized. In oncology, the technique has been useful for classifying patients with unknown primary carcinomas,18 for predicting long-term survival after surgery for melanoma,19 for the detection of prostate cancer,20 and for predicting tumor response to preoperative radiotherapy in rectal tumors.21 Other applications in medicine include the prediction of hip fractures,22 outcome after acute liver failure,23 in-hospital mortality in acutely decompensated heart failure,24 and outcome after severe head injury.25
One of the main advantages of the method in comparison with more established survival analysis methods is the ease for the visualization of the interaction between survival covariates, without being less accurate in prediction as demonstrated by the results. Both CART- and logistic regression-based models were comparable in their effectiveness in discriminating good or poor 5-year outcome after surgery.
The use of the tree-structured survival analysis is intended to extend the current application of the standard statistical methods to offer a better understanding on how the complex long-term outcome after surgery for CRC can be modeled and thus changed by intervention. In this sense and based in our results, CART appeared as a useful tool for dissecting complex clinical situations and identifying homogeneous patient populations that can assist clinicians in making a further treatment decisions and prognostic inference in CRC patients for future clinical trials.
| ACKNOWLEDGMENTS |
|---|
Received for publication May 17, 2006. Accepted for publication June 2, 2006.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
I Zlobec and A Lugli Prognostic and predictive factors in colorectal cancer Postgrad. Med. J., August 1, 2008; 84(994): 403 - 411. [Abstract] [Full Text] [PDF] |
||||
![]() |
I Zlobec and A Lugli Prognostic and predictive factors in colorectal cancer J. Clin. Pathol., May 1, 2008; 61(5): 561 - 569. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |