| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Original Article |
1 Clinica Chirurgica II, Dipartimento di Scienze Oncologiche e Chirurgiche, Istituto Oncologico Veneto IRCCS and University of Padova, Padova, Italy
2 Surgical Oncology, Centro di Riferimento Oncologico IRCCS, Aviano, Italy
3 University Centre of Statistics for the Biomedical Sciences, Vita-Salute San Raffaele University, Milan, Italy
4 Research & Innovation (R&I) Company, Padova, Italy
Correspondence: Address correspondence and reprint requests to: Donato Nitti, MD; E-mail: donato.nitti{at}unipd.it
| ABSTRACT |
|---|
|
|
|---|
Methods: The gene expression profile was evaluated in frozen tumor samples obtained from 32 patients with primary gastric adenocarcinomas. The array consisted of a duplicated spot panel of 5,541 human genes. To classify node-positive (N+) and node-negative (N) cases, a logistic regression model was fitted optimizing the Akaike Information Criteria after a stepwise gene selection. The accuracy was evaluated by means of leave-one-out cross validation.
Results: All patients underwent radical gastrectomy and extended lymphadenectomy. Of all the cases, 21 were N+ and 11 demonstrated no lymph node involvement (N). After quality filtering, the analysis of variance selected a set of 136 genes potentially correlated with nodal involvement (P value <.05). Of these 136 genes, 5 were differentially expressed (adjusted P value <.05). After a stepwise gene selection, only three genes (Bik, aurora kinase B, eIF5A2) were retained in the logistic model, which could correctly predict lymph node status in 30 of 32 cases.
Conclusions: If our findings were confirmed, the identified gene pattern might be used to tailor the extent of lymph node dissection on a single patient basis.
Key Words: Gastric cancer Gene expression profile Lymph node status Prognostic markers
| INTRODUCTION |
|---|
|
|
|---|
In recent years, investigators have been proficient in elucidating the cascade of molecular events leading to cancer progression12, and some authors have reported a significant association between the altered expression of single genes/proteins and gastric cancer prognosis1315 and lymph node status1619. However, as tumor progression is a process involving the dys-regulation of several genes, it is unlikely that the abnormal expression of single genes or proteins might sustain tumor aggressiveness andin particularpredict tumor spread to lymph nodes.
Unlike the traditional molecular analyses, which support a reductionist approach to research, high-throughput technologies (e.g., SAGE, DNA micro-array) allow one to test in the same experiment not only multiple hypotheses but also multiple combinations of hypotheses20. Among these techniques, DNA microarrays have become prominent because they are easier to use and do not require large-scale DNA sequencing. Using this genome-wide approach, investigators have recently identified a set of genes that significantly correlate with pathological features of tumors and accurately predict the risk of both disease recurrence and tumor-specific death21,22. Moreover, specific gene clusters sorted out of micro-array experiments have been shown to correlate with lymphatic vessel invasion and lymph node status22,23.
In this study, we analyzed the gene expression profile of the primary tumor from 32 patients who underwent radical surgery for gastric carcinoma. The genetic signature of these tumors was then correlated with the lymph node status, and a statistical model was fitted to predict lymph node metastasis based on the molecular profile of the primary tumors.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
According to the TNM classification, 8 cases were T2a (25%), 13 were T2b (40.6%) and 11 were T3 (34.4%). Moreover, at pathological examination, no lymph node metastasis was found in 11 (34.4%) cases (N); whereas, in 21 cases (65.6%) lymph node involvement was present (N+). In the latter group, 6 were N1 (18.7%) and 15 were N2 (46.9%). According to the TNM staging system, 9 patients were classified in stage Ib (28.1%), 11 in stage II (34.4%), 5 in stage IIIa (15.6%), 6 in stage IIIB (18.8%), and 1 in stage IV (3.1%; T3N2).
With regard to tumor grading, 1 (3.1%) tumor was well differentiated, 15 (46.9%) were moderately differentiated, 13 (40.6%) were poorly differentiated, and 3 (9.4%) were undifferentiated. For gene profiling purposes, one bulk tumor tissue sample of about 5x 5 mm was obtained from each surgical specimen. Biopsies were snap-frozen in liquid nitrogen immediately after excision using RNase-free vials without other protective solutions. Samples were stored in liquid nitrogen until use.
Array construction
Duplicated cDNA arrays comprising 5,541 human genes were assembled onto mirrored aminosilane type-7 STAR slides (Amersham-Pharmacia Biotech, Little Chalfont, UK) by a Lucidea Spotter (Amersham). The DNAs were purchased in the form of purified, sequence-verified PCR products from RZPD (Berlin, Germany) and were diluted in 50% DMSO. The final concentrations of the DNA samples ranged between 100 fmol/µl and 400 fmol/µl. The gene panel was selected with particular regard to the relevance for molecular events related to tumor progression, and included genes controlling apoptosis and cell cycle as well as genes encoding for adhesion molecules and factors involved in cell differentiation. After deposition, slides were briefly exposed to a brief pulse of UVC light to stabilize DNA attachment to the slide surface.
RNA extraction, cDNA synthesis, fluorescent labeling and image visualization
These methods have been described by us in detail previously24. Briefly, total RNA was extracted from tumor biopsies using standard methods (TRIzol, Invitrogen). RNA integrity was assessed using an Agilent 2100 Bioanalyzer (Agilent Technologies). A dendrimer-based labeling system (Array50 version 2, Genisphere Inc.) was used for the preparation of fluorescent cDNA probes from tumor samples and slide hybridization. Briefly, 10 µg of total RNA was reverse transcribed into cDNA using 5' tagged oligo (dT) primers. The tagged cDNAs were hybridized overnight at 50°C with the arrays in a humidified chamber, and the slides were sequentially washed as follows: 2X SSC+0.2% SDS at 50°C, 2X SSC at room temperature and 0.2X SSC at room temperature (10 min per washing step). The slides were then incubated with DNA dendrimers containing cyanine dyes and including sequences complementary to the cDNA tags. As a reference sample, a pool of total RNAs from ten different cell lines was used (Universal Human Reference RNA, Stratagene). Cy3 and Cy5 dyes were used for tumor and reference sample labeling, respectively. After washing and drying, fluorescent signals were generated by laser excitation with a Gen III Laser Scanner (Amersham-Pharmacia/ Molecular Dynamics). Images were visualized and signals quantified by Array Vision (Imaging Research Inc.).
Quantitative real-time PCR
The amount of starting RNA was normalized using 18S ribosomal RNA as a control transcript. To this end, a QuantumRNA 18S internal standard kit (Ambion) was utilized, followed by quantification of the electrophoretic bands by ImageQuant (Molecular Dynamics). Primers for quantitative real-time PCR were designed with Primer Express 2.0 (Applied Biosystems). Primer and probe sequences were as follows: Bik, 5'-CCT GGA ACC CCC GAC CAT-3' (forward), 5'-CAC TGC CCT CCA TGC ATT C-3' (reverse), 5'-AGG ACC TGG ACC CTA TGG AGG ACT TCG-3' (probe); aurora kinase B, 5'-ACG CGG CAC TTC ACA ATT GA-3' (forward), 5'-GAG CGC CAC GAT GAA ATG G-3' (reverse) and 5'-TTG GAA ACG TGT ACT TGG CTC GGG A-3' (probe); eIF5A2, 5'-AAG CAG GCC ATT TCA GCA T-3' (forward), 5'-TCA TTA ACC CCA GTT TAT TGA ATC-3' (reverse) and 5'-AGG CAA GTG GCT GGA TGG TAT TCG AA-3' (probe).
Probes were labeled with 6-FAM at the 5'-end and TAMRA at the 3'-end. For the amplification, the qPCR core kit was utilized (Applied Biosystem). PCR conditions were as specified by the manufacturer. A threshold level of fluorescence within the log phase was chosen, and the relative levels of RNA were calculated as a function of the number of amplification cycles required to reach the threshold.
Data analysis
After nonparametric normalization of the logarithmic transformation of raw data, only spots with intensity above the background in both duplicate arrays of each slide and in all samples in each group were considered for further analysis. Denoting the transformed relative intensity as y, we considered the linear model ygijk = µg + Ni + Tj +
gijk, where µg is the mean level of intensity for gene g (g = 1,...,G), and Ni(i = ,+) and Tj(j = 2a, 2b, 3) capture the variations among patients grouped by lymph node status (N versus N+) and tumor depth (T2a versus T2b versus T3). The error term
gijk was used to describe the residual variability associated to the k-th subject.
Statistical tests were based on the F statistic of the ANOVA relative to the above linear model. P values were obtained by means of permutations and then adjusted using the procedure described by Reiner et al.25. Genes were ranked based on their adjusted P values. We defined as informative any genes with a raw P value of less than .05; those with an adjusted P value of less than .05 were defined as differentially expressed genes.
In order to predict lymph node status, we fitted a logistic model given a subset of genes (x1··· xGs) : p(N + |x1, ··· xGs ) = exp (
Gsi=1
ixi)/
1 + exp(
Gsi=1
ixi)
. Since, a number of genes contained in the microarray slides might be irrelevant or redundant with respect to the T and the N stages, the performance of the statistical model might be impaired if a high number of non-informative genes were included in the analysis. To address this issue, we identified a subset of Gs genes by performing a stepwise selection that optimized the Akaike Information Criterion. We, thus, considered as the starting reference set all the genes resulted to be informative in the ANOVA.
Finally, we compared the prediction effciency of the logistic regression model with that obtained with the logitboost model based on the same set of selected genes26.
The accuracy of the fitted models was computed by means of leave-one-out cross validation (LOOCV).
| RESULTS |
|---|
|
|
|---|
|
Lymph node status prediction model
Considering all the informative genes (i.e., those related to both N and T stages) as a starting set, after a stepwise gene selection the logistic regression model identified three genes (Bcl2-interacting killer [Bik], aurora kinase B/serine-threonine kinase-12 [AIK2/ STK12], and eukaryotic translation initiation factor 5A2 [eIF5A2]) that were strongly associated with lymph node status (Table 2
).
|
|
Quantitative real-time PCR
Quantitative real-time PCR results are shown in Table 4
. Of the three genes (Bik, aurora kinase B, eIF5A2) we tested, only Bik was differentially expressed in the two study groups (N+ versus N). In particular, the transcriptional levels of this pro-apoptotic gene were higher in node-negative than in node-positive patients.
|
| DISCUSSION |
|---|
|
|
|---|
D2 lymphadenectomy might be better accepted if performed in subgroups of patients at higher risk of lymph node metastasis. By focusing on this subset of patients, investigators might definitively assess whether or not this type of lymph node dissection has a therapeutic impact on the management of patients with gastric cancer. Although pathological features (e.g., T stage) and single molecular markers (e.g., VEGF, c-Met) are associated with different risks of lymph node metastasis1619, none of them is reliable enough to be implemented in the clinical setting to predict lymph node status on a single patient basis.
In the search for novel prognostic markers or combinations of them, high-throughput gene micro-array provides investigators with a powerful tool to screen the whole genome20. Using this approach, Weiss et al. reported that gene expression profiles of primary tumors (n=35) can identify groups of patients with statistically different risks of lymph node metastasis23. However, the accuracy (i.e., the percentage of cases correctly classified) and the negative predictive value (i.e., the probability that a patient has no lymph node metastasis when he/she is classified as node negative) were not encouraging (74.2% and 40%, respectively). Moreover, no data were reported on the list of informative genes, precluding any consideration on their biological significance with respect to lymph node status. By utilizing a k-nearest neighbor classifier, in their series (n=54) Teramoto et al. obtained a better accuracy (91.4%)22, but no data were reported on the predictive values of their model.
In our exploratory study, we confirmed that high-throughput gene microarray is an effective method to screen the genome in search of a gene set correlated with lymph node status in patients with gastric cancer. After adjustment for multiplicity, only five genes in N+ cases (DAG kinase alpha, HMMR, ARL1, CRTAP and EST moderately similar to ZRF1) displayed differential expression when compared with N cases. For two of these five, data exist on their potential role in cancer development and progression. DAG kinase alphawhich belongs to a family of nine kinases (alpha to jota)phosphorylates the lipid second messenger DAG to produce phosphatidic acid; thus, by influencing the intracellular levels of DAG, DAG kinase alpha can contribute to the regulation of activity of target proteins that are activated by DAG and have an established role in cancer biology (e.g., protein kinase C)30. As regards HMMR, preclinical models indicate that this receptor plays a key role in cell migration and, thus, might contribute to the metastatic potential of malignant cells; moreover, recent evidence suggests that this gene is involved in Ras and Hedgehog signaling pathways31, which are well known to be of relevance in cancer development and progression32,33.
Interestingly, when we used microarray-generated data to build a predictive model, the three genes most "informative" to predict lymph node status (Bik, aurora kinase B, eIF5A2) were not among those differentially expressed after adjustment for multiplicity. These genes code for proteins with a demonstrated role in tumor biology. Bik is a BH3-only member of the Bcl-2 intracellular protein family, which includes Bim, Bmf, Bik, Bad, Bid, Puma, Noxa and Hrk34. These proteins mediate many developmentally programmed and induced cytotoxic signals, and compounds mimicking them are promising anti-cancer agents. When activated, these death ligands engage anti-apoptotic Bcl-2-like proteins via the BH3 domain, inactivating their function and promoting apoptosis. Remarkably, although Bik was not among the genes identified by Teramoto et al., in that work, most genes selected for optimal prediction of lymph node status were apoptosis-correlated (e.g., survivin, clusterin, caspase-8, DPP4)22. Aurora kinases (A to C) are closely related kinases that have been implicated in tumorigenesis as they are important regulators of diverse cell cycle events, ranging from the entry into mitosis, centrosome function, mitotic spindle formation, chromosome biorientation and segregation, and cytokinesis35. Finally, eIF5A2which functions in the initiation of ribosome-mediated translation of mRNA into a polypeptidehas been found overexpressed in certain human cancer cells (in contrast to its weak normal expression limited to human testis and brain), suggesting a potential role as an oncogene36.
Although quantitative real-time PCR showed that among these three genes only Bik was differentially expressed in node positive when compared with node negative patients, the high accuracy (93.796.8%) and negative predictive value (95.295.4%) of our prediction models suggest that the combination of the expression levels of these three genes might be biologically more important than the average transcriptional abundance of each single gene. However, the functional interplay among these three genes is purely hypothetical and further research is warranted to biologically substantiate this microarray-generated finding.
Taken together, our results support the strategy of using high-throughput technologies coupled with appropriate statistical models for predicting lymph node status in patients with gastric cancer. However, larger series of patients need to be evaluated before the analysis of the molecular profile of primary tumors might be implemented in the clinical setting to guide surgeons in the decision-making process for the therapeutic management of gastric carcinoma.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Received for publication June 5, 2006. Accepted for publication June 5, 2006.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
W. Chen, J.-H. Luo, W.-F. Hua, F.-J. Zhou, M. C. Lin, H.-F. Kung, Y.-X. Zeng, X.-Y. Guan, and D. Xie Overexpression of EIF-5A2 Is an Independent Predictor of Outcome in Patients of Urothelial Carcinoma of the Bladder Treated with Radical Cystectomy Cancer Epidemiol. Biomarkers Prev., February 1, 2009; 18(2): 400 - 408. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |