Research Papers:

A common molecular signature of intestinal-type gastric carcinoma indicates processes related to gastric carcinogenesis

PDF |  HTML  |  Supplementary Files  |  How to cite

Oncotarget. 2018; 9:7359-7371. https://doi.org/10.18632/oncotarget.23670

Metrics: PDF 1937 views  |   HTML 2658 views  |   ?  

Renata Binato _, Everton Cruz Santos, Mariana Boroni, Samia Demachki, Paulo Assumpção and Eliana Abdelhay


Renata Binato1,2,*, Everton Cruz Santos1,2,*, Mariana Boroni3, Samia Demachki4, Paulo Assumpção4 and Eliana Abdelhay1,2

1Laboratório de Célula tronco, Centro de Transplante de Medula Óssea (CEMO), Instituto Nacional de Câncer (INCA), Rio de Janeiro, RJ, Brazil

2Instituto Nacional de Ciência e Tecnologia Para o Controle do Câncer (INCT), Rio de Janeiro, RJ, Brazil

3Laboratório de Bioinformática e Biologia Computacional, Instituto Nacional de Câncer (INCA), Rio de Janeiro, RJ, Brazil

4Núcleo de Pesquisas em Oncologia, Universidade Federal do Pará (UFPA), Belém, PA, Brazil

*These authors contributed equally to this work

Correspondence to:

Renata Binato, email: [email protected]

Keywords: molecular signature; intestinal-type gastric carcinoma; brazilian molecular profile; common molecular signature worldwide

Received: March 22, 2017     Accepted: December 11, 2017     Published: December 27, 2017


Gastric carcinoma (GC) is one of the most aggressive cancers and the second leading cause of cancer death in the world. According to the Lauren classification, this adenocarcinoma is divided into two subtypes, intestinal and diffuse, which differ in their clinical, epidemiological and molecular features. Several studies have attempted to delineate the molecular signature of gastric cancer to develop new and non-invasive screening tests that improve diagnosis and lead to new treatment strategies. However, a consensus signature has not yet been identified for each condition. Thus, this work aimed to analyze the gene expression profile of Brazilian intestinal-type GC tissues using microarrays and compare the results to those of non-tumor tissue samples. Moreover, we compared our intestinal-type gastric carcinoma profile with those obtained from populations worldwide to assess their similarity. The results identified a molecular signature for intestinal-type GC and revealed that 38 genes differentially expressed in Brazilian intestinal-type gastric carcinoma samples can successfully distinguish gastric tumors from non-tumor tissue in the global population. These differentially expressed genes participate in biological processes important to cell homeostasis. Furthermore, Kaplan-Meier analysis suggested that 7 of these genes could individually be able to predict overall survival in intestinal-type gastric cancer patients.


Gastric cancer, one of the most aggressive cancers and the second leading cause of cancer-related death worldwide, is a multifactorial disease affected by lifestyle, aging, socioeconomic factors, dietary behavior and infection [13].

The majority of gastric cancers are associated with infections agents, including Helicobacter pylori (present in 65% to 80% of cases) and Epstein-Barr virus (present in 6% to 10% of cases) [45]. Although the role of Helicobacter pylori in the emergence of gastric cancer has already been proposed, the role of Epstein-Barr virus is not yet clear because only a small group of patients harbor this infection [2, 68].

Most gastric cancers are adenocarcinomas that are divided into two subtypes according to the Lauren classification, intestinal and diffuse [9], which differ in their clinical and epidemiological features. Moreover, most cases are sporadic and occur as a result of acquired genetic abnormalities, such as microsatellite instability, changes in the epigenetic landscape, somatic gene mutation or single nucleotide polymorphisms (SNPs) within key candidate genes [5, 1013].

Late disease detection due to the nonspecific symptomatology in early stages remains a significant problem in gastric cancer that is associated with poor prognosis and a 5-year survival rate of approximately 20%. Moreover, surgical resection and chemotherapy have a limited value for treating patients in the advanced stages [2, 4, 5, 7]. Therefore, many studies have attempted to elucidate gastric cancer biology to develop new non-invasive screening tests that improve diagnosis and facilitate the development of new treatment strategies.

Many innovative technologies have been used in the past five years to identify changes in cell biology associated with gastric cancer. Several genetic abnormalities, such as aberrant genes, copy number variation, microRNAs and long non-coding RNAs, were identified as possible biomarkers in these studies [1416]. However, the molecular mechanisms leading to gastric cancer and those responsible for its progression remain poorly understood.

Bessède and co-workers [17] suggested that long-term chronic infection that damages the gastric mucosa and recruits bone marrow mesenchymal cells may cause gastric cancer. However, even this model depends on additional epigenetic and mutational events for carcinogenesis.

In addition to the large number of genomic analyses that identified most known mutations related to gastric cancer [18], several studies have attempted to define the gene expression signature of gastric cancer [1921]. Although these studies successfully correlated some changes in gene expression to specific conditions and resultant abnormalities in cellular processes, a consensus signature has not yet been clearly identified for the two subtypes of gastric cancer.

The distribution of gastric cancer worldwide is heterogeneous, and much of the information in the literature does not apply across all populations. In fact, almost all studies were conducted on populations from Asia or Central America. Therefore, we attempted to broaden the applicability of current findings by analyzing microarrays and comparing the Brazilian intestinal-type gastric carcinoma profile to that of other populations. To address this hypothesis, we used chip arrays to compare the gene expression profiles of tumor samples from Brazilian patients with intestinal-type gastric carcinoma with those of non-tumor tissue from the same patient (control). Specifically, our study identified a molecular signature for Brazilian intestinal-type gastric carcinoma that distinguishes tumor from non-tumor tissue. Moreover, we compared this profile with the ones obtained from other populations to assess the similarity of our intestinal-type gastric carcinoma profile with the profiles observed in the global population. To this end, an unsupervised analysis compared microarrays from different studies worldwide and this Brazilian molecular signature, which revealed that 38 genes from the Brazilian intestinal-type gastric carcinoma molecular signature successfully distinguished intestinal gastric tumors from non-tumor tissue in patients worldwide. Among then, seven genes could individually predict overall survival in intestinal-type gastric cancer patients.


Differential gene expression: fifty-seven genes define a Brazilian intestinal-type gastric carcinoma molecular profile

The Lauren classification of intestinal gastric cancer has been extensively studied over the years. Several works related to microarrays and gene expression profiles have already been described for this disease, but significant differences in incidence exist between continents. Although the incidence of this disease is highest in men in northeast Asia (Japan, Korea and China), its incidence is low in North America, Africa, south Asia and Oceania. South America (including Brazil) and Europe are classified as intermediate incidence regions [22]. Therefore, most studies related to this disease focus on northeast Asia. Because the Brazilian population is extremely heterogeneous, identifying relationships between the Lauren classification and gene expression patterns as well as the similarity of these patterns to those in other studies worldwide is challenging.

To identify a global gene expression pattern using tumor tissue from patients with intestinal-type gastric carcinoma and compare this pattern with that of non-tumor control tissue, we performed a comparative transcriptome analysis using an expression chip array assay using a pool of 2 samples in each array.

In this assay, 8 samples from patients with different stages of intestinal-type gastric carcinoma were used and compared with the non-tumor control tissue. In total, we have 16 samples, 8 from the tumor region that were divided into 4 arrays containing a pool of two samples in each array and 8 samples from the non-tumor region from the same patients, also divided into 4 arrays containing a pool of two samples in each array. Using a ≥ 5-fold change as the cut-off to define overexpression or downregulation, fifty-seven genes were found to be differentially expressed in all tumor tissue chip array assays. The hierarchical clustering of these differentially expressed genes shown in Figure 1 suggests that a common molecular signature exists for all intestinal-type gastric carcinoma tumors compared to non-tumor control tissues. Interestingly, 16 of these 57 genes were overexpressed in tumor tissues, whereas 41 of these genes were downregulated, indicating a global decrease in gene expression in intestinal gastric cancer tumor tissues (Supplementary Table 1).


Figure 1: Hierarchical clustering of the 57 differentially expressed genes identified by the chip array assay. The results showed a common molecular signature for tumor tissues from intestinal gastric cancer compared to non-tumor control tissues. NN- Non-tumor.

RT-qPCR assay confirmed the Brazilian molecular signature of intestinal-type gastric carcinoma

To confirm the obtained chip array results, quantitative PCR (RT-qPCR) was performed for selected overexpressed (MMP7, SPARC and TIMP1) and downregulated (CHGA, KRT20, GIF, AKR1C2 and PGA4) genes by comparing tumor and non-tumor tissues in a larger subset of Brazilian patients (n = 17). These genes were selected because they had all been previously related to gastric cancer or other cancers.

The RT-qPCR results presented in Figure 2 confirmed the obtained chip array assay results.


Figure 2: RT-qPCR to validate the chip array assay results. To confirm the obtained chip array results, RT-qPCR was used to analyze selected differentially expressed genes using a larger number of Brazilian patient samples to determine changes in mRNA expression levels after normalization to Actin and GAPDH. RT-qPCR analyses of MMP7 (A), SPARC (B) and TIMP1 (C) (overexpressed in patients with intestinal gastric cancer) and PGA4 (D), KRT20 (E), AKR1C2 (F), GIF (G) and CHGA (H) (downregulated in patients with intestinal gastric cancer) confirmed the chip array assay results and the common molecular signature that was able to discriminate all tumor tissues from all intestinal-type gastric carcinoma patients from non-tumor control tissue. *p < 0.05; **p < 0.01.

An unsupervised analysis revealed a common molecular signature for intestinal gastric cancer worldwide

To assess the ability of this molecular signature identified in Brazilian patients with intestinal-type gastric carcinoma to discriminate non-tumor and tumor tissues in samples from intestinal-type gastric carcinoma patients of other nationalities, we performed an unsupervised analysis of 190 non-tumors and 312 tumor samples from different studies representing several countries (Table 1). After the integration of all expression data, only 38 of the 57 differentially expressed genes identified in our dataset were common to all different platforms and could be used in this analysis. An unsupervised, hierarchical clustering of samples based on the expression of these 38 selected genes (Figure 3) successfully distinguished tumor and non-tumor samples. Based on similarities in the expression of this gene panel, the 502 samples separated into two large clusters that extensively differed in terms of disease status (tumor or non-tumor). A small set of tumor samples produced a separate cluster due to the upregulation of most selected genes, suggesting that the tumors can be divided into two types based on this set of 38 significant genes.


Figure 3: Unsupervised analysis of differentially expressed genes found in Brazilian patients with intestinal-type gastric carcinoma in different populations samples. Hierarchical clustering of samples using 38 genes differentially expressed between non-tumor and tumor samples from different studies. Each row represents a gene, and each column represents a sample. The expression level of each gene in a single sample is relative to its median abundance across all samples and is depicted according to a color scale shown at the right. Red and green indicate expression levels above and below the median, respectively. The magnitude of deviation from the median is represented by the color saturation. Dendrograms of samples (above matrix) and genes (to the left of matrix) represent overall similarities in gene expression profiles. For samples, blue boxes represent non-tumor tissue (n = 190), and red boxes represent cancerous tissue (n = 312). Colored boxes represent datasets from different studies showed in Table 1.

Table 1: Microarray data from other studies

Study-GEO acession



Histologycal type


Affymetrix Human Genome U133A Array

United Kingdom



Affymetrix Human Genome U133 Plus 2.0 Array




Affymetrix Human Genome U133 Plus 2.0 Array


Non Tumor


Affymetrix Human Genome U133 Plus 2.0 Array




Affymetrix Human Genome U133A Array


Intestinal/Non tumor


Affymetrix Human Genome U133A Array

Several Cohorts



Affymetrix Human Genome U133 Plus 2.0 Array




Affymetrix Human Genome U95 Version 2 Array




Affymetrix Human Genome U133 Plus 2.0 Array




Affymetrix Human Genome U133 Plus 2.0

Asian Cancer Research Group cohort


Overall, the results confirmed that this molecular signature can distinguish intestinal-type gastric carcinoma tissue from non-tumor tissue and suggested a common molecular signature for intestinal-type gastric carcinoma, independent of geographic origin of the patient.

Pathways and processes related to the 38 differentially expressed genes

An in silico analysis of the 38 genes defined as the common molecular signature was conducted using the Metacore™ software (GeneGO Inc., Encinitas, CA). This tool categorized the input genes to produce representative pathway maps. As shown in Table 2, the most representative processes that the 38 common differentially expressed genes participated in, were related to matrix alterations, adhesion, gastric mucosa modification, and inflammation. Some overlapping genes, e.g., TIMP1, MMP7 and FN1, appeared in two or more of these processes and may be involved in cross-talk between these pathways. The upregulated genes were primarily involved in extracellular matrix remodeling, whereas the downregulated genes were involved in pathways associated with the differentiation and normal function of the gastric mucosa in tumor tissues.

Table 2: Processes related to the 38 common genes differentially expressed in intestinal-type gastric carcinoma

Functional Enrichment Analysisa




Extracellular Matrix Remodeling


Gastrin in differentiation of the gastric mucosa


Stimulation of gastric acid secretion


Cell adhesion_Cell-matrix interactions






aEnrichment analysis was performed using MetacoreTM.

bGene symbols from 38 genes found in our unsupervised analysis which were identified to be significantly up- or down-regulated in pathway maps.

The prognostic value of the genes from molecular signature

In order to analyze the impact of high expression of the differentially expressed genes found in intestinal-type gastric cancer on overall patient survival we have performed Kaplan-Meier analysis on two validation cohorts of intestinal-type patients that provided overall survival information, one from Microarray data used in our unsupervised analysis and the other one from RNA-seq data from TCGA Stomach adenocarcinoma (TCGA-STAD) dataset [27]. Kaplan-Meier analyses demonstrated that patients with tumors expressing high levels of PSCA(HR, 3.05; 95% CI, 1.26–7.37), SPARC (HR, 3.56; 95% CI, 1.31–9.66), THBS2 (HR, 2.47; 95% CI, 1.03–5.927) and THY1 (HR, 3.40; 95% CI, 1.34–11.89) genes had a significantly poor overall survival while high levels of CXCL9 (HR, 0.42; 95% CI, 0.21–0.83), HMGCS2 (HR, 0.48; 95% CI, 0.23–0.98), SULT1B1 (HR, 0.50; 95% CI, 0.26–1.00) genes has a protective effect (p < 0.05 by the log-rank test) (Figure 4). Altogether these results suggested that these genes could be intestinal-type gastric cancer survival predictors.


Figure 4: Overall survival of patients stratified according to gene expression. Kaplan-Meier analyses showed that patients with high levels of PSCA, SPARC, THBS2 and THY1 genes had a significantly poor overall survival while high levels of CXCL9, HMGCS2 and SULT1B1 genes has a protective effect (p < 0.05).


Many factors synergistically contribute to cancer development, such as infection, environment and heredity. Although the diagnostic capabilities and therapeutic methods for gastric cancer have improved, the prognosis of patients with gastric cancer remains poor, especially in the advanced stages.

Several groups have used genome and transcriptome profiling to identify genes that could be related to gastric cancer. However, the majority of these studies use samples from populations in which the disease incidence is highest, and few studies have examined populations in which the incidence of this disease is lower [1821, 23, 24].

In this study, we compared the gene expression profiles of tumor tissue from Brazilian patients with intestinal-type gastric carcinoma and their corresponding non-tumor control tissue using a transcriptome analysis. The molecular profiles of these samples revealed that 57 genes that were differentially expressed compared with non-tumor tissue could differentiate intestinal tumor tissue from non-tumor tissue, suggesting that these genes constituted an intestinal-type gastric carcinoma molecular signature. A RT-qPCR analysis confirmed that this molecular signature can distinguish intestinal tumor tissue from non-tumor control tissue. Thus, this molecular signature may serve as an important molecular marker to identify patients with intestinal gastric cancer in Brazil.

Because the Brazilian population is extremely heterogeneous and the incidence of gastric cancer in our population is intermediate [22], we assessed the similarity of this expression profile to that of other populations worldwide. To this end, we performed an unsupervised analysis using the molecular signature found from Brazilian intestinal gastric cancer and verified the ability of this signature to discriminate non-tumor and tumor samples from other nationalities. Our results show that the 38 genes identified in the Brazilian population are sufficient to discriminate tumor and non-tumor region in patients with intestinal gastric cancers, irrespective of region.

An in silico analysis of the 38 differentially expressed genes defined as the common signature of intestinal gastric cancer showed important processes that may be involved in the development or progression of gastric cancer, including extracellular matrix (ECM) remodeling and alterations in cell adhesion, gastric mucosa modification, gastric acid secretion and inflammation.

Pathways that affect the ECM also interact with cell adhesion molecules. This balance between cell adhesion and extracellular molecules is essential for normal cell survival, and imbalance among these pathways results in the detachment of cells from the extracellular matrix and consequently promotes metastasis [25]. Changes in the ECM and cell adhesion processes have been identified in several cancers, suggesting that it plays an essential role in cancer biology. We herein identified a large number of genes associated with the ECM and cell adhesion to be differentially expressed in intestinal gastric cancer, including TIMP1, MMP7, FN1, SPARC, LUM and BGN, which were upregulated.

MMP7 is a matrix metalloprotease gene that is involved in the degradation of all components of basement membranes under physiological conditions. Under pathological conditions, MMP7 overexpression has been associated with cancer-cell invasion and metastasis, and MMP7 regulates cancer-associated processes, such as the inhibition of apoptosis, the degradation of cell-cell contact and cellular proliferation [2628]. In gastric cancer, MMP7 was previously identified to be overexpressed, and Koskensalo and co-workers suggested that this gene may be an independent prognostic marker [29]. Moreover, the SPARC gene encodes a matrix-associated protein that is required for the calcification of collagen in bone but is also involved in extracellular matrix synthesis and changes in cell shape. Its gene product has been correlated with metastasis based on changes in cell shape, which can promote tumor cell invasion [30]. Specifically, the expression of SPARC is higher in advanced gastric cancer compared to the early stages, and high SPARC expression significantly correlated with lymph node metastasis, lymphatic invasion and perineural invasion [31]. Furthermore, TIMP1 is a metallopeptidase inhibitor 1 gene that is involved in the control of the proteolytic activities of MMPs during the degradation of the extracellular matrix. TIMP1 can also induce cell proliferation and has an anti-apoptotic effect, and its overexpression has been associated with a poor prognosis in several types of cancer [3234]. TIMP1 has been reported to be overexpressed in gastric cancer cells and in the inflammatory cells of the stromal element of the tumor, and high levels of this protein are associated with poor outcome [35, 36]. The FN1 gene encodes fibronectin, a ubiquitous ECM protein related to many important normal biologic processes, such as cell adhesion and migration. In several cancers, FN1 is a key mediator of disease progression and metastasis [3740]. In gastric cancer tissue, FN1 expression was found to be upregulated and related to invasion and migration [41, 42]. Moreover, the lumican gene (LUM) is also a component of the ECM that participates in important regulatory processes, such as cell proliferation, migration and adhesion. Additionally, LUM has been associated with the aggressiveness of lung adenocarcinoma and squamous cell carcinoma and was found to be overexpressed in gastric cancer [43, 44]. Biglycan (BGN) is expressed in the ECM, and its upregulation was associated with several types of cancer, including colon tumor, pancreatic cancer and gastric cancer [4548]. GC cells secrete BGN into the tumor stroma and promote GC progression [48, 49]. This protein may also regulate inflammation and innate immunity.

Other processes that seem to be important in intestinal gastric cancer are gastric acid secretion and the differentiation of gastric mucosa. We identified REG1A, CHGA, TFF2, ATP4A and ATP4B to be downregulated in intestinal gastric cancer, and Rajkumar and co-workers found ATP4A and ATP4B to be downregulated in gastric tumor tissues compared to normal tissues [44]. ATPases are the most critical component of the ion transport system in parietal cells, which mediate acid secretion in the stomach and the inhibition of ATPase activity cause epithelial cell proliferation and suppress their differentiation [50]. TFF genes play a regulatory role in the mammalian digestive system, specifically in mucosal protection and epithelial cell reconstruction, tumor suppression or promotion, signal transduction and the regulation of proliferation and apoptosis. TFF2 expression is high in the normal gastric mucosa, and several studies have shown that TFF2 expression is downregulated in gastric cancer compared with normal tissue and that this downregulation may be associated with promoter hypermethylation [51]. The REG1A gene encodes a protein that is secreted by the exocrine pancreas [52] and is expressed in the normal colorectal mucosa and tumors, such colorectal cancer, pancreatic ductal adenocarcinoma [5356]. Zhang and co-workers used an RNA-seq approach to identify that REG1A was downregulated in gastric carcinoma [23]. Chromogranin A (CHGA) belongs to the granins (acidic glycoproteins) family, which is related to the family of neuroendocrine secretory proteins, and it is crucial for the exocytosis of secretory vesicles in neuroendocrine cells, including the gastrointestinal endocrine system [57, 58]. Signet ring cells (SRC) were found to be derived from neuroendocrine cells, indicating that SRC-gastric carcinomas may be of neuroendocrine origin [59]. CHGA expression correlates with better prognosis in SRC-gastric carcinoma [60]. However, the expression of this gene in intestinal-type GC has not yet been described.

Inflammation is also dysregulated in intestinal gastric cancer. The relationship between inflammation and cancer was first discovered in 1863 by Rudolf Virchow, who suggested that cancer may originate at sites of inflammation. Chronic inflammation may increase the risk of developing cancer; for instance, esophagitis or gastritis may lead to the development of esophageal or gastric cancer, respectively [61]. In the common gene signature identified in this study, genes related to inflammation were both up- (CXCL9 and FN1) and downregulated (REG3A) in our analysis. The REG3A gene has been reported to be downregulated in gastric cancers and may be involved in cell adhesion and protection from oxidative stress-induced apoptosis. REG3A has also been reported to bind fibronectin (FN1) and is implicated in cell-cell interaction, differentiation and metastasis [62]. CXCL9 is a C-X-C Motif chemokine ligand that encodes a protein thought to be involved in T cell trafficking [61].

Interestingly, a common expression pattern of 38 genes was consistently associated with intestinal-type gastric carcinoma worldwide, irrespective of the incidence of the disease or heterogeneity of the population. This molecular signature includes genes that participate in processes important to cell homeostasis. This common signature may be useful as a molecular profile of intestinal gastric cancers and warrant exploration since our data indicate a reproducible worldwide framework for this histological type.

Moreover, among these 38 genes, CXCL9, HMGCS2, SULT1B1, PSCA, SPARC, THBS2 and THY1 could predict overall survival. This new gene panel may help guide investigations of new targets to develop novel therapies and customize treatment to improve the overall survival of patients with intestinal gastric cancer.


Patient samples

All tumor tissues and non-tumor control tissues were obtained from patients diagnosed with gastric adenocarcinoma intestinal type by the Lauren classification at the Hospital João de Barros Barreto, Universidade Federal do Pará (Belém, PA, Brazil). The 44 samples from 22 patients obtained were characterized as shown in Table 3. These patients were stratified into two cohorts: chip array cohort (n = 8) and RT-qPCR cohort (n = 17). All samples were obtained in accordance with the guidelines of the local Ethics Committee and the Helsinki Declaration. The procedures were previously approved by the institutional review board, and all participants signed informed consent forms. This study was approved by the National Ethics Committee (Conselho Nacional de Ética em Pesquisa–CONEP) and the local institutional committee.

Table 3: List of Brazilian patients with intestinal-type gastric carcinoma that participated in this study

Sample laboratory code

TNM classification

Chiparray cohort

RT-qPCR confirmation cohor









































































































































Expression chip array data analysis

An RNeasy Mini kit (Qiagen, CA, USA) was used to obtain total RNA from intestinal-type gastric carcinoma and non-tumor control tissues according to the manufacturer’s instructions. One hundred nanograms (100 ng) of total RNA were used to synthesize biotinylated cRNA using a GeneChip Whole Transcription (WT) Sense Target Labeling Assay Kit (Affymetrix, CA, USA). The biotinylated cRNA was then hybridized to GeneChip Human Exon 1.0 ST Arrays (Affymetrix, CA, USA), washed and stained according to the manufacturer’s protocols. The GeneChip arrays were scanned using a GeneChip® Scanner 3000. The Affymetrix Expression Console software version 1.0 was used to create summarized expression values (CHP-files), and the robust multichip analysis (RMA) algorithm was applied. The data were analyzed using the Partek® software (http://www.partek.com) [63], and a ≥5-fold change in expression was defined as differential overexpression or downregulation. The pathway analysis and related processes were obtained using the MetaCoreTM software (http://thomsonreuters.com/metacore/).

Quantitative PCR (RT-qPCR)

RT-qPCR analyses were performed using 2 μg of mRNA treated with amplification-grade DNase I (Invitrogen, CA, USA) and reverse transcribed with Superscript III Reverse transcriptase® (Invitrogen, CA, USA). Each reaction was performed with 5 μL of SYBR Green PCR Master Mix® (Applied Biosystems, CA, USA), 2.5 μL of cDNA (10 ng of cDNA) and 2 μM of each primer. The mRNA levels were quantified using the Rotor-Gene 6000 Series software (Corbett, Australia). The reactions were performed in a Rotor-Gene 6000 thermocycler (Corbett, Australia) using the following program: 95°C for 5 min, followed by 45 cycles at 95°C for 15 s with a final extension at 62°C for 40 s. A dissociation curve analysis was used to demonstrate that the amplification efficiency of a specific PCR products for all primers used in this study was equal and that products were specific. The fold-change in expression was calculated using the DDCt method according to Livak and Schmittgen [64]. The expression levels were estimated in triplicate, and Actin and GAPDH were used to normalize gene expression. The following primers were used: TIMP1 Fw (5′-CATC CTGTTGTTGCTGTGGCTGA-3′) and Rev (5′-GGTGG TCTGGTTGACTTCTGGTGT-3′); PGA4 Fw (5′-GCCCA GGATTTCACCGTCGTCTT-3′) and Rev (5′-ACTGTCT CGCTGGTGGACTGGTA-3′); GIF Fw (5′- ATCTAAC CATTGGGCAGCTCGGC-3′) and Rev (5′-GGCCCATAG AAGGCTGATGCTTCAG-3′); KRT20 Fw (5′-AGCAGT GGTACGAAACCAACGC-3′) and Rev (5′- CAGGACAC ACCGAGCATTTTGCA-3′); CHGA Fw (5′-GCTCCCT GTGAACAGCCCTATGAA-3′) and Rev (5′-GGCTTGGA AAGTGTGTCGGAGATG-3′); MMP7 Fw (5′-TGCAGA AGCCCAGATGTGGAGTG-3′) and Rev (5′-CGATCCT GTAGGTGACCACTTTGG-3′); SPARC Fw (5′-TGCCTG ATGAGACA GAGGTGGT-3′) and Rev (5′-CGGTTT CCTCTGCACCATCA TCAA-3′); AKR1C2 Fw (5′-AAGCTCTAGAGGCCGTCAAATTGG-3′) and Rev (5′-CTC TGGTCGATGGGAATTGCTCC-3′) GAPDH Fw (5′-GT CAACGGATTTGGTC GTATTG-3′) and Rev (5′-TGGAA GATGGTGATGGGATTT-3′), Actin Fw (5′-TTCCTTC CTGGGCATGGAGTC-3′) and Rev (5′-AGACAGC ACTGTGTTGGCGTA-3′). The results were compared using the Mann–Whitney test. The GraphPad PrismTM software (GraphPad Software Inc., CA, USA) was used for the statistical analysis and to prepare graphs.

Unsupervised analysis

Cell intensity (CEL) files storing probe-level intensity data were downloaded from NCBI’s Gene Expression Omnibus (GEO); accession numbers are described in Table 2. The simpleAffy Bioconductor R package was used to preprocess all raw data files. The extraction of probe level data, background correction, normalization using the robust multi-array average (RMA) algorithm, and the mapping of probes to genes were performed for each individual experiment to summarize gene-levels of expression. The datasets were then merged to obtain complete expression data. Non-biological experimental variation or batch effects were adjusted using a parametric empirical Bayes framework using the ComBat function implemented on the sva Bioconductor R package [65].

In the two-dimensional cluster analysis, gene clustering and sample clustering were independently performed using an unsupervised hierarchical clustering algorithm. For gene clustering, pairwise similarity metrics among genes were calculated based on expression ratio measurements across all samples (average linkage clustering using Pearson’s correlation as similarity metric). Similarly, for sample clustering, pairwise similarity measures among samples were calculated using the Euclidean distance based on expression ratio measurements across all significant genes.

TCGA data

Public available RNA-Seq and clinical data from 158 intestinal-type gastric cancer and 15 normal tissues samples from The Cancer Genome Atlas (TCGA) project was downloaded from the NCI’s Genomic Data Commons (GDC) [66] using CGAbiolinks Bioconductor R package [67]. The downloaded files correspond to the clinical data and the HTSeq - counts (gene expression quantification - transcriptome profiling) from the “TCGA Stomach adenocarcinoma (TCGA-STAD) dataset [68]. HTSeq counts were normalized using DESeq2 [69].

Survival analysis

For survival analysis of the 38 individual marker genes, tumor samples were stratified into quartiles according to the expression of each marker: the lower quartile was named Low expression group and the upper, High expression group. The survival curves were computed using the method of Kaplan-Meier and Cox proportional hazards models (survival and survminer R package). Statistical significance was determined using the log-rank test.

Statistical analysis

All experiments were carried out in triplicate, and the data are expressed as the mean ± standard error of the mean. The results were compared using an unpaired Mann–Whitney test, and a p-value <0.05 was considered significant (*p < 0.05, **p < 0.01). The GraphPad PrismTM software (GraphPad Software Inc., CA, USA) was used for statistical analyses and to generate graphs.

Author contributions

Contribution: R.B and E.C.S performed experiments, analyzed the results and prepared the figures. M.B. performed the unsupervised analysis. S.D and P.A provided and characterized the samples. R.B and E.A designed the study and wrote the paper.


The authors have no conflicts of interest to declare.


This work was financially supported by CNPq, FINEP and FAPERJ.


1. Forman D, Burley V. Gastric cancer: global pattern of the disease and an overview of environmental risk factors. Best Pract Res Clin Gastroenterol. 2006; 20:633–649.

2. Karimi P, Islami F, Anandasabapathy S, Freedman ND, Kamangar F. Gastric Cancer: Descriptive Epidemiology, Risk Factors, Screening, and Prevention. Cancer Epidemiology, Biomarkers & Prevention. 2014; 23:700–713.

3. http://globocan.iarc.fr/Pages/fact_sheets_population.aspx.

4. Nagini S. Carcinoma of the stomach: A review of epidemiology, pathogenesis, molecular genetics and chemoprevention. World J Gastrointest Oncol. 2012; 4:156–169.

5. Kumar RK, Raj SS, Shankar EM, Ganapathy E, Ebrahim AS, Farooq SM. Gastric Carcinoma: A Review on Epidemiology, Current Surgical and Chemotherapeutic Options. Gastric Carcinoma - New Insights into Current Management. 2013; 12.

6. Chang MS, Kim WH. Epstein-Barr Virus in Human Malignancy: A Special Reference to Epstein-Barr Virus associated Gastric Carcinoma. Cancer Res Treat. 2005; 37:257–267.

7. Yong X, Tang B, Li BS, Xie R, Hu CJ, Luo G, Qin Y, Dong H, Yang SM. Helicobacter pylori virulence factor CagA promotes tumorigenesis of gastric cancer via multiple signaling pathways. Cell Commun Signal. 2015; 13:30.

8. Cho J, Kang MS, Kim KM. Epstein-Barr Virus-Associated Gastric Carcinoma and Specific Features of the Accompanying Immune Response. J Gastric Cancer. 2016; 16:1–7. Review.

9. Lauren P. The two histological main types of gastric carcinoma: diffuse and so-called intestinal-type carcinoma. an attempt at a histo-clinical classification. Acta Pathol Microbiol Scand. 1965; 64:31–49.

10. El-Omar EM, Carrington M, Chow WH, McColl KE, Bream JH, Young HA, Herrera J, Lissowska J, Yuan CC, Rothman N, Lanyon G, Martin M, Fraumeni JF Jr, et al. Interleukin-1 polymorphisms associated with increased risk of gastric cancer. Nature. 2000; 404:398–402.

11. Henson DE, Dittus C, Younes M, Nguyen H, Albores-Saavedra J. Differential trends in the intestinal and diffuse types of gastric carcinoma in the United States, 1973–2000: increase in the signet ring cell type. Arch Pathol Lab Med. 2004; 128:765–770.

12. Vauhkonen M, Vauhkonen H, Sipponen P. Pathology and molecular biology of gastric cancer. Best Pract Res Clin Gastroenterol. 2006; 20:651–674.

13. Carcas LP. Gastric cancer review. J Carcinog. 2014; 13:14.

14. Pinheiro DR, Ferreira WA, Barros MB, Araújo MD, Rodrigues-Antunes S, Borges BN. Perspectives on new biomarkers in gastric cancer: diagnostic and prognostic applications. World J Gastroenterol. 2014; 20:11574–85.

15. Liu HS, Xiao HS. MicroRNAs as potential biomarkers for gastric cancer. World J Gastroenterol. 2014; 20:12007–12017.

16. Dang Y, Lan F, Ouyang X, Wang K, Lin Y, Yu Y, Wang L, Wang Y, Huang Q. Expression and clinical significance of long non-coding RNA HNF1A-AS1 in human gastric cancer. World J Surg Oncol. 2015; 13:302–308.

17. Bessède E, Dubus P, Mégraud F, Varon C. Helicobacter pylori infection and stem cells at the origin of gastric cancer. Oncogene. 2015; 34:2547–2555.

18. McLean MH, El-Omar EM. Genetics of gastric cancer. Nat Rev Gastroenterol Hepatol. 2014; 11:664–674.

19. Jiang HB, Yang TJ, Lu P, Ma YJ. Gene expression profiling of gastric cancer. Eur Rev Med Pharmacol Sci. 2014; 18:2109–1215.

20. Zhang FG, He ZY, Wang Q. Transcriptome profiling of the cancer and normal tissues from gastric cancer patients by deep sequencing. Tumour Biol. 2014; 35:7423–7427.

21. D’Angelo G, Di Rienzo T, Ojetti V. Microarray analysis in gastric cancer: a review. World J Gastroenterol. 2014; 20:11972–11976.

22. Hartgrink HH, Jansen EP, van Grieken NC, van de Velde CJ. Gastric cancer. Lancet. 2009; 374:477–490.

23. Zhang J, Huang JY, Chen YN, Yuan F, Zhang H, Yan FH, Wang MJ, Wang G, Su M, Lu G, Huang Y, Dai H, Ji J, et al. Whole genome and transcriptome sequencing of matched primary and peritoneal metastatic gastric carcinoma. Sci Rep. 2015; 5:13750–13760.

24. Min L, Zhao Y, Zhu S, Qiu X, Cheng R, Xing J, Shao L, Guo S, Zhang S. Integrated Analysis Identifies Molecular Signatures and Specific Prognostic Factors for Different Gastric Cancer Subtypes. Transl Oncol. 2017; 10:99–107.

25. Krupp M, Maass T, Marquardt JU, Staib F, Bauer T, König R, Biesterfeld S, Galle PR, Tresch A, Teufel A. The functional cancer map: a systems-level synopsis of genetic deregulation in cancer. BMC Med Genomics. 2011; 4:53–62.

26. Egeblad M, Werb Z. New functions for the matrix metalloproteinases in cancer progression. Nat Rev Cancer. 2002; 2:161–74.

27. Wielockx B, Libert C, Wilson C. Matrilysin (matrix metalloproteinase-7): a new promising drug target in cancer and inflammation? Cytokine Growth Factor Rev. 2004; 15:111–115.

28. Soleyman-Jahi S, Nedjat S, Abdirad A, Hoorshad N, Heidari R, Zendehdel K. Prognostic significance of matrix metalloproteinase-7 in gastric cancer survival: a meta-analysis. PLoS One. 2015; 10:e0122316.

29. Koskensalo S, Mrena J, Wiksten JP, Nordling S, Kokkola A, Hagström J, Haglund C. MMP-7 overexpression is an independent prognostic marker in gastric cancer. Tumour Biol. 2010; 31:149–155.

30. Wang Z, Hao B, Yang Y, Wang R, Li Y, Wu Q. Prognostic role of SPARC expression in gastric cancer: a meta-analysis. Arch Med Sci. 2014; 10:863–869.

31. Wang CS, Lin KH, Chen SL, Chan YF, Hsueh S. Overexpression of SPARC gene in human gastric carcinoma and its clinic-pathologic significance. Br J Cancer. 2004; 91:1924–1930.

32. Peng L, Yanjiao M, Ai-guo W, Pengtao G, Jianhua L, Ju Y, Hongsheng O, Xichen Z. A fine balance between CCNL1 and TIMP1 contributes to the development of breast cancer cells. Biochem Biophys Res Commun. 2011; 409:344–349.

33. Bjerre C, Vinther L, Belling KC, Würtz SØ, Yadav R, Lademann U, Rigina O, Do KN, Ditzel HJ, Lykkesfeldt AE, Wang J, Nielsen HB, Brünner N, et al. TIMP1 overexpression mediates resistance of MCF-7 human breast cancer cells to fulvestrant and down-regulates progesterone receptor expression. Tumour Biol. 2013; 34:3839–3851.

34. Song G, Xu S, Zhang H, Wang Y, Xiao C, Jiang T, Wu L, Zhang T, Sun X, Zhong L, Zhou C, Wang Z, Peng Z, et al. TIMP1 is a prognostic marker for the progression and metastasis of colon cancer through FAK-PI3K/AKT and MAPK pathway. J Exp Clin Cancer Res. 2016; 35:148–159.

35. Grunnet M, Mau-Sørensen M, Brünner N. Tissue inhibitor of metalloproteinase 1 (TIMP-1) as a biomarker in gastric cancer: a review. Scand J Gastroenterol. 2013; 48:899–905.

36. Alpízar-Alpízar W, Laerum OD, Christensen IJ, Ovrebo K, Skarstein A, Høyer-Hansen G, Ploug M, Illemann M. Tissue Inhibitor of Metalloproteinase-1 Is Confined to Tumor-Associated Myofibroblasts and Is Increased with Progression in Gastric Adenocarcinoma. J Histochem Cytochem. 2016; 64:483–494.

37. Wang F, Song G, Liu M, Li X, Tang H. miRNA-1 targets fibronectin1 and suppresses the migration and invasion of the HEp2 laryngeal squamous carcinoma cell line. FEBS Lett. 2011; 585:3263–3269.

38. Dooley TP, Reddy SP, Wilborn TW, Davis RL. Biomarkers of human cutaneous squamous cell carcinoma from tissues and cell lines identified by DNA microarrays and qRT-PCR. Biochem Biophys Res Commun. 2003; 306:1026–36.

39. Chen SH, Lin CY, Lee LT, Chang GD, Lee PP, Hung CC, Kao WT, Tsai PH, Schally AV, Hwang JJ, Lee MT. Up-regulation of fibronectin and tissue transglutaminase promotes cell invasion involving increased association with integrin and MMP expression in A431 cells. Anticancer Res. 2010; 30:4177–4186.

40. Howe EN, Cochrane DR, Richer JK. Targets of miR-200c mediate suppression of cell motility and anoikis resistance. Breast Cancer Res. 2011; 13:R45.

41. Chang L, Guo F, Wang Y, Lv Y, Huo B, Wang L, Liu W. MicroRNA-200c regulates the sensitivity of chemotherapy of gastric cancer SGC7901/DDP cells by directly targeting RhoE. Pathol Oncol Res. 2014; 20:93–98.

42. Xu TP, Huang MD, Xia R, Liu XX, Sun M, Yin L, Chen WM, Han L, Zhang EB, Kong R, De W, Shu YQ. Decreased expression of the long non-coding RNA FENDRR is associated with poor prognosis in gastric cancer and FENDRR regulates gastric cancer cell metastasis by affecting fibronectin1 expression. J Hematol Oncol. 2014; 7:63–77.

43. Matsuda Y, Yamamoto T, Kudo M, Kawahara K, Kawamoto M, Nakajima Y, Koizumi K, Nakazawa N, Ishiwata T, Naito Z. Expression and roles of lumican in lung adenocarcinoma and squamous cell carcinoma. Int J Oncol. 2008; 33:1177–85.

44. Rajkumar T, Vijayalakshmi N, Gopal G, Sabitha K, Shirley S, Raja UM, Ramakrishnan SA. Identification and validation of genes involved in gastric tumorigenesis. Cancer Cell Int. 2010; 10:45–56.

45. Zhu YH, Yang F, Zhang SS, Zeng TT, Xie X, Guan XY. High expression of biglycan is associated with poor prognosis in patients with esophageal squamous cell carcinoma. Int J Clin Exp Pathol. 2013; 6:2497–2505.

46. Mikula M, Rubel T, Karczmarski J, Goryca K, Dadlez M, Ostrowski J. Integrating proteomic and transcriptomic high-throughput surveys for search of new biomarkers of colon tumors. Funct Integr Genomics. 2011; 11:215–224.

47. Weber CK, Sommer G, Michl P, Fensterer H, Weimer M, Gansauge F, Leder G, Adler G, Gress TM. Biglycan is overexpressed in pancreatic cancer and induces G1-arrest in pancreatic cancer cell lines. Gastroenterology. 2001; 121:657–667.

48. Hu L, Duan YT, Li JF, Su LP, Yan M, Zhu ZG, Liu BY, Yang QM. Biglycan enhances gastric cancer invasion by activating FAK signaling pathway. Oncotarget. 2014; 5:1885–96. https://doi.org/10.18632/oncotarget.1871.

49. Hu L, Zang MD, Wang HX, Li JF, Su LP, Yan M, Li C, Yang QM, Liu BY, Zhu ZG. Biglycan stimulates VEGF expression in endothelial cells by activating the TLR signaling pathway. Mol Oncol. 2016; 10:1473–1484.

50. Raja UM, Gopal G, Rajkumar T. Intragenic DNA methylation concomitant with repression of ATP4B and ATP4A gene expression in gastric cancer is a potential serum biomarker. Asian Pac J Cancer Prev. 2012; 13:5563–5568.

51. Xiao P, Ling H, Lan G, Liu J, Hu H, Yang R. Trefoil factors: Gastrointestinal-specific proteins associated with gastric cancer. Clin Chim Acta. 2015; 450:127–134.

52. Terazono K, Yamamoto H, Takasawa S, Shiga K, Yonemura Y, Tochino Y, Okamoto H. A novel gene activated in regenerating islets. J Biol Chem. 1988; 263:2111–4.

53. Watanabe T, Yonekura H, Terazono K, Yamamoto H, Okamoto H. Complete nucleotide sequence of human reg gene and its expression in normal and tumoral tissues. The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene. J Biol Chem. 1990; 265:7432–7439.

54. Astrosini C, Roeefzaad C, Dai YY, Dieckgraefe BK, Jöns T, Kemmner W. REG1A expression is a prognostic marker in colorectal cancer and associated with peritoneal carcinomatosis. Int J Cancer. 2008; 123:409–413.

55. Kimura M, Naito H, Tojo T, Itaya-Hironaka A, Dohi Y, Yoshimura M, Nakagawara K, Takasawa S, Taniguchi S. REG Iα gene expression is linked with the poor prognosis of lung adenocarcinoma and squamous cell carcinoma patients via discrete mechanisms. Oncol Rep. 2013; 30:2625–2631.

56. Li Q, Wang H, Zogopoulos G, Shao Q, Dong K, Lv F, Nwilati K, Gui XY, Cuggia A, Liu JL, Gao ZH. Reg proteins promote acinar-to-ductal metaplasia and act as novel diagnostic and prognostic markers in pancreatic ductal adenocarcinoma. Oncotarget. 2016; 7:77838–53. https://doi.org/10.18632/oncotarget.12834.

57. Rindi G, Buffa R, Sessa F, Tortora O, Solcia E. Chromogranin A, B and C immunoreactivities of mammalian endocrine cells. Distribution, distinction from costored hormones/prohormones and relationship with the argyrophil component of secretory granules. Histochemistry. 1986; 85:19–28.

58. Taupenot L, Harper KL, O’Connor DT. The chromogranin-secretogranin family. N Engl J Med. 2003; 348:1134–1149.

59. Bakkelund K, Fossmark R, Nordrum I, Waldum H. Signet ring cells in gastric carcinomas are derived from neuroendocrine cells. J Histochem Cytochem. 2006; 54:615–621.

60. Fujiyoshi Y, Eimoto T. Chromogranin A expression correlates with tumour cell type and prognosis in signet ring cell carcinoma of the stomach. Histopathology. 2008; 52:305–313.

61. Verbeke H, Geboes K, Van Damme J, Struyf S. The role of CXC chemokines in the transition of chronic inflammation to esophageal and gastric cancer. Biochim Biophys Acta. 2012; 1825:117–129.

62. Choi B, Suh Y, Kim WH, Christa L, Park J, Bae CD. Downregulation of regenerating islet-derived 3 alpha (REG3A) in primary human gastric adenocarcinomas. Exp Mol Med. 2007; 39:796–804.

63. Partek® Discovery SuiteTM. Version 6.3. St. Louis, MO: Partek. Inc. 2008.

64. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001; 25:402–408.

65. Leek JT, Johnson WE, Parker HS, Fertig EJ, Jaffe AE, Storey JD. sva: Surrogate Variable Analysis. R package version 3.24.0.

66. Grossman RL, Heath AP, Ferretti V, Varmus HE, Lowy DR, Kibbe WA, Staudt LM. Toward a Shared Vision for Cancer Genomic Data. N Engl J Med. 2016; 375: 1109–1112.

67. Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, Sabedot TS, Malta TM, Pagnotta SM, Castiglioni I, Ceccarelli M, Bontempi G, Noushmehr H. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2016; 44:e71.

68. Bass AJ, Thorsson V, Shmulevich I, Reynolds SM, Miller M, Bernard B, Hinoue T, Laird PW, Curtis C, Shen H, Weisenberger DJ, Schultz N, Shen R, et al, and Cancer Genome Atlas Research Network. Comprehensive molecular characterization of gastric adenocarcinoma. Nature. 2014; 513:202–9.

69. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15:550.

Creative Commons License All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 23670