Research Papers:

Genetic variants in regulatory regions of microRNAs are associated with lung cancer risk

PDF |  HTML  |  Supplementary Files  |  Order a Reprint

Oncotarget. 2016; 7:47966-47974. https://doi.org/10.18632/oncotarget.10299

Metrics: PDF 1110 views  |   HTML 903 views  |   ?  

Kaipeng Xie, Cheng Wang, Na Qin, Jianshui Yang, Meng Zhu, Juncheng Dai, Guangfu Jin, Hongbing Shen, Hongxia Ma and Zhibin Hu _


Kaipeng Xie1,2,*, Cheng Wang1,2,*, Na Qin1,2, Jianshui Yang1,2, Meng Zhu1,2, Juncheng Dai1,2, Guangfu Jin1,2, Hongbing Shen1,2, Hongxia Ma1,2, Zhibin Hu1,2

1Department of Epidemiology and Biostatistics, Collaborative Innovation Center of Cancer Medicine, School of Public Health, Nanjing Medical University, Nanjing, 211166, China

2Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, School of Public Health, Nanjing Medical University, Nanjing, 211166, China

*These authors contributed equally to this work

Correspondence to:

Zhibin Hu, email: zhibin_hu@njmu.edu.cn

Hongxia Ma, email: hongxiama@njmu.edu.cn

Keywords: non-small cell lung cancer, microRNA, susceptibility, survival

Received: February 18, 2016     Accepted: May 28, 2016     Published: June 25, 2016


Genetic variants in regulatory regions of some miRNAs might be associated with lung cancer risk and survival. We performed a case-control study including 1341 non-small cell lung cancer (NSCLC) cases and 1982 controls to evaluate the associations of 7 potentially functional polymorphisms in several differently expressed miRNAs with NSCLC risk. Each SNP was also tested for the association with overall survival of 1001 NSCLC patients. We identified that rs9660710 in miR-200b/200a/429 cluster and rs763354 in miR-30a were significantly associated with NSCLC risk [odds ratio (OR) = 1.17, 95% confidence interval (CI) = 1.06–1.30, P = 0.002; OR = 0.88, 95% CI = 0.80–0.98, P = 0.017; respectively]. However, no significant association between variants and NSCLC death risk was observed in survival analysis. Functional annotation showed that both rs9660710 and rs763354 were located in regulatory elements in lung cancer cells. Compared to normal tissues, miR-200a-3p, miR-200a-5p, miR-200b-3p, miR-200b-5p and miR-429 were significantly increased in The Cancer Genome Atlas (TCGA) Lung Adenocarcinoma (LUAD) tumors, whereas miR-30a-3p and miR-30a-5p were significantly decreased in tumors (all P < 0.05). Furthermore, we observed that rs9660710 is an expression quantitative trait locus (eQTL) or methylation eQTL for miR-429 expression in TCGA normal tissues. Our results indicated that rs9660710 in miR-200b/200a/429 cluster and rs763354 in miR-30a might modify the susceptibility to NSCLC.

Genetic variants in regulatory regions of microRNAs are associated with lung cancer risk | Xie | Oncotarget


Non-small cell lung cancer (NSCLC) accounts for over 80% of lung cancer, which remains the leading cause of cancer-related death worldwide [1, 2]. Although tobacco consumption is a major risk factor for lung cancer, accumulating evidence suggests that genetic variations may play a vital role in tumor development and progression [36]. Therefore, identification of these genetic biomarkers may enhance risk prediction and improve clinical outcomes of NSCLC patients.

Emerging evidence has suggested that some key microRNAs (miRNAs) are consistently dysregulated in lung cancer tissues and function as oncogenes or tumor suppressor genes in the initiation and progression of lung cancer [710]. Notably, Guan et al. synthesized fourteen microarray-based human lung cancer miRNA expression profiling studies including a total of 585 tumor and 519 non-cancerous control samples, and employed a vote-counting strategy to identify several consistent differentially miRNAs, such as up-regulated miR-210, miR-21, miR-183, miR-182, miR-200b, and down-regulated miR-30a, miR-145, miR-143 and miR-451a [8]. Furthermore, using a meta-analysis with more than 1100 lung cancer and noncancerous samples from 20 original studies, another study also confirmed these abnormally expressed miRNAs [9]. Meanwhile, previous studies have suggested that the high expression of miR-210, miR-21, miR-183, miR-182, and low expression of miR-30a, miR-145, miR-143 were associated with poor prognosis of lung cancer [1116]. These studies provided the promising candidates for further researches on miRNAs in lung cancer development and progression.

Interestingly, the biological function of these miRNAs has also been explored in lung cancer. For example, miR-21 located in the region of 17q23.1 promotes lung tumorigenesis through inhibition of negative regulators of the Ras/MEK/ERK pathway and apoptosis [17]. MiR-210 localizes in chromosome 11p15.5 and acts as an important regulator of the cellular response to hypoxia [18]. MiR-182 and miR-183 belong to the members of the miR-183/96/182 cluster, the overexpression of which has been identified as a potential risk factor for lung cancer development [19]. MiR-200b/200a/429 clustered at chromosome 1p36 and miR-30a located at 6q13 are key regulators of epithelial-mesenchymal transition (EMT) and thus affect the cell migration, invasion and metastasis in lung cancer [2022]. MiR-451a inhibits ras-related protein 14 (RAB14) expression and functions as a tumor suppressor in human non-small cell lung cancer [10]. MiR-145 and miR-143 located in 5q33.1 inhibit NSCLC cell growth and metastasis by targeting epidermal growth factor receptor (EGFR) and nucleoside diphosphate linked moiety X-type motif 1 (NUDT1) or LIM domain kinase 1 (Limk1) [23, 24]. Taken together, these miRNAs have been identified to play the important role in the NSCLC tumorigenesis and progression.

In recent years, comprehensive analyses based on genome-wide association studies (GWAS) have indicated that disease/trait-associated loci are enriched in the genomic regulatory regions [2527]. Thus, it is biologically plausible that single nucleotide polymorphisms (SNPs) in the regulatory regions of miRNA may also contribute to phenotypic variations in the human population [25]. Therefore, we evaluate the effect of potentially functional polymorphisms within the 10 kb upstream region of the above-mentioned key miRNAs cluster or single miRNA (miR-143/145, miR-183/96/182 cluster, miR-21, miR-210, miR-200b/200a/429 cluster, miR-30a, miR-451a) on the NSCLC risk and survival in a Chinese Han population.


Subjects characteristics

Characteristics of all subjects (1341 cases and 1982 controls) have been described in Supplementary Table S1. The age and gender were comparable between cases and controls (Page = 0.972, Pgender = 0.179). The cases have a higher rate of smoking (61.1%) compared with the controls (48.5%), and most cases (64.1%) are diagnosed with adenocarcinoma.

Associations of 7 SNPs with NSCLC risk

Based on the bioinformatics methods (see Materials and Methods), 7 potentially functional SNPs (rs3733846 in miR-143/145, rs12538588 in miR-183/96/182, rs1292060 in miR-21, rs763354 in miR-30a, rs9660710 in miR-200b/200a/429, rs12286521 in miR-210 and rs901975 in miR-451a) were genotyped in our study. The genotype distributions of these SNPs between cases and controls are shown in Table 1. After adjusting for age, gender, and smoking, the minor allele of rs9660710 (C > A) in miR-200b/miR200a/429 demonstrated a significant association with an increased risk of NSCLC [additive model: Odds ratios (OR) = 1.17, 95% confidence intervals (CI) = 1.06–1.30, P = 0.002], while the variant allele of rs763354 (G > A) in miR-30a was associated with a decreased risk (additive model: adjusted OR = 0.88, 95%CI = 0.80–0.98, P = 0.017). No significant association was observed between other five SNPs and NSCLC risk.

Table 1: Primary information of 7 potentially functional SNPs and associations with NSCLC risk

aMajor/minor allele.

bMajor homozygote/heterozygote/minor homozygote in cases and controls.

cMinor allele frequency among cases/controls.

dLogistic regression with adjustment for age, gender and smoking.

ehet: heterozygote versus major homozygote; hom: minor homozygote versus major homozygote.

fP for additive model.

Bold values are statistically significant.

Furthermore, we performed stratification analysis on the associations of rs9660710 and rs763354 with NSCLC risk by age, gender, smoking and histological subtype (Table 2). The heterogeneity test showed that the difference was significant in the subgroup of age for rs763354 (P = 0.040), and the protective effect of rs763354 A allele in subjects aged ≥ 60 years (adjusted OR = 0.80, 95%CI = 0.70–0.92, P = 0.002) was prominent as compared with subjects aged < 60 years (adjusted OR = 0.99, 95%CI = 0.85–1.15, P = 0.847).

Table 2: Stratification analysis of rs9660710 and rs763354 genotypes associated with NSCLC risk

aMajor homozygote/heterozygote/minor homozygote.

bAdjusted for age, gender and smoking where appropriate in additive model.

cP values were from heterogeneity test based on χ2-based Q test.

Bold values are statistically significant.

Relationships between 7 SNPs and NSCLC survival

We further investigated the relationships between these SNPs and NSCLC survival. The clinical characteristics of 1001 NSCLC patients are described in Supplementary Table S2. We found that some factors, including gender, smoking, surgery, clinical stage and chemotherapy or radiotherapy had significant influences on patient prognosis (log-rank P < 0.05). Associations of these SNPs with overall survival in different genetic models are shown in Supplementary Table S3. Log-rank test indicated that no variant showed a significant association with overall survival time in additive, dominant or co-dominant models (all P > 0.05). The results were similar after adjusting for age, gender, smoking, clinical stage, chemotherapy or radiotherapy, surgery status and histological types (all P > 0.05, Supplementary Table S4).

In silico analysis of promising variants and miRNAs

We annotated two significantly risk-associated variants (rs763354, rs9660710) in regulatory elements cataloged in Encyclopedia of DNA Elements (ENCODE) project (https://www.encodeproject.org/), and HaploReg (http://compbio.mit.edu/HaploReg), respectively. Visual inspection of the SNPs and histone modification peaks showed that rs763354 is situated within the enhancer element (H3K4me1and H3K27ac histone mark) in both normal lung cell line (Normal Human Lung Fibroblasts, NHLF) and lung cancer cell line (A549), and fall into the promoter element (H3K4me3) in A549 cell line (Figure 1A). Rs9660710 is situated in the enhancer element (H3K27ac) in A549 cell line (Figure 1B). Furthermore, based on the HaploReg data, we found that rs763354, rs9660710 and their correlated variants within a LD block are positioned in regulatory regions (histone promoter or enhancer marks in several tissues), and alter multiple regulatory motifs or disturb some proteins bound (Supplementary Tables S5–S6). Taken together, our results indicated that these SNPs were probably involved in the regulation of miRNA expression.

Chromatin features of risk-associated SNPs.

Figure 1: Chromatin features of risk-associated SNPs. Functional annotation in proximity to miR-30a rs763354 location (A) and miR-200b/200a/429 cluster rs9660710 location (B) in A549 and NHLF cells. ChIP-seq tracks for promoter histone marks (H3K4me3) and enhancer histone marks (H3K4me1, H3K27ac) in A549 and NHLF cells are present along with DNAse hypersensitivity tracks from ENCODE data. The black vertical lines guide the position of risk-associated SNPs. The blue stripe represents the normal lung fibroblasts (NHLF). The red stripe represents the lung cancer cell line (A549). The green stripe represents the CpG island in the region.

To further explore the influence of rs763354 and rs9660710 on their related miRNAs (miR-30a-3p, miR-30-5p, miR-200a-3p, miR-200a-5p, miR-200b-3p, miR-200b-5p and miR-429) in NSCLC development, we used publicly available TCGA Lung Adenocarcinoma (LUAD) datasets and firstly evaluated the association of rs763354 and rs9660710 genotypes with their mature miRNAs expression in 38 adjacent normal tissues. We observed that rs9660710 (C > A) was positively associated with miR-429 expression (rho = 0.388, P = 0.016), indicating that rs9660710 may be an expression quantitative trait locus (eQTL) for miR-429. However, no significant association was observed between this SNP and the expression of miR-200b (PmiR-200b-3p = 0.059, PmiR-200b-5p = 0.190) and miR-200a (PmiR-200a-3p = 0.093, PmiR-200a-5p = 0.120). Additionally, there was no significant association between rs763354 and miR-30a expression (PmiR-30a-5p = 0.833, PmiR-30a-3p = 0.648). Subsequently, we evaluated these miRNA expression levels in 39 paired tissues. Consistent with previous studies, miR-30a-3p and miR-30a-5p were significantly decreased in tumor tissues in comparison with paired normal tissues while others (miR-200a-3p, miR-200a-5p, miR-200b-3p, miR-200b-5p and miR-429) were significantly increased in tumor tissues (Figure 2). Taken together, the minor allele of rs9660710 may increase the miR-429 expression and thus contribute to lung tumorigenesis.

Expression levels of candidate miRNAs in 39 paired TCGA LUAD tissues.

Figure 2: Expression levels of candidate miRNAs in 39 paired TCGA LUAD tissues. miR-200a-3p (A), miR-200a-5p (B), miR-200b-3p (C), miR-200b-5p (D), miR-429 (E) were significantly increased in tumor tissues in comparison with paired normal tissues, while miR-30a-5p (F), miR-30a-3p (G) were significantly decreased in tumor tissues. The statistical analysis was performed using paired student’s t-test.

Interestingly, rs9660710 (C > A) located in the CpG island in the promoter of miR-200b/200a/429 can lead to the loss of the CpG site (Figure 1B), which may prevent the CpG site from being methylated and increasing the miR-429 expression. To support the hypothesis, we extracted the TCGA LUAD DNA methylation data, which covers 51 CpG sites in the 10kb 5′ upstream region of miR-200b/200a/429 cluster. We observed that rs9660710 was negatively associated with the methylation level of cg00152708 in 32 normal tissues (rho = –0.427, P = 0.017). Meanwhile, the methylation level of cg00152708 was significantly lower in 29 tumor tissues compared to the adjacent normal tissues (0.457± 0.135 vs 0.553 ± 0.063, P = 0.003). However, due to the limited normal tissues (6 samples) with both DNA methylation and miRNA expression information, we cannot directly analyze the relationship between the methylation level of cg00152708 and miR-429 expression. Therefore, we speculated that rs9660710 (C > A) may decrease cg00152708 methylation level and thus increase the miR-429 expression in lung carcinogenesis, indicating a role of rs9660710 as a methylation quantitative trait locus (meQTL) for miR-429 expression.


In the present study, we explored the associations of 7 SNPs in regulatory regions of candidate miRNAs with the NSCLC risk and survival. Our findings suggested that rs9660710 in miR-200b/200a/429 cluster was associated with an increased NSCLC risk, and miR-30a rs763354 was associated with a decreased NSCLC risk. However, no variant was associated with NSCLC prognosis.

Previous studies demonstrated that abnormal expression of miR-200b/200a/429 cluster and miR-30a could contribute to lung tumorigenesis, and may be potential diagnostic biomarkers for NSCLC. For example, one recent study reported that upregulation of miR-200 could active several key components of the PI3K pathway to promoter lung cancer cells growth and contribute to early lung tumorigenesis [28]. Lang et al. observed that the expression of miR-429 increased in primary NSCLS tissues and cell lines, and overexpression of miR-429 promoted NSCLC cells proliferation through inhibiting PTEN, RASSF8 and TIMP2 expression [29]. For miR-30a, its overexpression was indicated to suppress NSCLC cell proliferation and trigger cell cycle arrest by inhibiting cell cycle regulators (CDK2, CDK4, Cyclin A2, Cyclin D1) [30]. Consistent with these previous findings, we found that miR-200a-3p, miR-200a-5p, miR-200b-3p, miR-200b-5p and miR-429 in our study were significantly overexpressed in tumors compared with adjacent normal tissues in TCGA LUAD data, whereas the expression levels of miR-30a-3p and miR-30a-5p were suppressed in tumors.

To date, only few studies have focused on the associations of above miRNAs polymorphisms with cancer development and prognosis. Leng et al. identified the G allele of rs61768479, a tagSNP located in the promoter region of miR-200a/200b/429, was associated with a 50% reduced risk of lung cancer in Caucasians; however, this association was not confirmed in the validation stage [31]. Wu et al. reported that rs1045385 in the 3′ UTR of AP-2α suppressed the binding of miR-200b/200c/429 to AP-2α, and might be a potential prognostic marker for cisplatin treatment [32]. Another study showed that miR-143/145 rs3733845 confers protective effect on the risk of colorectal cancer in Chinese population [33]. In the present study, we failed to analyze the associations between these two SNPs (rs61768479 and rs1045385) and NSCLC risk and survival, owing to the low frequency in Chinese Han population and the reason that rs1045385 is located in the target gene. But for rs3733845, we did not observed the significant association of this SNP with lung cancer risk in our study, which may be due to the complex effects of polymorphisms on different cancer types. Our study showed for the first time that rs9660710 in miR-200b/200a/429 cluster conferred the increased risk of NSCLC in Chinese population, but miR-30a rs763354 showed a protective effect on NSCLC risk.

In silico analysis suggested that rs763354 and rs9660710 were situated in different histone modification patterns between normal and lung cancer cells, highlighting the possibility that these loci may regulate the transcription of miRNAs. Functional annotations from HaploReg indicated that rs763354 might increase the binding of Sox2, which might bind to the promoter regions of miRNAs [34]. However, we did not observe significant association between rs763354 and miR-30a-3p or miR-30a-5p expression in 38 TCGA adjacent normal tissues. The variant allele of rs9660710 (C > A) is predicted to strengthen the affinity motifs for BRCA1 (Supplementary Table S5), which participates in transcriptional regulation by interacting with a large amount of different transcription factors, including TP53 and CMYC [35]. In our study, we observed that the variant allele of rs9660710 is an eQTL for miR-429 expression and significantly increase the miR-429 expression in normal tissues. Moreover, rs9660710 is a CpG site related SNP (cgSNP) [36] and meQTL for miR-429 expression. Consequently, we speculated that rs9660710 may disrupt transcription factor response elements or DNA methylation to promote the expression of miR-429, respectively. However, further functional characterizations are warranted to validate the genotype-phenotype correlation.

In conclusion, our study provided evidence that SNPs in miRNAs regulatory regions were significantly associated with NSCLC risk, indicating the importance of the functional variants of miRNAs in NSCLC tumorigenesis. Larger studies with functional investigations are needed to validate the findings.


Study subjects

All subjects in this study have been described previously [37]. In brief, 1341 patients with NSCLC were newly diagnosed and recruited from the First Affiliated Hospital of Nanjing Medical University and the Cancer Hospital of Jiangsu Province since 2003. All patients were histopathologically or cytologically confirmed NSCLC. The 1982 controls were selected from healthy individuals participated in screening for non-infectious diseases during the same time period. The controls were frequency-matched to cases on age and gender. After having signed the informed consent, each participant was face-to-face interviewed by trained interviewers to collect personal information on demographics data. Furthermore, we collected patients’ clinical information from patients’ medical records and subsequent personal or family contacts, including the date of diagnosis, surgery status, clinical stage, chemotherapy or radiotherapy, histological types and the date of death during follow-up (last follow-up in August 2013). Finally, a total of 1001 cases (74.6%) had sufficient information about survival time to permit statistical analysis with the median survival time (MST) of 26.0 months. This study was approved by the institutional review board of Nanjing Medical University.

SNPs selection and genotyping

Based on the UCSC Genome Browser database (http://genome.ucsc.edu/) and HapMap SNP database (phase II + III Feb 09, on NCBI B36 assembly, dbSNP b126), 38 common SNPs (MAF ≥ 0.05 in Chinese Han population) were selected in the 10kb 5’ upstream region of each pre-miRNA. If the candidate miRNA is located in a cluster, we then screened the upstream region of the cluster instead of the single miRNA (miR-143/145, miR-183/96/183, miR-21, miR-210, miR-200b/200a/429, miR-30a, miR-451a). The potentially functional SNPs were predicted in silico by using the SNPinfo web server (http://snpinfo.niehs.nih.gov/). Among 16 potentially functional SNPs that might disturb the binding of transcription factors, 9 SNPs were excluded because of high linkage disequilibrium (LD) analyzed by (r2 > 0.8). As a result, 7 SNPs (rs3733846 in miR-143/145, rs12538588 in miR-183/96/182, rs1292060 in miR-21, rs763354 in miR-30a, rs9660710 in miR-200b/200a/429, rs12286521 in miR-210 and rs901975 in miR-451a) were genotyped in our study.

Genomic DNA was isolated from a leukocyte pellet by proteinase K digestion, followed by phenol-chloroform extraction and ethanol precipitation. All SNPs were genotyped by Illumina Infinium® Human Exome BeadChip (Illumina Inc., San Diego, CA). All SNPs were successfully genotyped with call rates > 95%.

In silico analysis

In order to evaluate whether promising SNPs harbor regulatory elements, we downloaded Chromatin Immunoprecipitation sequencing (ChIP-seq) and DNaseI Hypersensitivity (DNaseI HS) data in normal lung cell line (Normal Human Lung Fibroblasts, NHLF) and lung cancer cell line (A549) from Encyclopedia of DNA Elements (ENCODE) project (https://www.encodeproject.org/), and visualized these regulatory elements using Integrative Genomics Viewer (IGV) (http://www.broadinstitute.org/igv/). We also used the HaploReg resource V2 (http://compbio.mit.edu/HaploReg) to examine the risk loci and variants in high LD (r2 > 0.8 in Asian from the 1000 Genomes Project) for functional elements available from the Epigenome roadmap project [38].

To assess miRNA expression, level 3 miRNA isoform quantification data (reads per million of total reads mapping to a mature microRNA) in LUAD tumors and adjacent normal tissues were retrieved from TCGA portal on date 07/15/2014 (http://cancergenome.nih.gov/). We filtered out cross-mapped regions and then summed over the reads per million miRNAs mapped (RPM) values for each mature miRNA according to the previous study [37]. After normalization by using the EdgeR package [39], the expression data of interest miRNAs were extracted and were log2-transformed for further analysis.

In order to bridge the gap between the genotype and phenotype, we have applied for the Affymetrix SNP 6.0 Level 1 data of TCGA LUAD samples to extract genotypes of rs9660710 and rs763354. Because the chip didn’t include the genotype information of rs9660710, we further imputed the genotypes of rs9660710 for TCGA samples by using IMPUTE2 (1000 Genomes Phase I integrated variant set b37 March 2012 release). The level 3 TCGA methylation data (Illumina’s Infinium HumanMethylation450 Beadchip) for 32 LUAD normal tissues and paired 29 tumor tissues were downloaded from the TCGA data portal on date 07/15/2014.

Statistical analysis

Differences between cases and controls in the distribution of demographic characteristics were evaluated by the χ2 test (for categorical variables) or Student’s t-test (for continuous variables). Genotype frequencies in control group were tested for the Hardy-Weinberg equilibrium by using goodness-of-fit χ2 test. OR and their 95% CI were calculated by using logistic regression analysis to evaluate the associations between genotypes and lung cancer risk with adjustment for age, gender and smoking. For survival analysis, the distribution of demographic and clinical variables according to death status was tested by the χ2 test. Log-rank test was applied to compare the survival time in different subgroups categorized by patient characteristics and genotypes. Univariate and multivariate Cox proportional hazard regression analysis were performed to estimate the crude or adjusted hazard ratio (HR) and 95%CI with adjustment of age, gender, smoking, clinical stage, chemotherapy or radiotherapy, surgery status and histological types. The Chi-square-based Q test was applied to test the heterogeneity of associations between subgroups. The paired two-sample Student’s t-test was used to compare the miRNA expression or methylation levels of CpG sites in paired tumor and adjacent normal tissues from TCGA dataset. Spearman’s rank correlation was used to evaluate genotype-expression and genotype-methylation associations. All of the statistical analyses were performed with R software (version 3.2.2; The R Foundation for Statistical Computing). A two-sided P value of < 0.05 was considered statistically significant.


We thank the ENCODE and TCGA Projects for making the data public.


The authors declare that they have no competing interests.


This work was supported by the National Science Foundation for Distinguished Young Scholars of China (81225020, 81270044, 81521004, 81230067), Jiangsu Specially-Appointed Professor project, National Program for Support of Top-notch Young Professionals from the Organization Department of the CPC Central Committee, Jiangsu Province Clinical Science and Technology Projects (BL2012008) and the Priority Academic Program for the Development of Jiangsu Higher Education Institutions (Public Health and Preventive Medicine).


1. Chen W, Zheng R, Baade PD, Zhang S, Zeng H, Bray F, Jemal A, Yu XQ, He J. Cancer statistics in China, 2015. CA Cancer J Clin. 2016; 66:115–132

2. Siegel R, Naishadham D, Jemal A. Cancer statistics, 2013. CA Cancer J Clin. 2013; 63:11–30.

3. Cao S, Wang C, Ma H, Yin R, Zhu M, Shen W, Dai J, Shu Y, Xu L, Hu Z, Shen H. Genome-wide Association Study on Platinum-induced Hepatotoxicity in Non-Small Cell Lung Cancer Patients. Scientific reports. 2015; 5:11556.

4. Chu M, Zhang R, Zhao Y, Wu C, Guo H, Zhou B, Lu J, Shi Y, Dai J, Jin G, Ma H, Dong J, Wei Y, et al. A genome-wide gene-gene interaction analysis identifies an epistatic gene pair for lung cancer susceptibility in Han Chinese. Carcinogenesis. 2014; 35:572–577.

5. Dong J, Hu Z, Wu C, Guo H, Zhou B, Lv J, Lu D, Chen K, Shi Y, Chu M, Wang C, Zhang R, Dai J, et al. Association analyses identify multiple new lung cancer susceptibility loci and their interactions with smoking in the Chinese population. Nature genetics. 2012; 44:895–899.

6. Tang S, Pan Y, Wang Y, Hu L, Cao S, Chu M, Dai J, Shu Y, Xu L, Chen J, Jin G, Hu Z, Ma H, et al. Genome-wide association study of survival in early-stage non-small cell lung cancer. Annals of surgical oncology. 2015; 22:630–635.

7. Boeri M, Verri C, Conte D, Roz L, Modena P, Facchinetti F, Calabro E, Croce CM, Pastorino U, Sozzi G. MicroRNA signatures in tissues and plasma predict development and prognosis of computed tomography detected lung cancer. Proceedings of the National Academy of Sciences of the United States of America. 2011; 108:3713–3718.

8. Guan P, Yin Z, Li X, Wu W, Zhou B. Meta-analysis of human lung cancer microRNA expression profiling studies comparing cancer tissues with normal tissues. Journal of experimental & clinical cancer research. 2012; 31:54.

9. Vosa U, Vooder T, Kolde R, Vilo J, Metspalu A, Annilo T. Meta-analysis of microRNA expression in lung cancer. International journal of cancer. 2013; 132:2884–2893.

10. Wang R, Wang ZX, Yang JS, Pan X, De W, Chen LB. MicroRNA-451 functions as a tumor suppressor in human non-small cell lung cancer by targeting ras-related protein 14 (RAB14). Oncogene. 2011; 30:2644–2658.

11. Eilertsen M, Andersen S, Al-Saad S, Richardsen E, Stenvold H, Hald SM, Al-Shibli K, Donnem T, Busund LT, Bremnes RM. Positive prognostic impact of miR-210 in non-small cell lung cancer. Lung cancer (Amsterdam, Netherlands). 2014; 83:272–278.

12. Gao W, Yu Y, Cao H, Shen H, Li X, Pan S, Shu Y. Deregulated expression of miR-21, miR-143 and miR-181a in non small cell lung cancer is related to clinicopathologic characteristics or patient prognosis. Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie. 2010; 64:399–408.

13. Stenvold H, Donnem T, Andersen S, Al-Saad S, Busund LT, Bremnes RM. Stage and tissue-specific prognostic impact of miR-182 in NSCLC. BMC cancer. 2014; 14:138.

14. Tang R, Liang L, Luo D, Feng Z, Huang Q, He R, Gan T, Yang L, Chen G. Downregulation of MiR-30a is Associated with Poor Prognosis in Lung Cancer. Medical science monitor. 2015; 21:2514–2520.

15. Xia W, Chen Q, Wang J, Mao Q, Dong G, Shi R, Zheng Y, Xu L, Jiang F. DNA methylation mediated silencing of microRNA-145 is a potential prognostic marker in patients with lung adenocarcinoma. Scientific reports. 2015; 5:16901.

16. Xu F, Zhang H, Su Y, Kong J, Yu H, Qian B. Up-regulation of microRNA-183-3p is a potent prognostic marker for lung adenocarcinoma of female non-smokers. Clinical & translational oncology. 2014; 16:980–985.

17. Hatley ME, Patrick DM, Garcia MR, Richardson JA, Bassel-Duby R, van Rooij E, Olson EN. Modulation of K-Ras-dependent lung tumorigenesis by MicroRNA-21. Cancer cell. 2010; 18:282-293.

18. Grosso S, Doyen J, Parks SK, Bertero T, Paye A, Cardinaud B, Gounon P, Lacas-Gervais S, Noel A, Pouyssegur J, Barbry P, Mazure NM, Mari B. MiR-210 promotes a hypoxic phenotype and increases radioresistance in human lung cancer cell lines. Cell death & disease. 2013; 4:e544.

19. Zhu W, Liu X, He J, Chen D, Hunag Y, Zhang YK. Overexpression of members of the microRNA-183 family is a risk factor for lung cancer: a case control study. BMC cancer. 2011; 11:393.

20. Gill BJ, Gibbons DL, Roudsari LC, Saik JE, Rizvi ZH, Roybal JD, Kurie JM, West JL. A synthetic matrix with independently tunable biochemistry and mechanical properties to study epithelial morphogenesis and EMT in a lung adenocarcinoma model. Cancer research. 2012; 72:6013–6023.

21. Kumarswamy R, Mudduluru G, Ceppi P, Muppala S, Kozlowski M, Niklinski J, Papotti M, Allgayer H. MicroRNA-30a inhibits epithelial-to-mesenchymal transition by targeting Snai1 and is downregulated in non-small cell lung cancer. International journal of cancer. 2012; 130:2044–2053.

22. Pacurari M, Addison JB, Bondalapati N, Wan YW, Luo D, Qian Y, Castranova V, Ivanov AV, Guo NL. The microRNA-200 family targets multiple non-small cell lung cancer prognostic markers in H1299 cells and BEAS-2B cells. International journal of oncology. 2013; 43:548–560.

23. Cho WC, Chow AS, Au JS. MiR-145 inhibits cell proliferation of human lung adenocarcinoma by targeting EGFR and NUDT1. RNA biology. 2011; 8:125–131.

24. Xia H, Sun S, Wang B, Wang T, Liang C, Li G, Huang C, Qi D, Chu X. miR-143 inhibits NSCLC cell growth and metastasis by targeting Limk1. International journal of molecular sciences. 2014; 15:11973–11983.

25. Bulik-Sullivan B, Selitsky S, Sethupathy P. Prioritization of genetic variants in the microRNA regulome as functional candidates in genome-wide association studies. Human mutation. 2013; 34:1049–1056.

26. Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, Reynolds AP, Sandstrom R, Qu H, Brody J, Shafer A, Neri F, Lee K, et al. Systematic localization of common disease-associated variation in regulatory DNA. Science (New York, NY). 2012; 337:1190–1195.

27. Schaub MA, Boyle AP, Kundaje A, Batzoglou S, Snyder M. Linking disease associations with regulatory information in the human genome. Genome research. 2012; 22:1748–1759.

28. Guo L, Wang J, Yang P, Lu Q, Zhang T, Yang Y. MicroRNA-200 promotes lung cancer cell growth through FOG2-independent AKT activation. IUBMB life. 2015.

29. Lang Y, Xu S, Ma J, Wu J, Jin S, Cao S, Yu Y. MicroRNA-429 induces tumorigenesis of human non-small cell lung cancer cells and targets multiple tumor suppressor genes. Biochemical and biophysical research communications. 2014; 450:154–159.

30. Wen XP, Ma HL, Zhao LY, Zhang W, Dang CX. MiR-30a suppresses non-small cell lung cancer progression through AKT signaling pathway by targeting IGF1R. Cellular and molecular biology (Noisy-le-Grand, France). 2015; 61:78–85.

31. Leng S, Bernauer AM, Zhai R, Tellez CS, Su L, Burki EA, Picchi MA, Stidley CA, Crowell RE, Christiani DC, Belinsky SA. Discovery of common SNPs in the miR-205/200 family-regulated epithelial to mesenchymal transition pathway and their association with risk for non-small cell lung cancer. International journal of molecular epidemiology and genetics. 2011; 2:145–155.

32. Wu Y, Xiao Y, Ding X, Zhuo Y, Ren P, Zhou C, Zhou J. A miR-200b/200c/429-binding site polymorphism in the 3’ untranslated region of the AP-2alpha gene is associated with cisplatin resistance. PloS one. 2011; 6:e29043.

33. Li L, Pan X, Li Z, Bai P, Jin H, Wang T, Song C, Zhang L, Gao L. Association between polymorphisms in the promoter region of miR-143/145 and risk of colorectal cancer. Human immunology. 2013; 74:993–997.

34. Vencken SF, Sethupathy P, Blackshields G, Spillane C, Elbaruni S, Sheils O, Gallagher MF, O’Leary JJ. An integrated analysis of the SOX2 microRNA response program in human pluripotent and nullipotent stem cell lines. BMC genomics. 2014; 15:711.

35. Mullan PB, Quinn JE, Harkin DP. The role of BRCA1 in transcriptional regulation and cell cycle control. Oncogene. 2006; 25:5854–5863.

36. Zhou D, Li Z, Yu D, Wan L, Zhu Y, Lai M, Zhang D. Polymorphisms involving gain or loss of CpG sites are significantly enriched in trait-associated SNPs. Oncotarget. 2015; 6:39995–40004. doi: 10.18632/oncotarget.5650.

37. Xie K, Ma H, Liang C, Wang C, Qin N, Shen W, Gu Y, Yan C, Zhang K, Dai N, Zhu M, Wu S, Wang H, et al. A functional variant in miR-155 regulation region contributes to lung cancer risk and survival. Oncotarget. 2015; 6:42781–42792. doi: 10.18632/oncotarget.5840.

38. Ward LD, Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic acids research. 2012; 40:D930–934.

39. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics (Oxford, England). 2010; 26:139–140.

Creative Commons License All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 3.0 License.
PII: 10299