Multiple functional SNPs in differentially expressed genes modify risk and survival of non-small cell lung cancer in Chinese female non-smokers
Metrics: PDF 1802 views | HTML 1820 views | ?
Xue Fang1,2, Zhihua Yin1,2, Xuelian Li1,2, Lingzi Xia 1,2, Xiaowei Quan1,2, Yuxia Zhao3, Baosen Zhou1,2
1Department of Epidemiology, School of Public Health, China Medical University, Shenyang, China
2Key Laboratory of Cancer Etiology and Prevention, China Medical University, Liaoning Provincial Department of Education, Liaoning, China
3Department of Radiotherapy, The Fourth Affiliated Hospital of China Medical University, Shenyang, China
Baosen Zhou, email: [email protected]
Keywords: differentially expressed genes, functional single nucleotide polymorphism, non-small cell lung cancer, risk, survival
Received: December 02, 2016 Accepted: January 11, 2017 Published: January 27, 2017
DNA genotype can affect gene expression, and gene expression can influence the onset and progression of diseases. Here we conducted a comprehensive study, we integrated analysis of gene expression profile and single nucleotide polymorphism (SNP) microarray data in order to scan out the critical genetic changes that participate in the onset and development of non-small cell lung cancer (NSCLC). Gene expression profile datasets were downloaded from the GEO database. Firstly, differentially expressed genes (DEGs) between NSCLC samples and adjacent normal samples were identified. Next, by STRING database, protein-protein interaction (PPI) network was constructed. At the same time, hub genes in PPI network were identified. Then, some functional SNPs in hub genes that may affect gene expression have been annotated. Finally, we carried a study to explore the relationship between functional SNPs and NSCLC risk and overall survival in Chinese female non-smokers. A total of 488 DEGs were identified in our study. There are 29 proteins with a higher degree of connectivity in the PPI network, including FOS, IL6 and MMP9. By using database annotation, we got 8 candidate functional SNPs that may affect the expression level of hub proteins. In the case-control study, we found that rs4754-T allele, rs959173-C allele and rs2239144-G allele were the protective allele of NSCLC risk. In dominant model, rs4754-CT+TT genotype were associated with a shorter survival time. In general, our study provides a novel research direction in the field of multi-omic data integration, and helps us find some critical genetic changes in disease.
All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.