Research Papers:
Systemically identifying and prioritizing risk lncRNAs through integration of pan-cancer phenotype associations
PDF | HTML | Supplementary Files | How to cite
Metrics: PDF 2287 views | HTML 2323 views | ?
Abstract
Chaohan Xu1,*, Rui Qi1,*, Yanyan Ping1,*, Jie Li1, Hongying Zhao1, Li Wang1, Michael Yifei Du3, Yun Xiao1,2, Xia Li1
1College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang, China
2Key Laboratory of Cardiovascular Medicine Research, Harbin Medical University, Ministry of Education, China
3Weston High School of Massachusetts, Massachusetts, USA
*These authors have contributed equally to this work
Correspondence to:
Xia Li, email: [email protected]
Yun Xiao, email: [email protected]
Keywords: disease phenotype association, risk lncRNA, pan cancer, identification and prioritization
Received: July 15, 2016 Accepted: December 12, 2016 Published: January 05, 2017
ABSTRACT
LncRNAs have emerged as a major class of regulatory molecules involved in normal cellular physiology and disease, our knowledge of lncRNAs is very limited and it has become a major research challenge in discovering novel disease-related lncRNAs in cancers. Based on the assumption that diverse diseases with similar phenotype associations show similar molecular mechanisms, we presented a pan-cancer network-based prioritization approach to systematically identify disease-specific risk lncRNAs by integrating disease phenotype associations. We applied this strategy to approximately 2800 tumor samples from 14 cancer types for prioritizing disease risk lncRNAs. Our approach yielded an average area under the ROC curve (AUC) of 80.66%, with the highest AUC (98.14%) for medulloblastoma. When evaluated using leave-one-out cross-validation (LOOCV) for prioritization of disease candidate genes, the average AUC score of 97.16% was achieved. Moreover, we demonstrated the robustness as well as the integrative importance of this approach, including disease phenotype associations, known disease genes and the numbers of cancer types. Taking glioblastoma multiforme as a case study, we identified a candidate lncRNA gene SNHG1 as a novel disease risk factor for disease diagnosis and prognosis. In summary, we provided a novel lncRNA prioritization approach by integrating pan-cancer phenotype associations that could help researchers better understand the important roles of lncRNAs in human cancers.
All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 14510