CancerDiscover: an integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data

Akram Mohammed; Greyson Biegert; Jiri Adamec; Tomáš Helikar

doi:10.18632/oncotarget.23511

Oncotarget

Oncotarget (a primarily oncology-focused, peer-reviewed, open access journal) aims to maximize research impact through insightful peer-review; eliminate borders between specialties by linking different fields of oncology, cancer research and biomedical sciences; and foster application of basic and clinical science.

Its scope is unique. The term "oncotarget" encompasses all molecules, pathways, cellular functions, cell types, and even tissues that can be viewed as targets relevant to cancer as well as other diseases. The term was introduced in the inaugural Editorial, Introducing Oncotarget.

As of January 1, 2022, Oncotarget has shifted to a continuous publishing model. Papers will now be published continuously within yearly volumes in their final and complete form and then quickly released to Pubmed.

Subscribe to receive alerts once a paper has been published by Oncotarget.

Impact Journals, LLC is the publisher of Oncotarget: www.impactjournals.com.

Impact Journals is a member of the Wellcome Trust List of Compliant Publishers.

Impact Journals is a member of the Society for Scholarly Publishing.

On December 23, 2022, Oncotarget server experienced a DDoS attack. As a result, Oncotarget site was inaccessible for a few hours. Oncotarget team swiftly dealt with the situation and took it under control. This malicious action will be reported to the FBI.

Research Papers:

CancerDiscover: an integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data

Akram Mohammed, Greyson Biegert, Jiri Adamec and Tomáš Helikar _

PDF | HTML | Supplementary Files | How to cite

Oncotarget. 2018; 9:2565-2573. https://doi.org/10.18632/oncotarget.23511

Metrics: PDF 1176 views | HTML 2610 views | ?

Abstract

Akram Mohammed1,*, Greyson Biegert1,*, Jiri Adamec1 and Tomáš Helikar1

1Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America

*These authors have contributed equally to this work

Correspondence to:

Tomáš Helikar, email: [email protected]

Keywords: open-source; cancer classification; gene expression; machine learning; cancer biomarker

Received: September 28, 2017 Accepted: December 09, 2017 Published: December 20, 2017

ABSTRACT

Accurate identification of cancer biomarkers and classification of cancer type and subtype from High Throughput Sequencing (HTS) data is a challenging problem because it requires manual processing of raw HTS data from various sequencing platforms, quality control, and normalization, which are both tedious and time-consuming. Machine learning techniques for cancer class prediction and biomarker discovery can hasten cancer detection and significantly improve prognosis. To date, great research efforts have been taken for cancer biomarker identification and cancer class prediction. However, currently available tools and pipelines lack flexibility in data preprocessing, running multiple feature selection methods and learning algorithms, therefore, developing a freely available and easy-to-use program is strongly demanded by researchers. Here, we propose CancerDiscover, an integrative open-source software pipeline that allows users to automatically and efficiently process large high-throughput raw datasets, normalize, and selects best performing features from multiple feature selection algorithms. Additionally, the integrative pipeline lets users apply different feature thresholds to identify cancer biomarkers and build various training models to distinguish different types and subtypes of cancer. The open-source software is available at https://github.com/HelikarLab/CancerDiscover and is free for use under the GPL3 license.

All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 23511

Publication Alerts

Research Papers:

CancerDiscover: an integrative pipeline for cancer biomarker and cancer class prediction from high-throughput sequencing data

Abstract