iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC

Jianhua Jia; Zi Liu; Xuan Xiao; Bingxiang Liu; Kuo-Chen Chou

doi:10.18632/oncotarget.9148

Oncotarget

Oncotarget (a primarily oncology-focused, peer-reviewed, open access journal) aims to maximize research impact through insightful peer-review; eliminate borders between specialties by linking different fields of oncology, cancer research and biomedical sciences; and foster application of basic and clinical science.

Its scope is unique. The term "oncotarget" encompasses all molecules, pathways, cellular functions, cell types, and even tissues that can be viewed as targets relevant to cancer as well as other diseases. The term was introduced in the inaugural Editorial, Introducing Oncotarget.

As of January 1, 2022, Oncotarget has shifted to a continuous publishing model. Papers will now be published continuously within yearly volumes in their final and complete form and then quickly released to Pubmed.

Subscribe to receive alerts once a paper has been published by Oncotarget.

Impact Journals, LLC is the publisher of Oncotarget: www.impactjournals.com.

Impact Journals is a member of the Wellcome Trust List of Compliant Publishers.

Impact Journals is a member of the Society for Scholarly Publishing.

On December 23, 2022, Oncotarget server experienced a DDoS attack. As a result, Oncotarget site was inaccessible for a few hours. Oncotarget team swiftly dealt with the situation and took it under control. This malicious action will be reported to the FBI.

Research Papers:

iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC

Jianhua Jia _, Zi Liu, Xuan Xiao, Bingxiang Liu and Kuo-Chen Chou

PDF | HTML | Supplementary Files | How to cite

Oncotarget. 2016; 7:34558-34570. https://doi.org/10.18632/oncotarget.9148

Metrics: PDF 1368 views | HTML 3426 views | ?

Abstract

Jianhua Jia1,2, Zi Liu3, Xuan Xiao1,2, Bingxiang Liu1, Kuo-Chen Chou2,4

1Computer Department, Jing-De-Zhen Ceramic Institute, Jing-De-Zhen 333403 China

2Gordon Life Science Institute, Boston, MA 02478, USA

3School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China

4Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah 21589, Saudi Arabia

Correspondence to:

Jianhua Jia, email: [email protected]

Xuan Xiao, email: [email protected]

Kuo-Chen Chou, email: [email protected]

Keywords: carbonylation, sequence-coupling model, PseAAC, Monte Carlo sampling, random forest algorithm

Received: March 06, 2016 Accepted: April 09, 2016 Published: May 03, 2016

ABSTRACT

Carbonylation is a posttranslational modification (PTM or PTLM), where a carbonyl group is added to lysine (K), proline (P), arginine (R), and threonine (T) residue of a protein molecule. Carbonylation plays an important role in orchestrating various biological processes but it is also associated with many diseases such as diabetes, chronic lung disease, Parkinson’s disease, Alzheimer’s disease, chronic renal failure, and sepsis. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of K, P, R, or T, which ones can be carbonylated, and which ones cannot? To address this problem, we have developed a predictor called iCar-PseCp by incorporating the sequence-coupled information into the general pseudo amino acid composition, and balancing out skewed training dataset by Monte Carlo sampling to expand positive subset. Rigorous target cross-validations on a same set of carbonylation-known proteins indicated that the new predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iCar-PseCp has been established at http://www.jci-bioinfo.cn/iCar-PseCp, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. It has not escaped our notice that the formulation and approach presented here can also be used to analyze many other problems in computational proteomics.

All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 4.0 License.
PII: 9148

Publication Alerts

Research Papers:

iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC

Abstract