NcPred for Accurate Nuclear Protein Prediction Using n-mer Statistics with Various Classification Algorithms
File version
Author(s)
Kabir, Alaol
Sakib, Kazi
Hossain, Md Alamgir
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Rocha, MP
Rodriguez, JMC
FdezRiverola, F
Valencia, A
Date
Size
File type(s)
Location
Salamanca, SPAIN
License
Abstract
Prediction of nuclear proteins is one of the major challenges in genome annotation. A method, NcPred is described, for predicting nuclear proteins with higher accuracy exploiting n − mer statistics with different classification algorithms namely Alternating Decision (AD) Tree, Best First (BF) Tree, Random Tree and Adaptive (Ada) Boost. On BaCello dataset [1], NcPred improves about 20% accuracy with Random Tree and about 10% sensitivity with Ada Boost for Animal proteins compared to existing techniques. It also increases the accuracy of Fungal protein prediction by 20% and recall by 4% with AD Tree. In case of Human protein, the accuracy is improved by about 25% and sensitivity about 10% with BF Tree. Performance analysis of NcPred clearly demonstrates its suitability over the contemporary in-silico nuclear protein classification research.
Journal Title
Conference Title
5TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS (PACBB 2011)
Book Title
Edition
Volume
93
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© 2011 Springer-Verlag Berlin Heidelberg. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. The original publication is available at www.springerlink.com
Item Access Status
Note
Access the data
Related item(s)
Subject
Bioinformatics and computational biology not elsewhere classified
Data mining and knowledge discovery
Artificial intelligence not elsewhere classified