Nair, Smitha Sunil Kumaran and Subbareddy , NV and Hareesha , K S and Balaji, S (2013) A Diverse Assimilation of Sequence and Structure Dependent Features for Amyloid Plaque Prediction Using Random Forests Amyloid Plaque Prediction Using Random Forests. Current Proteomics, 10 (1). pp. 38-44. ISSN 1875-6247
![]() |
PDF
Nair_CSE_MIT.pdf - Published Version Restricted to Registered users only Download (231kB) | Request a copy |
Abstract
The failure of proteins to fold correctly result in amyloidosis. Therefore, amyloid plaque prediction has become significant to narrow down the exploration of anti- amyloidosis and related drugs. In this research article, we propose a unique hybrid approach to computationally predict the formation of amyloid plaques by exploiting diversity in the feature vector extracted from protein sequences and structures. The diversity in the sequence of feature space is exploited using structure dependent features besides the physico-chemical information from amino acid chemistry and frequency spectrum based parameters. We explored the prediction capability with independent and integrated feature vectors by an ensemble machine learning classifier, Random Forests. Computational analysis evidence that the assimilation of diverse feature set outperform individual feature array with a balanced prediction accuracy of 0.830 and Receiver Characteristic Curve area of 0.918 on stratified10-fold cross-validation test.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Amyloid plaque, fibrillogenesis, frequency spectrum parameters, physico-chemical properties, random forests, structure dependent features |
Subjects: | Engineering > MIT Manipal > Computer Science and Engineering |
Depositing User: | MIT Library |
Date Deposited: | 11 Jul 2013 11:27 |
Last Modified: | 11 Jul 2013 11:27 |
URI: | http://eprints.manipal.edu/id/eprint/136506 |
Actions (login required)
![]() |
View Item |