Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine

The field of computational structural biology these days has become advanced especially in the continued development of new high-throughput methods for predicting enzyme sub-functional classes. Prior knowledge of enzyme subfunctional classes has been applied in numerous important predictive tasks th...

Full description

Saved in:
Bibliographic Details
Main Author: Guramad Singh, Sharon Kaur
Format: Thesis
Language:English
Published: 2013
Subjects:
Online Access:http://eprints.utm.my/id/eprint/40668/5/SharonKaurGuramadSinghMFSKSM201.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.40668
record_format uketd_dc
spelling my-utm-ep.406682017-06-21T01:46:00Z Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine 2013-11 Guramad Singh, Sharon Kaur QA75 Electronic computers. Computer science The field of computational structural biology these days has become advanced especially in the continued development of new high-throughput methods for predicting enzyme sub-functional classes. Prior knowledge of enzyme subfunctional classes has been applied in numerous important predictive tasks that address structural and functional features of enzymes. However, issues on insufficient sequence-structure knowledge, lack of known enzyme sub-functional class, low-identity sequences have caused inaccurate feature representation and imbalance distribution of enzyme sub-functional class which has contributed to low prediction results. Thus, the research proposed a derivative features vector through the consolidation of amino acid composition; dipeptide composition; hydrophobicity and hydrophilicity known as APH which is based on multi-biological knowledge. The Support Vector Machine assigns and classifies every protein sequence into its respective vector. This process would enhance the sequence-structure knowledge and overcome inaccurate feature representation. Besides that, the Twin Support Vector Machine classifies the enzyme sub-functional class and solves the imbalance distribution of enzyme sub-functional class. In this study, bio-inspired kernel function was introduced to improve the overall enzyme sub-functional class prediction. The overall results were evaluated based on accuracy, sensitivity, specificity and Matthew’s Correlation Coefficient value. Statistical and biological validation using t-test and Gene Ontology showed that the experimental results achieved an accuracy of more than 98%. Findings from the research have shown that the proposed method could assist in the prediction of the enzyme biological function, protein structure and function, protein structural class and hence provide guidance in the designing of novel drugs to cure disease 2013-11 Thesis http://eprints.utm.my/id/eprint/40668/ http://eprints.utm.my/id/eprint/40668/5/SharonKaurGuramadSinghMFSKSM201.pdf application/pdf en public masters Universiti Teknologi Malaysia, Faculty of Computing Faculty of Computing
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic QA75 Electronic computers
Computer science
spellingShingle QA75 Electronic computers
Computer science
Guramad Singh, Sharon Kaur
Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
description The field of computational structural biology these days has become advanced especially in the continued development of new high-throughput methods for predicting enzyme sub-functional classes. Prior knowledge of enzyme subfunctional classes has been applied in numerous important predictive tasks that address structural and functional features of enzymes. However, issues on insufficient sequence-structure knowledge, lack of known enzyme sub-functional class, low-identity sequences have caused inaccurate feature representation and imbalance distribution of enzyme sub-functional class which has contributed to low prediction results. Thus, the research proposed a derivative features vector through the consolidation of amino acid composition; dipeptide composition; hydrophobicity and hydrophilicity known as APH which is based on multi-biological knowledge. The Support Vector Machine assigns and classifies every protein sequence into its respective vector. This process would enhance the sequence-structure knowledge and overcome inaccurate feature representation. Besides that, the Twin Support Vector Machine classifies the enzyme sub-functional class and solves the imbalance distribution of enzyme sub-functional class. In this study, bio-inspired kernel function was introduced to improve the overall enzyme sub-functional class prediction. The overall results were evaluated based on accuracy, sensitivity, specificity and Matthew’s Correlation Coefficient value. Statistical and biological validation using t-test and Gene Ontology showed that the experimental results achieved an accuracy of more than 98%. Findings from the research have shown that the proposed method could assist in the prediction of the enzyme biological function, protein structure and function, protein structural class and hence provide guidance in the designing of novel drugs to cure disease
format Thesis
qualification_level Master's degree
author Guramad Singh, Sharon Kaur
author_facet Guramad Singh, Sharon Kaur
author_sort Guramad Singh, Sharon Kaur
title Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
title_short Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
title_full Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
title_fullStr Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
title_full_unstemmed Enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
title_sort enzyme sub-functional class prediction using multibiological knowledge feature respresentation and twin support vector machine
granting_institution Universiti Teknologi Malaysia, Faculty of Computing
granting_department Faculty of Computing
publishDate 2013
url http://eprints.utm.my/id/eprint/40668/5/SharonKaurGuramadSinghMFSKSM201.pdf
_version_ 1747816570155958272