Neural network for prediction of cysteine disulphide bridge connectivity in proteins

The goal of this thesis is to develop a computational method based on machine learning techniques for predicting disulfide-bonding states of Cysteine residues in proteins, which is a sub-problem of the bigger and yet unsolved problem of protein structure prediction. First, we preprocessed the datase...

Description complète

Détails bibliographiques
Auteur principal: Bostan, Hamed
Format: Thèse
Langue:anglais
Publié: 2010
Sujets:
Accès en ligne:http://eprints.utm.my/18275/1/HamedBostanMFSKSM2010.pdf
_version_ 1846215675334361088
author Bostan, Hamed
author_facet Bostan, Hamed
author_sort Bostan, Hamed
description The goal of this thesis is to develop a computational method based on machine learning techniques for predicting disulfide-bonding states of Cysteine residues in proteins, which is a sub-problem of the bigger and yet unsolved problem of protein structure prediction. First, we preprocessed the datasets from Protein Data Bank (PDB) and filtered mutations and low resolution files out. A number of descriptors in two dimensional (2D) protein sequences are studied. These descriptors are based on local feature values of adjacent amino acid to Cystein residue, namely encoded, propensity value and averaged propensity value. We have used Artificial Neural Network (ANN) as a machine learning technique to develop our prediction method. We use ‘trainlm’, ‘trainrp’ and ‘trainscg’ training functions for training out network and also a 5-fold validation is implemented. Our results show that we can predict the state of Cystein disulphide bond formation. It shows that using propensity valued descriptor and ‘trainscg’ training function is better to be used for Cystein bond state prediction compared to the other training functions and descriptors in this study. The accuracy of prediction in this study is 80.85% on a propensity value descriptor dataset which had been trained by ‘trainscg’ with a dataset of over than 400 thousand protein patterns. Results of this work will have direct implications in site directed mutational studies of protein, protein engineering and the problem of protein folding.
format Thesis
id uthm-18275
institution Universiti Teknologi Malaysia
language English
publishDate 2010
record_format eprints
spelling uthm-182752017-09-18T04:26:19Z http://eprints.utm.my/18275/ Neural network for prediction of cysteine disulphide bridge connectivity in proteins Bostan, Hamed QA75 Electronic computers. Computer science The goal of this thesis is to develop a computational method based on machine learning techniques for predicting disulfide-bonding states of Cysteine residues in proteins, which is a sub-problem of the bigger and yet unsolved problem of protein structure prediction. First, we preprocessed the datasets from Protein Data Bank (PDB) and filtered mutations and low resolution files out. A number of descriptors in two dimensional (2D) protein sequences are studied. These descriptors are based on local feature values of adjacent amino acid to Cystein residue, namely encoded, propensity value and averaged propensity value. We have used Artificial Neural Network (ANN) as a machine learning technique to develop our prediction method. We use ‘trainlm’, ‘trainrp’ and ‘trainscg’ training functions for training out network and also a 5-fold validation is implemented. Our results show that we can predict the state of Cystein disulphide bond formation. It shows that using propensity valued descriptor and ‘trainscg’ training function is better to be used for Cystein bond state prediction compared to the other training functions and descriptors in this study. The accuracy of prediction in this study is 80.85% on a propensity value descriptor dataset which had been trained by ‘trainscg’ with a dataset of over than 400 thousand protein patterns. Results of this work will have direct implications in site directed mutational studies of protein, protein engineering and the problem of protein folding. 2010 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/18275/1/HamedBostanMFSKSM2010.pdf Bostan, Hamed (2010) Neural network for prediction of cysteine disulphide bridge connectivity in proteins. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems.
spellingShingle QA75 Electronic computers. Computer science
Bostan, Hamed
Neural network for prediction of cysteine disulphide bridge connectivity in proteins
title Neural network for prediction of cysteine disulphide bridge connectivity in proteins
title_full Neural network for prediction of cysteine disulphide bridge connectivity in proteins
title_fullStr Neural network for prediction of cysteine disulphide bridge connectivity in proteins
title_full_unstemmed Neural network for prediction of cysteine disulphide bridge connectivity in proteins
title_short Neural network for prediction of cysteine disulphide bridge connectivity in proteins
title_sort neural network for prediction of cysteine disulphide bridge connectivity in proteins
topic QA75 Electronic computers. Computer science
url http://eprints.utm.my/18275/1/HamedBostanMFSKSM2010.pdf
url-record http://eprints.utm.my/18275/
work_keys_str_mv AT bostanhamed neuralnetworkforpredictionofcysteinedisulphidebridgeconnectivityinproteins