Comparative study of feature selection method of microarray data for gene classification

Recent advances in biotechnology such as microarray, offer the ability to measure the levels of expression of thousands of genes in parallel. Analysis of microarray data can provide understanding and insight into gene function and regulatory mechanisms. This analysis is crucial to identify and class...

Description complète

Détails bibliographiques
Auteur principal: Ghazali, Nurulhuda
Format: Thèse
Langue:anglais
Publié: 2009
Sujets:
Accès en ligne:http://eprints.utm.my/11502/6/NurulhudaGhazaliMFSKSM2009.pdf
_version_ 1846215398255493120
author Ghazali, Nurulhuda
author_facet Ghazali, Nurulhuda
author_sort Ghazali, Nurulhuda
description Recent advances in biotechnology such as microarray, offer the ability to measure the levels of expression of thousands of genes in parallel. Analysis of microarray data can provide understanding and insight into gene function and regulatory mechanisms. This analysis is crucial to identify and classify cancer diseases. Recent technology in cancer classification is based on gene expression profile rather than on morphological appearance of the tumor. However, this task is made more difficult due to the noisy nature of microarray data and the overwhelming number of genes. Thus, it is an important issue to select a small subset of genes to represent thousands of genes in microarray data which is referred as informative genes. These informative genes will then be classified according to its appropriate classes. To achieve the best solution to the classification issue, we proposed an approach of minimum Redundancy-Maximum Relevance feature selection method together with Probabilistic Neural Network classifier. The minimum Redundancy- Maximum Relevance feature selection method is used to select the informative genes while the Probabilistic Neural Network classifier acts as the classifier. This approach has been tested on a well-known cancer dataset which is Leukemia. The results achieved shows that the gene selected had given high classification accuracy. This reduction of genes helps take out some burdens from biologist and better classification accuracy can be used widely to detect cancer in early stage.
format Thesis
id uthm-11502
institution Universiti Teknologi Malaysia
language English
publishDate 2009
record_format eprints
spelling uthm-115022017-09-20T10:00:12Z http://eprints.utm.my/11502/ Comparative study of feature selection method of microarray data for gene classification Ghazali, Nurulhuda QA75 Electronic computers. Computer science RC0254 Neoplasms. Tumors. Oncology (including Cancer) Recent advances in biotechnology such as microarray, offer the ability to measure the levels of expression of thousands of genes in parallel. Analysis of microarray data can provide understanding and insight into gene function and regulatory mechanisms. This analysis is crucial to identify and classify cancer diseases. Recent technology in cancer classification is based on gene expression profile rather than on morphological appearance of the tumor. However, this task is made more difficult due to the noisy nature of microarray data and the overwhelming number of genes. Thus, it is an important issue to select a small subset of genes to represent thousands of genes in microarray data which is referred as informative genes. These informative genes will then be classified according to its appropriate classes. To achieve the best solution to the classification issue, we proposed an approach of minimum Redundancy-Maximum Relevance feature selection method together with Probabilistic Neural Network classifier. The minimum Redundancy- Maximum Relevance feature selection method is used to select the informative genes while the Probabilistic Neural Network classifier acts as the classifier. This approach has been tested on a well-known cancer dataset which is Leukemia. The results achieved shows that the gene selected had given high classification accuracy. This reduction of genes helps take out some burdens from biologist and better classification accuracy can be used widely to detect cancer in early stage. 2009-10 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/11502/6/NurulhudaGhazaliMFSKSM2009.pdf Ghazali, Nurulhuda (2009) Comparative study of feature selection method of microarray data for gene classification. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems.
spellingShingle QA75 Electronic computers. Computer science
RC0254 Neoplasms. Tumors. Oncology (including Cancer)
Ghazali, Nurulhuda
Comparative study of feature selection method of microarray data for gene classification
title Comparative study of feature selection method of microarray data for gene classification
title_full Comparative study of feature selection method of microarray data for gene classification
title_fullStr Comparative study of feature selection method of microarray data for gene classification
title_full_unstemmed Comparative study of feature selection method of microarray data for gene classification
title_short Comparative study of feature selection method of microarray data for gene classification
title_sort comparative study of feature selection method of microarray data for gene classification
topic QA75 Electronic computers. Computer science
RC0254 Neoplasms. Tumors. Oncology (including Cancer)
url http://eprints.utm.my/11502/6/NurulhudaGhazaliMFSKSM2009.pdf
url-record http://eprints.utm.my/11502/
work_keys_str_mv AT ghazalinurulhuda comparativestudyoffeatureselectionmethodofmicroarraydataforgeneclassification