A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model

The accurate prediction and characterization of small open reading frames (smORFs) are critical for understanding their functional roles in gene regulation and cellular processes. This study presents the development and evaluation of a novel hybrid machine learning algorithm that integrates the stre...

Full description

Bibliographic Details
Main Author: Ouwabunmi, Babalola AbdulHafeez
Format: Thesis
Language:English
Published: 2025
Subjects:
Online Access:http://eprints.usm.my/62965/
Abstract Abstract here
_version_ 1854969821212442624
author Ouwabunmi, Babalola AbdulHafeez
author_facet Ouwabunmi, Babalola AbdulHafeez
author_sort Ouwabunmi, Babalola AbdulHafeez
description The accurate prediction and characterization of small open reading frames (smORFs) are critical for understanding their functional roles in gene regulation and cellular processes. This study presents the development and evaluation of a novel hybrid machine learning algorithm that integrates the strengths of Random Forest and Gradient Boosting models to enhance the prediction of smORFs. The performance of the hybrid algorithm was rigorously assessed and compared to the standalone models using comprehensive evaluation metrics, including accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC). Results demonstrated that the hybrid model achieved superior performance, with an accuracy of 0.998, a sensitivity of 0.998, and a specificity of 1.00, significantly outperforming both the Random Forest and Gradient Boosting models individually. Additionally, transcriptomic data from Mycobacterium tuberculosis were utilized to validate the predictions, highlighting the biological relevance and potential applications of the proposed approach in computational biology. This study underscores the importance of combining machine learning techniques to improve prediction accuracy and provides a robust framework for advancing smORF discovery. While the focus was on comparing standalone and hybrid models, the study identifies opportunities for future benchmarking against external tools to further validate its contributions. The findings contribute to both computational and biological research, offering insights into smORF prediction methodologies and their applications.
first_indexed 2026-01-21T21:14:40Z
format Thesis
id usm-62965
institution Universiti Sains Malaysia
language English
last_indexed 2026-01-21T21:14:40Z
publishDate 2025
record_format eprints
record_pdf Abstract
spelling usm-629652026-01-19T04:13:48Z http://eprints.usm.my/62965/ A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model Ouwabunmi, Babalola AbdulHafeez R Medicine RA Public aspects of medicine The accurate prediction and characterization of small open reading frames (smORFs) are critical for understanding their functional roles in gene regulation and cellular processes. This study presents the development and evaluation of a novel hybrid machine learning algorithm that integrates the strengths of Random Forest and Gradient Boosting models to enhance the prediction of smORFs. The performance of the hybrid algorithm was rigorously assessed and compared to the standalone models using comprehensive evaluation metrics, including accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC). Results demonstrated that the hybrid model achieved superior performance, with an accuracy of 0.998, a sensitivity of 0.998, and a specificity of 1.00, significantly outperforming both the Random Forest and Gradient Boosting models individually. Additionally, transcriptomic data from Mycobacterium tuberculosis were utilized to validate the predictions, highlighting the biological relevance and potential applications of the proposed approach in computational biology. This study underscores the importance of combining machine learning techniques to improve prediction accuracy and provides a robust framework for advancing smORF discovery. While the focus was on comparing standalone and hybrid models, the study identifies opportunities for future benchmarking against external tools to further validate its contributions. The findings contribute to both computational and biological research, offering insights into smORF prediction methodologies and their applications. 2025-08 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/62965/1/BABALOLA%20ABDULHAFEEZ%20OLUWABUNMI-THESIS-E.pdf Ouwabunmi, Babalola AbdulHafeez (2025) A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model. Masters thesis, Universiti Sains Malaysia.
spellingShingle R Medicine
RA Public aspects of medicine
Ouwabunmi, Babalola AbdulHafeez
A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
title A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
title_full A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
title_fullStr A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
title_full_unstemmed A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
title_short A deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
title_sort deep reinforcement learning hybrid algorithm for the computational discovery and characterization of small proteins utilizing mycobacterium tuberculosis as a model
topic R Medicine
RA Public aspects of medicine
url http://eprints.usm.my/62965/
work_keys_str_mv AT ouwabunmibabalolaabdulhafeez adeepreinforcementlearninghybridalgorithmforthecomputationaldiscoveryandcharacterizationofsmallproteinsutilizingmycobacteriumtuberculosisasamodel
AT ouwabunmibabalolaabdulhafeez deepreinforcementlearninghybridalgorithmforthecomputationaldiscoveryandcharacterizationofsmallproteinsutilizingmycobacteriumtuberculosisasamodel