A stylometry approach for blind linguistic steganalysis model against translation-based steganography

Steganography is the art of hiding information in ways that prevent the detection of a secret message. In Translation-based Steganography (TBS), the secret messages are encoded in the “noise” made via translation of natural language text programmed. The adversarial technique to extract the secret me...

Full description

Bibliographic Details
Main Author: Mohd Lokman, Syiham
Format: Thesis
Language:English
English
English
Published: 2023
Subjects:
Online Access:http://eprints.uthm.edu.my/10995/
Abstract Abstract here
_version_ 1855520935802568704
author Mohd Lokman, Syiham
author_facet Mohd Lokman, Syiham
author_sort Mohd Lokman, Syiham
description Steganography is the art of hiding information in ways that prevent the detection of a secret message. In Translation-based Steganography (TBS), the secret messages are encoded in the “noise” made via translation of natural language text programmed. The adversarial technique to extract the secret message is called steganalysis, which can be categorized into two types; targeted vs. blind. While targeted steganalysis is designed to attack a specific embedding algorithm, blind steganalysis use features extracted or selection from the medium to detect any anomalies that indicate a possibility that a secret data has been embedded within the medium. However, accuracy of blind steganalysis algorithms highly depend on the features selected from the input data especially when attacking embedding techniques in TBS. This thesis explore the potential of using stylometry or linguistic style to improve the representation of characteristics among the word distribution in distinguishing the stego text from the cover text for TBS. This is because all translated in TBS text have an intrinsic structural styles that can be used to improve the performance of a blind steganalysis model. The proposed stylometry-based blind steganalysis model consists of two stages, which are stylometric feature selection and classification. The proposed stylometric features selected from a set of cover text are categorized into two group features; lexical and syntactic features before implemented into the model Support Vector Machine (SVM) as the classifier. The performance of the stylometry-based blind steganalysis model is then evaluated based on all false rate, missing rate and accuracy rate and compared against three other standard classifiers in steganalysis; Naive Bayes (NB), k-Nearest Neighbor (k-NN), and Decision Tree (J48). The results showed that the stylometric features are impactful to a blind steganalysis model by giving higher detection performance. Meanwhile, SVM is the best classifier for stego text detection with significantly low processing time performance
format Thesis
id uthm-10995
institution Universiti Tun Hussein Onn Malaysia
language English
English
English
publishDate 2023
record_format EPrints
record_pdf Restricted
spelling uthm-109952024-05-20T01:36:54Z http://eprints.uthm.edu.my/10995/ A stylometry approach for blind linguistic steganalysis model against translation-based steganography Mohd Lokman, Syiham T Technology (General) Steganography is the art of hiding information in ways that prevent the detection of a secret message. In Translation-based Steganography (TBS), the secret messages are encoded in the “noise” made via translation of natural language text programmed. The adversarial technique to extract the secret message is called steganalysis, which can be categorized into two types; targeted vs. blind. While targeted steganalysis is designed to attack a specific embedding algorithm, blind steganalysis use features extracted or selection from the medium to detect any anomalies that indicate a possibility that a secret data has been embedded within the medium. However, accuracy of blind steganalysis algorithms highly depend on the features selected from the input data especially when attacking embedding techniques in TBS. This thesis explore the potential of using stylometry or linguistic style to improve the representation of characteristics among the word distribution in distinguishing the stego text from the cover text for TBS. This is because all translated in TBS text have an intrinsic structural styles that can be used to improve the performance of a blind steganalysis model. The proposed stylometry-based blind steganalysis model consists of two stages, which are stylometric feature selection and classification. The proposed stylometric features selected from a set of cover text are categorized into two group features; lexical and syntactic features before implemented into the model Support Vector Machine (SVM) as the classifier. The performance of the stylometry-based blind steganalysis model is then evaluated based on all false rate, missing rate and accuracy rate and compared against three other standard classifiers in steganalysis; Naive Bayes (NB), k-Nearest Neighbor (k-NN), and Decision Tree (J48). The results showed that the stylometric features are impactful to a blind steganalysis model by giving higher detection performance. Meanwhile, SVM is the best classifier for stego text detection with significantly low processing time performance 2023-02 Thesis NonPeerReviewed text en http://eprints.uthm.edu.my/10995/1/24p%20SYIHAM%20MOHD%20LOKMAN.pdf text en http://eprints.uthm.edu.my/10995/2/SYIHAM%20MOHD%20LOKMAN%20COPYRIGHT%20DECLARATION.pdf text en http://eprints.uthm.edu.my/10995/3/SYIHAM%20MOHD%20LOKMAN%20WATERMARK.pdf Mohd Lokman, Syiham (2023) A stylometry approach for blind linguistic steganalysis model against translation-based steganography. Masters thesis, Universiti Tun Hussein Onn Malaysia.
spellingShingle T Technology (General)
Mohd Lokman, Syiham
A stylometry approach for blind linguistic steganalysis model against translation-based steganography
thesis_level Master
title A stylometry approach for blind linguistic steganalysis model against translation-based steganography
title_full A stylometry approach for blind linguistic steganalysis model against translation-based steganography
title_fullStr A stylometry approach for blind linguistic steganalysis model against translation-based steganography
title_full_unstemmed A stylometry approach for blind linguistic steganalysis model against translation-based steganography
title_short A stylometry approach for blind linguistic steganalysis model against translation-based steganography
title_sort stylometry approach for blind linguistic steganalysis model against translation based steganography
topic T Technology (General)
url http://eprints.uthm.edu.my/10995/
work_keys_str_mv AT mohdlokmansyiham astylometryapproachforblindlinguisticsteganalysismodelagainsttranslationbasedsteganography
AT mohdlokmansyiham stylometryapproachforblindlinguisticsteganalysismodelagainsttranslationbasedsteganography