Detrecting malicious PDF document using support vector machine supervised learning algorithm

Malicious PDF files remain a real threat, in cyber world. In practice, it can affect badly masses of computer users, even after several high-profile security incidents. In spite of a series of a security patches issued by Adobe and other vendors, many users still have vulnerable client software inst...

पूर्ण विवरण

ग्रंथसूची विवरण
मुख्य लेखक: Dabiranzohouri, Miranda
स्वरूप: थीसिस
प्रकाशित: 2014
विषय:
_version_ 1846216547035512832
author Dabiranzohouri, Miranda
author_facet Dabiranzohouri, Miranda
author_sort Dabiranzohouri, Miranda
description Malicious PDF files remain a real threat, in cyber world. In practice, it can affect badly masses of computer users, even after several high-profile security incidents. In spite of a series of a security patches issued by Adobe and other vendors, many users still have vulnerable client software installed on their computers. The expressiveness of the PDF format, furthermore, enables attackers to evade detection with little effort. Apart from traditional antivirus products, which are always a step behind attackers, few methods are known that can be deployed for protection of end-user systems. This thesis proposes a machine learning based method for detecting of malicious PDF documents which, instead of analyzing JavaScript or any other content, makes use of essential differences in the structural properties of malicious and benign PDF files. Support Vector Machine is used in order to testify and recognize the benign and malicious PDF file. The collected dataset consists of 2190 instance which 404 of them are malicious and 1786 instance are benign. The experimental results shows that SVM gives better result in limited number of feature compared to MLP and BayesNet method.
format Thesis
id uthm-41612
institution Universiti Teknologi Malaysia
publishDate 2014
record_format eprints
spelling uthm-416122017-08-17T01:51:40Z http://eprints.utm.my/41612/ Detrecting malicious PDF document using support vector machine supervised learning algorithm Dabiranzohouri, Miranda Q Science Malicious PDF files remain a real threat, in cyber world. In practice, it can affect badly masses of computer users, even after several high-profile security incidents. In spite of a series of a security patches issued by Adobe and other vendors, many users still have vulnerable client software installed on their computers. The expressiveness of the PDF format, furthermore, enables attackers to evade detection with little effort. Apart from traditional antivirus products, which are always a step behind attackers, few methods are known that can be deployed for protection of end-user systems. This thesis proposes a machine learning based method for detecting of malicious PDF documents which, instead of analyzing JavaScript or any other content, makes use of essential differences in the structural properties of malicious and benign PDF files. Support Vector Machine is used in order to testify and recognize the benign and malicious PDF file. The collected dataset consists of 2190 instance which 404 of them are malicious and 1786 instance are benign. The experimental results shows that SVM gives better result in limited number of feature compared to MLP and BayesNet method. 2014 Thesis NonPeerReviewed Dabiranzohouri, Miranda (2014) Detrecting malicious PDF document using support vector machine supervised learning algorithm. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computing.
spellingShingle Q Science
Dabiranzohouri, Miranda
Detrecting malicious PDF document using support vector machine supervised learning algorithm
title Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_full Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_fullStr Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_full_unstemmed Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_short Detrecting malicious PDF document using support vector machine supervised learning algorithm
title_sort detrecting malicious pdf document using support vector machine supervised learning algorithm
topic Q Science
url-record http://eprints.utm.my/41612/
work_keys_str_mv AT dabiranzohourimiranda detrectingmaliciouspdfdocumentusingsupportvectormachinesupervisedlearningalgorithm