Preventing Spam Blogs Using Content Analysis and User Behaviour Model

Spam blog is a subset of blog which contains nothing more than stolen materials and inauthentic text designed to gain profit from various type of advertisements. Splogs have become a nuisance in the blogosphere because it pollutes search engine results and blog update servers. This paper discusses t...

全面介绍

书目详细资料
主要作者: Mohammad Hafiz, Ismail
格式: Thesis
语言:英语
英语
出版: 2007
主题:
在线阅读:https://etd.uum.edu.my/21/1/mohammad_hafiz.pdf
https://etd.uum.edu.my/21/2/mohammad_hafiz.pdf
实物特征
总结:Spam blog is a subset of blog which contains nothing more than stolen materials and inauthentic text designed to gain profit from various type of advertisements. Splogs have become a nuisance in the blogosphere because it pollutes search engine results and blog update servers. This paper discusses the similarity between spam blogs and email spams and the techniques used to identify them. The paper also propose the development of a prototype blog update server that implements content analysis and user behaviour model to filter splogs before they are indexed into blog search engine.