Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

Time efficient spam e-mail filtering for Turkish

Basit öğe kaydını göster

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Güngör, Tunga.
dc.contributor.author Çıltık, Ali.
dc.date.accessioned 2023-03-16T10:06:06Z
dc.date.available 2023-03-16T10:06:06Z
dc.date.issued 2006.
dc.identifier.other CMPE 2006 C55
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12492
dc.description.abstract In the present thesis, we propose spam e-mail filtering methods having high accuracies and low time complexities. The methods are based on the n-gram approach and a heuristics which is referred to as the first n-words heuristics. Though the main concern of the research is studying the applicability of these methods on Turkish e-mails, they were also applied to English e-mails. A data set for both languages was compiled. Tests were performed with different parameters. Success rates above 95% for Turkish e-mails and around 98% for English e-mails were obtained. In addition, it has been shown that the time complexities can be reduced significantly without sacrificing from success. We also propose a combined perception refinement (CPR) which improves baseline success rates around 2%, where development set is used in the first step of the CPR to find out the parameters used in the second step. Free word order is another characteristic of Turkish language; we will make an attempt to implement free word order aspect of Turkish.
dc.format.extent 30cm.
dc.publisher Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2006.
dc.relation Includes appendices.
dc.relation Includes appendices.
dc.subject.lcsh Spam filtering (Electronic mail)
dc.title Time efficient spam e-mail filtering for Turkish
dc.format.pages x, 48 leaves;


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım