usenix conference policies
Improving Spam Detection Based on Structural Similarity
We propose a new spam detection algorithm that uses structural relationships between senders and recipients of email as the basis for spam detection. A unifying representation of users and receivers in the vectorial space of their contacts is constructed, that leads to a natural definition of similarity between them. This similarity is then used to group email senders and recipients into clusters. Historical information about the messages sent and received by the clusters is obtained by forwarding messages to an auxiliary spam detection algorithm and this information is used to reclassify messages. In the framework proposed, our algorithm aims at correcting misclassifications from an auxiliary algorithm. A simulation is performed based on actual data collected from an SMTP server from a large University. We show that our approach is able reduce false positives, produced by the auxiliary classification algorithm, up to about 60%.
author = {Luiz H. Gomes and Fernando D. O. Castro and Virg{\'\i}lio A. F. Almeida and Jussara M. Almeida and Rodrigo B. Almeida and Luis M. A. Bettencourt},
title = {Improving Spam Detection Based on Structural Similarity},
booktitle = {Steps to Reducing Unwanted Traffic on the Internet Workshop (SRUTI 05)},
year = {2005},
address = {Cambridge, MA},
url = {https://www.usenix.org/conference/sruti-05/improving-spam-detection-based-structural-similarity},
publisher = {USENIX Association},
month = jul
}
connect with us