Email spam filtering

dc.contributor.authorPuertas Sanz, Enrique
dc.contributor.authorGómez Hidalgo, José María
dc.contributor.authorCortizo Pérez, José Carlos
dc.date.accessioned2016-07-20T17:40:46Z
dc.date.available2016-07-20T17:40:46Z
dc.date.issued2008
dc.description.abstractIn recent years, email spam has become an increasingly important problem, with a big economic impact in society. In this work, we present the problem of spam, how it affects us, and how we can fight against it. We discuss legal, economic, and technical measures used to stop these unsolicited emails. Among all the technical measures, those based on content analysis have been particularly effective in filtering spam, so we focus on them, explaining how they work in detail. In summary, we explain the structure and the process of different Machine Learning methods used for this task, and how we can make them to be cost sensitive through several methods like threshold optimization, instance weighting, or MetaCost. We also discuss how to evaluate spam filters using basic metrics, TREC metrics, and the receiver operating characteristic convex hull method, that best suits classification problems in which target conditions are not known, as it is the case. We also describe how actual filters are used in practice. We also present different methods used by spammers to attack spam filters and what we can expect to find in the coming years in the battle of spam filters against spammers.spa
dc.description.filiationUEMspa
dc.description.impact0.267 JCR (2008) Q4, 42/45 Computer science, hardware & architecture, 82/86 Computer science, software engineering.spa
dc.description.sponsorshipSin financiaciónspa
dc.identifier.citationPuertas Sanz, E., Gómez Hidalgo, J. M., & Cortizo Pérez, J. C. (2008). Email spam filtering. Advances in computers, 74, 45-114.spa
dc.identifier.doi10.1016/S0065-2458(08)00603-7
dc.identifier.isbn9780123744265
dc.identifier.issn00652458
dc.identifier.urihttp://hdl.handle.net/11268/5429
dc.language.isoengspa
dc.peerreviewedSispa
dc.rights.accessRightsrestricted accessen
dc.subject.uemCorreo electrónico-Protecciónspa
dc.subject.unescoCorreo electrónicospa
dc.subject.unescoProtección de datosspa
dc.titleEmail spam filteringspa
dc.typejournal articlespa
dspace.entity.typePublication
relation.isAuthorOfPublication001b7f40-b837-4929-82ca-df26041a995a
relation.isAuthorOfPublication76a395e8-090d-4187-9a3c-420063e1f44f
relation.isAuthorOfPublicatione1ae5b27-3248-41df-ac24-a38ed621e0f9
relation.isAuthorOfPublication.latestForDiscovery001b7f40-b837-4929-82ca-df26041a995a

Files