Evaluating cost-sensitive unsolicited bulk email categorization

dc.contributor.authorGómez Hidalgo, José María
dc.date.accessioned2016-09-13T15:22:38Z
dc.date.available2016-09-13T15:22:38Z
dc.date.issued2002
dc.description.abstractIn the recent years, Unsolicited Bulk Email has became an increasingly important problem, with a big economic impact. In this paper, we discuss cost-sensitive Text Categorization methods for UBE filtering. In concrete, we have task (C4.5, Naive Bayes, PART. Support Vector Machines and Rocchio), made cost sensitive through several methods (Threshold Optimization, Instance Weighting, and Meta-Cost). We have used the Receiver Operating Characteristic Convex Hull method for the evaluation, that best suits classification problems in which target conditions are not known, as it is the case. Our results do not show a dominant algorithm nor method for making algorithms cost-sensitive, but are the best reported on the test collection used, and approach real-world hand-crafted classifiers accuracy.spa
dc.description.filiationUEMspa
dc.description.impact0.213 SJR (2002) Q3, 231/333 Softwarespa
dc.description.sponsorshipSin financiaciónspa
dc.identifier.citationGómez Hidalgo, J. M. (2002). Evaluating cost-sensitive unsolicited bulk email categorization. In Proceedings of the 2002 ACM symposium on Applied computing, March 11-14 (pp. 615-620). Madrid: ACM.spa
dc.identifier.doi10.1145/508791.508911
dc.identifier.isbn1581134452
dc.identifier.urihttp://hdl.handle.net/11268/5750
dc.language.isoengspa
dc.peerreviewedSispa
dc.publisherACMspa
dc.relation.publisherversionhttps://doi.org/10.1145/508791.508911spa
dc.rights.accessRightsopen accessen
dc.subject.uemInteligencia artificialspa
dc.subject.uemSistemas informáticosspa
dc.subject.unescoInformática y desarrollospa
dc.subject.unescoInteligencia artificialspa
dc.titleEvaluating cost-sensitive unsolicited bulk email categorizationspa
dc.typeconference outputspa
dspace.entity.typePublication
relation.isAuthorOfPublication76a395e8-090d-4187-9a3c-420063e1f44f
relation.isAuthorOfPublication.latestForDiscovery76a395e8-090d-4187-9a3c-420063e1f44f

Files