Compartir
Título
Tracking Concept Drift at Feature Selection Stage in SpamHunting: An Anti-spam Instance-Based Reasoning System
Autor(es)
Palabras clave
Computer Science
Fecha de publicación
2006/09
Editor
Springer Science + Business Media
Citación
Advances in Case-Based Reasoning Lecture Notes in Computer Science. Lecture Notes in Computer Science. Volumen 4106, pp. 504-518.
Resumen
In this paper we propose a novel feature selection method able to handle concept drift problems in spam filtering domain. The proposed technique is applied to a previous successful instance-based reasoning e-mail filtering system called SpamHunting. Our achieved information criterion is based on several ideas extracted from the well-known information measure introduced by Shannon. We show how results obtained by our previous system in combination with the improved feature selection method outperforms classical machine learning techniques and other well-known lazy learning approaches. In order to evaluate the performance of all the analysed models, we employ two different corpus and six well-known metrics in various scenarios.
URI
ISBN
978-3-540-36843-4 (Print) / 978-3-540-36846-5 (Online)
ISSN
0302-9743 (Print) / 1611-3349 (Online)
Collections
- BISITE. Congresos [288]