| dc.contributor.author | Méndez, Jose R. | |
| dc.contributor.author | Iglesias, E. L. | |
| dc.contributor.author | Fernández Riverola, Florentino | |
| dc.contributor.author | Díaz Gómez, Fernando | |
| dc.contributor.author | Corchado Rodríguez, Juan Manuel | |
| dc.date.accessioned | 2017-09-06T09:16:07Z | |
| dc.date.available | 2017-09-06T09:16:07Z | |
| dc.date.issued | 2006 | |
| dc.identifier.citation | Lecture Notes in Computer Science Current Topics in Artificial Intelligence. 11th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2005, Santiago de Compostela, Spain, November 16-18, 2005, Revised Selected Papers. Lecture Notes in Computer Science. Volumen 4177, pp. 449-458. | |
| dc.identifier.isbn | 978-3-540-45914-9 (Print) / 978-3-540-45915-6 (Online) | |
| dc.identifier.issn | 0302-9743 (Print) / 1611-3349 (Online) | |
| dc.identifier.uri | http://hdl.handle.net/10366/135055 | |
| dc.description.abstract | Junk e-mail detection and filtering can be considered a cost-sensitive classification problem. Nevertheless, preprocessing methods and noise reduction strategies used to enhance the computational efficiency in text classification cannot be so efficient in e-mail filtering. This fact is demonstrated here where a comparative study of the use of stopword removal, stemming and different tokenising schemes is presented. The final goal is to preprocess the training e-mail corpora of several content-based techniques for spam filtering (machine approaches and case-based systems). Soundness conclusions are extracted from the experiments carried out where different scenarios are taken into consideration. | |
| dc.format.mimetype | application/pdf | |
| dc.language.iso | en | |
| dc.publisher | Springer Science + Business Media | |
| dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Unported | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ | |
| dc.subject | Computer Science | |
| dc.title | Tokenising, Stemming and Stopword Removal on Anti-spam Filtering Domain | |
| dc.type | info:eu-repo/semantics/conferenceObject | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |