Assessing Classification Accuracy in the Revision Stage of a CBR Spam Filtering System
Fecha de publicación
Springer Science + Business Media
Case-Based Reasoning Research and Development Lecture Notes in Computer Science. Lecture Notes in Computer Science. Volumen 4626, pp. 374-388.
In this paper we introduce a quality metric for characterizing the solutions generated by a successful CBR spam filtering system called SpamHunting. The proposal is denoted as relevant information amount rate and it is based on combining estimations about relevance and amount of information recovered during the retrieve stage of a CBR system. The results obtained from experimentation show how this measure can successfully be used as a suitable complement for the classifications computed by our SpamHunting system. In order to evaluate the performance of the quality estimation index, we have designed a formal benchmark procedure that can be used to evaluate any accuracy metric. Finally, following the designed test procedure, we show the behaviour of the proposed measure using two well-known publicly available corpus.
978-3-540-74138-1 (Print) / 978-3-540-74141-1 (Online)
0302-9743 (Print) / 1611-3349 (Online)
- BISITE. Congresos 
Files in this item