REINA at CLEF 2006 Robust Task : local query expansion using term windows for robust retrieval
Expansión de la consulta
Recuperación de la información
Fecha de publicación
Zazo Rodríguez, A.F., García Figuerola, L.C. y Alonso Berrocal, J.L. (2006). REINA at CLEF 2006 Robust Task : local query expansion using term windows for robust retrieval. En Nardi, A., Peters, C. y Vicedo, J.L. (Eds.) "WORKING NOTES CLEF 2006 workshop, 20-22 September, Alicante, Spain"
This paper describes our work at CLEF 2006 Robust task. This task is an ad-hoc task that explores methods for stable retrieval by focusing on poorly performing topics. We have realized experiments for all subtask: monolingual (EN, ES, FR and IT), bilingual (IT!ES) and multilingual (ES![EN ES FR IT]) retrieval.For monolingual retrieval we have focused our work on local query expansion, i.e.using only the information from retrieved documents. External corpora, such as theWeb, were not used. Our document retrieval system is simple; it is based on vector space model. Some local expansion techniques were applied for training topics. The best improvement was achieved using association thesauri, which were constructed employing co-occurrence relations in term windows, not in complete document. Thistechnique is effective and can be easily implemented without tuning some parameters. Our mandatory runs (title+description topic fields) have obtained good positions in all monolingual subtasks we participate.For bilingual retrieval two machine translation programs were used to translate the topics from Italian into Spanish. Both translations were joined before searching. The same expansion technique was also applied. Our mandatory run has got the top rank in the bilingual subtask. For multilingual research we used the same procedure to obtain the retrieval list for each target language, and we combined them with the MAX-MIN data fusion method. In this subtask, our mandatory run has been in the lower part of the ranking of runs.
Se describe el trabajo presentado en el fórum CLEF 2006 basado en la recuperación estable de la información, experimentando con recuperación monolingüe, bilingüe y multilingüe