2024-03-28T12:42:43Zhttps://gredos.usal.es/oai/requestoai:gredos.usal.es:10366/561552022-02-07T15:37:29Zcom_10366_4558com_10366_4512com_10366_3823col_10366_4564
Figuerola, Carlos G.
Alonso Berrocal, José Luis
Zazo Rodríguez, Ángel Francisco
Rodríguez Vázquez de Aldana, Emilio
2006
Se describe la participación del Grupo de Investigación REINA de la Universidad de Salamanca en foro WebCLEF 2006. Este año participa con un trabajo sobre Subtarea mixta monolingüe en español
This paper describes the participation of the REINA Research Group of the University of Salamanca at WebCLEF 2006. The task in that we have participated this year is the Monolingual Mixed Task in Spanish. To select web pages of the EuroGov collectionin Spanish, the wide collection was processed with a language guesser, searching for pages in Spanish. All pages in the .es domain were also pre-selected. Our focus, this year, is to test pre-retrieval ways of mixing fields or elements of information in web pages, as well as to test the retrieval capacity of these fields. Mixing terms from several sources in a only index can be achieved, in retrieval systems based on the vector spacemodel, operating on the term frequency in the document, if we use a tf x idf schemaof weigthing. BODY field is, by the way, the most powerfull from the point of viewof retrieval, but ANCHORS of backlinks add a considerable improvement. META fields, nevertheless, contribute little to the improvement in retrieval.
application/pdf
http://hdl.handle.net/10366/56155
http://hdl.handle.net/10366/56155
eng
REINA at WebCLEF 2006 : Mixing fields to improve retrieval
info:eu-repo/semantics/article
info:eu-repo/semantics/article
7 p.
TEXT
Gredos. Repositorio Documental de la Universidad de Salamanca
Hispana