Mostrar el registro sencillo del ítem
dc.contributor.author | García de Figuerola Paniagua, Luis Carlos | es_ES |
dc.contributor.author | Gómez Díaz, Raquel | es_ES |
dc.contributor.author | López de San Román, Eva | es_ES |
dc.date.accessioned | 2009-03-12 | es_ES |
dc.date.accessioned | 2009-10-15T10:25:37Z | |
dc.date.available | 2009-10-15T10:25:37Z | |
dc.date.issued | 2000 | es_ES |
dc.identifier.citation | Figuerola García, L.C., Gómez Díaz, R. y López de San Román, E. (2000). Stemming and n-grams in Spanish: an evaluation of their impact on information retrieval. "Journal of Information Science", 26 (6), 461-467. | es_ES |
dc.identifier.uri | http://hdl.handle.net/10366/56126 | |
dc.description | Se analizan modelos y técnicas utilizadas para los recuentos de frecuencias de palabras que aparecen tanto en los documentos como en las preguntas formuladas en los Sistemas de Recuperacion de Información. Se describen pruebas realizadas para los documentos en español, que implicaron algunas técnicas utilizadas en inglés, así como el uso de n-gramas, y se comparan los resultados. | es_ES |
dc.description.abstract | At some stage, most of the models and techniques implemented in IR use frequency counts of the terms appearing in documents and in queries. However, many words, since they are derived from the same stem, have very close semantic contents. This makes a grouping of such variants under a single term advisable. Otherwise, dispersal occurs in the calculation of frequency of these terms, and it also becomes difficult to compare queries and documents. On the other hand, there are notable differences between different languages in the way of forming derivatives and inflected forms, so that the application of specific techniques can produce unequal results according to the language of the documents and queries. A description is given of the tests carried out for documents in Spanish, which involved some stemming techniques widely used in English, as well as the application of n-grams, and the results are compared. | es_ES |
dc.format.extent | 17 p. | es_ES |
dc.format.mimetype | application/pdf | es_ES |
dc.language | Inglés | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Horth-Holland | es_ES |
dc.relation.requires | Adobe Acrobat | es_ES |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Unported | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/3.0/ | |
dc.subject | Recuperación de la información | es_ES |
dc.subject | S-stemmer | es_ES |
dc.subject | N-gramas | es_ES |
dc.subject | Information retrieval | es_ES |
dc.subject | Stemming | es_ES |
dc.subject | N-grams | es_ES |
dc.title | Stemming and n-grams in Spanish: an evaluation of their impact on information retrieval | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.type | info:eu-repo/semantics/article | es_ES |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es_ES |
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(ones)
-
REINA. Artículos [15]