Organización Automática de Documentos. Técnicas K-Means

Pradales Gallego, Sergio

Título

dc.contributor.advisor	García de Figuerola Paniagua, Luis Carlos
dc.contributor.author	Pradales Gallego, Sergio
dc.date.accessioned	2018-07-26T08:43:17Z
dc.date.available	2018-07-26T08:43:17Z
dc.date.issued	2017-07-20
dc.identifier.citation	Pradales Gallego, S. (2017, julio 20). Organización Automática de Documentos. Técnicas K-Means. Trabajo de Fin de Máster en Sistemas de Información Digital, 2016-2017. Recuperado a partir de https://gredos.usal.es/jspui/handle/10366/138123
dc.identifier.uri	http://hdl.handle.net/10366/138123
dc.description	Trabajo de Fin de Máster en Sistemas de Información Digital, 2016-2017.	es_ES
dc.description.abstract	[ES] En los últimos años hemos experimentado el crecimiento progresivo de los documentos, sobre todo con la aparición de internet, la globalización y las nuevas tecnologías de la información y comunicación, que han dado lugar a una gran cantidad de documentos en formato digital. Toda esta información era necesario organizarla, y para ello existen diversos sistemas de filtrado de documentos automáticos, siendo uno de los más utilizados el clustering, que a través en este caso del algoritmo de K-Means y el software de Scikit-learn, permite la recuperación y clasificación de documentos afines. Con esta idea se pretendió comprobar la evolución de un conjunto de noticias de ciencia y tecnología extraídas de la prensa digital española, y la utilidad de este sistema de clasificación. Los resultados reflejan que es un buen sistema aunque tiene algunas carencias que dependen sobre todo de factores humanos. [EN] In recent years we have experienced the progressive growth of documents, especially with the emergence of the internet, globalization and new information and communication technologies, which have resulted in a large number of documents in digital format. All this information was necessary to organize it, and for this purpose there are several automatic document filtering systems, one of the most used being clustering, which in this case, the K-Means algorithm and the Scikit-learn software, Retrieval and classification of related documents. With this idea it was tried to verify the evolution of a set of science and technology news extracted from the Spanish digital press, and the usefulness of this classification system. The results reflect that it is a good system although it has some deficiencies that depend mainly on human factors.	es_ES
dc.format.mimetype	application/pdf
dc.language.iso	spa	es_ES
dc.rights	Attribution-NonCommercial-ShareAlike 4.0 International
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.subject	Clasificación automática	es_ES
dc.subject	Prensa digital	es_ES
dc.subject	España	es_ES
dc.subject	Noticias	es_ES
dc.subject	Ciencia	es_ES
dc.subject	K-Means	es_ES
dc.subject	Scikit-learn	es_ES
dc.subject	Automatic clustering	en
dc.subject	Digital press	en
dc.subject	Spain	en
dc.title	Organización Automática de Documentos. Técnicas K-Means	es_ES
dc.type	info:eu-repo/semantics/masterThesis	es_ES
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es_ES