| dc.contributor.author | López Sánchez, Daniel | |
| dc.contributor.author | González Arrieta, María Angélica | |
| dc.contributor.author | Corchado Rodríguez, Juan Manuel | |
| dc.date.accessioned | 2024-04-04T14:35:49Z | |
| dc.date.available | 2024-04-04T14:35:49Z | |
| dc.date.issued | 2019 | |
| dc.identifier.citation | López-Sánchez, D., Arrieta, A. G., & Corchado, J. M. (2019). Visual content-based web page categorization with deep transfer learning and metric learning. Neurocomputing, 338, 418-431. https://doi.org/10.1016/J.NEUCOM.2018.08.086 | es_ES |
| dc.identifier.issn | 0925-2312 | |
| dc.identifier.uri | http://hdl.handle.net/10366/157119 | |
| dc.description.abstract | [EN]The growing amounts of online multimedia content challenge the current search, recommendation and
information retrieval systems. Information in the form of visual elements is highly valuable in a range of
web mining tasks. However, the mining of these resources is a difficult task due to the complexity and
variability of images, and the cost of collecting big enough datasets to successfully train accurate deep
learning models. This paper proposes a novel framework for the categorization of web pages on the basis
of their visual content. This is achieved by exploring the joint application of a transfer learning strategy
and metric learning techniques to build a Deep Convolutional Neural Network (DCNN) for feature extrac-
tion, even when training data is scarce. The obtained experimental results evidence that the proposed
approach outperforms the state-of-the-art handcrafted image descriptors and achieves a high categoriza-
tion accuracy. In addition, we address the problem of over-time learning, so the proposed framework can
learn to identify new web page categories as new labeled images are provided at test time. As a result,
prior knowledge of the complete set of possible web categories is not necessary in the initial training
phase. | es_ES |
| dc.language.iso | eng | es_ES |
| dc.publisher | Elsevier | es_ES |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
| dc.subject | Web page categorization | es_ES |
| dc.subject | Metric learning | es_ES |
| dc.subject | Transfer learning | es_ES |
| dc.subject | Deep learning | es_ES |
| dc.title | Visual content-based web page categorization with deep transfer learning and metric learning. | es_ES |
| dc.type | info:eu-repo/semantics/article | es_ES |
| dc.relation.publishversion | https://doi.org/10.1016/j.neucom.2018.08.086 | es_ES |
| dc.subject.unesco | 1203.17 Informática | es_ES |
| dc.identifier.doi | 10.1016/J.NEUCOM.2018.08.086 | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | es_ES |
| dc.journal.title | Neurocomputing | es_ES |
| dc.volume.number | 338 | es_ES |
| dc.page.initial | 418 | es_ES |
| dc.page.final | 431 | es_ES |
| dc.type.hasVersion | info:eu-repo/semantics/publishedVersion | es_ES |