| dc.contributor.author | Cardona-Mendoza, Andrés Felipe | |
| dc.contributor.author | Gil González, Ana Belén | |
| dc.contributor.author | Perdomo Lara, Sandra Janneth | |
| dc.contributor.author | Rossi Labianca, Lautaro | |
| dc.contributor.author | Marcos Recio, Sandra | |
| dc.contributor.author | Barrero Bueno, Andrés | |
| dc.coverage.spatial | Colombia – Bogotá D.C. and 11 branches of Liga Colombiana Contra el Cáncer, lat=40.705781, long=-74.015334 | es_ES |
| dc.coverage.temporal | start=2023-07 and end=2025-02 | es_ES |
| dc.date.accessioned | 2026-04-07T12:30:25Z | |
| dc.date.available | 2026-04-07T12:30:25Z | |
| dc.date.issued | 2026 | |
| dc.identifier.citation | Cardona-Mendoza, A., Gil-González, A.B., et al. (2025). Labeled Patches Dataset for Semi-supervised YOLO Training on Cervical Cytology WSI. GREDOS Repository, University of Salamanca. | es_ES |
| dc.identifier.uri | http://hdl.handle.net/10366/170872 | |
| dc.description | Here is the English version of the abstract, optimized for international repositories (like Zenodo, IEEE Dataport, or Kaggle) using standard terminology in Digital Pathology and Computer Vision:
📝 Dataset Abstract
Title Suggestion: Automated Cervical Cytology Screening: A Labeled WSI Patch Dataset for Deep Learning-Based Object Detection.
Abstract:
The automated detection of cellular structures using Deep Learning models represents a key strategy to optimize cervical cancer screening by reducing clinical workload and inter-observer variability. However, analyzing Whole Slide Images (WSI) presents critical challenges, including the scarcity of high-quality annotations, high morphological complexity, and significant class imbalance.
This dataset addresses these limitations by providing a collection of 640×640px image patches extracted from conventional Papanicolaou (Pap) tests from a Colombian clinical cohort. The data was processed through a semi-automated pipeline integrating manual annotation in QuPath, automated patch extraction, and YOLO label generation via Groovy and Python scripts. To ensure high diagnostic reliability, a web-based expert validation interface was implemented for final label verification. This resource supports the development of robust computer vision models aimed at enhancing precision and efficiency in digital cytological diagnosis. | es_ES |
| dc.description.abstract | [EN]This dataset contains labeled image patches extracted from Colombian Whole Slide Images (WSIs) of conventional Papanicolaou (Pap) tests. It supports the training and validation of object detection models (e.g., YOLO) in automated cervical cytology diagnosis. The patches (640×640px JPGs) were extracted and labeled using a semi-automated pipeline combining manual annotation in QuPath and automated patch extraction and YOLO label generation via Groovy and Python scripts. A web-based expert validation interface was used to ensure label accuracy. | es_ES |
| dc.description.sponsorship | This work was supported by the Secretaría Distrital de Salud de Bogotá (Colombia) and the ATENEA Agency (grant number 368-2022), in collaboration with the Spanish Ministry for Digital Transformation and Civil Service through the University-Enterprise Chair Call (Cátedras ENIA 2022) (Grant TSI-100933-2023-1), co-funded by the European Union NextGenerationEU/PRTR. | es_ES |
| dc.description.tableofcontents | 1. File List:
✓ JPG patches (640x640)
✓ YOLO-format labels (.txt)
✓ Visual validated patches (.jpg with bounding boxes)
✓ Metadata snapshots (.csv)
✓ Logs of processed annotations (.txt)
2. Relationship between files, if important:
Each patch corresponds to a label file. Patches and labels are grouped by class. Visual validated images are derived from the patch-label pair.
3. File format:
✓ Images: .jpg
✓ Labels: .txt (YOLO format)
✓ Logs/Snapshots: .csv, .txt | es_ES |
| dc.language.iso | eng | es_ES |
| dc.publisher | Universidad de Salamanca | es_ES |
| dc.relation.isreferencedby | Andrés, CM., Ana-Belén, GG., Hortua, H.J., Lautaro, R.L., Sandra, PL. (2026). Prototype of a Comprehensive System for Automated Generation and Expert Validation of Labeled Patches on Papanicolaou Test WSI Images for Semi-supervised Training of YOLO Models in Automated Cervical Cytology Diagnosis. In: Fdez-Riverola, F., et al. Practical Applications of Computational Biology and Bioinformatics, 19th International Conference (PACBB 2025). PACBB2025 2025. Lecture Notes in Networks and Systems, vol 1720. Springer, Cham. https://doi.org/10.1007/978-3-032-10634-6_1 | es_ES |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | es_ES |
| dc.subject | Cervical cytology | es_ES |
| dc.subject | Papanicolaou test | es_ES |
| dc.subject | YOLO | es_ES |
| dc.subject | Whole Slide Image | es_ES |
| dc.subject | object detection | es_ES |
| dc.subject | patch dataset | es_ES |
| dc.subject | digital pathology | es_ES |
| dc.subject | label validation | es_ES |
| dc.subject.mesh | Cytodiagnosis | * |
| dc.subject.mesh | Papanicolaou Test | * |
| dc.subject.mesh | Uterine Cervical Neoplasms | * |
| dc.subject.mesh | Cervix Uteri | * |
| dc.title | Labeled Patches Dataset for Semi-supervised YOLO Training on Cervical Cytology WSI [Dataset] | es_ES |
| dc.type | info:eu-repo/semantics/dataset | es_ES |
| dc.subject.unesco | 1203.04 Inteligencia Artificial | es_ES |
| dc.subject.unesco | 2407.04 Citología | es_ES |
| dc.subject.unesco | 2209.09 Radiación Infrarroja | es_ES |
| dc.subject.unesco | 2209.90 Tratamiento Digital. Imágenes | es_ES |
| dc.identifier.doi | 10.71636/g7s3-1n94 | |
| dc.relation.projectID | grantAgreement/TSI-100933-2023-1 | es_ES |
| dc.rights.accessRights | info:eu-repo/semantics/embargoedAccess | es_ES |
| dc.type.hasVersion | info:eu-repo/semantics/publishedVersion | es_ES |
| dc.subject.decs | cuello del útero | * |
| dc.subject.decs | neoplasias del cuello uterino | * |
| dc.subject.decs | citodiagnóstico | * |
| dc.subject.decs | prueba de Papanicolau | * |
| dc.publication.year | 2026 | |