Mostra i principali dati dell'item

dc.contributor.authorCanal-Alonso, Ángel
dc.contributor.authorEgido, Noelia
dc.contributor.authorJiménez, Pedro
dc.contributor.authorPrieto Tejedor, Javier 
dc.contributor.authorCorchado Rodríguez, Juan Manuel 
dc.date.accessioned2023-10-03T07:53:26Z
dc.date.available2023-10-03T07:53:26Z
dc.date.issued2022
dc.identifier.urihttp://hdl.handle.net/10366/153115
dc.description.abstract[EN]The Variant Detection process (Variant Calling) is fundamental in bioinformatics, demanding maximum precision and reliability. This study examines an innovative integration strategy between a traditional pipeline developed in-house and an advanced Intelligent System (IS). Although the original pipeline already had tools based on traditional algorithms, it had limitations, particularly in the detection of rare or unknown variants. Therefore, SI was introduced with the aim of providing an additional layer of analysis, capitalizing on deep and symbolic learning techniques to improve and enhance previous detections. The main technical challenge lay in interoperability. To overcome this, NextFlow, a scripting language designed to manage complex bioinformatics workflows, was employed. Through NextFlow, communication and efficient data transfer between the original pipeline and the SI were facilitated, thus guaranteeing compatibility and reproducibility. After the Variant Calling process of the original system, the results were transmitted to the SI, where a meticulous sequence of analysis was implemented, from preprocessing to data fusion. As a result, an optimized set of variants was generated that was integrated with previous results. Variants corroborated by both tools were considered to be of high reliability, while discrepancies indicated areas for detailed investigations. The product of this integration advanced to subsequent stages of the pipeline, usually annotation or interpretation, contextualizing the variants from biological and clinical perspectives. This adaptation not only maintained the original functionalities of the pipeline, but was also enhanced with the SI, establishing a new standard in the Variant Calling process. This research offers a robust and efficient model for the detection and analysis of genomic variants, highlighting the promise and applicability of blended learning in bioinformaticses_ES
dc.description.sponsorshipThis study has been funded by the AIR Genomics project (with file number CCTT3/20/SA/0003), through the call 2020 R&D PROJECTS ORIENTED TO THE EXCELLENCE AND COMPETITIVE IMPROVEMENT OF THE CCTT by the Institute of Business Competitiveness of Castilla y León and FEDER fundses_ES
dc.language.isoenges_ES
dc.subjectNext-Generation sequencinges_ES
dc.subjectExplainable Artificial Intelligencees_ES
dc.subjectDeep Symbolic Learninges_ES
dc.titleDeep Symbolic Learning Architecture for Variant Calling in NGSes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.subject.unesco1203.04 Inteligencia Artificiales_ES
dc.subject.unesco2410.07 Genética Humanaes_ES
dc.relation.projectIDCCTT3/20/SA/0003es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersiones_ES


Files in questo item

Thumbnail

Questo item appare nelle seguenti collezioni

Mostra i principali dati dell'item