Show simple item record

dc.contributor.authorCaño Pascual, Pablo
dc.contributor.authorFran Abadía, Pablo
dc.contributor.authorValdes-Ramirez, Danilo
dc.contributor.authorGonzález Briones, Alfonso 
dc.contributor.authorBarrio Val, Pablo
dc.date.accessioned2026-06-10T07:59:53Z
dc.date.available2026-06-10T07:59:53Z
dc.date.issued2026-03-02
dc.identifier.urihttp://hdl.handle.net/10366/171782
dc.description.abstract[EN]Efficient environmental perception is a cornerstone of Advanced Driver Assistance Systems (ADAS) and autonomous driving. A persistent architectural dilemma in this domain is whether to employ unified Multi-Task Learning (MTL) frameworks, which optimize computation through shared backbones, or modular multi-model pipelines, which prioritize task-specific accuracy. This paper presents a comparative analysis of these two paradigms for joint object detection and drivable area estimation. Specifically, we evaluate YOLOPX, a representative anchor-free MTL architecture, against a decoupled multi-model system that integrates RT-DETRv2 for vehicle detection and the lightweight YOLO11n-seg for drivable area segmentation on the BDD100K benchmark under identical hardware conditions. The results show that, although the MTL YOLOPX model achieves higher throughput, the decoupled system delivers substantially better detection performance, particularly in the stricter 𝑚𝐴𝑃 50:95 metric, while preserving competitive segmentation quality and maintaining real-time latency suitable for edge deployment. These findings suggest that modular designs, rather than monolithic MTL models, can offer a more favorable balance between safety-critical detection accuracy and computational efficiency for next-generation intelligent vehicles.es_ES
dc.description.sponsorshipProject “A catalyst for EuropeaN ClOUd Services in the era of data spaces, high-performance and edge computing (NOUS)”, Grant Agreement Number 101135927. Funded by the European Union.es_ES
dc.format.mimetypeapplication/pdf
dc.language.isoenges_ES
dc.relation.ispartofseriesACM conference proceedings;
dc.rightsAttribution 4.0 Internationales_ES
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/es_ES
dc.subjectAutonomous driving perceptiones_ES
dc.subjectMulti-task learninges_ES
dc.subjectReal-time object detectiones_ES
dc.subjectVision Transformerses_ES
dc.subjectDrivable area segmentationes_ES
dc.titleUnified Multi-Task Learning vs. Decoupled Transformer-based Perception: A Comparative Analysises_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.subject.unesco1203 Ciencia de los ordenadoreses_ES
dc.relation.projectIDEuropean Union 101135927es_ES
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses_ES
dc.journal.titleInternational conference on Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) solutions for Europe’s Next-Gen Cloud Infrastructurees_ES
dc.type.hasVersioninfo:eu-repo/semantics/draftes_ES


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 International
Except where otherwise noted, this item's license is described as Attribution 4.0 International