Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Domain-Specialized Object Detection via Model-Level Mixtures of Experts

About

Mixture-of-Experts (MoE) models provide a structured approach to combining specialized neural networks and offer greater interpretability than conventional ensembles. While MoEs have been successfully applied to image classification and semantic segmentation, their use in object detection remains limited due to challenges in merging dense and structured predictions. In this work, we investigate model-level mixtures of object detectors and analyze their suitability for improving performance and interpretability in object detection. We propose an MoE architecture that combines YOLO-based detectors trained on semantically disjoint data subsets, with a learned gating network that dynamically weights expert contributions. We study different strategies for fusing detection outputs and for training the gating mechanism, including balancing losses to prevent expert collapse. Experiments on the BDD100K dataset demonstrate that the proposed MoE consistently outperforms standard ensemble approaches and provides insights into expert specialization across domains, highlighting model-level MoEs as a viable alternative to traditional ensembling for object detection. Our code is available at https://github.com/KASTEL-MobilityLab/mixtures-of-experts/.

Svetlana Pavlitska, Malte St\"uven, Beyza Keskin, J. Marius Z\"ollner• 2026

Related benchmarks

TaskDatasetResultRank
Object DetectionBDD100K
mAP77.24
88
Object DetectionBDD100K (Nighttime)
AP55.36
66
Object DetectionBDD100k daytime
mAP (Overall)60.82
47
Object DetectionBDD100K Dawn Dusk
mAP61.41
40
Object DetectionBDD100K Day+Night
mAP58.75
40
Object DetectionBDD100K (all)
mAP59.19
40
Showing 6 of 6 rows

Other info

Follow for update