Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Nov 08, 2023

Yash Jain, Harkirat Behl, Zsolt Kira, Vibhav Vineet

Figure 1 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Figure 2 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Figure 3 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Figure 4 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Share this with someone who'll enjoy it:

Abstract:Construction of a universal detector poses a crucial question: How can we most effectively train a model on a large mixture of datasets? The answer lies in learning dataset-specific features and ensembling their knowledge but do all this in a single model. Previous methods achieve this by having separate detection heads on a common backbone but that results in a significant increase in parameters. In this work, we present Mixture-of-Experts as a solution, highlighting that MoEs are much more than a scalability tool. We propose Dataset-Aware Mixture-of-Experts, DAMEX where we train the experts to become an `expert' of a dataset by learning to route each dataset tokens to its mapped expert. Experiments on Universal Object-Detection Benchmark show that we outperform the existing state-of-the-art by average +10.2 AP score and improve over our non-MoE baseline by average +2.0 AP score. We also observe consistent gains while mixing datasets with (1) limited availability, (2) disparate domains and (3) divergent label sets. Further, we qualitatively show that DAMEX is robust against expert representation collapse.

* https://github.com/jinga-lala/DAMEX

View paper on

Share this with someone who'll enjoy it:

Title:DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Paper and Code