ECMEE: Expert Constrained Multi-Expert Ensembles with Category Entropy Minimization for Long-tailed Visual Recognition

Yu Fu, Changjing Shang, Jungong Han*, Qiang Shen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

When the training dataset follows a long-tail distribution, models tend to prioritize the majority of the data, thus resulting in lower predictive accuracy for the minority data. Among existing methods, integrating multiple experts with different logit distributions has yielded promising results. However, the current state-of-the-art (SOTA) ensemble method, i.e., Self-supervised Aggregation of Diverse Experts, trains three expert models separately to favor the head, middle, and tail data, respectively, without imposing mutual constraints. Failure to constrain the magnitude of logits among experts may result in higher category entropy, making it difficult to achieve an optimal ensemble solution. To address this issue, we propose the Expert Constrained Multi-Expert Ensembles with Category Entropy Minimization method, which consists of two new strategies: (1) Confidence Enhancement Loss to constrain the expert models based on maximizing target and non-target logit margins, thereby minimizing category entropy; (2) Shot-aware Weights associated with expert models to accommodate the shot-headed characteristic of the experts. Experiments demonstrate that our method effectively reduces expert category entropy, improves integration effectiveness, and achieves SOTA results on three datasets in diverse test distributions.

Original languageEnglish
Article number127357
JournalNeurocomputing
Volume576
Early online date02 Feb 2024
DOIs
Publication statusPublished - 01 Apr 2024

Keywords

  • Long-tail
  • Expert Ensemble
  • Visual Recognition
  • Entropy Minimization
  • Self-supervised Learning

Fingerprint

Dive into the research topics of 'ECMEE: Expert Constrained Multi-Expert Ensembles with Category Entropy Minimization for Long-tailed Visual Recognition'. Together they form a unique fingerprint.

Cite this