Skip to main navigation Skip to search Skip to main content

AssocFormer: Association Transformer for Multi-label Classification

  • Xin Xing
  • , Chong Peng
  • , Yu Zhang
  • , Ai Ling Lin
  • , Nathan Jacobs

Research output: Contribution to conferencePaperpeer-review

Abstract

The goal of multi-label image classification is to predict a set of labels for a single image. Recent work has shown that explicitly modeling the co-occurrence relationship between classes is critical for achieving good performance on this task. State-of-the-art approaches model this using graph convolutional networks, which are complex and computationally expensive. We propose a novel, efficient association module as an alternative. This is coupled with a transformer-based feature-extraction backbone. The proposed model was evaluated using two standard datasets: MS-COCO and PASCAL VOC. The results show that the proposed model outperforms several strong baseline models.

Original languageEnglish
StatePublished - 2022
Event33rd British Machine Vision Conference Proceedings, BMVC 2022 - London, United Kingdom
Duration: Nov 21 2022Nov 24 2022

Conference

Conference33rd British Machine Vision Conference Proceedings, BMVC 2022
Country/TerritoryUnited Kingdom
CityLondon
Period11/21/2211/24/22

Fingerprint

Dive into the research topics of 'AssocFormer: Association Transformer for Multi-label Classification'. Together they form a unique fingerprint.

Cite this