TY - GEN
T1 - Task Agnostic Cost Prediction Module for Semantic Labeling in Active Learning
AU - Sastry, Srikumar
AU - Jacobs, Nathan
AU - Belgiu, Mariana
AU - Maretto, Raian Vargas
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - We consider the problem of cost effective active learning for semantic segmentation, which aims at reducing the efforts of semantically annotating images. Current studies have ignored the inclusion of cost of labeling into their active learning frameworks. To this end, we first present a novel cost prediction module based on what we call the M-Net. M-Net combines the power of unsupervised W-Net and supervised U-Net to compute a refined segmentation map. The refined segmentation map is used to estimate the cost of annotations. The cost of annotation is estimated by the number of clicks required to annotate an image. To solve this task, we make use of the harris corner detector algorithm to estimate the location of the clicks required to annotate an image. Finally, we employ a multi armed bandit setting to minimize the cost of annotations while maximizing the performance of the semantic segmentation task. The M-Net outperforms fully supervised U-Net with +4.37 Acc and +3.75 mIoU. The proposed active learning framework also outperforms the existing baselines to prove the relevance of the approach in the current paradigm.
AB - We consider the problem of cost effective active learning for semantic segmentation, which aims at reducing the efforts of semantically annotating images. Current studies have ignored the inclusion of cost of labeling into their active learning frameworks. To this end, we first present a novel cost prediction module based on what we call the M-Net. M-Net combines the power of unsupervised W-Net and supervised U-Net to compute a refined segmentation map. The refined segmentation map is used to estimate the cost of annotations. The cost of annotation is estimated by the number of clicks required to annotate an image. To solve this task, we make use of the harris corner detector algorithm to estimate the location of the clicks required to annotate an image. Finally, we employ a multi armed bandit setting to minimize the cost of annotations while maximizing the performance of the semantic segmentation task. The M-Net outperforms fully supervised U-Net with +4.37 Acc and +3.75 mIoU. The proposed active learning framework also outperforms the existing baselines to prove the relevance of the approach in the current paradigm.
KW - Active Learning
KW - Semantic Segmentation
KW - Uncertainty
UR - https://www.scopus.com/pages/publications/85178350064
U2 - 10.1109/IGARSS52108.2023.10281434
DO - 10.1109/IGARSS52108.2023.10281434
M3 - Conference contribution
AN - SCOPUS:85178350064
T3 - International Geoscience and Remote Sensing Symposium (IGARSS)
SP - 4990
EP - 4993
BT - IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2023
Y2 - 16 July 2023 through 21 July 2023
ER -