Aligning Geo-Tagged Clip Representations and Satellite Imagery for Few-Shot Land Use Classification

  • Pallavi Jain
  • , Diego Marcos
  • , Dino Ienco
  • , Roberto Interdonato
  • , Aayush Dhakal
  • , Nathan Jacobs
  • , Tristan Berchoux

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A major difference between ground-level and satellite imagery of landscapes lies in their semantic granularity: ground-level images tend to offer details on objects and human activities, while satellite images provide broader geographic context but, typically, with coarser semantics. This study aims to leverage this complementary information by integrating fine-grained insights from a ground-level view into the analysis of satellite image data. To achieve this integration, we propose to align a satellite image representation with co-located geo-tagged ground-level image CLIP representations. This method focuses on enriching satellite image visual features by leveraging the inherent visual characteristics found in ground-level images as a reference in a contrastive manner, without relying on additional textual information to guide the learning process. We evaluate the quality of the learned representations on the EuroSAT benchmark in various few-shot settings.

Original languageEnglish
Title of host publicationIGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages319-323
Number of pages5
ISBN (Electronic)9798350360325
DOIs
StatePublished - 2024
Event2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024 - Athens, Greece
Duration: Jul 7 2024Jul 12 2024

Publication series

NameInternational Geoscience and Remote Sensing Symposium (IGARSS)

Conference

Conference2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024
Country/TerritoryGreece
CityAthens
Period07/7/2407/12/24

Keywords

  • computer vision
  • contrastive learning
  • land use
  • satellite images

Fingerprint

Dive into the research topics of 'Aligning Geo-Tagged Clip Representations and Satellite Imagery for Few-Shot Land Use Classification'. Together they form a unique fingerprint.

Cite this