Image geolocalization is receiving increasing attention due to its importance in several applications, such as image retrieval, criminal investigations and fact-checking. Previous works focused on several instances of image geolocalization including place recognition, GPS coordinates estimation and country recognition. In this paper, we tackle an even more challenging problem, which is recognizing the city where an image has been taken. Due to the vast number of cities in the world, we cast the problem as a verification problem, whereby the system has to decide whether a certain image has been taken in a given city or not. In particular, we present a system that given a query image and a small set of images taken in a target city, decides if the query image has been shot in the target city or not. To allow the system to handle the case of images, taken in cities that have not been used during training, we use a Siamese network based on Vision Transformer as a backbone. The experiments we run prove the validity of the proposed system which outperforms solutions based on state-of-the-art techniques, even in the challenging case of images shot in different cities of the same country.
A Siamese Based System for City Verification / O. Alamayreh, J. Wang, G.M. Dimitri, B. Tondi, M. Barni (FRONTIERS IN ARTIFICIAL INTELLIGENCE AND APPLICATIONS). - In: ECAI 2023 / [a cura di] K. Gal, A. Nowé, G.J. Nalepa, R. Fairstein, R. Rădulescu. - Amsterdam : IOS press, 2023. - ISBN 9781643684369. - pp. 69-76 (( Intervento presentato al 26. convegno European Conference on Artificial Intelligence tenutosi a Krakow nel 2023 [10.3233/FAIA230255].
A Siamese Based System for City Verification
G.M. Dimitri;
2023
Abstract
Image geolocalization is receiving increasing attention due to its importance in several applications, such as image retrieval, criminal investigations and fact-checking. Previous works focused on several instances of image geolocalization including place recognition, GPS coordinates estimation and country recognition. In this paper, we tackle an even more challenging problem, which is recognizing the city where an image has been taken. Due to the vast number of cities in the world, we cast the problem as a verification problem, whereby the system has to decide whether a certain image has been taken in a given city or not. In particular, we present a system that given a query image and a small set of images taken in a target city, decides if the query image has been shot in the target city or not. To allow the system to handle the case of images, taken in cities that have not been used during training, we use a Siamese network based on Vision Transformer as a backbone. The experiments we run prove the validity of the proposed system which outperforms solutions based on state-of-the-art techniques, even in the challenging case of images shot in different cities of the same country.| File | Dimensione | Formato | |
|---|---|---|---|
|
FAIA-372-FAIA230255.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
502.65 kB
Formato
Adobe PDF
|
502.65 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




