Cross-View Image Geolocalization
description
Transcript of Cross-View Image Geolocalization
![Page 1: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/1.jpg)
Cross-View Image Geolocalization
CVPR2013 POSTER
![Page 2: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/2.jpg)
Outline
IntroductionMethodExperimentsConclusion
![Page 3: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/3.jpg)
How can we determine where the photos were taken?
Introduction
![Page 4: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/4.jpg)
In im2gps [9], we can sometimes get a coarse geolocation based on the distribution of similar scenes.
We take a small step in this direction by exploiting two previously unused geographic data sets – overhead appearance and land cover survey data.
Introduction
![Page 5: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/5.jpg)
These data sets have the benefits of being
(1) densely available for nearly all of the Earth (2) rich enough such that the mapping from ground level appearance
Introduction
![Page 6: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/6.jpg)
Dataset1.Ground-level Imagery2.Aerial Imagery3.Land Cover Attribute Imagery
Image Features and Kernels Matching
Method
![Page 7: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/7.jpg)
Dataset
We examine a 40km40km region around Charleston, SC.
Our training data consists of triplets of ground-level images, aerial images, and attribute maps.
![Page 8: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/8.jpg)
Dataset
Ground-level Imagery:We downloaded 6756 ground-level images from Panoramio.
Aerial Imagery :The satellite imagery is downloaded from Bing Maps.
Each 256256 image covers a 180m180m region.
The image resolution is 0.6 m/pixel.
We collect a total of 182,988 images for the map database.
![Page 9: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/9.jpg)
Dataset
Land Cover Attribute Imagery:
The National Gap Analysis Program (GAP) conducted a national wide geology survey and produced the USGS GAP Land Cover Data Set.
The dataset contains two hierarchical levels of classes including 8 general classes and 590 land use classes as the subclasses.
The attribute image resolution is 30 m/pixel.
We crop 55 images for the training data and the map database.
![Page 10: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/10.jpg)
Dataset
![Page 11: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/11.jpg)
Image Features and Kernels
Ground and Aerial Image:
We represent each ground image and aerial tile using four features: 1. HoG 2. self-similarity3. gist4. color histograms.
When computing image similarity we use a histogram intersection kernel for HoG pyramids, kernel for self-similarity pyramids and color histograms, and RBF kernel for gist descriptors.
![Page 12: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/12.jpg)
Image Features and Kernels
Land Cover Attribute Image:
Each ground and aerial image has a corresponding 55 attribute image.
We build histograms for the general classes and subclasses and then concatenate them to form a multi-nomial attribute feature.
We compare attribute features with a kernel.
![Page 13: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/13.jpg)
Matching
We introduce three baseline algorithms for comparison.
1. im2gps2. Direct Match3. Kernelized Canonical Correlation Analysis
We introduce two noval data-driven approaches
1. Data-driven Feature Averaging2. Discriminative Translation
![Page 14: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/14.jpg)
Matching
im2gps:It geolocalizes a query image by guessing the locations of the top k scene matches.
We compute to measure the similarity of the query image and the training images.
denotes the query image. The bold-faced denotes ground-level images and denotes
aerial/attribute images of corresponding training triplets. denotes the aerial/attributes images in the map database.denotes the kernel function.
![Page 15: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/15.jpg)
Matching
Direct Match (DM):The simplest method to match ground level images to aerial images is just to match the same features with no translation.
![Page 16: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/16.jpg)
Matching
Kernelized Canonical Correlation Analysis (KCCA):
we use the “kernel trick” to represent a basis as a linear combination of training examples by and
We compute the cross-view correlation score of query and training images by summing over the correlation scores on the top d bases:
![Page 17: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/17.jpg)
Matching
Data-driven Feature Averaging (AVG):
we propose a simple method to translate ground-level to aerial and attribute features by averaging the features of good scene matches.
we use the averaged features to match features in the map database:
![Page 18: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/18.jpg)
Matching
Discriminative Translation (DT):The positive set of DT is the same as AVG and we add to that a relatively large set of negative training samples from the bottom 50% ranked scene matches.
We apply the trained classifier on the map database:
![Page 19: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/19.jpg)
Matching
![Page 20: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/20.jpg)
Experiments
Test Sets :
1. Random: we randomly draw 1000 images from the ground-level image database.
2. Isolated: We use 737 isolated images with no training images in the same 180m x 180m block.
![Page 21: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/21.jpg)
Matching Performance
![Page 22: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/22.jpg)
Matching Performance
![Page 23: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/23.jpg)
Experiments
![Page 24: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/24.jpg)
Experiments
![Page 25: Cross-View Image Geolocalization](https://reader035.fdocuments.in/reader035/viewer/2022062803/56814830550346895db551f4/html5/thumbnails/25.jpg)
We propose a cross-domain matching framework that greatly extends the domain of image geolocation.
Using our new dataset of ground-level, aerial, and attribute images we quantified the performance of several baseline and novel approaches for “cross-view” geolocation.
Our approach can roughly geolocate 17% of isolated query images, compared to 0% for existing methods.
Conclusion