Robust Camera Location Estimation by Convex Programming · In J. G. Oxley, J. E. Bonin, and B....

1
Robust Camera Location Estimation by Convex Programming Onur Özye¸ sil 1 and Amit Singer 1,2 1 Program in Applied and Computational Mathematics, Princeton University. 2 Department of Mathematics, Princeton University. 3D structure recovery from a collection of 2D images requires the estima- tion of the camera locations and orientations, i.e. the camera motion. For large, irregular collections of images, existing methods for the location es- timation part, which can be formulated as the inverse problem of estimating n locations t 1 , t 2 ,..., t n in R 3 from noisy measurements of a subset of the pairwise directions γ ij . .= t i t j t i t j (cf. Figure 1), are sensitive to outliers in direction measurements. t 1 t 2 t 3 t 4 t 5 t 6 γ 12 γ 13 γ 24 γ 15 γ 25 γ 36 γ 45 γ 46 γ12 γ13 γ 24 γ 15 γ 25 γ 45 γ 36 γ 46 Pairwise Directions Locations Figure 1: A (noiseless) instance of the location estimation problem in R 3 , with n = 6 locations and m = 8 pairwise directions. We represent the pairwise directions using a measurement graph G t = ( V t , E t ), where the i’th node in V t = {1, 2,..., n} corresponds to the location t i and each edge (i, j) E t is endowed with the direction γ ij . Provided with the set {γ ij } (i, j)E t of (noiseless) directions on G t =( V t , E t ), we first con- sider the following fundamental questions: Can we uniquely realize {t i } iV t , of course, up to a global translation and scale? Is unique realizability a generic property of G t and can it be decided efficiently? The unique realiz- ability of locations (in R d , d 2) was previously studied under the general title of parallel rigidity theory (see, e.g., [1, 4, 6]), and was shown to be a generic property of G t , which admits a complete combinatorial characteri- zation: Theorem 1. For a graph G =( V, E ), let (d 1)E denote the set consisting of (d 1) copies of each edge in E. Then, G is generically parallel rigid in R d if and only if there exists a nonempty set D (d 1)E, with |D| = d| V |− (d + 1), such that for all subsets D of D, we have |D |≤ d| V (D )|− (d + 1) , (1) where V (D ) denotes the vertex set of the edges in D . There exist efficient algorithms for testing parallel rigidity (see, e.g., [4] for a randomized spectral test having a time complexity of O(m)). More- over, for a graph that is not parallel rigid, maximal uniquely realizable subgraphs can be efficiently extracted (see, e.g., [2]). For noisy direction measurements, we consider problem instances on parallel rigid graphs to be well-posed instances. Now, suppose that we are given a set of noisy pairwise direction mea- surements {γ ij } (i, j)E t , i.e., for each (i, j) E t , γ ij satisfies γ ij = t i t j t i t j + ε γ ij (2) where, ε γ ij denotes the direction error. Our objective is to estimate the lo- cations {t i } iV t by maintaining robustness to outliers (i.e., γ ij ’s with large ε γ ij ’s) in a computationally efficient manner. In this respect, we first rewrite (2) as t i t j = t i t j γ ij + ε t ij (3) ⇐⇒ ε t ij = t i t j d ij γ ij (4) where, ε t ij denotes the displacement error, and we define d ij . .= t i t j to rewrite ε t ij linearly in t i , t j and d ij . Observe that, large direction errors This is an extended abstract. The full paper is available at the Computer Vision Foundation webpage. Figure 2: Snapshots of selected 3D structures computed using the initial location estimates of the LUD solver (5) (without bundle adjustment). ε γ ij ’s induce large displacement errors ε t ij ’s. Hence, we employ displacement error minimization as a substitute for direction error minimization. Also, to maintain robustness to large ε t ij ’s, we minimize the sum of unsquared norms of ε t ij ’s, and for computational efficiency, we drop the intrinsic non-convex constraints d ij = t i t j to obtain the convex “least unsquared deviations” (LUD) formulation minimize {t i },{d ij } (i, j)E t t i t j d ij γ ij subject to iV t t i = 0 ; d ij 1, (i, j) E t (5) where the constraints are used to remove the global ambiguities, and to pre- vent trivial solutions, as well as solutions clustered around a few locations. In our paper, we also provide a new method for estimating γ ij that is robust to outliers in point correspondences, and an efficient iteratively- reweighted least squares (IRLS) algorithm to solve the LUD problem (5). Additionally, we compare the performance of our formulations to various methods in the literature through experiments on synthetic data sets, and In- ternet photo collections (cf. Figure 2 for selected 3D structures computed using the LUD solver (5)), which demonstrate the relatively high accuracy and efficiency of our approach. [1] T. Eren, W. Whiteley, and P. N. Belhumeur. Using angle of arrival (bearing) information in network localization. In IEEE Conference on Decision & Control, San Diego, CA, USA, December 2006. [2] R. Kennedy, K. Daniilidis, O. Naroditsky, and C. J. Taylor. Identifying maximal rigid components in bearing-based localization. In IEEE/RSJ International Conference on Intelligent Robots and Systems, Vilam- oura, Algarve, Portugal, October 2012. [3] P. Moulon, P. Monasse, and R. Marlet. Global fusion of relative motions for robust, accurate and scalable structure from motion. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 3248– 3255, Dec 2013. [4] O. Özye¸ sil, A. Singer, and R. Basri. Stable camera mo- tion estimation using convex programming, 2015. SIAM Journal on Imaging Sciences, accepted. Also available at http://arxiv.org/abs/1312.5047. [5] R. Tron and R. Vidal. Distributed 3-D localization of camera sensor net- works from 2-D image measurements. Automatic Control, IEEE Trans- actions on, 59(12):3325–3340, Dec 2014. [6] W. Whiteley. Matroids from discrete geometry. In J. G. Oxley, J. E. Bonin, and B. Servatius, editors, Matroid Theory, volume 197 of Con- temporary Mathematics, pages 171–313. American Mathematical Soci- ety, 1996. We note that the least squares version of (5) (i.e., the program with the cost function (i, j)Et t i t j d ij γ ij 2 , and the same constraints as in (5)), and the version of (5), were previously studied (resp.) in [5] and in [3].

Transcript of Robust Camera Location Estimation by Convex Programming · In J. G. Oxley, J. E. Bonin, and B....

Page 1: Robust Camera Location Estimation by Convex Programming · In J. G. Oxley, J. E. Bonin, and B. Servatius, editors, Matroid Theory, volume 197 of Con-temporary Mathematics, pages 171–313.

Robust Camera Location Estimation by Convex Programming

Onur Özyesil1 and Amit Singer1,2

1Program in Applied and Computational Mathematics, Princeton University. 2Department of Mathematics, Princeton University.

3D structure recovery from a collection of 2D images requires the estima-tion of the camera locations and orientations, i.e. the camera motion. Forlarge, irregular collections of images, existing methods for the location es-timation part, which can be formulated as the inverse problem of estimatingn locations t1, t2, . . . , tn in R

3 from noisy measurements of a subset of the

pairwise directions γi j..=

ti−t j

‖ti−t j‖(cf. Figure 1), are sensitive to outliers in

direction measurements.

t1

t2

t3

t4

t5

t6

γ12

γ13 γ24

γ15

γ25

γ36

γ45γ46

γ12

γ13

γ24

γ15

γ25γ45

γ36

γ46

Pairwise Directions Locations

Figure 1: A (noiseless) instance of the location estimation problem in R3,

with n = 6 locations and m = 8 pairwise directions.

We represent the pairwise directions using a measurement graph Gt =(Vt ,Et), where the i’th node in Vt = {1,2, . . . ,n} corresponds to the locationti and each edge (i, j) ∈ Et is endowed with the direction γi j . Provided withthe set {γi j}(i, j)∈Et

of (noiseless) directions on Gt = (Vt ,Et), we first con-sider the following fundamental questions: Can we uniquely realize {ti}i∈Vt

,of course, up to a global translation and scale? Is unique realizability ageneric property of Gt and can it be decided efficiently? The unique realiz-ability of locations (in R

d , d ≥ 2) was previously studied under the generaltitle of parallel rigidity theory (see, e.g., [1, 4, 6]), and was shown to be ageneric property of Gt , which admits a complete combinatorial characteri-zation:

Theorem 1. For a graph G = (V,E), let (d−1)E denote the set consisting

of (d − 1) copies of each edge in E. Then, G is generically parallel rigid

in Rd if and only if there exists a nonempty set D ⊆ (d − 1)E, with |D| =

d|V |− (d +1), such that for all subsets D′ of D, we have

|D′| ≤ d|V (D′)|− (d +1) , (1)

where V (D′) denotes the vertex set of the edges in D′.

There exist efficient algorithms for testing parallel rigidity (see, e.g., [4]for a randomized spectral test having a time complexity of O(m)). More-over, for a graph that is not parallel rigid, maximal uniquely realizablesubgraphs can be efficiently extracted (see, e.g., [2]). For noisy directionmeasurements, we consider problem instances on parallel rigid graphs to bewell-posed instances.

Now, suppose that we are given a set of noisy pairwise direction mea-surements {γi j}(i, j)∈Et

, i.e., for each (i, j) ∈ Et , γi j satisfies

γi j =ti − t j

‖ti − t j‖+ ε

γi j (2)

where, εγi j denotes the direction error. Our objective is to estimate the lo-

cations {ti}i∈Vtby maintaining robustness to outliers (i.e., γi j’s with large

εγi j’s) in a computationally efficient manner. In this respect, we first rewrite

(2) as

ti − t j = ‖ti − t j‖γi j + ε ti j (3)

⇐⇒ ε t

i j = ti − t j −di jγi j (4)

where, ε ti j denotes the displacement error, and we define di j

..= ‖ti − t j‖

to rewrite ε t

i j linearly in ti, t j and di j . Observe that, large direction errors

This is an extended abstract. The full paper is available at theComputer Vision Foundation webpage.

Figure 2: Snapshots of selected 3D structures computed using the initiallocation estimates of the LUD solver (5) (without bundle adjustment).

εγi j’s induce large displacement errors ε t

i j’s. Hence, we employ displacementerror minimization as a substitute for direction error minimization. Also, tomaintain robustness to large ε t

i j’s, we minimize the sum of unsquared norms

of ε ti j’s, and for computational efficiency, we drop the intrinsic non-convex

constraints di j = ‖ti − t j‖ to obtain the convex “least unsquared deviations”(LUD) formulation

minimize{ti},{di j}

∑(i, j)∈Et

∥ti − t j −di jγi j

subject to ∑i∈Vt

ti = 0 ; di j ≥ 1, ∀(i, j) ∈ Et

(5)

where the constraints are used to remove the global ambiguities, and to pre-vent trivial solutions, as well as solutions clustered around a few locations.

In our paper, we also provide a new method for estimating γi j thatis robust to outliers in point correspondences, and an efficient iteratively-reweighted least squares (IRLS) algorithm to solve the LUD problem (5).Additionally, we compare the performance of our formulations to variousmethods in the literature through experiments on synthetic data sets, and In-ternet photo collections (cf. Figure 2 for selected 3D structures computedusing the LUD solver (5)), which demonstrate the relatively high accuracyand efficiency of our approach.

[1] T. Eren, W. Whiteley, and P. N. Belhumeur. Using angle of arrival(bearing) information in network localization. In IEEE Conference on

Decision & Control, San Diego, CA, USA, December 2006.

[2] R. Kennedy, K. Daniilidis, O. Naroditsky, and C. J. Taylor. Identifyingmaximal rigid components in bearing-based localization. In IEEE/RSJ

International Conference on Intelligent Robots and Systems, Vilam-oura, Algarve, Portugal, October 2012.

[3] P. Moulon, P. Monasse, and R. Marlet. Global fusion of relative motionsfor robust, accurate and scalable structure from motion. In Computer

Vision (ICCV), 2013 IEEE International Conference on, pages 3248–3255, Dec 2013.

[4] O. Özyesil, A. Singer, and R. Basri. Stable camera mo-tion estimation using convex programming, 2015. SIAM

Journal on Imaging Sciences, accepted. Also available athttp://arxiv.org/abs/1312.5047.

[5] R. Tron and R. Vidal. Distributed 3-D localization of camera sensor net-works from 2-D image measurements. Automatic Control, IEEE Trans-

actions on, 59(12):3325–3340, Dec 2014.

[6] W. Whiteley. Matroids from discrete geometry. In J. G. Oxley, J. E.Bonin, and B. Servatius, editors, Matroid Theory, volume 197 of Con-

temporary Mathematics, pages 171–313. American Mathematical Soci-ety, 1996.

We note that the least squares version of (5) (i.e., the program with the cost function

∑(i, j)∈Et‖ti − t j −di jγi j‖

2, and the same constraints as in (5)), and the ℓ∞ version of (5), werepreviously studied (resp.) in [5] and in [3].