RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments

RGB-D Mapping:Using Depth Cameras for Dense 3D Modeling of

Indoor Environments

Peter Henry1, Michael Krainin1, Evan Herbst1,Xiaofeng Ren2, and Dieter Fox1,2

1University of Washington Computer Science and Engineering2Intel Labs Seattle

RGB-D Data

Related Work SLAM

[Davison et al, PAMI 2007] (monocular) [Konolige et al, IJR 2010] (stereo) [Pollefeys et al, IJCV 2007] (multi-view stereo) [Borrmann et al, RAS 2008] (3D laser) [May et al, JFR 2009] (ToF sensor)

Loop Closure Detection [Nister et al, CVPR 2006] [Paul et al, ICRA 2010]

Photo collections [Snavely et al, SIGGRAPH 2006] [Furukawa et al, ICCV 2009]

System Overview

1. Frame-to-frame alignment2. Loop closure detection and global consistency3. Map representation

Alignment (3D SIFT RANSAC)

SIFT features (from image) in 3D (from depth)RANSAC alignment requires no initial estimateRequires sufficient matched features

RANSAC Failure Case

Alignment (ICP)

Does not need visual features or colorRequires sufficient geometry and initial estimate

ICP Failure Case

Joint Optimization (RGBD-ICP)

Fixed feature matches Dense point-to-plane

Loop ClosureSequential alignments accumulate errorRevisiting a previous location results in an

inconsistent map

Loop Closure DetectionDetect by running RANSAC against previous

framesPre-filter options (for efficiency):

Only a subset of frames (keyframes)Every nth frameNew Keyframe when RANSAC fails directly against

previous keyframeOnly keyframes with similar estimated 3D posePlace recognition using vocabulary tree

Loop Closure CorrectionTORO [Grisetti 2007]:

Constraints between camera locations in pose graph

Maximum likelihood global camera poses

Map Representation: Surfels

Surface Elements [Pfister 2000, Weise 2009, Krainin 2010]

Circular surface patchesAccumulate color / orientation / size informationIncremental, independent updatesIncorporate occlusion reasoning750 million points reduced to 9 million surfels

Experiment 1: RGBD-ICP16 locations marked in environment (~7m

apart)Camera on tripod carried between and placed at

markersError between ground truth distance and the

accumulated distance from frame-to-frame alignment

α weight 0.0 (ICP) 0.2 0.5 0.8 1.0 (RANSAC)

Mean error (m)

0.22 0.19

Experiment 2: Robot ArmCamera held in WAM arm for ground truth pose

MethodMean

Translational Error

Mean Angular

Error (deg)

RANSAC .168 4.5ICP .144 3.9RGBD-ICP (α=0.5) .123 3.3RGBD-ICP + TORO

.069 2.8

Map Accuracy: Architectural Drawing

Map Accuracy: 2D Laser Map

ConclusionKinect-style depth cameras have recently become

available as consumer productsRGB-D Mapping can generate rich 3D maps using

these camerasRGBD-ICP combines visual and shape information

for robust frame-to-frame alignmentGlobal consistency achieved via loop closure

detection and optimization (RANSAC, TORO, SBA)Surfels provide a compact map representation

Open QuestionsWhich are the best features to use?How to find more loop closure constraints between

frames?What is the right representation (point clouds, surfels,

meshes, volumetric, geometric primitives, objects)?How to generate increasingly photorealistic maps? Autonomous exploration for map completeness?Can we use these rich 3D maps for semantic

mapping?

Thank You!

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments

Documents

Transcript of RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments

A Primal-Dual Framework for Real-Time Dense RGB … · A Primal-Dual Framework for Real-Time Dense RGB-D Scene ... a primal-dual algorithm is applied for the rst ... the bijective

Integrating Depth and Color Cues for Dense Multi ... · Integrating Depth and Color Cues for Dense Multi-Resolution Scene Mapping Using RGB-D Cameras Jörg Stückler and Sven Behnke

DTAM: Dense Tracking and Mapping in Real-Timeajd/Publications/newcombe_etal_iccv2011.pdf · DTAM: Dense Tracking and Mapping in Real-Time ... As a single hand-held RGB camera ...

Dense Visual SLAM for RGB-D Cameras - TUMDense Visual SLAM for RGB-D Cameras Christian Kerl, Jurgen Sturm, and Daniel Cremers¨ Abstract—In this paper, we propose a dense visual

Room Occupancy Estimation Through WiFi, UWB, and Light ...gnawali/papers/occupancy-icsde2017.pdfthose solutions inapplicable in wider doors. RGB cameras: RGB cameras are very popular

3DOF Pedestrian Trajectory Prediction Learned from Long-Term … · 2017-10-03 · Nowadays, mobile robots are equipped with various sen-sors such as RGB/RGB-D/stereo cameras and

ELECTRO - OPTICAL OBSERVATION STATION · RGB, Nir framing camera with pan-sharpening RGB, Nir framing camera with pan-sharpening Multi-lenses framing cameras (NIR+3RGB) combined in

Multispectral image fusion for improved RGB representation ... · The RGB colour space is the dominant colour space and the most frequently used in colour cameras, scanners, displays,

Real-Time Visual Odometry from Dense RGB-D Images · 2011-11-22 · Real-Time Visual Odometry from Dense RGB-D Images Frank Steinbrücker Jürgen Sturm Daniel Cremers Department of

Robust Keyframe-based Dense SLAM with an RGB-D …1 Robust Keyframe-based Dense SLAM with an RGB-D Camera Haomin Liu 1y, Chen Li , Guojun Chen 1, Guofeng Zhang , Michael Kaess2, and

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling ...xren/publication/... · We evaluate RGB-D Mapping on two large indoor environments, and show that it effectively combines

Toward Lifelong Object Segmentation from Change …people.csail.mit.edu/kaess/pub/Finman13ecmr.pdfToward Lifelong Object Segmentation from Change Detection in Dense RGB-D Maps Ross

Extrinsic Calibration of Multiple RGB-D Cameras From Line ...

COMPARISON BETWEEN RGB AND RGB-D CAMERAS FOR SUPPORTING LOW-COST … · 2018-05-31 · COMPARISON BETWEEN RGB AND RGB-D CAMERAS FOR SUPPORTING LOW-COST GNSS URBAN NAVIGATION L. Rossi1,,

Towards Solving Real-World Vision Problems with RGB-D Cameras

Monocular Dense 3D Reconstruction of a Complex Dynamic ... · a direct approach to capturing dense, detailed 3D geom-etry of generic, complex non-rigid meshes using a single RGB camera.

RGB-D mapping: Using Kinect-style depth cameras for dense 3D ...

Self-Supervised Drivable Area and Road Anomaly ...RGB-D cameras, such as Kinect [1], are visual sensors that can stream RGB and depth images at the same time [2]–[4]. We use an RGB-D

Active Perception of Deformable Objects using 3D Cameras · deformable objects both using conventional RGB cameras and active sensing strategies by means of depth cameras. ... shapes

Volume-based Human Re-identication with RGB-D Cameraswebpages.lincoln.ac.uk/nbellotto/doc/Cosar2017.pdf · Volume-based Human Re-identication with RGB-D Cameras Serhan Cos¸ar, Claudio