Direkt zum Inhalt springen
Computer Vision Group
TUM School of Computation, Information and Technology
Technical University of Munich

Technical University of Munich

Menu

Links

Informatik IX
Computer Vision Group

Boltzmannstrasse 3
85748 Garching info@vision.in.tum.de

Follow us on:

News

04.03.2024

We have twelve papers accepted to CVPR 2024. Check our publication page for more details.

18.07.2023

We have four papers accepted to ICCV 2023. Check out our publication page for more details.

02.03.2023

CVPR 2023

We have six papers accepted to CVPR 2023. Check out our publication page for more details.

15.10.2022

NeurIPS 2022

We have two papers accepted to NeurIPS 2022. Check out our publication page for more details.

15.10.2022

WACV 2023

We have two papers accepted at WACV 2023. Check out our publication page for more details.

More


Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
research:vslam:dso [2018/01/06 23:27]
Rui Wang
research:vslam:dso [2018/08/24 13:41]
Nan Yang
Line 40: Line 40:
  
 Note that as for LSD-SLAM, we use a dual-licensing model; Please contact [[members:engelj]] or [[members:cremers|Prof. Daniel Cremers]] for details on commercial licensing. Note that as for LSD-SLAM, we use a dual-licensing model; Please contact [[members:engelj]] or [[members:cremers|Prof. Daniel Cremers]] for details on commercial licensing.
- 
 <html><br><br></h1></html> <html><br><br></h1></html>
  
 +===== Extensions =====
 +<html><div style="color: #333; font-size: 1.666em; font-weight: bold; line-height: 1em">Stereo DSO: <a href="https://vision.in.tum.de/research/vslam/stereo-dso">Link</a></div></html>
 +<html><div style="color: #333; font-size: 1.666em; font-weight: bold; line-height: 1em">Visual-Inertial DSO: <a href="https://vision.in.tum.de/research/vslam/vi-dso">Link</a></div></html>
 +<html><div style="color: #333; font-size: 1.666em; font-weight: bold; line-height: 1em">DVSO: <a href="https://vision.in.tum.de/research/vslam/dvso">Link</a></div></html>
  
-====== Stereo DSO: Large-Scale Direct Sparse Visual Odometry with Stereo Cameras ====== +<html><br></h1></html>
-**Contact:** [[members:wangr]], [[members:cremers|Prof. Daniel Cremers]] +
- +
-<html><center><iframe width="640" height="360" src="//www.youtube.com/embed/A53vJO8eygw" frameborder="0" allowfullscreen></iframe></center></html> +
- +
-<html><center><iframe width="640" height="360" src="https://www.youtube.com/embed/BxTLhubqEKg" frameborder="0" allowfullscreen></iframe></center></html> +
- +
-===== Abstract ===== +
-** Stereo DSO ** is a novel method for highly accurate real-time visual odometry estimation of large-scale environments from stereo cameras. It jointly optimizes for all the model parameters within the active window, including the intrinsic/extrinsic camera parameters of all keyframes and the depth values of all selected pixels. In particular, it integrates constraints from static stereo into the bundle adjustment pipeline of temporal multi-view stereo. Real-time optimization is realized by sampling pixels uniformly from image regions with sufficient intensity gradient. Fixed-baseline stereo resolves scale drift. It also reduces the sensitivities to large optical flow and to rolling shutter effect which are known shortcomings of direct image alignment methods. Quantitative evaluation demonstrates that the proposed Stereo DSO outperforms existing state-of-the-art visual odometry methods both in terms of tracking accuracy and robustness. Moreover, our method delivers a more precise metric 3D reconstruction than previous dense/semi-dense direct approaches while providing a higher reconstruction density than feature-based methods. +
- +
-===== Results ===== +
-For this work we use the [[http://www.cvlibs.net/datasets/kitti/eval_odometry.php | KITTI Visual Odometry Benchmark]] and the Frankfurt sequence of the [[https://www.cityscapes-dataset.com/ | Cityscapes Dataset]] for evaluations. The full evaluation results can be found in the supplementary material of our ICCV 2017 paper. Below we show some representative results. +
- +
-** KITTI Visual Odometry Benchmark ** +
- +
-The following 4 figures show the average translational and rotational errors with respect to driving intervals (first row) and driving speed (second row) on the KITTI VO testing set. We compare our method with the current state-of-the-art direct and feature-based methods, namely the Stereo LSD-SLAM and ORB-SLAM2. Note that both of the compared methods are SLAM systems with loop closure based on pose graph optimization (ORB-SLAM2 also with global bundle adjustment), while ours is pure visual odometry.  +
- +
-{{:research:vslam:dso:tl.png?350&nolink|}} +
-{{:research:vslam:dso:rl.png?350&nolink|}} +
- +
-{{:research:vslam:dso:ts.png?350&nolink|}} +
-{{:research:vslam:dso:rs.png?350&nolink|}} +
- +
-As qualitative results we run our method on all the sequences from the training set and compare the estimated camera trajectories to the provided ground truth. Following are the results on some example sequences. +
- +
-{{:research:vslam:dso:00.png?350&nolink|}} +
-{{:research:vslam:dso:00_traj_stereo.png?350&nolink|}}  +
- +
-{{:research:vslam:dso:02.png?350&nolink|}} +
-{{:research:vslam:dso:02_traj_stereo.png?350&nolink|}}  +
- +
- +
-**Update July 2017: ** After the ICCV 2017 deadline, we extended our method to a SLAM system with additional components for map maintenance, loop detection and loop closure. Our performance on KITTI is further boosted a little, as shown with black plot below. A demonstration video is shown above. +
-{{:research:vslam:dso:slam-trl.png?700&nolink|}}  +
- +
- +
- +
- +
-** Frankfurt Sequence of Cityscapes** +
- +
-To verify that our method can work with industrial level cameras (high dynamic range, rolling shutter with high pixel read-out speed), we evaluate our method on the Frankfurt sequence from the Cityscapes dataset. We split the sequence to several smaller segments, each with a comparable scale to those sequences from KITTI. The estimated camera trajectories with their alignments to the GPS trajectory are shown below (blue: estimates, red: GPS). Note that the provide GPS coordinates are not accurate. +
- +
-{{:research:vslam:dso:frankfurt_large.png?700&nolink|}}  +
- +
-Some qualitative results on the 3D reconstruction are shown below. +
- +
-{{:research:vslam:dso:cs03.png?350&nolink|}} +
-{{:research:vslam:dso:cs06.png?350&nolink|}}  +
- +
-{{:research:vslam:dso:cs09.png?350&nolink|}} +
-{{:research:vslam:dso:cs11.png?350&nolink|}}  +
- +
-<html><h2 class="sectionedit1">Open-Source Code</h2></html> +
-Under discussion. +
- +
-<html><br><br></h1></html>+
  
 ==== Publications ==== ==== Publications ====

Rechte Seite

Informatik IX
Computer Vision Group

Boltzmannstrasse 3
85748 Garching info@vision.in.tum.de

Follow us on:

News

04.03.2024

We have twelve papers accepted to CVPR 2024. Check our publication page for more details.

18.07.2023

We have four papers accepted to ICCV 2023. Check out our publication page for more details.

02.03.2023

CVPR 2023

We have six papers accepted to CVPR 2023. Check out our publication page for more details.

15.10.2022

NeurIPS 2022

We have two papers accepted to NeurIPS 2022. Check out our publication page for more details.

15.10.2022

WACV 2023

We have two papers accepted at WACV 2023. Check out our publication page for more details.

More