Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
research:vslam:dso [2018/01/06 23:24] Rui Wang |
research:vslam:dso [2018/08/24 13:41] Nan Yang |
||
---|---|---|---|
Line 10: | Line 10: | ||
< | < | ||
- | < | + | ===== Abstract |
**DSO** is a novel //direct// and //sparse// formulation for Visual Odometry. | **DSO** is a novel //direct// and //sparse// formulation for Visual Odometry. | ||
It combines a fully direct probabilistic model (minimizing a photometric error) with | It combines a fully direct probabilistic model (minimizing a photometric error) with | ||
Line 20: | Line 20: | ||
We thoroughly evaluate our method on three different datasets comprising several hours of video. The experiments show that the presented approach significantly outperforms state-of-the-art direct and indirect methods in a variety of real-world settings, both in terms of tracking accuracy and robustness. | We thoroughly evaluate our method on three different datasets comprising several hours of video. The experiments show that the presented approach significantly outperforms state-of-the-art direct and indirect methods in a variety of real-world settings, both in terms of tracking accuracy and robustness. | ||
- | < | + | ===== Datatset |
Please see [[: | Please see [[: | ||
It contains over 2h of video and respective evaluation / benchmarking metrics / tools. | It contains over 2h of video and respective evaluation / benchmarking metrics / tools. | ||
- | < | + | ===== Supplementary Material |
Supplementary material with all ORB-SLAM and DSO results presented in the paper can be downloaded from here: [[http:// | Supplementary material with all ORB-SLAM and DSO results presented in the paper can be downloaded from here: [[http:// | ||
in the paper from the above archive, which can be downloaded here: [[http:// | in the paper from the above archive, which can be downloaded here: [[http:// | ||
Line 30: | Line 30: | ||
14.10.2016.: | 14.10.2016.: | ||
- | < | + | ===== Open-Source Code ===== |
The full source code is available on Github under GPLv3: | The full source code is available on Github under GPLv3: | ||
[[https:// | [[https:// | ||
Line 40: | Line 40: | ||
Note that as for LSD-SLAM, we use a dual-licensing model; Please contact [[members: | Note that as for LSD-SLAM, we use a dual-licensing model; Please contact [[members: | ||
- | |||
< | < | ||
- | < | + | ====== Extensions ====== |
+ | < | ||
+ | < | ||
+ | < | ||
- | **Contact: | + | < |
- | + | ||
- | <html>< | + | |
- | + | ||
- | < | + | |
- | + | ||
- | < | + | |
- | ** Stereo DSO ** is a novel method for highly accurate real-time visual odometry estimation of large-scale environments from stereo cameras. It jointly optimizes for all the model parameters within the active window, including the intrinsic/ | + | |
- | + | ||
- | < | + | |
- | For this work we use the [[http:// | + | |
- | + | ||
- | ** KITTI Visual Odometry Benchmark ** | + | |
- | + | ||
- | The following 4 figures show the average translational and rotational errors with respect to driving intervals (first row) and driving speed (second row) on the KITTI VO testing set. We compare our method with the current state-of-the-art direct and feature-based methods, namely the Stereo LSD-SLAM and ORB-SLAM2. Note that both of the compared methods are SLAM systems with loop closure based on pose graph optimization (ORB-SLAM2 also with global bundle adjustment), | + | |
- | + | ||
- | {{: | + | |
- | {{: | + | |
- | + | ||
- | {{: | + | |
- | {{: | + | |
- | + | ||
- | As qualitative results we run our method on all the sequences from the training set and compare the estimated camera trajectories to the provided ground truth. Following are the results on some example sequences. | + | |
- | + | ||
- | {{: | + | |
- | {{: | + | |
- | + | ||
- | {{: | + | |
- | {{: | + | |
- | + | ||
- | + | ||
- | **Update July 2017: ** After the ICCV 2017 deadline, we extended our method to a SLAM system with additional components for map maintenance, | + | |
- | {{: | + | |
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | ** Frankfurt Sequence of Cityscapes** | + | |
- | + | ||
- | To verify that our method can work with industrial level cameras (high dynamic range, rolling shutter with high pixel read-out speed), we evaluate our method on the Frankfurt sequence from the Cityscapes dataset. We split the sequence to several smaller segments, each with a comparable scale to those sequences from KITTI. The estimated camera trajectories with their alignments to the GPS trajectory are shown below (blue: estimates, red: GPS). Note that the provide GPS coordinates are not accurate. | + | |
- | + | ||
- | {{: | + | |
- | + | ||
- | Some qualitative results on the 3D reconstruction are shown below. | + | |
- | + | ||
- | {{: | + | |
- | {{: | + | |
- | + | ||
- | {{: | + | |
- | {{: | + | |
- | + | ||
- | < | + | |
- | Under discussion. | + | |
- | + | ||
- | < | + | |
==== Publications ==== | ==== Publications ==== |