logo
banner

Journals & Publications

Publications Papers

Papers

Learning Stratified 3D Reconstruction
Mar 19, 2018Author:
PrintText Size A A

Title: Learning Stratified 3D Reconstruction

 Authors: Dong, QL; Shu, M; Cui, HN; Xu, HR; Hu, ZY

 Author Full Names: Dong, Qiulei; Shu, Mao; Cui, Hainan; Xu, Huarong; Hu, Zhanyi

 Source: SCIENCE CHINA-INFORMATION SCIENCES, 61 (2):10.1007/s11432-017-9234-7 FEB 2018

 Language: English

 Abstract: Stratified 3D reconstruction, or a layer-by-layer 3D reconstruction upgraded from projective to affine, then to the final metric reconstruction, is a well-known 3D reconstruction method in computer vision. It is also a key supporting technology for various well-known applications, such as streetview, smart3D, oblique photogrammetry. Generally speaking, the existing computer vision methods in the literature can be roughly classified into either the geometry-based approaches for spatial vision or the learning-based approaches for object vision. Although deep learning has demonstrated tremendous success in object vision in recent years, learning 3D scene reconstruction from multiple images is still rare, even not existent, except for those on depth learning from single images. This study is to explore the feasibility of learning the stratified 3D reconstruction from putative point correspondences across images, and to assess whether it could also be as robust to matching outliers as the traditional geometry-based methods do. In this study, a special parsimonious neural network is designed for the learning. Our results show that it is indeed possible to learn a stratified 3D reconstruction from noisy image point correspondences, and the learnt reconstruction results appear satisfactory although they are still not on a par with the state-of-the-arts in the structure-from-motion community due to largely its lack of an explicit robust outlier detector such as random sample consensus (RANSAC). To the best of our knowledge, our study is the first attempt in the literature to learn 3D scene reconstruction from multiple images. Our results also show that how to implicitly or explicitly integrate an outlier detector in learning methods is a key problem to solve in order to learn comparable 3D scene structures to those by the current geometry-based state-of-the-arts. Otherwise any significant advancement of learning 3D structures from multiple images seems difficult, if not impossible. Besides, we even speculate that deep learning might be, in nature, not suitable for learning 3D structure from multiple images, or more generally, for solving spatial vision problems.

 ISSN: 1674-733X

 eISSN: 1869-1919

 Article Number: 023101

 IDS Number: FU1UO

 Unique ID: WOS:000423635200003

*Click Here to View Full Record