0% found this document useful (0 votes)

44 views38 pages

Slide - 3DP - 10 - Structure From Motion

The document discusses Structure from Motion (SfM), a technique for reconstructing 3D structures from 2D images taken from various viewpoints. It outlines the SfM pipeline, including correspondence search, geometric verification, initialization, pose recovery, triangulation, and bundle adjustment. Additionally, it mentions open-source frameworks like COLMAP and Bundler for implementing SfM.

Uploaded by

amir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views38 pages

Slide - 3DP - 10 - Structure From Motion

Uploaded by

amir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

3D Data Processing

Structure from Motion

Alberto Pretto
SfM: Problem Statement
SfM is the process of reconstructing the 3D
structure of a scene from its projections into a
series of images taken from different viewpoints.

2
Picture from Sameer Agarwal et al.
Structure from Motion
● Input: a set of images that frame the same scene
from different points of view. Each image should
have some field of view overlap with other
images
– Images can be totally unordered and taken with
differen cameras
– Camera calibration parameters may not be available
● Output: camera positions, camera calibrations
(optional) and sparse 3D scene structure (i.e., a
set of 3D points)
3
Incremental SfM Typical Pipeline

● Incremental SfM is a sequential processing pipeline

with an iterative reconstruction component
● In this discussion, we report some implementation
details from:
J. L. Schanberger and J. Frahm, "Structure-from-Motion Revisited,"
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016,
4
Correspondence Search
● For each image extract inavarinat
features with associated descriptors
(e.g., SIFT, SURF, ORB,...).
● Match features between every
image pair
– This is just a naive approach since an
exhaustive matching has
computational complexity:

– Many efficient approaches have

been introduced to improve matching
5
Geometric Verification
● Matching is based solely
on appearance, no
guarantee that matches
refer to the same 3D point
in the scene
● Use RANSAC +
projective/epipolar
geometry to verify the
matches
6
Geometric Verification
● Try all models:
– Essential matrix E (use an initial guess calibration)
– Fundamental matrix F (uncalibrated settings)
– Homography matrix H (planar scene or pure rotation)
● For each model, use RANSAC to estimate the best
estimate:
– Sample N minimal set of mathes (e.g., 5, 8, or 4) and find the
maximum number of inliers that support the model
● Select the best model E, F or H based on certain criteria:
its inliers define the inlier matches between an image
pair
7
Correspondences

3 3

5 6
5 8
Observations

3 3

4 5
5 9
Scene Graph
After geometric verification, we can build a
scene graph with images as nodes and
verified pairs of images as edges. It
provides:
– Correspondences between a pair of images.
– Number of correspondences per image.
– Number of observations in an image.
– The correspondence of an image observation
with all other images
10
Incremental Mapping

● Iterative, "try and error" process

● Reconstruction may never recover from a
bad initialization and also from a bad
image registration
11
Initialization
● Find a good initial pair ("seed"):
– Sort images such that images with
more correspondences are
preferred, i.e. they appear in the
front of the list.
– Collect images that are
connected to the first seed image
and have not been registered
before
– If not done before, perform
geometrical verification
12
Initialization
● Try to recover a rigid body
transformation from the selected
model (E, F, or H)
– In the uncalibrated case, this most likely
leads to a ill-defined reconstruction
● If failed, restart the initalization process
● If the motion is a pure rotation
estimated from H, restart the
initalization process
– No triangulation can be obtained from a R,t
pure rotation
13
Pose Recovery
When recovering
position from the
Essential or
Fundamental
matrix we get 4
possible solutions:
use cheirality
constraint to select
one

[Picture from Marc Pollefeys, Viktor Larsson] 14

Initial Triangulation
Using point correspondences
and the initial R,t, triangulate
point to obtain their 3D
coordinates

R,t 15
Triangulation
For a non rectilinear stereo rigs, no single
solution due to noise

x’
x

[Some slide from 16-385 Computer Vision, www.cs.cmu.edu ] 16

Triangulation
We have R,t and (an estimate of) the
calibrations parameters: given a 3D point,
we know how to project it in both views!

x’
x

17
Triangulation
To use our points cooridnates x, we need to
normalize ("h-normalize") the homogeneous
points:
Same direction but differs
by a scale factor

Cross product of two vectors of same

direction is zero:

18
Triangulation

19
Triangulation

20
Triangulation

Third line is a linear combination of the first and second lines.

21
Triangulation
Use both points:

Is a homogeneous linear system!

Use SVD and take the last (h-normalized)
column of V
22
PnP-based Image Registration
A newly registered image
must observe existing scene
points.
● Use Perspective-n-Point
(PnP) using feature
correspondences to
triangulated points in
already registered images.
– Estimate both R’,t’ and
(optionally) intrinsic
parameters.
– Use a modified version of PnP R,t
that exploits RANSAC and a
minimal pose solver R’,t’
23
3D Points Registration
The newly
registered image
may also increase
scene coverage
by extending the
set of points X
through
triangulation.
R,t
R’,t’
24
Next Best View Selection
Several criteria to select a still not registered image, among others:
– "Sees" the maximum of already registered points, or;
– Use visivility score (e.g., sum #occupied cells * res over a number of
resolutions with res = 2,4,8,...):

25
[J. L. Schanberger and J. Frahm]
Bundle Adjustment
● We may iterate with PnP and triangulation
steps to register new images and new points
but...
● ... they are very correltated procedures:
uncertainties in the camera pose propagate
to triangulated points and vice-versa
● Bundle Adjustment (BA): joint non-linear
refinement of camera parameters and point
parameters
26
Bundle Adjustment

R1,t1

R2,t2
R3,t3 - = reprojection error
R4,t4
27
Bundle Adjustment
BA goal: find a suitable parameters set
(transformations Ti= Ri,ti, calibration
parameters θi , 3D points positions Pj , i=
0, ...Nc – 1, j=0, ..Np - 1) that minimizes the
sum of squares of reprojection errors (one
for each observation of each camera):

Solve with LM 28
BA Parametrization
● Camera positions (#=6*(Nc- 1))
– Set the coordinate system of one of the two seed cameras as reference
world frame
– Use a suitable rotation representation (e.g., with Axis-Angle, 6 DoF for each
camera)
● Calibration parameters (if not given) (#=3*Nc)
– Fix the principal point at the image center (principal point
calibration is an ill-posed problem)
– Optimize for each camera the focal length in pixel (e.g., one
shared focal lenght) and some distortion parameters (e.g., two
radial distortion parameters)
● Scene points (#=3*Np)
– Simply parametrized as 3D points
29
BA Complexity
LM update:

with

a = [a1, ..., am ]T problem parmeters vector, e.g. with

dimension:
m = 6*(Nc- 1) +3*Nc + 3*Np
Warning: matrix inversion requires O(m3)
30
Schur Complement Trick
Idea to efficiently solve BA: exploit J
sparsity and the fact the usually Np >> Nc
C1 C2 C3 X1 X2 X3 X4
Ci : camera i params
Camera 1 Xj : point j params
observations

Camera 2
observations

Camera 3
observations
31
Schur Complement Trick
Rewrite LM step as:
and let define H = JTJ + λI + λI

H=
C E
E P
T
C depends only on cameras,
P depends only on points
32
Schur Complement Trick

Partitioning the step update between

cameras and structure (points):

Camera Parameters
Structures (3D points)

33
Schur Complement Trick

Multiply both sides by

Schur complement of P 34
Schur Complement Trick

● P block diagonal matrix: calculating the

inverse of P by inverting each of its 3x3
blocks is cheap!
– Solve for δac then then backsubstituting it to
obtain the value of δap
● Complexity: O(Nc3)

35
Robust Cost Function
● Standard least square assumes gaussian errors
– High influence from high errors!
● To account for potential outliers, it is better to choose a
proper loss function ρ (e.g., Cauchy, Huber, Tuckey, ...)

36
Sparse to Dense Reconstruction

Use Multi-View Stereo (MVS): see next

lectures...
37
Picture from Sameer Agarwal et al.
Open Source SfM Frameworks
● COLMAP
https://colmap.github.io/
● Bundler: Structure from Motion (SfM) for
Unordered Image Collections
https://www.cs.cornell.edu/~snavely/bundler/
● Multicore Bundle Adjustment
http://grail.cs.washington.edu/projects/mcba/

Structure from Motion Lecture
No ratings yet
Structure from Motion Lecture
84 pages
04 Multi-View Geometry
No ratings yet
04 Multi-View Geometry
54 pages
3D Reconstruction USING MULTIPLE 2D IMAGES
No ratings yet
3D Reconstruction USING MULTIPLE 2D IMAGES
4 pages
Schoenberger 2016 SFM
No ratings yet
Schoenberger 2016 SFM
10 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
95 pages
3DCV Lec06 SFM 1
No ratings yet
3DCV Lec06 SFM 1
49 pages
SFM Szeliski
No ratings yet
SFM Szeliski
18 pages
Diffusionsfm: Predicting Structure and Motion Via Ray Origin and Endpoint Diffusion
No ratings yet
Diffusionsfm: Predicting Structure and Motion Via Ray Origin and Endpoint Diffusion
15 pages
Image Stitching for CS Students
No ratings yet
Image Stitching for CS Students
57 pages
MP SFM
No ratings yet
MP SFM
19 pages
Unsupervised 3D Object Recognition and Reconstruction in Unordered Datasets
No ratings yet
Unsupervised 3D Object Recognition and Reconstruction in Unordered Datasets
8 pages
LabLecture8 Inertial Odometery Using AR-Drone
No ratings yet
LabLecture8 Inertial Odometery Using AR-Drone
8 pages
3D Reconstruction From Multiple Images Part 1: Principles
No ratings yet
3D Reconstruction From Multiple Images Part 1: Principles
37 pages
Lecture10 2
No ratings yet
Lecture10 2
23 pages
Computer Viruses
No ratings yet
Computer Viruses
58 pages
Unit Iii Cv&ip
No ratings yet
Unit Iii Cv&ip
29 pages
Day 3 - Stereo Image Orientation
No ratings yet
Day 3 - Stereo Image Orientation
24 pages
3D Reconstruction: Jeff Boody
No ratings yet
3D Reconstruction: Jeff Boody
32 pages
Computer Vison 5
No ratings yet
Computer Vison 5
44 pages
Multiple View Geometry
No ratings yet
Multiple View Geometry
42 pages
VSLAM
No ratings yet
VSLAM
75 pages
Pataki 2025 MPSFM
No ratings yet
Pataki 2025 MPSFM
11 pages
Structure From Motion: Class 9
No ratings yet
Structure From Motion: Class 9
47 pages
Stereo Vision Due Diligence
No ratings yet
Stereo Vision Due Diligence
6 pages
Determining The Epipolar Geometry and Its Uncertainty: A Review
No ratings yet
Determining The Epipolar Geometry and Its Uncertainty: A Review
35 pages
Thesis Rec From Video
No ratings yet
Thesis Rec From Video
124 pages
CV Potential Questions
No ratings yet
CV Potential Questions
5 pages
Advanced SfM for High-Res Videos
No ratings yet
Advanced SfM for High-Res Videos
9 pages
Multi - View Stereo A Tutorial
No ratings yet
Multi - View Stereo A Tutorial
151 pages
A Cluster
No ratings yet
A Cluster
17 pages
L17 Panaroma Disparity
No ratings yet
L17 Panaroma Disparity
73 pages
Affine Reconstruction From Multiple Views Using Singular Value Decomposition
No ratings yet
Affine Reconstruction From Multiple Views Using Singular Value Decomposition
61 pages
Slide 3DP 05 Homographies
No ratings yet
Slide 3DP 05 Homographies
35 pages
CV - Unit Iii
No ratings yet
CV - Unit Iii
25 pages
Measuring Height: T B R R B T
No ratings yet
Measuring Height: T B R R B T
51 pages
3 DRec
No ratings yet
3 DRec
31 pages
Class04eth18 Annotated
No ratings yet
Class04eth18 Annotated
46 pages
Structure from Motion Analysis
No ratings yet
Structure from Motion Analysis
4 pages
Build Your Own 3D Scanner: 3D Photography For Beginners: SIGGRAPH 2009 Course Notes Wednesday, August 5, 2009
No ratings yet
Build Your Own 3D Scanner: 3D Photography For Beginners: SIGGRAPH 2009 Course Notes Wednesday, August 5, 2009
94 pages
03 CV2 SFM Slam
No ratings yet
03 CV2 SFM Slam
82 pages
Lec 07 Odometry Slam Localization
No ratings yet
Lec 07 Odometry Slam Localization
75 pages
VO Tutorial
No ratings yet
VO Tutorial
74 pages
Computer Vision and Image Understanding: Yisong Chen, Antoni B. Chan, Zhouchen Lin, Kenji Suzuki, Guoping Wang
No ratings yet
Computer Vision and Image Understanding: Yisong Chen, Antoni B. Chan, Zhouchen Lin, Kenji Suzuki, Guoping Wang
11 pages
Structure From Motion - Revisited
No ratings yet
Structure From Motion - Revisited
10 pages
S16.s1 - Material
No ratings yet
S16.s1 - Material
29 pages
An Invitation To 3-D Vision PDF
No ratings yet
An Invitation To 3-D Vision PDF
338 pages
02 Camera Geometry
No ratings yet
02 Camera Geometry
95 pages
Semantic Structure From Motion With Points, Regions, and Objects
No ratings yet
Semantic Structure From Motion With Points, Regions, and Objects
8 pages
Unit4 CV
No ratings yet
Unit4 CV
24 pages
UNIT-5 (PART-2, Final)
No ratings yet
UNIT-5 (PART-2, Final)
28 pages
Camera Geometry & Pinhole Model
No ratings yet
Camera Geometry & Pinhole Model
83 pages
An Invitation To 3-D Vision From Images To Models
No ratings yet
An Invitation To 3-D Vision From Images To Models
339 pages
Pixel-Perfect Structure-from-Motion With Featuremetric Refinement
No ratings yet
Pixel-Perfect Structure-from-Motion With Featuremetric Refinement
17 pages
1.2-ML Recap
No ratings yet
1.2-ML Recap
38 pages
Slide 3DP 12 3D Data Representation
No ratings yet
Slide 3DP 12 3D Data Representation
53 pages
3-Neural Networks - Parts 1 and 2
No ratings yet
3-Neural Networks - Parts 1 and 2
48 pages
Slide - 3DP - 15 - Deep Learning On 3D Data
No ratings yet
Slide - 3DP - 15 - Deep Learning On 3D Data
37 pages
2 - Learning With Gradient
No ratings yet
2 - Learning With Gradient
23 pages
Slide 3DP 14 3D Shape Registration
No ratings yet
Slide 3DP 14 3D Shape Registration
34 pages
Slide 3DP 02 Intro 3d Sensors
No ratings yet
Slide 3DP 02 Intro 3d Sensors
19 pages
Slide - 3DP - 06 - Camera Calibration
No ratings yet
Slide - 3DP - 06 - Camera Calibration
28 pages
Slide - 3DP - 09 - Beyond Classical Stereo Matching
No ratings yet
Slide - 3DP - 09 - Beyond Classical Stereo Matching
27 pages
Slide 3DP 07 Epipolar Geometry
No ratings yet
Slide 3DP 07 Epipolar Geometry
31 pages
Slide - 3DP - 08 - Stereo Matching
No ratings yet
Slide - 3DP - 08 - Stereo Matching
38 pages
Slide 3DP 01 Course Introduction
No ratings yet
Slide 3DP 01 Course Introduction
33 pages
Slide 3DP 04 From Objects To Camera
No ratings yet
Slide 3DP 04 From Objects To Camera
38 pages
Slide 3DP 03 Background and Rigid-Body Transformations
No ratings yet
Slide 3DP 03 Background and Rigid-Body Transformations
31 pages
Early Detection of Fungal Diseases
No ratings yet
Early Detection of Fungal Diseases
10 pages
Data Mining Practical Machine Learning Tools and Techniques Fourth Edition Ian H. Witten PDF Download
100% (2)
Data Mining Practical Machine Learning Tools and Techniques Fourth Edition Ian H. Witten PDF Download
46 pages
Hyperparameter Optimization
No ratings yet
Hyperparameter Optimization
1 page
SSRN 5263710
No ratings yet
SSRN 5263710
94 pages
Introduction To AI and ML 50 Slides Presentation
No ratings yet
Introduction To AI and ML 50 Slides Presentation
52 pages
Hugging Face Models and References - TNAU Agri
No ratings yet
Hugging Face Models and References - TNAU Agri
12 pages
PCL-RC: A Parallel Cloud Resource Load Prediction Model Based On Feature Optimization
No ratings yet
PCL-RC: A Parallel Cloud Resource Load Prediction Model Based On Feature Optimization
17 pages
1 s2.0 S1566253525005792 Main
No ratings yet
1 s2.0 S1566253525005792 Main
58 pages
Module 1
No ratings yet
Module 1
18 pages
Chateau Winery Unsupervised Learning Presentation
No ratings yet
Chateau Winery Unsupervised Learning Presentation
18 pages
Use Cases
No ratings yet
Use Cases
2 pages
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 3
No ratings yet
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 3
5 pages
Batch Normalization in Neural Network Simply Explained - by Anthony Kwok - Medium
No ratings yet
Batch Normalization in Neural Network Simply Explained - by Anthony Kwok - Medium
23 pages
Introduction To Transformers An NLP Perspective
No ratings yet
Introduction To Transformers An NLP Perspective
119 pages
Kaxxxxxxxxx Be Cse Data Science
No ratings yet
Kaxxxxxxxxx Be Cse Data Science
1 page
Ieee Paper1
No ratings yet
Ieee Paper1
13 pages
AI Project Cycle (PPT)
No ratings yet
AI Project Cycle (PPT)
35 pages
UNIT-5(DL)
No ratings yet
UNIT-5(DL)
18 pages
AI Term Paper Final
No ratings yet
AI Term Paper Final
29 pages
Recursive Feature Learning From Pre-Trained Models For Spoofing Speech Detection
No ratings yet
Recursive Feature Learning From Pre-Trained Models For Spoofing Speech Detection
5 pages
Machine Learning-Based Detection of SQL Injection and Data Exfiltration Through Behavioral Profiling of Relational Query Patterns
No ratings yet
Machine Learning-Based Detection of SQL Injection and Data Exfiltration Through Behavioral Profiling of Relational Query Patterns
15 pages
250523 (제3권) 생성형AI 데이터 품질관리 가이드 v2.0
No ratings yet
250523 (제3권) 생성형AI 데이터 품질관리 가이드 v2.0
226 pages
Poster-Presentation-707-Hybrid Deep Learning and Machine Learning Approach For Early Guava Disease Detection & Classification
No ratings yet
Poster-Presentation-707-Hybrid Deep Learning and Machine Learning Approach For Early Guava Disease Detection & Classification
2 pages
A Three-Model Deep Learning Framework For ASL Recognition Integrating CNN Skeletal Features and Multi-Modal Fusion
No ratings yet
A Three-Model Deep Learning Framework For ASL Recognition Integrating CNN Skeletal Features and Multi-Modal Fusion
9 pages
Retrieval-Augmented Generation (RAG) From Basics To Advanced - by Tejpal Kumawat - Medium
No ratings yet
Retrieval-Augmented Generation (RAG) From Basics To Advanced - by Tejpal Kumawat - Medium
38 pages
Unit 1
No ratings yet
Unit 1
19 pages
Deep Learning Empowered IoT Toothbrush A Paradigm Shift in Dental Health Monitoring
No ratings yet
Deep Learning Empowered IoT Toothbrush A Paradigm Shift in Dental Health Monitoring
6 pages
Feature Engineering Bookcamp 1st Edition Sinan Ozdemir Complete Edition
100% (1)
Feature Engineering Bookcamp 1st Edition Sinan Ozdemir Complete Edition
74 pages
SecureVision IEEE Research Paper Final
No ratings yet
SecureVision IEEE Research Paper Final
3 pages
Hyperparemter and Cross Validaton
No ratings yet
Hyperparemter and Cross Validaton
8 pages

Slide - 3DP - 10 - Structure From Motion

Uploaded by

Slide - 3DP - 10 - Structure From Motion

Uploaded by

3D Data Processing

Structure from Motion

● Incremental SfM is a sequential processing pipeline

– Many efficient approaches have

● Iterative, "try and error" process

[Picture from Marc Pollefeys, Viktor Larsson] 14

[Some slide from 16-385 Computer Vision, www.cs.cmu.edu ] 16

Cross product of two vectors of same

Third line is a linear combination of the first and second lines.

Is a homogeneous linear system!

a = [a1, ..., am ]T problem parmeters vector, e.g. with

Partitioning the step update between

Multiply both sides by

● P block diagonal matrix: calculating the

Use Multi-View Stereo (MVS): see next

You might also like