0% found this document useful (0 votes)

39 views4 pages

Unit 3 Notes

The document discusses the geometry of multiple views in computer vision, focusing on concepts like epipolar geometry, stereo vision, and 3D reconstruction. It covers the fundamental and essential matrices, camera calibration, and techniques for rectification and depth estimation. Additionally, it explores human stereopsis, multi-view stereo applications, and various segmentation techniques including clustering and graph-based methods.

Uploaded by

lillys9089

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views4 pages

Unit 3 Notes

Uploaded by

lillys9089

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

THE GEOMETRY OF MULTIPLE VIEWS

Introduction:
 In computer vision, analyzing multiple views of the same scene provides crucial information about the
3D structure of the scene and the relative motion between the camera and the scene.
 The geometric relationship between multiple images taken from different viewpoints is a key area of
study.

Two-View Geometry:
Epipolar Geometry:
Definition:
 Epipolar geometry describes the intrinsic projective geometry between two views of a scene.
 It is the basis for many stereo vision algorithms and involves concepts like the epipolar plane, epipolar
lines, and epipoles.

Epipolar Plane:
The plane that contains the camera centers and a 3D point in the scene.

Epipolar Line:
 The intersection of the epipolar plane with the image plane.
 For a point in one image, its corresponding point in the other image must lie on the corresponding
epipolar line.

Epipole:
The point of intersection of the line connecting the camera centers with the image plane.

Fundamental Matrix (F):

 The fundamental matrix relates corresponding points in stereo images.
 It encodes the epipolar geometry between two views: x′TFx=0x'^T F x = 0x′TFx=0 Where xxx and
x′x'x′ are corresponding points in the two images.

Essential Matrix (E):

Definition:
The essential matrix is a special case of the fundamental matrix when the cameras are calibrated.

Computation:
The essential matrix can be computed using the camera intrinsic parameters: E=K′TFKE = K'^T F
KE=K′TFK Where KKK and K′K'K′ are the intrinsic parameter matrices of the two cameras.

Decomposition:
The essential matrix can be decomposed to obtain the relative rotation and translation between the two
camera views.

Camera Calibration:
Intrinsic Parameters:
Intrinsic parameters define the internal characteristics of the camera, such as focal length, principal
point, and skew.

Extrinsic Parameters:
Extrinsic parameters define the camera's position and orientation in the world coordinate system.
Calibration Process:
 The process of determining the intrinsic and extrinsic parameters of the camera.
 Typically involves taking images of a known calibration object (e.g., a checkerboard) and applying
techniques like Zhang's method to estimate the parameters.

Rectification:
Definition:
Rectification is the process of transforming stereo images so that corresponding points are aligned
horizontally.

Importance:
Rectification simplifies the stereo matching problem by reducing the search for corresponding points
to a single dimension (horizontal line).

Techniques:
Several techniques are used to rectify images, often involving homography transformations or epipolar
line correction.

Stereo Vision:
Definition:
 Stereo vision involves extracting 3D information from two or more images taken from different
viewpoints.
 By analyzing the disparity between corresponding points in the images, depth information can be
recovered.

Disparity Map:
 The disparity map is a representation of the difference in pixel positions between corresponding points
in the stereo images.
 The disparity is inversely proportional to the depth of the points in the scene.

Depth Estimation:
Using the disparity map, the depth ZZZ can be estimated: Z=f⋅BdZ = \frac{f \cdot B}{d}Z=df⋅B Where
fff is the focal length, BBB is the baseline (distance between cameras), and ddd is the disparity.

Stereopsis
Reconstruction:
3D Reconstruction:
 Stereopsis refers to the process of reconstructing the 3D structure of a scene from two or more 2D
images.
 The depth information is recovered by triangulating the corresponding points from the different views.

Triangulation:
Triangulation is the process of determining the 3D coordinates of a point by intersecting the lines of
sight from multiple camera viewpoints.

Reconstruction Pipeline:
 The general pipeline for 3D reconstruction includes:
 Feature detection and matching.
 Estimation of camera parameters.
 Computation of the 3D coordinates using triangulation.
Human Stereopsis:
Biological Inspiration:
 Human stereopsis is the biological equivalent of computer stereo vision, where the brain combines the
images from the two eyes to perceive depth.
 The disparity between the two retinal images is used by the brain to infer the relative depth of objects.

Binocular Fusion:
Definition:
Binocular fusion is the process by which the brain combines the two slightly different images from the
eyes into a single 3D perception.

Binocular Disparity:
 The small difference between the images seen by the left and right eyes is known as binocular disparity.
 The brain uses this disparity to compute the depth of objects.

Using More Cameras:

Multi-view Stereo:
 In some cases, using more than two cameras provides additional views that can improve the accuracy
and robustness of 3D reconstruction.
 Multi-view stereo techniques extend the principles of stereo vision to multiple cameras.

Applications:
Multi-view stereo is widely used in applications like 3D modeling, virtual reality, and autonomous
navigation.

3. Segmentation by Clustering

Segmentation:
Definition:
Segmentation is the process of partitioning an image into distinct regions, typically corresponding to
different objects or surfaces.
Goal:
The goal of segmentation is to simplify the representation of an image, making it easier to analyze and
understand.

Human Vision: Grouping and Gestalt:

Gestalt Principles:
 Gestalt psychology suggests that the human visual system tends to group elements based on certain
principles, such as proximity, similarity, and continuity.
 These principles are often used in computer vision to design algorithms that mimic human perception
for image segmentation.

Segmentation Techniques:
Clustering-Based Segmentation:
Clustering is a common approach to segmentation, where pixels are grouped based on their similarity
in terms of color, intensity, texture, or spatial location.

K-means Clustering:
An unsupervised algorithm that partitions the image into kkk clusters, where each pixel is assigned to
the cluster with the nearest centroid.
Mean Shift Clustering:
A non-parametric clustering technique that shifts each pixel to the average of the pixels in its
neighborhood, leading to clusters that represent modes in the data.

Graph-Based Segmentation:
 Involves representing the image as a graph, where pixels are nodes, and edges represent the similarity
between pixels.
 Segmentation is performed by finding cuts in the graph that minimize the dissimilarity between different
regions.

Spectral Clustering:
Uses the eigenvalues of a similarity matrix to reduce the dimensionality of the data, then applies
clustering in the lower-dimensional space.

Applications:
Shot Boundary Detection:
Segmentation is used to detect transitions between different shots in a video by identifying significant
changes in the scene content.

Background Subtraction:
A common technique in video analysis, where the goal is to separate moving objects (foreground) from
the static background.

Image Segmentation by Clustering Pixels:

Pixels are grouped into clusters based on their attributes (color, texture) to segment the image into
meaningful regions.

Segmentation by Graph-Theoretic Clustering:

This method involves constructing a graph based on pixel similarity and partitioning the graph to
segment the image.

Unit4 CV
No ratings yet
Unit4 CV
24 pages
CV Unit-4
No ratings yet
CV Unit-4
10 pages
Stereo Vision
No ratings yet
Stereo Vision
19 pages
Stereo Vision Disparity Estimation
No ratings yet
Stereo Vision Disparity Estimation
29 pages
Image Texture Analysis Guide
No ratings yet
Image Texture Analysis Guide
6 pages
Optical Flow and Motion Analysis
No ratings yet
Optical Flow and Motion Analysis
12 pages
Clustering Via K-Means and Meanshift
No ratings yet
Clustering Via K-Means and Meanshift
11 pages
Python String Manipulation Tasks
No ratings yet
Python String Manipulation Tasks
2 pages
Edge and Corner Detection
No ratings yet
Edge and Corner Detection
1 page
Unit Iii Cv&ip
No ratings yet
Unit Iii Cv&ip
29 pages
List of Important Programs For Campus Placements of Various Companies (SET-3)
No ratings yet
List of Important Programs For Campus Placements of Various Companies (SET-3)
1 page
Computer Vision
No ratings yet
Computer Vision
3 pages
Canny Edge Detection Tutorial PDF
No ratings yet
Canny Edge Detection Tutorial PDF
17 pages
Advanced Image Recognition Techniques
No ratings yet
Advanced Image Recognition Techniques
8 pages
Harris Corner Detection Guide
No ratings yet
Harris Corner Detection Guide
6 pages
Group-5 (Image Segmentation) - PPT
100% (1)
Group-5 (Image Segmentation) - PPT
108 pages
Facet Model Recognition Guide
No ratings yet
Facet Model Recognition Guide
5 pages
DS1703 CV Unit1
No ratings yet
DS1703 CV Unit1
36 pages
R23 III-I CSE (AI) Computer Vision and Image Processing Question Bank
100% (1)
R23 III-I CSE (AI) Computer Vision and Image Processing Question Bank
27 pages
CV Unit-4
No ratings yet
CV Unit-4
11 pages
Unit 2 Computer Vision & Image Processsing
No ratings yet
Unit 2 Computer Vision & Image Processsing
16 pages
Advanced Feature Extraction Guide
No ratings yet
Advanced Feature Extraction Guide
47 pages
Image Processing
No ratings yet
Image Processing
27 pages
Watershed Segmentation
No ratings yet
Watershed Segmentation
19 pages
IPCV Unit 03
No ratings yet
IPCV Unit 03
9 pages
Basic Relationship Between Pixels
No ratings yet
Basic Relationship Between Pixels
22 pages
Digital Image Processing - Region Based Splitting & Merging
No ratings yet
Digital Image Processing - Region Based Splitting & Merging
13 pages
Computer Vision Question Bank
No ratings yet
Computer Vision Question Bank
4 pages
Computer Vision (7th Sem)
No ratings yet
Computer Vision (7th Sem)
48 pages
Active Range Finding
No ratings yet
Active Range Finding
6 pages
CVR 4
No ratings yet
CVR 4
38 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
Unit 4
No ratings yet
Unit 4
13 pages
Image Processing and Computer Vision Unit 2
No ratings yet
Image Processing and Computer Vision Unit 2
4 pages
Image Processing
No ratings yet
Image Processing
39 pages
Unit3 CV
No ratings yet
Unit3 CV
27 pages
CVR Unit 2
No ratings yet
CVR Unit 2
25 pages
Feature Extraction & Image Registration
No ratings yet
Feature Extraction & Image Registration
14 pages
Image Enhancement in Spatial Domain: Pixel Operations and Histogram Processing
No ratings yet
Image Enhancement in Spatial Domain: Pixel Operations and Histogram Processing
59 pages
CV Lab Manual
No ratings yet
CV Lab Manual
45 pages
Active - Appearance & 3D - Shape
No ratings yet
Active - Appearance & 3D - Shape
7 pages
Unit 1 CV Notes
No ratings yet
Unit 1 CV Notes
122 pages
Image Processing IEEE Projects
0% (1)
Image Processing IEEE Projects
4 pages
CVR 3
100% (1)
CVR 3
32 pages
Image Processing and Computer Vision Unit 3
No ratings yet
Image Processing and Computer Vision Unit 3
3 pages
03 - Image Segmentation
No ratings yet
03 - Image Segmentation
45 pages
CVR Unit 1
No ratings yet
CVR Unit 1
24 pages
Unit 3 Hough Transform
No ratings yet
Unit 3 Hough Transform
19 pages
Computer Vision Course Syllabus
No ratings yet
Computer Vision Course Syllabus
92 pages
Circle Generation Algorithm
No ratings yet
Circle Generation Algorithm
10 pages
Chapter 2 Digital Image Fundamantels
No ratings yet
Chapter 2 Digital Image Fundamantels
58 pages
Image Segmentation Techniques
No ratings yet
Image Segmentation Techniques
11 pages
DIP Notes Unit 5
No ratings yet
DIP Notes Unit 5
30 pages
Computer Vision Exam Paper B.E. 7th Sem
No ratings yet
Computer Vision Exam Paper B.E. 7th Sem
2 pages
Image Registration
100% (1)
Image Registration
33 pages
Machine - Learning (Computer Vision)
No ratings yet
Machine - Learning (Computer Vision)
56 pages
Unit 1 To 5 Computer Vision and Image Processing
No ratings yet
Unit 1 To 5 Computer Vision and Image Processing
56 pages
Stereo Vision Using The Opencv Library: Sebastian DR Oppelmann Moos Hueting Sander Latour Martijn Van Der Veen June 2010
No ratings yet
Stereo Vision Using The Opencv Library: Sebastian DR Oppelmann Moos Hueting Sander Latour Martijn Van Der Veen June 2010
15 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
99 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
Wireless Tech & Supercomputers
No ratings yet
Wireless Tech & Supercomputers
163 pages
App001 - Patent Application
No ratings yet
App001 - Patent Application
5 pages
Syllabus DBI202
No ratings yet
Syllabus DBI202
8 pages
Brochure E&A Solutions
No ratings yet
Brochure E&A Solutions
9 pages
HVAC Insulation Tape Guide
No ratings yet
HVAC Insulation Tape Guide
2 pages
CAT 777E Steering
No ratings yet
CAT 777E Steering
2 pages
Comprehensive Guide to Cyber Law
No ratings yet
Comprehensive Guide to Cyber Law
11 pages
Power Quality Techniques for Engineers
No ratings yet
Power Quality Techniques for Engineers
1 page
Unit 5 - Final
No ratings yet
Unit 5 - Final
12 pages
ICG - Preview
No ratings yet
ICG - Preview
1 page
Lec 1
No ratings yet
Lec 1
24 pages
Movie Data
No ratings yet
Movie Data
11 pages
Python String Concatenation Guide
No ratings yet
Python String Concatenation Guide
11 pages
Turning Buzz Into Gold
No ratings yet
Turning Buzz Into Gold
26 pages
Final Draft Proposal Capstone - Docx64b9db062d30c33642
No ratings yet
Final Draft Proposal Capstone - Docx64b9db062d30c33642
8 pages
Differential Calculus: y + y F (X + X) y F (X + X) - y or y F (X + X) - F (X)
No ratings yet
Differential Calculus: y + y F (X + X) y F (X + X) - y or y F (X + X) - F (X)
13 pages
TCS SQL Question
No ratings yet
TCS SQL Question
2 pages
Iphone 4 (GSM - AT&T) Screen Replacement - Ifixit Repair Guide
No ratings yet
Iphone 4 (GSM - AT&T) Screen Replacement - Ifixit Repair Guide
1 page
Adl400 Manual
No ratings yet
Adl400 Manual
27 pages
Collect Fault Finish
No ratings yet
Collect Fault Finish
462 pages
Manufacturers and Models of Solar PV
No ratings yet
Manufacturers and Models of Solar PV
45 pages
Chat GPT
No ratings yet
Chat GPT
9 pages
Keysight Spectrum Analysis Basics 2015
No ratings yet
Keysight Spectrum Analysis Basics 2015
89 pages
Coach Care Report Railway
No ratings yet
Coach Care Report Railway
65 pages
1.1 - Statistics Refresher
No ratings yet
1.1 - Statistics Refresher
34 pages
The Role of Big Data Analytics in Detecting and Preventing Financial Fraud
No ratings yet
The Role of Big Data Analytics in Detecting and Preventing Financial Fraud
8 pages
In Gov Rajasthan rajeduboard-SSCER-15237712020
No ratings yet
In Gov Rajasthan rajeduboard-SSCER-15237712020
1 page
7.62x54 MM MG - Arsenal JSCo. - Bulgarian Manufacturer of Weapons and Ammunition Since 1878
No ratings yet
7.62x54 MM MG - Arsenal JSCo. - Bulgarian Manufacturer of Weapons and Ammunition Since 1878
4 pages
Axway Managed File Transfer Solutions: Secure, Auditable, and Easy-to-Manage
No ratings yet
Axway Managed File Transfer Solutions: Secure, Auditable, and Easy-to-Manage
6 pages
Seagate HDD Data Sheet
No ratings yet
Seagate HDD Data Sheet
2 pages

Unit 3 Notes

Uploaded by

Unit 3 Notes

Uploaded by

THE GEOMETRY OF MULTIPLE VIEWS

Fundamental Matrix (F):

Essential Matrix (E):

Using More Cameras:

Human Vision: Grouping and Gestalt:

Image Segmentation by Clustering Pixels:

Segmentation by Graph-Theoretic Clustering:

You might also like