0% found this document useful (0 votes)

48 views58 pages

Week5 Computer Vision

Uploaded by

albertadi412

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views58 pages

Week5 Computer Vision

Uploaded by

albertadi412

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 58

CSCI218: Foundations

of Artificial Intelligence
Human Vision System

2
Robot Vision System

3
Image Formation

4
Image Formation

5
Simple Image Feature

Image Color Histogram 6

Simple Image Feature

Edge

7
Simple Image Feature

Edge

8
Simple Image Feature

Texture (e.g., Gray-Level Co-Occurrence Matrix (GLCM))

- Characterise how often pairs of pixel with specific values and in a specified spatial relationship occur in an image

9
Simple Image Feature

Optical Flow: Whenever there is relative movement between the camera and one or more
objects in the scene, the resulting apparent motion in the image is called optical flow.

10
Simple Image Feature

Optical Flow: Whenever there is relative movement between the camera and one or more
objects in the scene, the resulting apparent motion in the image is called optical flow.

11
Simple Image Feature

Segmentation of natural images

12
Classifying Images

Important sources of appearance variation

13
Classifying Images

Why convolutional neural networks classify images well

14
Detecting Objects

Faster RCNN for object detection

15
The 3D World

Binocular stereopsis

16
Using Computer Vision

Understanding what people are doing

17
Using Computer Vision

Understanding what people are doing

18
Using Computer Vision

Automated image captioning

19
Using Computer Vision

Visual question-answering
20
Using Computer Vision

Reconstruction from many views

21
Using Computer Vision

Geometry from a single view

22
Using Computer Vision

Making pictures

23
Using Computer Vision

Image Transformation (Paired)

24
Using Computer Vision

Image Transformation (Unpaired)

25
Using Computer Vision

Image Transformation (Style transfer)

26
Using Computer Vision

Image Generation (by GAN)

27
Using Computer Vision

Controlling movement with vision

28
Using Computer Vision

Navigation

29
Image Analysis
§ Overview of Image Analysis
§ Collecting and Representing Image
§ Image Recognition
§ Bag-of-Visual-Words model
§ Deep Convolutional Neural Networks
Overview of Image Analysis
§ Image analysis
§ Refers to the representation, processing, and modelling of visual data to
derive useful insights
§ Suffers from the semantic gap
§ Visual data (image, video, …) is unstructured
§ Semantic gap
§ The gap between high-level concepts used by human and the low-level
features used by computer
Overview of Image Analysis
§ Image recognition (in a narrow sense)
§ Image classification
§ Object detection, localisation, tracking
§ Scene segmentation and reconstruction
§ Image search and retrieval
Overview of Image Analysis
§ Image classification

Face OCR recognition

recognition

Scene recognition Object recognition

Overview of Image Analysis
§ Object detection, localisation, tracking

Object detection and localization

Object tracking (https://www.youtube.com/watch?v=dKpRsdYSCLQ)

Overview of Image Analysis
§ Scene segmentation and reconstruction

[Farabet et al. PAMI 2013]

http://twd20g.blogspot.com.au/2011/12/this-work-presents-novel-system-that.html https://www.3dflow.net/elementsCV/S4.xhtml
Image Analysis Steps
§ Collection and labelling
§ Collect representative images from a given task and label the ground
truth
§ Image representation
§ Select and/or design appropriate image representations (invariant and
discriminative)
§ Image analysis techniques
§ Apply and/or design appropriate analysis techniques for the given tasks
(classification, detection, tracking, segmentation, etc.)
Representing Image
§ Why representing images is difficult?
§ Scale, rotation, illumination, occlusion, background clutter, deformation, …
§ Invariant and Discriminative representation

Cat:
Representing Image
§ Traditional representation (before year 2000)
§ Hand-crafted, global features
§ Intensity, colour, texture, shape, structure, etc.

Colour histogram in a RGB space Face recognition with raw pixel

intensities
Representing Image
§ Days of the BoVW model (2000 ~ 2012)
§ SIFT, HOG, SURF, CENTRIST, filter-based, …
§ Invariant to view angle, scale, illumination, ...

SIFT (Scale Invariant Feature

Transform)

http://www.robots.ox.ac.uk/~vgg/software
/ Image courtesy of David Lowe, IJCV04
Deep Learning Model
Convolutional Neural Networks (CNNs)
§ A special multi-stage architecture inspired by visual system
§ Higher stages compute more global, more invariant features
Deep Learning Model

https://www.datasciencecentral.com/lenet-5-a-classic-cnn-architecture/
Convolution

§ For standard 2D convolution:

Filter

§ The stride is 1.
§ The height and width are changed as:
&'( )&*'+,-.
!"#$ = + 1 = (5 − 3)⁄1 + 1 = 3.
/$0123
Convolution

We need Zero-Padding to keep image size:

The width/height will become:

!&' − !)&*$+, + 2×0122345
!"#$ = +1
678329
Convolution Layers
In convolution layers:
§ Filters are called Kernels and become 3D. The parameters of
kernels (i.e., weights) are to be learned.

Kernel 1
…
Kernel N

'( ×') ×*%&

!×#×$%& !×#×$+,-
Convolution Layers
In convolution layers:
§ Feature maps are the outputs of each layer. The number of
feature maps is the channel.

Feature map 1
…
Feature map N

!×#×$%& !×#×$'()
Convolutional Neural Networks

§ Multi-stage Architecture
Convolution
Non-linearity
Pooling
Convolutional Neural Networks
Convolution
- A set of filters convolve with the input
- Share weights across the input space (translation equivariance)

Input
Filters
Feature Map
Convolutional Neural Networks
Non-linearity

Sigmoid: f(x)=1/(1+e-x) Tanh: f(x)=(ex − e-x)/(ex +e-x) ReLu: f(x)=max(x, 0)

Convolutional Neural Networks

Spatial pooling
§ Non-overlapping / overlapping regions
§ Max or sum
§ Invariance to small transformations

Max pooling

Sum/Average
pooling
Deep Learning Model
CNNs: ImageNet Breakthrough

[Krizhevsky et al. NIPS 2012]

● Krizhevsky et al. win 2012 ImageNet classification with a much bigger ConvNet
○ deeper: 7 stages vs 3 before
○ larger: 60 million parameters vs 1 million before
○ 16.4% error (top-5) vs Next best 26.2% error

● This was made possible by:

○ fast hardware: GPU-optimized code
○ big dataset: 1.2 million images vs thousands before
○ better regularization: dropout et al. Image courtesy of Deng et al.
Deep Learning Model
Learned Features of CNNs

[Matthew D. Zeiler et al. ECCV 2014]

Deep Learning Model

Object detection (Source: Rich feature hierarchies for accurate object detection and semantic
segmentation, CVPR 2014)

Face Recognition (Source: DeepFace: Closing the Gap to Human-Level Performance in Face Verification,
CVPR 2014)
Deep Learning Model

§ Directly use pre-trained CNNs

§ Which layer to use?
§ How to pool the features in a convolutional layer?
Deep Learning Model

§ Directly use pre-trained CNNs

§ Which layer to use?
Convolutional layer
Fully connected
layer
Deep Learning Model
§ Fine-tune pre-trained CNNs
§ To incorporate extra information from the images of a
new recognition task
§ Make the pre-trained CNNs adapt to this new task
Pre-trained CNNs New recognition task
on

Fine-
tune

Image courtesy of Deng et al.

http://people.csail.mit.edu/bzhou/
Summary
§ Computer vision is a key component of AI
§ Image analysis is an important and broad area
§ Feature representation is key for image analysis
§ Deep Learning techniques are now widely used
Acknowledgement

The lecture slides are based on the materials from ai.Berkey.edu

Thank you. Questions?

Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Chapitre 8 2024
No ratings yet
Chapitre 8 2024
231 pages
Topic 5 Computer Vision
No ratings yet
Topic 5 Computer Vision
65 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
CNN Course: Build & Apply Networks
No ratings yet
CNN Course: Build & Apply Networks
95 pages
Deep Learning for Vision Experts
No ratings yet
Deep Learning for Vision Experts
91 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Computer Vision and Its Applications
No ratings yet
Computer Vision and Its Applications
3 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Computer Vision
No ratings yet
Computer Vision
45 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
8394 Making Machines See
No ratings yet
8394 Making Machines See
50 pages
COMP3411 Week 7 - Computer Vision
No ratings yet
COMP3411 Week 7 - Computer Vision
58 pages
W11 Lecture ITS69204 Image Recognition
No ratings yet
W11 Lecture ITS69204 Image Recognition
44 pages
Syllabus
No ratings yet
Syllabus
15 pages
Ilovepdf Merged Compressed
No ratings yet
Ilovepdf Merged Compressed
1,100 pages
Unit 4 Deep Learning For Computer Vision
No ratings yet
Unit 4 Deep Learning For Computer Vision
6 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Computer Vision Revision Notes - 250322 - 101703
No ratings yet
Computer Vision Revision Notes - 250322 - 101703
4 pages
Class 10th Computer Vision Revision Notes
No ratings yet
Class 10th Computer Vision Revision Notes
4 pages
Thesis Research Deep Learning
No ratings yet
Thesis Research Deep Learning
18 pages
L7 - Computer Vision
No ratings yet
L7 - Computer Vision
69 pages
CVI Week 2 1 Pre Note
No ratings yet
CVI Week 2 1 Pre Note
56 pages
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
No ratings yet
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
15 pages
Computer Vision Basics for Beginners
No ratings yet
Computer Vision Basics for Beginners
21 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Computer Vision With Deep Learning
No ratings yet
Computer Vision With Deep Learning
5 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
61 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
Computer Vision Technology
No ratings yet
Computer Vision Technology
29 pages
CNN Basic
No ratings yet
CNN Basic
64 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
Syllabus Udacity Default en Us
No ratings yet
Syllabus Udacity Default en Us
4 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Lecture AI 15 23052025 112103am
No ratings yet
Lecture AI 15 23052025 112103am
69 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
6 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Convolutional Nets
No ratings yet
Convolutional Nets
41 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Abhijith Vision
No ratings yet
Abhijith Vision
17 pages
Machine Learning: Machine Learning (ML) Applications in Computer Vision (CV)
No ratings yet
Machine Learning: Machine Learning (ML) Applications in Computer Vision (CV)
6 pages
Cs383 Lecture 20 PDF
No ratings yet
Cs383 Lecture 20 PDF
61 pages
CV Unit 1
No ratings yet
CV Unit 1
17 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
Admin,+4554 Article+Text 17736 2 10 20210928
No ratings yet
Admin,+4554 Article+Text 17736 2 10 20210928
13 pages
Week1 Lecture1
No ratings yet
Week1 Lecture1
40 pages
AI & Neural Networks Basics
No ratings yet
AI & Neural Networks Basics
39 pages
Foundations of AI Course Outline
No ratings yet
Foundations of AI Course Outline
39 pages
Week3 LearningI
No ratings yet
Week3 LearningI
48 pages
Price Guideline Gladuate - 2024
No ratings yet
Price Guideline Gladuate - 2024
17 pages
Chinese CLIP for Vision-Language AI
No ratings yet
Chinese CLIP for Vision-Language AI
18 pages
Motivation Letter
100% (1)
Motivation Letter
4 pages
ImageFlow: Free AI Inpainting & Outpainting Tool
No ratings yet
ImageFlow: Free AI Inpainting & Outpainting Tool
9 pages
ConvNet For The 2020s
No ratings yet
ConvNet For The 2020s
12 pages
MSc Dissertation Help
67% (3)
MSc Dissertation Help
6 pages
Exercícios Complementares - Photoshop CC
No ratings yet
Exercícios Complementares - Photoshop CC
20 pages
Ptex: Per-Face Texture Mapping For Production Rendering: EGSR 2008
No ratings yet
Ptex: Per-Face Texture Mapping For Production Rendering: EGSR 2008
37 pages
Freeman Chain Code
No ratings yet
Freeman Chain Code
8 pages
Photoshop
No ratings yet
Photoshop
9 pages
Photoshop Clipping Mask Guide
No ratings yet
Photoshop Clipping Mask Guide
6 pages
Car Insurance Fraud Detection System
No ratings yet
Car Insurance Fraud Detection System
6 pages
Vizit Master Lock Case Study
No ratings yet
Vizit Master Lock Case Study
3 pages
Engineering Drawing: Credit Hours
No ratings yet
Engineering Drawing: Credit Hours
40 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Mulberry Leaf Disease Detection
No ratings yet
Mulberry Leaf Disease Detection
7 pages
Adobe Photoshop - Create Smart Objects PDF
No ratings yet
Adobe Photoshop - Create Smart Objects PDF
3 pages
Aksdcbfd12324 4354665tgdfhghdf
No ratings yet
Aksdcbfd12324 4354665tgdfhghdf
11 pages
Computer Vision - Session 1
No ratings yet
Computer Vision - Session 1
36 pages
IP Module2
No ratings yet
IP Module2
125 pages
Project Synopsis
No ratings yet
Project Synopsis
8 pages
Accident Detection with AlexNet
No ratings yet
Accident Detection with AlexNet
9 pages
A Photographers Guide To RAW in Photoshop
100% (1)
A Photographers Guide To RAW in Photoshop
228 pages
AI & ML Projects for Tech Enthusiasts
No ratings yet
AI & ML Projects for Tech Enthusiasts
5 pages
Ai Preboard Unlocked
No ratings yet
Ai Preboard Unlocked
7 pages
A Review of Visual Trackers and Analysis of Its Application To Mobile Robot
No ratings yet
A Review of Visual Trackers and Analysis of Its Application To Mobile Robot
25 pages
Digital Image Processing Lab
No ratings yet
Digital Image Processing Lab
11 pages
COM 423 AI & Expert System - 240903 - 104915
No ratings yet
COM 423 AI & Expert System - 240903 - 104915
39 pages
Deep Learning-Based Human Pose Estimation: A Survey
No ratings yet
Deep Learning-Based Human Pose Estimation: A Survey
25 pages
CCS349 - 2 Marks
No ratings yet
CCS349 - 2 Marks
13 pages

Week5 Computer Vision

Uploaded by

Week5 Computer Vision

Uploaded by

CSCI218: Foundations

Image Color Histogram 6

Texture (e.g., Gray-Level Co-Occurrence Matrix (GLCM))

Segmentation of natural images

Important sources of appearance variation

Why convolutional neural networks classify images well

Faster RCNN for object detection

Understanding what people are doing

Understanding what people are doing

Automated image captioning

Reconstruction from many views

Geometry from a single view

Image Transformation (Paired)

Image Transformation (Unpaired)

Image Transformation (Style transfer)

Image Generation (by GAN)

Controlling movement with vision

Face OCR recognition

Scene recognition Object recognition

Object detection and localization

Object tracking (https://www.youtube.com/watch?v=dKpRsdYSCLQ)

[Farabet et al. PAMI 2013]

Colour histogram in a RGB space Face recognition with raw pixel

SIFT (Scale Invariant Feature

§ For standard 2D convolution:

We need Zero-Padding to keep image size:

The width/height will become:

'( ×') ×*%&

Sigmoid: f(x)=1/(1+e-x) Tanh: f(x)=(ex − e-x)/(ex +e-x) ReLu: f(x)=max(x, 0)

[Krizhevsky et al. NIPS 2012]

● This was made possible by:

[Matthew D. Zeiler et al. ECCV 2014]

§ Directly use pre-trained CNNs

§ Directly use pre-trained CNNs

Image courtesy of Deng et al.

The lecture slides are based on the materials from ai.Berkey.edu

You might also like