0% found this document useful (0 votes)

12 views58 pages

COMP3411 Week 7 - Computer Vision

The document provides an overview of computer vision, covering topics such as image processing, scene analysis, and cognitive vision. It discusses the challenges machines face in interpreting visual information, the techniques used for image processing, and the interaction between vision and reasoning in cognitive vision. Additionally, it highlights applications of computer vision in various fields, including robotics and object recognition.

Uploaded by

tianzong Li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views58 pages

COMP3411 Week 7 - Computer Vision

Uploaded by

tianzong Li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 58

Computer Vision

COMP3411/9814: Artificial Intelligence

Lecture Overview

• Introduction

• Image processing

• Scene analysis

• Cognitive vision
Lecture Overview

• Introduction

• Image processing

• Scene analysis

• Cognitive vision
Introduction

• Other sensory modalities an agent uses for interaction with the

world (vision, acoustic, temperature, pressure, etc.)
• Computer vision endows machines to “see” the world.
• Applications in character recognition, image interpretation, face
recognition, fingerprint identification, robot control.
Introduction
• Effortless for humans, but a difficult problem for machines:
• Variable and uncontrolled illumination
• Shadows
• Complex and hard-to-describe objects
• Objects from outdoor scenes
• Non-rigid objects
• Objects occluding other objects
Introduction

• Stateof computer vision  The general computer vision problem

is unsolved
• Develop a visual system as good as humans. No progress for 40 years.
•A lot of progress in specific computer vision problems
• e.g. face recognition used in digital cameras, surveillance, security.
• e.g. pick and place.
Introduction
Doggie cam
Introduction
Object recognition
Introduction
Computer vision in action
Introduction
Doggie cam in action
Introduction
What the robot sees
Introduction

• Computer vision creates an image of a scene on an array.

• It uses the lens camera to produce a perspective projection of the
scene within the camera field of view.
• Perspective projection is a many-to-one transformation.
• It can be noisy due to low ambient light levels.
Introduction

• Perspective projection  Many-to-one transformation.

• Several different scenes can produce identical images.
• The image cannot be directly “inverted” to reconstruct the scene.
Introduction
• Image is represented as a two-dimensional, time-varying matrix of
intensity values I(x,y,t).
• Colour vision uses three matrices, RGB. Monochromatic only one.
• In static scenes, time variable is not considered.

• Iconic model or features can be obtained from this matrix.

• Information to be extracted depends on the task, e.g.,
• For safely navigation: object locations, boundaries, surface property.
• For object manipulation: locations, sizes, shapes, compositions and
textures.
• Others might include colour, belonging to certain classes.
Introduction

• Object features
• Illumination (incident light)

• Reflectance (reflected light)

• Depth (distance from camera)

• Orientation (angle of normal to surface)

• Other features: shading, colour, texture

Introduction
Image formation

Each pixel is a
Light image is number that is
created by an index into
camera a palette of
colours or
grey scales.
Introduction
• Binary vision

• The original image is “thresholded”, i.e.

new[x, y] = (old[x, y] > threshold)

• Every pixel brighter than a certain threshold is given a value of 1

otherwise it is zero.

• Easy to process and powerful enough to use in some industrial

applications, e.g., picking parts from an assembly line.
Example: Steering an Automobile
• Neural networks can be used to convert
the image intensity matrix directly into
actions.
• ALVINN steers an automobile:
• Input  low-resolution 30x32 image from a
mounted camera looking straight ahead.
• Hidden layer  5 sigmoid units.
• Output  30 units to control the steering
angle. Winner-take-all.
Example: Steering an Automobile

• Training 5 minutes of human driving,

using actual steering angles as labels.
• Incrementally training with
backpropagation.
• Problems:
• The driver usually drives well.
• After long, straight distance, the network
produces only straight-ahead angles.
Example: Steering an Automobile
• Non-official video:
• https://youtu.be/oHEH2VDDGss
Robot vision: Two stages
• Image processing involves filtering operations to reduce noise,
accentuate edges, and find regions.
• Scene analysis creates an iconic model or a feature-based
description including only relevant details.
Robot vision: Two stages
• Man-made environments: doorways,
furniture, other agents, humans, walls,
floors, etc.
• In exterior environments: animals,
plants, man-made structures,
automobiles, roads, etc.
• Two techniques: look for edges (e.g.,
intensity changes abruptly) or regions
(e.g., intensity changes gradually)
through discontinuities.
Robot vision: Two stages
• From robot view:
• Three toy blocks (A, B, C).
• A doorway.
• A corner of the room.

• Dealing only with disposition of the

blocks. Iconic model:
• ((C B A FLOOR))
• If C moved  ((C FLOOR) (B A FLOOR)) or
((B A FLOOR) (C FLOOR))
Lecture Overview

• Introduction

• Image processing

• Scene analysis

• Cognitive vision
Image processing: Averaging

• Image represented as n x m array I(x,y)  image intensity array.

• Cells are called pixels. Each number represent light intensity.
• Real images always contain noise.
• Smoothing tries to remove isolated bright and dark regions.
• Averaging + sliding  convolution.
• Has side-effect of blurring image.
Image processing: Averaging

• It can use a threshold.

• Larger rectangles achieve more
smoothing.
• Broad lines are thickened and thin lines
eliminated
• In the example, ε = 3, i.e., 0 if sum ≤ 3, 1
otherwise.
Image processing: Averaging

• Image smoothing with a

Gaussian filter.
• Images increasingly
blurred.
Image processing: Averaging

• Given a simple 4 x 4 picture matrix:

9 9 9 3
9 9 3 3
9 3 3 3
3 3 3 3

• Smooth this matrix using an averaging technique and a 3 x 3

pixel window.
Image processing: Averaging
• There are four 3 x 3 pixel windows in the
matrix.

• Replace middle value in each window by

average of all the values in the window.
9 9 9 3 9 9 9 3
9 9 3 3 9 7 5 3
9 3 3 3 9 5 4 3
3 3 3 3 3 3 3 3
Image processing: Edge enhancement
• Edges are used to build a line drawing.
• Outlines can be compared with object models.
• Edges are parts of the image with markedly
different property values (e.g., intensity)
Image processing: Edge enhancement

• Averaging and edge enhancement can be combined.

• For instance, using a Laplacian filter.
Image processing: Region finding
• To find regions in which a property does not change abruptly.
• A region is homogeneous. Intensity difference no more than some
ε threshold
• Split-and-merge method. 2 n x 2 n array of pixels.
• Each no homogeneous region is split in four.
• Splits continues until no more splits need to be made.
• Adjacent regions are merged if homogeneous.
Image processing: Region finding
• Splitting and merging candidate regions.
• In this example, intensities may not vary
more than 1 unit. Therefore, ε <= 1.
Lecture Overview

• Introduction

• Image processing

• Scene analysis

• Cognitive vision
Scene Analysis
• Extract information about the scene.
• As scene-to-image is many-to-one additional images or
information is needed.
• Information can be very general or specific, e.g., camera location,
illumination sources, indoors/outdoors, particular objects.
• Iconic model or features:
• Iconic model builds a model of the scene or part of it.
• Feature-based analysis is task-oriented.
Interpreting lines and curves in the image

• For scenes with rectilinear objects, lines should be postulated.

Methods fit segments of straight lines to edges.
• Scenes with curved objects fit conic sections (ellipses, parabolas,
hyperbolas).
• Interpreting the line drawing associates properties with
components of a line drawing.
• For instance, scenes with only planar surfaces have no more than
three surfaces intersected in a point.
Interpreting lines and curves in the image
• Scene with bounding walls, floor,
ceiling, a cube on the floor.
• Only three possible intersections:
• Occlude: 2 planes, one occluding ().
• Blade: both visible forming a convex edge
(+).
• Fold: both visible forming a concave edge
(-).
Interpreting lines and curves in the image
• Labelling types of junctions: V, W, Y, T assigning +, -, 
Model-based vision

• Use increasing knowledge about the scene.

• For instance, an industrial scene could use geometric models of
components to interpret images – still not semantics though.
• Or if we know a cube is in the scene, a projection can be fitted
specifying size, position, and orientation (using Euler angles).
Model-based vision

• Generalized cylinders for model

construction.
• Each cylinder uses 9 parameters: a, b,
c, 6 location parameters.
• Hierarchical representation.
Stereo vision
• Under perspective projection large,
distant objects might produce the
same image as similar but smaller,
closer ones.
• Distance estimation from single
images is problematic, but sometimes
possible.
• e.g., If we know an object is on the
floor and the camera height.
Stereo vision

• Depth information from stereo vision.

• Two-dimensional setup.
• Two lenses with distance b.
• Correspondence problem for pairs of
points.
Lecture Overview

• Introduction

• Image processing

• Scene analysis

• Cognitive vision
Principles of cognitive vision

• Is perception only a recovery process?

• Computer vision  3D descriptions of the scene, assigning labels to
objects and/or actions.
• Labels  provided to symbolic reasoning systems.

• Visual perception is seen as a black box delivering labels through

recognition, using (mostly static) data.
• From pixels to symbols is difficult, with no causal link between
present and past. Therefore, not well-fitted to anticipate the future.
Principles of cognitive vision

• Human behaviour is active!

• Humans (and animals) continuously shift their gaze.
• Humans have intentions and goals linking past with present with
the aim of anticipating the future.
• Human actions are goal driven, guided by motor and perceptual
expectations.
Principles of cognitive vision
• Is perception only an inference process?
• Signal analysis is not enough to understand a scene.
• Additional knowledge through inference – as we look at the world, we think
about it.

• Cognitive vision continuously exchanges information between

perception and reasoning. A form of predictive vision.
• Actions driven by perceptual expectations – how should I act to
see my hand close to the object vs. how should I act to reach the
object?
Vision and reasoning interaction

• Cognitive vision extends processing visual data beyond the

concept of extracting visual features for real-time control.
• Reasoning and perception talk about objects, actions, events, and
alternative possibilities.
• Loop between prediction (what the system expects perceptually)
and exploration (how the system acts to verify if predictions are
met).
Vision and reasoning interaction

• Five interaction paths for Vision (V) and Reasoning (R)

• V  R. Traditional perspective for computer vision.
• R  V. For example “search for the scissors” invokes a visual search.
• V  R  V. For example “someone is cutting the tomato with the spoon”.
Implausible for R so ask V again.
• R  V  R. For example R needs to know the number of cars.
• R  VV…V. Imaging and envisioning a situation, action, or event.
Vision and reasoning interaction

• Interactions between V and R can happen at earlier, later, or

middle stages.
V + R interaction examples

• Cognitive vision to support human-robot interaction.

• iCub’s behavior driven only by the direction of the subject’s gaze

making explicit intention to reach for the left or right hand.
V + R interaction examples

• Cognitive vision to support human-robot interaction.

• iCub’s behavior driven only by the direction of the subject’s gaze

making explicit intention to reach for the left or right hand.
V + R interaction examples

• Cognitive vision for signature of

biological motion.
•  Angular velocity and curvature of the
trajectory.
• Hand during drawing or writing.
• Knee or ankle during walking.

• Visually measured independently of

its shape and color.
V + R interaction examples

• Cognitive vision involves language as an attention mechanism.

• Synonymy (same meaning) and hypernymy (“is a” relation).

• Objects likely co-occur, e.g., table, cups, spoons.

• A knife put in a drawer is not gone but hidden from the sight. In
this case language acts as a part of the reasoning process.
V + R interaction examples

• Cognitive vision for object’s affordances.

• Segmentation to infer adjectives.

• Pixel coloured associated to affordances.

V + R interaction

• Cognitive vision does not exist in isolation to detect what is where.

• Direct contrast to how vision is predominantly studied today.

• Unified representation within vision and other sensory modalities

through action. V + R  Action.
• Questions beyond what, where  why, how, who.
• Also how synthesize visual information to anticipate action effects.
Resources
• OpenCV: real-time optimized
Computer Vision library.
• https://opencv.org/

• YOLO: state-of-the-art, real-time

object detection.
• https://pjreddie.com/darknet/yolo/
References
• Nilsson, N. J. (1998). Artificial
intelligence: a new synthesis. Morgan
Kaufmann. Chapter 6.

• Aloimonos, Y., & Sandini, G. Principles

of Cognitive Vision. In Cangelosi, A., &
Asada, M. (Eds.). (2022). Cognitive
robotics. MIT Press. Chapter 14.
Feedback
• In case you want to provide anonymous
feedback on these lectures, please visit:

• https://forms.gle/KBkN744QuffuAZLF8

Muchas gracias!

Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
Unit 1
No ratings yet
Unit 1
200 pages
Computer Vision - 01 Introduction
No ratings yet
Computer Vision - 01 Introduction
40 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
8394 Making Machines See
No ratings yet
8394 Making Machines See
50 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Revisionback
No ratings yet
Revisionback
13 pages
Prash MVS
No ratings yet
Prash MVS
48 pages
Final Exam Topics
No ratings yet
Final Exam Topics
9 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
40 pages
Unit 1 Computer Vision Notes
No ratings yet
Unit 1 Computer Vision Notes
11 pages
Computer Vision for Tech Enthusiasts
No ratings yet
Computer Vision for Tech Enthusiasts
41 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Unit1 CV
No ratings yet
Unit1 CV
44 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
Intro to Computer Vision & IP
No ratings yet
Intro to Computer Vision & IP
48 pages
Unit 1 - MV - 10212EC159
No ratings yet
Unit 1 - MV - 10212EC159
71 pages
Deep Learning Based Computer Vision
No ratings yet
Deep Learning Based Computer Vision
98 pages
By Dr. Lochandaka Ranathunga
No ratings yet
By Dr. Lochandaka Ranathunga
20 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
72 pages
Module 1
No ratings yet
Module 1
18 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
Paper BackProoagation
No ratings yet
Paper BackProoagation
13 pages
Computer Vision Basics Explained
No ratings yet
Computer Vision Basics Explained
35 pages
Computer Vision Course Overview
No ratings yet
Computer Vision Course Overview
79 pages
Unit 1
No ratings yet
Unit 1
186 pages
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
48 pages
Unit 1 - Merged - Lecture Slides
No ratings yet
Unit 1 - Merged - Lecture Slides
84 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Digital Image Processing: Instructor: Namrata Vaswani
No ratings yet
Digital Image Processing: Instructor: Namrata Vaswani
27 pages
Computer Vision
No ratings yet
Computer Vision
23 pages
1 Intro
No ratings yet
1 Intro
103 pages
CO Machine Vision
No ratings yet
CO Machine Vision
3 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
01 Lecture No. 1
No ratings yet
01 Lecture No. 1
52 pages
Lect 1 Computervision Student PPT 16-9-2017
No ratings yet
Lect 1 Computervision Student PPT 16-9-2017
143 pages
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
52 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Lec01 Intro
No ratings yet
Lec01 Intro
55 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
Ilovepdf Merged Compressed
No ratings yet
Ilovepdf Merged Compressed
1,100 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
61 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
Week5 Computer Vision
No ratings yet
Week5 Computer Vision
58 pages
CS436 CS5310 EE513 L01 Introduction
No ratings yet
CS436 CS5310 EE513 L01 Introduction
54 pages
L5 Compression
No ratings yet
L5 Compression
60 pages
COMP3411 Week 2 - Search - Armin v1-1
No ratings yet
COMP3411 Week 2 - Search - Armin v1-1
146 pages
COMP3411 Week 8 - Language Processing
No ratings yet
COMP3411 Week 8 - Language Processing
74 pages
COMP3411 Slides All Term
No ratings yet
COMP3411 Slides All Term
23 pages
M.tech Vjti Extc Syllabus
No ratings yet
M.tech Vjti Extc Syllabus
52 pages
Project in DSP c6713
No ratings yet
Project in DSP c6713
2 pages
Applied Digital Signal Processing 1St Edition Manolakis Solutions Manual Full Chapter PDF
100% (20)
Applied Digital Signal Processing 1St Edition Manolakis Solutions Manual Full Chapter PDF
67 pages
LMS Equalizer ProjectReport
No ratings yet
LMS Equalizer ProjectReport
10 pages
Pengolahan Citra Digital Untuk Menghitung Luas Daerah Bekas Penambangan Timah
No ratings yet
Pengolahan Citra Digital Untuk Menghitung Luas Daerah Bekas Penambangan Timah
9 pages
DMPT 2025 Lecture10 ENG MorphologicalOperations
No ratings yet
DMPT 2025 Lecture10 ENG MorphologicalOperations
19 pages
Aspiring DSP Engineer's Journey
No ratings yet
Aspiring DSP Engineer's Journey
2 pages
NTIRE 2017 Challenge On Single Image Super-Resolution: Dataset and Study
No ratings yet
NTIRE 2017 Challenge On Single Image Super-Resolution: Dataset and Study
10 pages
SweetFX Settings
No ratings yet
SweetFX Settings
7 pages
Image Gallery Image Gallery: Autoimt
No ratings yet
Image Gallery Image Gallery: Autoimt
4 pages
Applications of Fourier Series and Fourier Transform in Electrical and Electronics Devices
No ratings yet
Applications of Fourier Series and Fourier Transform in Electrical and Electronics Devices
13 pages
Experiment No. 1: AIM: Matlab Program To Generate Sine and Cosine Wave. Apparatus Used: Matlab R2016A. Matlab Code
No ratings yet
Experiment No. 1: AIM: Matlab Program To Generate Sine and Cosine Wave. Apparatus Used: Matlab R2016A. Matlab Code
19 pages
Module 4 Chapter 6 - Color Image Processing
No ratings yet
Module 4 Chapter 6 - Color Image Processing
12 pages
Voltage Synchronization Scheme Based On Zero Crossing Detection For Parallel Connected Inverters in AC Microgrids
No ratings yet
Voltage Synchronization Scheme Based On Zero Crossing Detection For Parallel Connected Inverters in AC Microgrids
7 pages
Digital Image Processing Lecture Notes
No ratings yet
Digital Image Processing Lecture Notes
342 pages
B.Tech DSP Exam 2019-20 Questions
No ratings yet
B.Tech DSP Exam 2019-20 Questions
2 pages
DSP Basics for EEE/EIE Students
No ratings yet
DSP Basics for EEE/EIE Students
37 pages
Administrative: Analog-Digital Interface Integrated Circuits © 2002 Bernhard E. Boser
No ratings yet
Administrative: Analog-Digital Interface Integrated Circuits © 2002 Bernhard E. Boser
7 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
1 page
Dip Unit 4
No ratings yet
Dip Unit 4
23 pages
Edt744 - Advanced Digital Signal Processing
No ratings yet
Edt744 - Advanced Digital Signal Processing
2 pages
PeopleLink AIO DSP 200W
No ratings yet
PeopleLink AIO DSP 200W
3 pages
Class 7 Computer Term 1 CH 2 Gimp An Introduction CW
No ratings yet
Class 7 Computer Term 1 CH 2 Gimp An Introduction CW
4 pages
Gaussian-Adaptive Bilateral Filter
No ratings yet
Gaussian-Adaptive Bilateral Filter
5 pages
DSP Fundamentals MCQ Quiz
No ratings yet
DSP Fundamentals MCQ Quiz
26 pages
Deep Fourier-Based Arbitrary-Scale Super-Resolution For Real-Time Rendering
No ratings yet
Deep Fourier-Based Arbitrary-Scale Super-Resolution For Real-Time Rendering
11 pages
Introduction to Embedded Systems
100% (1)
Introduction to Embedded Systems
162 pages
Photoshop vs Illustrator: Key Features
No ratings yet
Photoshop vs Illustrator: Key Features
4 pages
Super-Resolution for Document Images
No ratings yet
Super-Resolution for Document Images
6 pages
Student Clo Attainment
No ratings yet
Student Clo Attainment
12 pages

COMP3411 Week 7 - Computer Vision

Uploaded by

COMP3411 Week 7 - Computer Vision

Uploaded by

Computer Vision

COMP3411/9814: Artificial Intelligence

• Other sensory modalities an agent uses for interaction with the

• Stateof computer vision  The general computer vision problem

• Computer vision creates an image of a scene on an array.

• Perspective projection  Many-to-one transformation.

• Iconic model or features can be obtained from this matrix.

• Reflectance (reflected light)

• Depth (distance from camera)

• Orientation (angle of normal to surface)

• Other features: shading, colour, texture

• The original image is “thresholded”, i.e.

new[x, y] = (old[x, y] > threshold)

• Every pixel brighter than a certain threshold is given a value of 1

• Easy to process and powerful enough to use in some industrial

• Training 5 minutes of human driving,

• Dealing only with disposition of the

• Image represented as n x m array I(x,y)  image intensity array.

• It can use a threshold.

• Image smoothing with a

• Given a simple 4 x 4 picture matrix:

• Smooth this matrix using an averaging technique and a 3 x 3

• Replace middle value in each window by

• Averaging and edge enhancement can be combined.

• For scenes with rectilinear objects, lines should be postulated.

• Use increasing knowledge about the scene.

• Generalized cylinders for model

• Depth information from stereo vision.

• Is perception only a recovery process?

• Visual perception is seen as a black box delivering labels through

• Human behaviour is active!

• Cognitive vision continuously exchanges information between

• Cognitive vision extends processing visual data beyond the

• Five interaction paths for Vision (V) and Reasoning (R)

• Interactions between V and R can happen at earlier, later, or

• Cognitive vision to support human-robot interaction.

• iCub’s behavior driven only by the direction of the subject’s gaze

• Cognitive vision to support human-robot interaction.

• iCub’s behavior driven only by the direction of the subject’s gaze

• Cognitive vision for signature of

• Visually measured independently of

• Cognitive vision involves language as an attention mechanism.

• Synonymy (same meaning) and hypernymy (“is a” relation).

• Objects likely co-occur, e.g., table, cups, spoons.

• Cognitive vision for object’s affordances.

• Pixel coloured associated to affordances.

• Cognitive vision does not exist in isolation to detect what is where.

• Unified representation within vision and other sensory modalities

• YOLO: state-of-the-art, real-time

• Aloimonos, Y., & Sandini, G. Principles

You might also like