0% found this document useful (0 votes)

155 views12 pages

12.2 Computer Vision

The document discusses applications of deep learning in computer vision. It describes how computer vision is well-suited for deep learning research due to vision being easy for humans but difficult for computers. Common computer vision tasks aimed at replicating human abilities include object recognition, detection, image synthesis. Preprocessing techniques for computer vision with deep learning include contrast normalization, dataset augmentation which increases training data through transformations.

Uploaded by

nikhilsinha789

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

155 views12 pages

12.2 Computer Vision

Uploaded by

nikhilsinha789

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Deep Learning Srihari

Applications: Computer Vision

Sargur N. Srihari
srihari@cedar.buffalo.edu

1
Deep Learning Srihari

Topics in Applications
1. Large-Scale Deep Learning
2. Computer Vision
3. Speech Recognition
4. Natural Language Processing
5. Other Applications

2
Deep Learning Srihari

Topics in Computer Vision

• Overview
• Preprocessing
– Contrast Normalization
– Dataset Augmentation

3
Deep Learning Srihari

Computer Vision and Deep Learning

• Computer Vision is one of the most active
areas for deep learning research, since
– Vision is a task effortless for humans but difficult for
computers
• Standard benchmarks for deep learning
algorithms are:
– object recognition
– OCR

4
Deep Learning Srihari

Common tasks
• Small core of AI goals aimed at replicating
human abilities
– Object recognition
– Detection of some form
• Which object is present?
• Annotating an image with bounding boxes around each
object
• Transcribing a sequence of symbols from image
• Labeling each pixel with identity of object it belongs
– Image synthesis
• Because generative models are a guiding principle
behind deep learning, large body of work on synthesis5
Preprocessing
Deep Learning Srihari

• Some deep learning needs much

preprocessing
• Computer vision requires little preprocessing
– Pixel range
• Images should be standardized, so pixels lie in same
range [0,1], [-1,1], or [0,255] etc
– Picture size
• Some architectures need a standard size. So images may
need to be scaled
• May not be needed with convolutional models which
dynamically adjust size of pooling regions
– Data set augmentation 6
• Can be seen as a preprocessing step for training set
Deep Learning Srihari

Training with large data sets

• Large data sets (Imagenet) & models (Alexnet)
– No preprocessing
– Learns invariances

• Alexnet for Imagenet has one preprocessor

– Subtract mean across training examples of pixels
– Dataset: ILSVRC subset of ImageNet: 1000 images in each
of 1000 categories: 1.2m training, 50k validation, 150k testing
– Architecture: CNN with 5 conv layers, max-pool layers,
dropout layers, 3 fully connected layers.
– Performance: top 5 error rate= 15.4% next was 26.2% 7
Deep Learning Srihari

Contrast Normalization
• Image contrast can be safely removed
• Contrast refers to the magnitude of the
difference between bright and dark pixels
• In deep learning different definition
– Contrast = standard deviation of pixels
– For image with r rows and c columns, and RGB
image, contrast of entire image is
where

8
• When std dev is high, values differ more from mean
Deep Learning Srihari

Global Contrast Normalization

• Aims to prevent images from having varying
amounts of contrast
• Subtract mean from each image, then rescale it
so that std dev across pixels equals constant s
• Given an input image X, GCN produces an X’

• 𝜆 is a positive regularization term to bias the std

deviation, the denominator is constrained to be
at least 𝜀 9
Deep Learning Srihari

GCN maps examples onto sphere

• Raw input data may have any norm
• 𝜆=0 maps all nonzero examples onto sphere
• 𝜆>0 draws examples towards sphere but does
not discard variations in norm

10
Deep Learning Srihari

Local Contrast Normalization

• Contrast is normalized across each small
window rather than entire image

11
Deep Learning Srihari

Dataset Augmentation
• Increasing training set by adding modified
training examples
– with transformations that do not change the class
• Object recognition is helped because input may
be transformed with many geometric operations
– Classifiers benefit from random translations,
rotations, flips of the input
• In specialized vision applications:
– Perturbations of colors
– Nonlinear geometric transformations of input
12

Module 5
No ratings yet
Module 5
72 pages
DL U-III Computer Vision
100% (1)
DL U-III Computer Vision
30 pages
Computer Vision
No ratings yet
Computer Vision
20 pages
9.5 CNN-Variants
No ratings yet
9.5 CNN-Variants
21 pages
Sagar Paper
No ratings yet
Sagar Paper
4 pages
8 Modern Convolutional Neural Networks: Et Al. Et Al. Et Al
No ratings yet
8 Modern Convolutional Neural Networks: Et Al. Et Al. Et Al
57 pages
21.3 VAE Apps
No ratings yet
21.3 VAE Apps
29 pages
5.11 MLBasics-Challenges
No ratings yet
5.11 MLBasics-Challenges
20 pages
Production - Derieux - Cedric - Advances in Automatic Image Restoration and Upscaling
No ratings yet
Production - Derieux - Cedric - Advances in Automatic Image Restoration and Upscaling
4 pages
ImageNet Classification With Deep
No ratings yet
ImageNet Classification With Deep
7 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Deep Learning Object Detection in MATLAB
No ratings yet
Deep Learning Object Detection in MATLAB
13 pages
Unit 1
No ratings yet
Unit 1
17 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
22.1 GAN Motivation
No ratings yet
22.1 GAN Motivation
20 pages
A Comprehensive Review of Knowledge Distillation in Computer Vision
No ratings yet
A Comprehensive Review of Knowledge Distillation in Computer Vision
38 pages
Recent Advances in Deep Learning For Object Detection
No ratings yet
Recent Advances in Deep Learning For Object Detection
26 pages
RESNET
No ratings yet
RESNET
5 pages
Imagenet Classification With Deep Convolutional Neural Networks
No ratings yet
Imagenet Classification With Deep Convolutional Neural Networks
7 pages
L10-DL Intro
No ratings yet
L10-DL Intro
25 pages
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
No ratings yet
DL U3 Applications of Deep Learning To Computer Vision: Image Classification Object Detection
15 pages
Image Colour Prediction Using Deep Learning
No ratings yet
Image Colour Prediction Using Deep Learning
4 pages
Image Restoration Using Residual Generative Adversarial Networks-FINAL
No ratings yet
Image Restoration Using Residual Generative Adversarial Networks-FINAL
21 pages
9.2 CNN-Motivation
No ratings yet
9.2 CNN-Motivation
17 pages
Going Deeper With Convolutions
No ratings yet
Going Deeper With Convolutions
9 pages
Convolutional Neural PDF
No ratings yet
Convolutional Neural PDF
187 pages
Module-1 DL
No ratings yet
Module-1 DL
53 pages
CVAE-GAN Fine-Grained Image Generation Through Asymmetric Training
No ratings yet
CVAE-GAN Fine-Grained Image Generation Through Asymmetric Training
10 pages
Sampath Et Al. - 2021 - A Survey On Generative Adversarial Networks For Im
No ratings yet
Sampath Et Al. - 2021 - A Survey On Generative Adversarial Networks For Im
60 pages
Builders' Guide
No ratings yet
Builders' Guide
21 pages
Deep Learning in Computer Vision: Principles and Applications First Edition. Edition Mahmoud Hassaballah Instant Download
100% (1)
Deep Learning in Computer Vision: Principles and Applications First Edition. Edition Mahmoud Hassaballah Instant Download
115 pages
Computer Vision Algorithms and Hardware Implementations A Survey
No ratings yet
Computer Vision Algorithms and Hardware Implementations A Survey
12 pages
A Gentle Introduction To Deep Learning in Medical Image Processing
No ratings yet
A Gentle Introduction To Deep Learning in Medical Image Processing
31 pages
Video Analytics On Iot Devices: A Whitepaper: Abstract
No ratings yet
Video Analytics On Iot Devices: A Whitepaper: Abstract
5 pages
4 100593163merged
No ratings yet
4 100593163merged
11 pages
Deep Learning Based Computer Vision
No ratings yet
Deep Learning Based Computer Vision
98 pages
Crowd Counting
No ratings yet
Crowd Counting
11 pages
Deep Learning in Computer Vision
No ratings yet
Deep Learning in Computer Vision
15 pages
Deep Learning for Image Recognition
No ratings yet
Deep Learning for Image Recognition
12 pages
A Review On Deep Learning Applications
No ratings yet
A Review On Deep Learning Applications
11 pages
Real Time Object Detection Using Deep Learning Andmachine Learning Project
No ratings yet
Real Time Object Detection Using Deep Learning Andmachine Learning Project
56 pages
Deep Generative Image Models Using A Laplacian Pyramid of Adversarial Networks
No ratings yet
Deep Generative Image Models Using A Laplacian Pyramid of Adversarial Networks
10 pages
Advanced CNNs for Image Recognition
No ratings yet
Advanced CNNs for Image Recognition
9 pages
Deep Learning Vision Tools Guide
No ratings yet
Deep Learning Vision Tools Guide
4 pages
Transfer Learning For Object Detection Using State-of-the-Art Deep Neural Networks
No ratings yet
Transfer Learning For Object Detection Using State-of-the-Art Deep Neural Networks
7 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
A Givision 2011
No ratings yet
A Givision 2011
5 pages
249 254Tesma601IJEAST
No ratings yet
249 254Tesma601IJEAST
7 pages
ObjectDetectionPhase2 Demo
No ratings yet
ObjectDetectionPhase2 Demo
16 pages
Deep Learning & Vision for AI Students
No ratings yet
Deep Learning & Vision for AI Students
36 pages
Image Super Resolution Project
No ratings yet
Image Super Resolution Project
8 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
Unit - 3 - DL
No ratings yet
Unit - 3 - DL
15 pages
Facial Recognition Using Deep Learning
No ratings yet
Facial Recognition Using Deep Learning
6 pages
Image Recognition in Self-Driving Cars Using CNN
No ratings yet
Image Recognition in Self-Driving Cars Using CNN
7 pages
UNIT 2 IRS Up
No ratings yet
UNIT 2 IRS Up
42 pages
Online Voting Ballot Functions Analysis
No ratings yet
Online Voting Ballot Functions Analysis
8 pages
Unit 2
No ratings yet
Unit 2
10 pages
UNIT-III Lecture Notes
No ratings yet
UNIT-III Lecture Notes
18 pages
AI Lab Course for CSE Students
No ratings yet
AI Lab Course for CSE Students
18 pages
Control Systems Study Guide
No ratings yet
Control Systems Study Guide
98 pages
Process Control Instrumentation Glossary
No ratings yet
Process Control Instrumentation Glossary
9 pages
Requirement Management Plan
No ratings yet
Requirement Management Plan
7 pages
Chem2 Q3 Week 5 6
No ratings yet
Chem2 Q3 Week 5 6
6 pages
Class 11 Physics Notes Chapter 8 Studyguide360
No ratings yet
Class 11 Physics Notes Chapter 8 Studyguide360
36 pages
Digital Twins: Theory & Challenges
No ratings yet
Digital Twins: Theory & Challenges
15 pages
Langston Model 380SS Automatic Slitter Scorer Retrofit - CISA - Panama - Rack and Pinion Mechanism PDF
No ratings yet
Langston Model 380SS Automatic Slitter Scorer Retrofit - CISA - Panama - Rack and Pinion Mechanism PDF
9 pages
Chemistry, Class XI, Ch-6
No ratings yet
Chemistry, Class XI, Ch-6
2 pages
Alfred Sds Project
No ratings yet
Alfred Sds Project
10 pages
Operations Research Optimization Guide
No ratings yet
Operations Research Optimization Guide
1 page
Quasi Courtship Behavior in Psychotherapy
No ratings yet
Quasi Courtship Behavior in Psychotherapy
14 pages
NN3 PDF
No ratings yet
NN3 PDF
7 pages
Centiel CumulusPower WEB
No ratings yet
Centiel CumulusPower WEB
6 pages
Planos de Fases
No ratings yet
Planos de Fases
3 pages
3 - Software Development in Practice
No ratings yet
3 - Software Development in Practice
15 pages
Feedback Control Types Explained
No ratings yet
Feedback Control Types Explained
3 pages
Control System Design & Block Diagrams
No ratings yet
Control System Design & Block Diagrams
14 pages
6 Stability of Discrete-Time Systems - Complete
No ratings yet
6 Stability of Discrete-Time Systems - Complete
40 pages
Module 1 Merged
No ratings yet
Module 1 Merged
209 pages
Iec 61131-3 and Plcopen: What Do We Bring
No ratings yet
Iec 61131-3 and Plcopen: What Do We Bring
37 pages
Control Systems Tutorial Solutions
No ratings yet
Control Systems Tutorial Solutions
8 pages
Reinforcement Learning: Karan Kathpalia
No ratings yet
Reinforcement Learning: Karan Kathpalia
80 pages
LIMS Solutions for Labs & Industry
No ratings yet
LIMS Solutions for Labs & Industry
2 pages
Unit 3:group B Test 9-Klasse
No ratings yet
Unit 3:group B Test 9-Klasse
3 pages
Capstone Project 1
100% (3)
Capstone Project 1
47 pages
Artificial Intelligence Report
No ratings yet
Artificial Intelligence Report
20 pages
BTD Lession Plane
No ratings yet
BTD Lession Plane
6 pages
没有稳定约束的连续时间模型预测控制的递归可行性
No ratings yet
没有稳定约束的连续时间模型预测控制的递归可行性
6 pages
3 - PROCESS CONTROL - 2010 - Chemical Process Equipment
No ratings yet
3 - PROCESS CONTROL - 2010 - Chemical Process Equipment
21 pages
Chem Eng Thermodynamics Guide
No ratings yet
Chem Eng Thermodynamics Guide
61 pages

12.2 Computer Vision

Uploaded by

12.2 Computer Vision

Uploaded by

Deep Learning Srihari

Applications: Computer Vision

Topics in Computer Vision

Computer Vision and Deep Learning

• Some deep learning needs much

Training with large data sets

• Alexnet for Imagenet has one preprocessor

Global Contrast Normalization

• 𝜆 is a positive regularization term to bias the std

GCN maps examples onto sphere

Local Contrast Normalization

You might also like