31.07.
2023
Digital Image Processing (CS7.404)
Lecture-1: Overview
Anoop M Namboodiri
Many slides borrowed from Vineet Gandhi and various sources
Before there were images ….
http://www.cs.toronto.edu/~guerzhoy/320/lec/Introduction.pdf
Before there were images ….
http://www.cs.toronto.edu/~guerzhoy/320/lec/Introduction.pdf
Before there were images ….
Girl with a pearl earring, J. Vermeer, 1665
Camera Obscura
http://www.cs.toronto.edu/~guerzhoy/320/lec/Introduction.pdf
And then there were images ….
http://www.cs.toronto.edu/~guerzhoy/320/lec/Introduction.pdf
What is a digital image ?
• An array of numbers representing color intensities
100 50 0 150
90 255 70 70
200 150 255 50
0 100 80 0
http://www.cs.ucc.ie/~gavin/cs1001/Notes/chap01/ch01_4.html
What is a digital image ?
• An array of numbers representing color intensities
100 50 0 150
90 255 70 70
200 150 255 50
0 100 80 0
http://www.cs.ucc.ie/~gavin/cs1001/Notes/chap01/ch01_4.html
What is a digital image ?
• An array of numbers representing color intensities
Width
100 50 0 150
90 255 70 70
Height
200 150 255 50
0 100 80 0
1 2-D channel pixel
8 bits/pixel
Image Representation (Grayscale)
https://openframeworks.cc/ofBook/images/image_processing_computer_vision/
Image Representation (RGB)
3 2-D channels
24 bits per pixel
https://www.researchgate.net/profile/Bhupendra_Pratap_Singh/publication/282798184/figure/fig15/AS:283989639221249@1444719814399/Color-image-
representation-and-RGB-matrix.png
Image Representation
fMRI image slices
56 2-D channels
Types of Images (classification on source)
• Radiation from EM spectrum
• Acoustic/ultrasonic/spectrogram
• Electronic
• Computer generated
EM spectrum
BLOOD FLOW SCAN BONE SCAN PET SCAN
courtesy: artheristisresearch.us courtesy: artheristisresearch.us courtesy: research.ucla.edu
EM spectrum
Wilhelm Röntgen
HAND MIT RINGEN CHEST RADIOGRAPH CT SCAN AIRPORT SCAN
courtesy: wikipedia courtesy: wikipedia courtesy: wikipedia courtesy: dpl-surveillance-equipment
EM spectrum
Lithography, industrial inspection, microscopy,
lasers, astronomical observations, florescence
microscopy etc. Eric Betzig, William Moerner and Stefan Hell
SUN (2 years apart) 100 EURO BILL Cell Phalloidin
courtesy: NASA courtesy: lifepixel.com courtesy: wikipedia
EM spectrum
Source:
Lifepixel.com
EM spectrum
Chips (optical microscopy) High Speed Photograph Satellite Image (Hurricane Katrina)
courtesy: EPFL microelectornics systems laboratory courtesy:Alan Sailer courtesy: britannica.com
EM spectrum
courtesy: LANDSAT (NASA) courtesy: imaging1.com
EM spectrum
Main application: RADAR
SOUTHEAST TIBET MOUNTAINS
courtesy: NASA
EM spectrum
MRI Brain MRI Knee
courtesy: mritnt.com courtesy: mri-tip.com
Types of Images (classification on source)
• Radiation from EM spectrum
• Acoustic/ultrasonic/spectrogram
• Electronic
• Computer generated
Ultrasound
ULTRASOUND ULTRASOUND TWINS ULTRASOUND 3D
courtesy: wikipedia courtesy: pinterest courtesy: peek3D.com
Spectrogram
Frequency (Hz)
Time (seconds)
Violin Recording
courtesy: wikipedia
Spectrogram
https://elvinouyang.github.io/project/training-cnn-for-voice-command/
Types of Images (classification on source)
• Radiation from EM spectrum
• Acoustic/ultrasonic/spectrogram
• Electronic
• Computer generated
Scanning Electron Microscopy
Normal Circulating Human Blood Adhesive on Post-it note
courtesy: National Cancer Institute courtesy: wikipedia
Types of Images (classification on source)
• Radiation from EM spectrum
• Acoustic/ultrasonic/spectrogram
• Electronic
• Computer generated
Computer generated
Happy Birthday Nana
courtesy: Dan Roarty
Scale
Microscopes Telescopes
10-9m 220 kly ≈ 1021m
courtesy: nanolab technologies.com courtesy: wikipedia
Types of Images (classification on optics)
1. Reflection Images 2. Emission Images 3. Absorption Images
Information primarily about Information primarily about Information primarily about
objects surface internal properties internal structure
Types on images (classification on arrangement)
• Grayscale
• RGB
• Multispectral images
• Stereo images
• Multi-view images
Multi spectral images
Courtesy: LANDSAT
Multi spectral images
Courtesy: LANDSAT
Multi spectral images
Courtesy: LANDSAT
Multi spectral images
Courtesy: LANDSAT
Stereo Images
courtesy: wikimedia.com
Stereo Images
courtesy: vision.middlebury.edu
Multi-view images
courtesy: Yasutaka Furukawa courtesy: Sameer Agarwal
RGB-D images
courtesy: kinect and prime sense
Digital Image processing
• Study and Development of algorithms that operate on an image
– To create new image(s)
– To retrieve its attributes
• Consumer-based view
– For consumption by human eyes
– For consumption by machine-based processes
Outputs of these steps are generally images
Outputs of these steps are generally image attributes
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image
Restoration Segmentation
Problem Domain
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Processing
Multi-Resolution Compression Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Wavelets &
Color Image Morphological
Multi-Resolution Compression
Processing Processing
Processing
Image Segmentation
Restoration
Image Image Processing Representation
Enhancement Tasks & Description
Image Object
Acquisition Recognition
Image Processing Tasks
Tasks of interest: Contrast adjustment
Image courtesy: mathworks
Tasks of interest: Edge Detection
Image courtesy: mathworks
Tasks of interest: Feature detection + stitching
Image courtesy: opencv
Image courtesy: autostitch
Tasks of interest: Noise Removal
Total variation denoising [Chambolle JMIV 2004]
Tasks of interest: Haze Removal
Single Image Haze Removal [He et al. CVPR 2009]
Retouch Personal Photos!
©Images taken from the web.
Tasks of interest: Artistic enhancement
Image courtesy: webneel.com Image courtesy: Jon Morse
BW to Color
Mayabazar (1957), Vijaya Vauhini Studios Colorized (2010)
Tasks of interest: Cinematic Grading
Mission Impossible - Ghost Protocol, Paramount Pictures
Image courtesy: juanmelara.com
Tasks of interest: Segmentation
Tumour Segmentation [Yu et al. MICCAI 2010] Cup Segmentation [Joshi and Sivaswamy 2011]
Tasks of interest: Segmentation
Courtesy: Roman Shapovalov
Tasks of interest: Compression
Original Image (1.2 mb) Compressed JPEG Image (100 kb)
Tasks of interest: Inpainting
DAMAGED RESTORED
Bertalmio et al. SIGGRAPH 2010
Tasks of interest: Special effects
courtesy: wachowski brothers (matrix) courtesy: Miller et al. (sin city)
Tasks of interest: Satellite imaging
Terrain classification, weather predictions etc.
courtesy: NASA
Tasks of interest: Astronomy
courtesy: ISRO
Tasks of interest: Biometrics
courtesy: dqindia.com courtesy: heyce.com
Tasks of interest: Medicine
Courtesy: medINRIA Courtesy: Naoki Suzuki
Tasks of interest: Driverless Vehicle Systems
Tasks of interest: Document Image Analysis
Tasks of interest: Many more
• Biology
• HCI
• Number Plate recognition
• Gesture recognition Courtesy: Olympus
Courtesy: Perviverzov et al. 2012 Courtesy: researchdesignlab.com
http://www.cs.toronto.edu/~guerzhoy/320/lec/Introduction.pdf
http://karpathy.github.io/2012/10/22/state-of-computer-vision/
Companies and Software
Research
• Journals
Research
• Conferences
Research
• Conferences
Research
• Conferences
About the course
• Timings: Mon, Thu (11:40am – 1:05pm)
• Pre-requisites
– (CS): Programming, Data Structures, Algorithms
– (MA): Basics of Linear Algebra, Calculus
– (DSP): Minimal sense of signal processing.
• Textbook: Digital Image Processing, Gonzalez and Woods (4th ed)
• Other resources from the internet
About the course
• Teaching Assistants:
– Sanjay P + 3 (will be announced soon)
• Office Hours :
– Mon, Thu: 1:45pm-2:30pm [after class]
About the course: Grading Policy
• Assessment
– Assignments (4) : 24 %
– Quizzes (2) : 16 %
– Mid Sem : 15 %
– End Sem : 20 %
– Group Project : 25 %
About the course: Assignments
• Python (mandatory)
• Github Classroom
• Image Processing / Data Processing Libraries
• scikit-image , opencv-python, kornia
• pandas
• scikit-learn
– Do not use libraries for assignments unless explicitly
allowed.
About the course: Collaboration policy
• OK to discuss questions, approaches
• But work must be your own
– No copying – partially or fully
• If you worked with someone, mention their name(s)
• We will be checking for copying/plagiarism.
– Better to own up than be caught !
About the course – final projects
• Teams of 4
About the course – Project
• Creating resources (datasets, tutorials)
• Replicate an interesting research paper
• Original research
• Comparing different existing algorithms
• Deliverables
– Release the final code (github – code check-in analytics will form basis of marks)
– Give a group presentation
About the course – Grading Policy
• Homework/Project Late Policy: 20% penalty if one day late; 40% if
two days late; 100% if more than two days late
• A one-time late submission bonus:
– only applicable to assignments (with maximum of 3 days delay).
– must adhere to standard late submission policy after using your late submission
bonus.
– No exceptions will be made.
Additionally …
• The course load is fairly substantial
• Elective à Conscious choice
• Starting early on assignments helps
• Spending time everyday on material covered in class helps
How to Learn
• Understand, don’t just memorize
• Understand the theory behind library calls !
• Capture the broad ideas and insights (useful years down the line)
• Implement ! No substitute for experience.
Learning Resources
• PGSSP/MS/PhD
– Take some time understanding the Moodle portal and
other IIIT e-services
– Consider forming e-study / discussion groups
• Reach out to TAs, me in case you have difficulty
with material.
• Feedback need not wait until end of course.
Be there the whole way
• Put away phone and other distractions …
• Do NOT make end of semester travel plans until
project evaluations are completed.
• Book your tickets after
– 30th November 2023
– 5th December if you want to see the final exam paper
Taking Notes
• Take notes actively !
– Improves ability to summarize key ideas (core career skill !)
– Do well in quizzes
– Antidote to sleepiness J
– Tip: Note slide headings – useful for revision/recap
Welcome and Have Fun
• DIP is extremely interesting
– Visual
See you on
– Mathematical
Thursday
– Practical
– Challenging
– Lots of interesting applications