IMAGE
PROCESSING AND
COMPUTER VISION
Introduction
Computer vision (CV) and image processing are
two closely related fields that utilize techniques
from artificial intelligence (AI) and pattern
recognition to derive meaningful information
from images, videos, and other visual inputs.
Image processing is at the helm of the technical
revolution that is being called Industry 4.0, laying
the framework for new technologies in image data
processing and object recognition.
What is an Image?
• An image is represented by its dimensions (height and width)
based on the number of pixels. For example, if the dimensions
of an image are 500 x 400 (width x height), the total number of
pixels in the image is 200000.
• This pixel is a point on the image that takes on a specific shade,
opacity or color. It is usually represented in one of the
following:
• Grayscale - A pixel is an integer with a value between 0 to 255 PORTFOLIO
(0 is completely black and 255 is completely white). GROWTH
• RGB - A pixel is made up of 3 integers between 0 to 255 (the
integers represent the intensity of red, green, and blue).
• RGBA - It is an extension of RGB with an added alpha field,
which represents the opacity of the image.
• Image processing requires fixed sequences of operations that are
performed at each pixel of an image. The image processor
performs the first sequence of operations on the image, pixel by
pixel. Once this is fully done, it will begin to perform the second
operation, and so on. The output value of these operations can
What is Image Processing?
• Image processing involves two methods, namely analog image processing
and digital image processing. Analog image processing comprises the
technique to process photographs, printouts, and different hard copies of
images. In contrast, digital image processing involves manipulating the
digital image for generating information with the help of complex
algorithms.
• The input for an image processing task is an image. However, it is essential
to note that analog image processing always requires an image input. Still,
digital image processing may include images or information associated
with an image, such as features or bounding boxes, etc.
• Image visualization is the representation of the processed data in the form
of visual output for better understanding. This task is mainly done for
objects that are not easy to detect in an image.
• Improving the quality of the image by using image sharpening and
restoration.
• Image search is associated with retrieving the image source from an
investigation conducted by an image search engine.
Types of Image Processing
There are five main types of image processing:
• Visualization - Find objects that are not visible in the image
• Recognition - Distinguish or detect objects in the image
• Sharpening and restoration - Create an enhanced image from the original
image
INVESTMENT • Pattern recognition - Measure the various patterns around the objects in
the image
GOALS
• Retrieval - Browse and search images from a large database of digital
images that are similar to the original image.
Components of Image Processing
Computer
• A general-purpose computer, which may be anything from a PC to a
supercomputer, is used in an image processing system. Sometimes,
specifically built computers are utilized in specialized applications to
reach a specified degree of performance.
Hardware for Specialized Image Processing
• It comprises the digitizer and hardware that can carry out basic
operations, including an Arithmetic Logic Unit (ALU), which can carry
out simultaneous arithmetic and logical operations on whole pictures. PORTFOLIO
Massive Storing GROWTH
• The three main types of digital storage for image processing
applications are as follows: (1) short-term storage, (2) online storage
for quick recall (3) archive storage, which is characterized by rare
access.
Camera Sensors
• The image sensor's primary function is to collect incoming light,
transform it into an electrical signal, measure that signal, and then
output it to supporting electronics. It consists of a two-dimensional
array of light-sensitive components that convert photons into electrons.
Image Display
• The pictures are shown.
Software
• The image processing software comprises specialized modules that
carry out particular functions.
Hardcopy Equipment
• Laser printers, film cameras, heat-sensitive equipment, inkjet printers,
and digital equipment like optical and CDROM discs are just a few
examples of the instruments used to record pictures.
Networking
• To send visual data through a networked computer, it is a necessary
component. The most important factor in picture transmission is
bandwidth since image processing applications require vast amounts of
data.
Fundamental Image Processing Steps
Image Acquisition
• Image acquisition is the first step in image processing. This step is also
known as preprocessing in image processing. It involves retrieving the
image from a source, usually a hardware-based source.
Image enhancement Image Enhancement
• Image enhancement is the process of bringing out and highlighting
certain features of interest in an image that has been obscured. This can
involve changing the brightness, contrast, etc.
Image Restoration
• Image restoration is the process of improving the appearance of an image.
However, unlike image enhancement, image restoration is done using
Image restoration certain mathematical or probabilistic models.
Color Image Processing
• Color image processing includes a number of color modeling techniques
in a digital domain. This step has gained prominence due to the
significant use of digital images over the internet.
Wavelets and Multiresolution Processing
• Wavelets are used to represent images in various degrees of resolution.
The images are subdivided into wavelets or smaller regions for data
compression and for pyramidal representation.
Compression
• Compression is a process used to reduce the storage required to save an
image or the bandwidth required to transmit it. This is done particularly
when the image is for use on the Internet.
Morphological Processing
• Morphological processing is a set of processing operations for morphing
Wavelets
images based on their shapes.
Morphological
processing
Segmentation
• Segmentation is one of the most difficult steps of image processing. It
involves partitioning an image into its constituent parts or objects.
Representation and Description
• After an image is segmented into regions in the segmentation process,
each region is represented and described in a form suitable for further
Segmentation computer processing. Representation deals with the image’s
characteristics and regional properties. Description deals with extracting
quantitative information that helps differentiate one class of objects from
the other.
Recognition
• Recognition assigns a label to an object based on its description.
Recognition
Applications of Image Processing
Medical Image Retrieval
Image processing has been extensively used in medical research and has
enabled more efficient and accurate treatment plans. For example, it can
be used for the early detection of breast cancer using a sophisticated
nodule detection algorithm in breast scans. Since medical usage calls for
highly trained image processors, these applications require significant
implementation and evaluation before they can be accepted for use.
Traffic Sensing Technologies
In the case of traffic sensors, we use a video image processing system.
This consists of a) an image capturing system b) a telecommunication
system and c) an image processing system. When capturing video, a VIPS
has several detection zones which output an “on” signal whenever a
vehicle enters the zone, and then output an “off” signal whenever the
vehicle exits the detection zone. These detection zones can be set up for
multiple lanes and can be used to sense the traffic in a particular station.
Image Reconstruction
• Image processing can be used to recover and fill in the missing or
corrupt parts of an image. This involves using image processing
systems that have been trained extensively with existing photo datasets
to create newer versions of old and damaged photos.
Face Detection
• One of the most common applications of image processing that we use
today is face detection. It follows deep learning algorithms where the
machine is first trained with the specific features of human faces, such
as the shape of the face, the distance between the eyes, etc. After
teaching the machine these human face features, it will start to accept
all objects in an image that resemble a human face. Face detection is a
vital tool used in security, biometrics and even filters available on most
social media apps these days.
Computer vision
Computer Vision generally refers to the technologies involved in allowing
computers to make sense of images. The most common application of this
is image recognition, which is a process that enables the identifying of
objects and image features.
Image recognition is used in numerous applications today, such as
medical imaging, security surveillance, facial recognition, identification
of logos, and buildings, to name a few. However, for these models to
work, the images need to first be labeled, segmented, or have other
processing steps taken as mentioned prior.
Computer vision has been around since the 1960s. Recent
developments:
• Increasing availability of cheap, powerful cameras (e.g. digital cameras,
webcams) and other sensors.
• Increasing availability of massive amounts of image and multimedia
content on the web (e.g. face databases, streaming video or image-based
communication).
• Increasing availability of cheap, powerful computers (processor speed
and memory capacity).
• Introduction of techniques from machine learning and statistics
(complex, data-driven models and algorithms).
Three related areas:
1. Computer graphics: representation of a 3D scene in 2D image(s).
2. Computer vision: recovery of information about the 3D world from 2D
image(s); the inverse problem of computer graphics.
3. Image processing: operate one image to produce another image (e.g.
denoising, deblurring, enhancement, —in particular in medical imaging).
Getting Started with Image Recognition
Models
Some of the most valuable packages to utilize for Computer Vision and
Image Processing include:
• imutils
• OpenCV
• Dlib
• Scikit-learn
• Scikit-image
• TensorFlow
• Keras
• Mxnet
• Fastai
• Pytessarct
• PyTorchCV
Uses of computer vision
Defect Inspection
Image recognition has contributed positively to the manufacturing units. The
primary task of image recognition has been to identify defective items during
the manufacturing process. The ability to quickly examine thousands of
defective items in the assembly line speeds up the overall process and leads
to efficiency in the mode of operations.
Image Classification
Perhaps, the most crucial part of image recognition that has been part of
INVESTMENT many types of research is image classification. The possibility of assisting
doctors in finding a region of interest for detecting and predicting a
GOALS particular disease has been part of several researches in recent years. Image
classification has been a critical contributor in e-commerce industries to
enhance the user experience with quick search possibilities. Image
classification allows categorizing images as per a specific image content. It
is part of most of the recommendation systems and image retrieval engines
that we use today.
Autonomous Driving
The state-of-the-art technology of autonomous driving is yet to reach its full
potential before being allowed commercially. However, to have pedestrian
detection capability and to stop when a stop sign is being shown has been
possible to incorporate image recognition into computer vision techniques.
Robotics
Image recognition has been part of many robotics-based projects used to
train them to identify objects for better navigation and detect objects that
may be found in its path.
Text Detection
Text detection is yet another promising contributions with the help of
image recognition. The detection of text and characters from an image
such as a photograph that can include a street sign or a traffic sign has
been a possibility with text detection. Cloud Vision by Google is one of
the prominent companies in the field of text detection.
Facial Recognition
With the emergence of AI, facial recognition has been a possibility. From
securing the device to surveillance, facial recognition has a strong
demand in the market due to its potential. However, several experts are
questioning the privacy aspects of the technology. Nevertheless, it is a
fact that every technology has some limitations. Therefore, the proper
implementation of facial recognition techniques will result in life
essentials, such as traffic and city surveillance.
e-Commerce
This requires the transformation of the image into a visual embedding,
where then the recommendations are either products similar to the one
uploaded.
Thank
you