ITECH WORLD AKTU BCS057
ITECH WORLD AKTU
Image Processing (BCS057)
Unit 1: Digital Image Fundamentals
• Steps in Digital Image Processing
• Components
• Elements of Visual Perception
• Image Sensing and Acquisition
• Image Sampling and Quantization
• Relationships between Pixels
• Color Image Fundamentals: RGB, HSI Models
• Two-dimensional Mathematical Preliminaries
• 2D Transforms: DFT, DCT
What is an Image?
An image is a visual representation of an object or a scene. It is formed by capturing
light reflected or emitted from the object onto a two-dimensional surface, such as a
camera sensor or photographic film. In the context of digital image processing, an image
is represented as a matrix of pixel values, where each pixel value corresponds to the
intensity or color information at a particular point.
Example: A digital photograph taken by a camera is an image, where each pixel
value indicates the brightness and color at that point in the photograph.
Types of Images
Images can be categorized into several types based on their characteristics:
1. Binary Images
Binary images contain only two pixel values, typically 0 (black) and 1 (white). They are
used for representing simple shapes and structures.
Example: A scanned document where text is represented in black on a white back-
ground.
1
ITECH WORLD AKTU BCS057
2. Grayscale Images
Grayscale images represent various shades of gray, ranging from black (0) to white (255
in 8-bit images). They contain only intensity information without color.
Example: A black and white photograph.
3. Color Images
Color images use multiple color channels, such as RGB (Red, Green, Blue), to represent
colors at each pixel. Each channel has its own intensity value, and the combination of
these values determines the final color.
Example: A digital photograph taken in color.
4. Indexed Images
Indexed images use a colormap to map pixel values to specific colors. Each pixel value is
an index into a table of colors.
Example: A GIF image with a limited palette of 256 colors.
5. Multispectral Images
Multispectral images capture data across multiple wavelengths of light, such as infrared,
visible, and ultraviolet. They are used in remote sensing and satellite imagery.
Example: Satellite images used for land cover classification.
What is Digital Image Processing?
Digital Image Processing refers to the manipulation of an image using digital techniques to
extract meaningful information or to enhance its visual appearance. It involves applying
mathematical and algorithmic operations on an image to achieve the desired outcomes
such as noise reduction, image enhancement, and feature extraction.
Steps in Digital Image Processing
Digital Image Processing involves several steps to enhance and extract useful information
from an image. The primary steps are:
1. Image Acquisition: Capturing the image using a sensor, such as a camera.
2. Preprocessing: Enhancing the quality of the image by removing noise and ad-
justing contrast.
3. Segmentation: Dividing the image into meaningful parts or regions.
4. Representation and Description: Representing the segmented image in a form
suitable for further analysis, such as boundary descriptors.
5. Recognition and Interpretation: Identifying and interpreting objects within
the image.
2
ITECH WORLD AKTU BCS057
6. Knowledge Base: Utilizing prior information about the problem domain to assist
in processing.
Example: A digital camera captures an image, which is then preprocessed to reduce
noise. The regions of interest are segmented, and features such as edges and textures are
described. Finally, the image is analyzed to recognize objects such as faces or text.
Advantages and Disadvantages of Digital Images
Advantages
• Ease of Storage and Retrieval: Digital images can be easily stored in various
formats and retrieved for further processing or analysis.
• Image Manipulation: Digital images can be enhanced, modified, and analyzed
using various image processing algorithms.
• Transmission: Digital images can be transmitted over networks with minimal loss
of quality.
• Integration with Other Systems: Digital images can be easily integrated with
other data types, such as text and audio, for multimedia applications.
Disadvantages
• Storage Requirements: High-resolution digital images require significant storage
space.
• Loss of Information: Compression techniques can lead to loss of image informa-
tion, affecting the quality of the image.
• Processing Time: Large digital images may require significant processing time
and computational resources for analysis.
3
ITECH WORLD AKTU BCS057
Components
Digital Image Processing involves various components that work together to achieve the
desired image analysis.
• Image Sensors: Devices that capture the image, such as CCD or CMOS sensors
in cameras.
• Image Processing Algorithms: Techniques to process and analyze the image
data, such as filtering and enhancement algorithms.
• Hardware: Computing devices that execute image processing algorithms effi-
ciently.
• Software: Programs and tools that provide an interface for implementing image
processing techniques.
Example: A smartphone camera includes an image sensor and uses software to
enhance the image quality, such as applying filters to improve contrast and sharpness.
Elements of Visual Perception
Human visual perception plays a crucial role in how images are processed and understood.
Key elements include:
4
ITECH WORLD AKTU BCS057
• 1. Light and Color Perception: The way humans perceive colors and brightness,
depending on the wavelength of light.
• 2. Spatial Resolution: The ability to distinguish fine details in an image, influ-
enced by the density of photoreceptors in the retina.
• 3. Contrast Sensitivity: The ability to detect differences in brightness, which
helps in distinguishing objects from the background.
• 4. Depth Perception: The ability to perceive the world in three dimensions and
judge the distance of objects.
• 5. Motion Perception: The ability to detect and interpret movement in the
visual field.
• 6. Visual Acuity: The clarity or sharpness of vision, allowing the recognition of
small or detailed objects.
• 7. Adaptation: The ability of the human visual system to adjust to varying levels
of light, ensuring clear vision in different lighting conditions.
Example: The human eye is more sensitive to changes in brightness than to changes
in color, which is why grayscale images often reveal more detail than colored images.
Image Sensing and Acquisition
Image sensing involves capturing visual information using sensors, while acquisition refers
to converting this information into a digital form that can be processed and stored.
• Image Sensors:
– CCD (Charge-Coupled Device) Sensors: CCD sensors are highly sen-
sitive to light and provide high-quality images with low noise levels. They
are commonly used in scientific and medical imaging applications due to their
superior performance in low-light conditions.
– CMOS (Complementary Metal-Oxide-Semiconductor) Sensors: CMOS
sensors are more power-efficient and faster than CCD sensors. They allow for
on-chip processing and are widely used in consumer electronics, such as smart-
phones and digital cameras.
• Image Formation: The process begins with light from the scene entering through
the lens and focusing onto the sensor array. The lens plays a crucial role in deter-
mining the field of view and the focus of the captured image.
• Conversion to Electrical Signals: The image sensor, which consists of an array
of photosensitive elements (pixels), converts the incident light into electrical signals.
Each pixel generates a signal proportional to the intensity of light falling on it.
5
ITECH WORLD AKTU BCS057
• Digitization: The analog electrical signals from the image sensor are converted
into digital values using an Analog-to-Digital Converter (ADC). This process in-
volves sampling the analog signal at discrete intervals and quantizing the sampled
values into digital numbers, typically represented as a binary code.
• Image Acquisition System: In addition to the sensor and ADC, an image ac-
quisition system may include components like amplifiers, filters, and timing circuits
that ensure accurate signal processing and conversion.
• Image Storage: The digitized image data is stored in memory or transmitted to a
processing unit for further analysis. The format and resolution of the stored image
depend on the application requirements and sensor capabilities.
• Calibration and Correction: Calibration processes like white balance, gamma
correction, and lens distortion correction are applied to the raw image data to ensure
accurate color reproduction and image quality.
Example: In a digital camera, light enters through the lens and strikes the image
sensor, which could be either a CCD or CMOS sensor. The sensor converts the light
into electrical signals, which are then digitized by an ADC. The resulting digital image
is stored in the camera’s memory card, ready for viewing or editing.
6
ITECH WORLD AKTU BCS057
Image Sampling and Quantization
Image sampling and quantization are fundamental steps in converting an analog image
into its digital form. These processes determine the resolution and quality of the digital
image.
• Sampling: Sampling refers to measuring the intensity of the image at discrete
points, both horizontally and vertically. It defines the spatial resolution of the
image, which is the number of pixels used to represent the image. Higher sampling
rates result in better resolution, as more details of the image are captured. However,
it also increases the file size and computational requirements.
• Quantization: Quantization involves mapping the continuous range of intensity
values of the sampled image to a finite set of discrete levels. Each pixel value
is assigned to the nearest quantization level. The number of quantization levels
determines the bit depth of the image. Higher quantization levels provide a more
accurate representation of the image with smoother transitions between shades, but
they also require more storage space.
Example: Consider a 10-megapixel camera that samples the image at 10 million
discrete points. Each sample (pixel) is then quantized into 256 levels of brightness,
corresponding to an 8-bit image depth.
7
ITECH WORLD AKTU BCS057
Differences between Sampling and Quantization
S.No. Sampling Quantization
1 Refers to measuring the image at Refers to assigning a discrete
discrete intervals. value to each sampled intensity.
2 Determines the spatial resolution Determines the bit depth or num-
of the image. ber of intensity levels of the im-
age.
3 Affects the number of pixels in Affects the range of grayscale or
the digital image. color values each pixel can repre-
sent.
4 Higher sampling rate captures Higher quantization levels result
more details and results in a in smoother image representation
larger image size. and larger file sizes.
5 Related to the x and y dimensions Related to the z dimension, rep-
of the image matrix. resenting intensity or color levels.
6 Dependent on the sampling fre- Dependent on the number of
quency or rate. quantization levels (e.g., 256 lev-
els for 8-bit depth).
7 Aliasing occurs if the sampling Quantization error occurs if the
rate is too low. number of levels is insufficient to
represent the image accurately.
Table 1: Differences between Sampling and Quantization
8
ITECH WORLD AKTU BCS057
Relationships between Pixels
Understanding the relationships between pixels is crucial for various im-
age processing tasks such as edge detection, region segmentation, and
image analysis. These relationships help define how pixels interact with
each other and contribute to the overall image structure. Some common
relationships include:Neighbors:
– ∗ Neighbors are the pixels that are directly adjacent to a given pixel. There
are three main types of neighborhoods:
· 4-neighborhood (N4): Consists of the four pixels that share a com-
mon edge with the given pixel. For a pixel at coordinates (x, y), the
4-neighbors are located at (x−1, y), (x+1, y), (x, y −1), and (x, y +1).
· 8-neighborhood (N8): Includes the 4-neighbors as well as the four
diagonal neighbors. The 8-neighbors are located at (x−1, y), (x+1, y),
(x, y − 1), (x, y + 1), (x − 1, y − 1), (x − 1, y + 1), (x + 1, y − 1), and
(x + 1, y + 1).
· m-neighborhood: This is a mixture of the 4-neighbors and diagonal
neighbors, considering specific conditions for connectivity.
– Connectivity:
∗ Connectivity defines how pixels are connected based on their intensity
values and spatial relationships. It is crucial for identifying distinct regions
in an image.
· 4-connectivity: Two pixels are 4-connected if they share an edge
and have the same intensity value.
· 8-connectivity: Two pixels are 8-connected if they share either an
edge or a corner and have the same intensity value.
· Mixed-connectivity (m-connectivity): Two pixels are connected
using a combination of 4-connectivity and 8-connectivity rules, avoid-
ing the ambiguity of diagonal connectivity.
– Adjacency:
∗ Adjacency describes the relationship between pixels that share a common
side or corner. There are different types of adjacency:
· 4-adjacency: Two pixels are 4-adjacent if they share a common side.
· 8-adjacency: Two pixels are 8-adjacent if they share a common side
or a common corner.
9
ITECH WORLD AKTU BCS057
· m-adjacency: A combination of 4-adjacency and 8-adjacency used
to avoid multiple path connections in a binary image.
– Distance Measures:
∗ Distance measures quantify the closeness between pixels. Common dis-
tance measures include:
· Euclidean Distance: The straight-line p distance between two pixels
(x1 , y1 ) and (x2 , y2 ), calculated as d = (x2 − x1 )2 + (y2 − y1 )2 .
· City Block Distance (Manhattan Distance): The distance be-
tween two pixels measured along the grid lines, calculated as d =
|x2 − x1 | + |y2 − y1 |.
· Chessboard Distance: The maximum of the horizontal and vertical
distances, calculated as d = max(|x2 − x1 |, |y2 − y1 |).
– Path:
∗ A path is a sequence of pixels where each consecutive pixel is adjacent
to the next. Paths are used to trace connectivity and boundaries in an
image.
· 4-path: A path that connects 4-adjacent pixels.
· 8-path: A path that connects 8-adjacent pixels.
· m-path: A path that uses mixed-adjacency rules to avoid diagonal
ambiguity.
– Region:
∗ A region is a group of connected pixels with similar properties. Regions
are used in segmentation to separate different parts of an image.
· Connected Components: Pixels that are connected and share the
same intensity value form a connected component or region.
· Region Adjacency Graph (RAG): A graph representation where
nodes represent regions and edges represent the adjacency between
regions.
– Boundary:
∗ The boundary is the set of pixels that separate different regions in an
image. Identifying boundaries is important for edge detection and shape
analysis.
· External Boundary: The boundary between a region and the back-
ground.
· Internal Boundary: The boundary between two adjacent regions.
Example: In a binary image, two adjacent pixels with the same value are consid-
ered connected. For instance, if both pixels have a value of 1 and share a common
edge, they are 4-connected. This concept is used in connected component labeling
to identify distinct objects in an image.
10
ITECH WORLD AKTU BCS057
Color Image Fundamentals
Color image processing involves understanding and manipulating the color informa-
tion of images. Several color models are used to represent colors in different ways
for various applications. Key models include:
– RGB Model:
∗ The RGB model represents colors as combinations of the primary colors
Red, Green, and Blue. Each color is defined by its intensity values of R,
G, and B, ranging from 0 to 255 in an 8-bit representation.
∗ It is widely used in digital displays and imaging devices such as cameras,
monitors, and scanners.
∗ Colors are additive in nature, meaning they are formed by adding the
values of R, G, and B.
– HSI Model:
∗ The HSI model represents colors using three components: Hue (color
type), Saturation (color purity), and Intensity (brightness).
∗ It is more intuitive for human interpretation because it separates color
information (Hue) from brightness (Intensity).
∗ HSI is commonly used in image analysis, computer vision, and color-based
object recognition.
Example: The RGB model is widely used in digital displays and imaging
devices due to its straightforward representation of colors. In contrast, the
HSI model is preferred for image analysis and object recognition because it
separates color information from intensity, making it easier to analyze the
color features of objects independently from their brightness.
11
ITECH WORLD AKTU BCS057
Two-dimensional Mathematical Preliminaries
Understanding and manipulating digital images require fundamental mathe-
matical concepts. Key concepts include:
∗ Matrices:
· An image can be represented as a matrix, where each element of the
matrix corresponds to the intensity or color value of a pixel.
· For a grayscale image, each element contains a single value represent-
ing brightness, whereas, for a color image, each element might contain
multiple values (e.g., RGB components).
∗ Linear Algebra:
· Operations such as matrix multiplication, addition, and scalar multi-
plication are essential in image transformations like rotation, scaling,
and translation.
· Eigenvalues and eigenvectors are used in Principal Component Anal-
ysis (PCA) for image compression and recognition.
1 2D Transforms
2D transforms are crucial in image processing, facilitating various appli-
cations such as compression, filtering, and feature extraction. Key trans-
forms include:
· Discrete Fourier Transform (DFT):
· The DFT converts an image from the spatial domain to the frequency
domain, allowing for the analysis of frequency components.
· It helps identify periodic patterns and frequencies in images, which is
essential for tasks like image filtering and noise reduction.
· The transformation reveals how different frequency components con-
tribute to the overall image, aiding in various processing techniques.
· Discrete Cosine Transform (DCT):
· The DCT decomposes an image into a sum of cosine functions, empha-
sizing lower frequencies while minimizing high-frequency components.
· It is widely used in JPEG compression, where images are divided into
blocks, and the DCT is applied to each block to reduce data storage
requirements.
· By concentrating on significant low-frequency information, the DCT
allows for effective compression while preserving visual quality.
Example: In JPEG compression, the DCT is applied to each 8x8 block of
pixels. High-frequency components, which typically carry less perceptible
detail, are quantized more coarsely. This enables substantial data reduc-
tion while maintaining acceptable image quality during reconstruction.
12
ITECH WORLD AKTU BCS057
13