KEMBAR78
Chapter One | PDF | Data Structure | Queue (Abstract Data Type)
0% found this document useful (0 votes)
22 views64 pages

Chapter One

The document introduces data structures and algorithms, outlining their importance in developing efficient programming solutions. It covers various data structures such as arrays, linked lists, stacks, queues, trees, and graphs, along with their operations and classifications. Additionally, it discusses algorithm design techniques and the analysis of algorithm complexity, including time and space considerations.

Uploaded by

ridam adhikari
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views64 pages

Chapter One

The document introduces data structures and algorithms, outlining their importance in developing efficient programming solutions. It covers various data structures such as arrays, linked lists, stacks, queues, trees, and graphs, along with their operations and classifications. Additionally, it discusses algorithm design techniques and the analysis of algorithm complexity, including time and space considerations.

Uploaded by

ridam adhikari
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 64

CHAPTER 1:

INTRODUCTION TO DATA
STRUCTURE AND
ALGORITHM
BIBHA STHAPIT
ASST. PROFESSOR
IoE, PULCHOWK CAMPUS
SYLLABUS OUTLINE
• 1. Introduction
• 2. Stack and Recursion
• 3. Queues
• 4. Linked List
• 5. Trees

Chapter 1
• 6. Graphs
• 7. Sorting
• 8. Searching

• Text Book: Data Structures using C and C++ -> Langsam,


Augenstein and Tanenbaum 2
Introduction to Data Structure
• NEED??
• To apply the best practices for developing efficient solutions
using computer programming.

• To understand the relevant concepts and decisions to apply

Chapter 1
the best fit data structure for the particular requirements.

• It deals with architecting, designing, developing and


optimizing the applications using computer programming.

• Data structure helps the programmers to develop effective 3


and reliable solutions.
Data Structure
• To know about the data structure, we’ll have to
know about the computer programming

Some mysterious
Input Output
processing

Chapter 1
Example :
• Data structure for storing data of students:-
• Arrays , Linked Lists
• Issues
• Space needed
• Operations efficiency (Time required to complete 4
operations)
• Retrieval, Insertion, Deletion
Data Structure
• Data structures let the input and output be
represented in a way that can be handled efficiently
and effectively.
array

Chapter 1
Linked list

queue
5

tree stack
Data Structure
• Data :
• Taken as input, processed within a computer, and provided as
output as information
• Can be “atomic” or “composite”
• Atomic data takes single value like integer 123

Chapter 1
• Composite data which can be further broken down to
subfields like student record consists of roll_no, name, faculty.

• Data type:
• Kind of data that a variable can store.
• Built-in data types are system defined data types like int, float.
6
• User-defined data type are created according to the need of
the program like structure, union, class.
Data Structure
• Data Structure:
• Particular way of storing and organizing data in a computer so
that it can be used efficiently
• Representation of the logical relationship existing between
individual elements of data.
• In other words, a data structure is a way of organizing all data

Chapter 1
items that considers not only the elements stored but also their
relationship to each other.
• Hence in brief, a data structure :
• Depicts the logical representation of data in computer memory
• Represents the logical relationship between various data
elements 7
• Helps in efficient manipulation of stored data elements
• Allows programs to process data in efficient manner
Types of Data Structure
• Static and dynamic data structure
• Static can store data upto fix number/size. E.g.
Array
• Dynamic allows to change its size during program

Chapter 1
execution. E.g. Linked-list

• Linear and non-linear data structure


• In linear, data is stored in consecutive memory i.e.
every element has unique predecessor and unique
successor. E.g. Array, linked list, stack, queue 8
• In non-linear, data is stored in non-consecutive
memory. Eg. Tree, graph
Types of Data Structure
• Sequential and direct/random access data structure
• In sequential, we must access preceding (n-1) data to
access nth data. E.g. Linked list
• In direct access, we can directly access nth element.

Chapter 1
E.g. array

• Homogeneous and non-homogeneous data structure


• Homogeneous consist same type of data. E.g. Array
• Heterogeneous consists data of different types. E.g..
Structures, class 9
Types of Data Structure
• Primitive and non-primitive data structure
• Primitive data structures are generally primary or
built-in types and are directly operated upon by
machine instructions. E.g. integers, floating points,

Chapter 1
characters
• Non-primitive emphasizes on structuring of
homogeneous or heterogeneous data. E.g. Array,
lists, linked list, tree

10
Types of Data Structure
Data Structure

Primitive/ Non Primitive /


Physical logical

Chapter 1
Non
Linear
Linear

Tree
Stack

Graph
Queue
11
Linked
lists
Overview of different Data Structures
• Arrays:
• Collection of similar data elements
• Elements are stored in consecutive memory.
• Have random access to data

Chapter 1
• Application:
• Used in program that require storing
large collection of similar data

12
Overview of different Data Structures
• Linked list
• It provides a flexible storage system by dynamically
assigning the required memory.
• Linked lists are special list of some data elements linked
to one another.

Chapter 1
• An element of list must contain at least two fields, one
for storing data or information and other for storing
address of next element.
• Application:
• Used wherever dynamic memory allocation is needed

13
Overview of different Data Structures
• Stack:
• List of elements with insertions and deletions
permitted at one end only – called stack top
• A stack data structure exhibits the LIFO (last in first out)
property

Chapter 1
• Application:
• System processes such as compilation and program control

14
Overview of different Data Structures
• Queue
• A list of elements with insertions permitted at one
end—called the rear, and deletions permitted from the
other end—called the front
• a queue data structure exhibits the FIFO (first in first out)

Chapter 1
property.
• Application:
• Used in CPU scheduling, resource sharing

15
Overview of different Data Structures
• Trees:
• Non- linear data structure
• Arranges its nodes in the form of hierarchal tree
structure
• Application:

Chapter 1
• Used for storing hierarchical data, implementing search trees,
maintaining sorted data.

16
Overview of different Data Structures
• Graphs:
• A graph is a structure made of two components: a set of
vertices V, and a set of edges E
• Application:
• Modelling of networking systems, costing of network paths

Chapter 1
17
Operations on Data Structures
• Creating
• Make/Create new data structure
• Inserting
• Adding new record
• Traversing
• Accessing each record of data structure exactly once

Chapter 1
• Searching
• Identifying location of record that consists specific key value
• Deleting
• Removing existing record
• Sorting
• Arranging data in specific order
• Merging
• Combining two different sorted records to produce single sorted data
set. 18
Abstract Data Type (ADT)
• Data abstraction
• Asks you to think what you can do to a collection of
data independently of how you do it
• Allows you to develop each data structure in relative
isolation from the rest of the solution

Chapter 1
• An abstract data type (ADT) is composed of
• A collection of data
• A set of operations on that data

• Specifications of an ADT indicate what the ADT operations


do (but not how to implement them) 19
• Implementation of an ADT includes choosing a particular
data structure
Abstract Data Type (ADT)
• An ADT consists of
• Declaration of data
• Declaration of operations
• Encapsulation of data and operations : data is hidden
from user and can be manipulated only by means of
operations`

Chapter 1
20
Abstract Data Type (ADT)
• To implement an ADT, you need to choose:
• A data representation (VALUE DEFINITION)
• must be able to represent all necessary values of the ADT
• should be private

Chapter 1
• An algorithm for each of the necessary operation
(OPERATOR DEFINITION):
• must be consistent with the chosen representation
• all auxiliary (helper) operations that are not in the contract
should be private

21
Abstract Data Type (ADT)
Array as ADT:
//value definition
abstract typedef <element_type, index_type> Array
//definition clause

Chapter 1
element_type A [max_size]
index_type i
//condition clause
index_type = = integer

22
Abstract Data Type (ADT)
//operator definitions
abstract <element_type> Extract (A, i) //written as A[i]
integer i
Array A
//precondition
0 <= i <max_size

Chapter 1
//postcondition
Extract = = A[i]

abstract <element_type> Store(A, i, value)


Array A
index_type i
element_type value
//precondition
0 <= i <max_size 23
//postcondition
A[i] = = value
Abstract Data Type (ADT)

Chapter 1
24
Algorithm
• Once the data structure of particular application is chosen,
an algorithm must be developed to manipulate related
data items stored in it.
• It is a precise plan for performing sequence of actions
• Definition:

Chapter 1
• It is step-by-step finite set of instructions to solve a
well-defined computational problem.
• Sum of Two Numbers:
• step 1 − START ADD
• step 2 − get values of a & b
• step 3 − c ← a + b
• step 4 − display c 25
• step 5 − STOP
Algorithm
• Every algorithm must satisfy following criteria –

• input: there are zero or more quantities which are externally supplied;

• output: at least one quantity is produced;

Chapter 1
• definiteness: each instruction must be clear and unambiguous;

• finiteness: if we trace out the instructions of an algorithm, then for all


cases the algorithm will terminate after a finite number of steps;

• effectiveness: every instruction must be sufficiently basic that it can in


principle be carried out by a person using only pencil and paper. It is
26
not enough that each operation be definite as in (iii), but it must also
be feasible.
Algorithm design techniques
• Top down approach
• dividing the complex algorithm into one or more modules.
• These modules can further be decomposed into one or more
sub-modules, and this process of decomposition is iterated
until the desired level of module complexity is achieved.

Chapter 1
• Bottom up approach
• designing the most basic or concrete modules and then
proceed towards designing higher level modules.
• The higher level modules are implemented by using the
operations performed by lower level modules.
• Thus, in this approach sub-modules are grouped together to 27
form a higher level module.This process is repeated until the
design of the complete algorithm is obtained
Algorithm design techniques
• Greedy
• works by taking a decision that appears the best at the
moment, without thinking about the future.
• A greedy algorithm solution isn't necessarily an overall
optimal solution since it only goes from one best solution
to the next. Additionally, there is no backtracking involved
if it chooses the wrong option or step.

Chapter 1
• Djikstra’s algorithm, Prim’s algorithm

• Divide and conquer


• This algorithm solves a problem by dividing the original
problem into smaller chunks which are similar sub-problems.
• The solutions of these smaller sub problems are 28
later combined to get the solution of the given original
problem. E.g.Quick sort and merge sort
Algorithm design techniques
• Brute Force
• goes through all the possible solutions one after
another until you find the optimum solution.
• Linear Search algorithm

Chapter 1
• Backtracking
• finds all the possible combinations of a solution and
evaluates if it isn't optimal. If it isn't, the algorithm
backtracks and starts evaluating other solutions.
• Backtracking algorithms share a common approach
with the brute force algorithm design technique but
are much faster than brute-force algorithms. 29
• Depth-first recursive search in a tree
Algorithm design techniques
• Branch and bound
• The first solution is remembered by the algorithm and
set as a benchmark. It returns the original or the first
solution if no solution is found after the algorithm
completes the whole search space.

Chapter 1
• Travelling salesman problem

• Simple Recursive
• Process in which a problem is defined in terms of itself.
• The problem is solved by repeatedly breaking it into
smaller problems, which are similar in nature to the
30
original problems.
• Fibonacci series, Tower of Hanoi
Algorithm design techniques
• Dynamic programming
• solves problems that have overlapping sub-problems.
Therefore, they are well-suited for problems where certain
sub-problems get solved repeatedly.
• optimizes the solution by storing the answers to sub-

Chapter 1
problems in an optimal structure and retrieving them when
needed.
• for example, generating a Fibonacci sequence. If we found
the 4th number, we must have found all the ones before.
Therefore, they are handy for finding the 5th number.

31
Algorithm design techniques
• Randomized
• Here, random numbers are used to make some decisions.
These randomized algorithms are approximated using
a pseudo – random number generator.
• Quick sort using random number for pivot element, Monte-

Chapter 1
Carlo algorithm

32
Analysis of algorithm
• W h y we should analyze algorithms?
• Predict the resources that the algorithm requires
• Computational time (CPU consumption)
• Memory space (RAM consumption)
Communication bandwidth consumption

Chapter 9

• T h e running time of an algorithm is:


• The total number of primitive operations executed
(machine independent steps)
• Also known as algorithm complexity
33
Analysis of algorithm
• The two factors of Algorithm Complexity are:
• Time Factor: Time is measured by counting the
number of key operations such as comparisons
in the sorting algorithm.

Chapter 1
• Space Factor: Space is measured by counting
the maximum memory space required by the
algorithm.

34
Analysis of algorithm
• Therefore the complexity of an algorithm can be divided
into two types:
• Space Complexity:
• Space complexity of an algorithm refers to the
amount of memory that this algorithm requires to

Chapter 1
execute and get the result. This can be for inputs,
temporary operations, or outputs.
• Time Complexity:
• Time complexity of an algorithm refers to the amount
of time that this algorithm requires to execute and get
the result. This can be for normal operations,
35
conditional if-else statements, loop statements, etc.
Time complexity
Worst case running time
• It assures that the algorithm will never go beyond this limit
• Average case running time
• It assumes that all inputs of given size are equally likely
• Best case running time

Chapter 1
• It is used to analyse an algorithm under optimal condition

• Fo r exa m p l e : Sequential search in a list of size n


• Worst-case: ncomparisons
• Best-case: 1comparison
36
• Average-case: n/2comparisons
Time-space trade-off
• A time space tradeoff is a situation where the memory use
can be reduced at the cost of slower program execution
(and, conversely, the computation time can be reduced at
the cost of increased memory use).

Chapter 1
• A s the relative costs of CPU cycles, RAM space, and hard
drive space change—hard drive space has for some time
been getting cheaper at a much faster rate than other
components of computers[citation needed]—the
appropriate choices for time space tradeoff have changed
radically.
37
• Often, by exploiting a time space tradeoff, a program can be
made to run much faster.
How do we compare algorithms?
• We need to define a number of objective measures.
(1) Compare execution times?
Not good: times are specific to a particular computer !!

Chapter 1
(2) Count the number of statements executed?
Not good: number of statements vary with the
programming language as well as the style of the individual
programmer.

38
Ideal Solution
• Express running time as a function of the input size n
(i.e., f(n)).
• Compare different functions corresponding to
running times.

Chapter 1
• Such an analysis is independent of machine time,
programming style, etc.

39
Asymptotic Analysis
• To compare two algorithms with running
times f(n) and g(n), we need a rough measure
that characterizes how fast each function
grows.

Chapter 1
• Hint: use rate of growth
• Compare functions in the limit, that is,
asymptotically!
(i.e., for large values of n)
40
Growth Rate
• Growth rate: How fast f(x) increases as x increases
• f(x)=2x: constant growth rate (slope is 2)
• : growth rate increases as x increases (it
grows exponentially as well!)
• : growth rate decreases as x increases

Chapter 1
41
Growth Rate
• Consider the example of buying server and keyboard:
Cost: cost_of_server+ cost_of_keyboard
Cost ~ cost_of_server(approximation)
• The low order terms in a function are relatively insignificant
for large n

Chapter 1
n4 + 100n2 + 10n + 50 ~ n4

i.e., we say that n4 + 100n2 + 10n + 50 and n4 have the same


rate of growth

42
Asymptotic notation
• Algorithm complexity is rough estimation of the
number of steps performed by given computation
depending on the size of the input data

Chapter 1
• Asymptotic means the line that tends to converge to a
curve which may/may not eventually touch the curve
but stays within bounds.

43
Asymptotic notation
• Asymptotic notation analyses the time complexity of an
algorithm

• It is simply a function f(n) where n is the input size.

Chapter 1
• Using the order of growth, we examine how fast the running
time of algorithm increases when input size increases.

• Examples:
Linear complexity O(n) – all elements are processed once (or
constant number of times)
Quadratic complexity O(n2) – each of the 44
elements is processed ntimes
Big –O notation

• Definition: f(n) = O(g(n)) iff there are two positive constants c


and n₀ such that

Chapter 1
|f(n)| ≤ c |g(n)| for all n ≥ n₀

• If f(n) is non-negative, we can simplify the last condition to


0 ≤ f(n) ≤ cg(n) for all n ≥ n₀

• We say that “f(n) is big-O of g(n).”

• As n increases, f(n) grows no faster than g(n). 45

• In other words, g(n) is an asymptotic upper bound on f(n).


Big –O notation
• T h e running time is O(n2) means there is a function f(n) that is O(n2)
such that for any value of n, no matter what particular input of size
n is chosen, the running time of that input is bounded from above
by the value f(n).
• 3 *n2+n/2+12 ∈O(n2)

Chapter 1
• 4*n*log 2 (3*n+1) +2*n-1 ∈O(n*logn)

46
Show that 4n2 = O(n3).
• By definition, we have 0 ≤ f(n) ≤ cg(n)
• Substituting 4n2 as f(n) and n3 as g(n),
• we get , 0 ≤ 4n2 ≤ cn3
• Dividing by n3 , 0/n3 ≤ 4n2/n3 ≤ cn3/n3
• 0 ≤ 4/n ≤ c

Chapter 1
• Now to determine the value of c,
• As n-> ∞ , 4/n=0
• we see that 4/n is maximum when n=1.
• Therefore, c=4.
• To determine the value of n0,
• 0 ≤ 4/n0 ≤ 4
• 0 ≤ 4/4 ≤ n0
• 0 ≤ 1 ≤ n0 This means n0=1. 47
• Therefore, 0 ≤ 4n2 ≤ n3 ∀ n ≥ n0=1.
Ω notation

• Definition: f(n) = Ω(g(n)) iff there are two positive constants c and n0

Chapter 1
such that
|f(n)| ≥ c |g(n)| for all n ≥ n0

• If f(n) is nonnegative, we can simplify the last condition to


0 ≤ c g(n) ≤ f(n) for all n ≥ n0

• We say that “f(n) is omega of g(n).”


48
• As n increases, f(n) grows no slower than g(n).In other words, g(n) is
an asymptotic lower bound on f(n).
Ω notation
• W h e n we say that the running time of an algorithm
is Ω (g(n), w e mean that no matter what particular
input of size n is chosen for each value of n, the
running time on that input is at least a constant
times g(n), for sufficiently large n.

Chapter 1
• n 3 + 20n ∈ Ω(n2)

49
Prove that : n³ + 4n² = Ω(n²)
• Proof:
• Here, we have f(n) = n³ + 4n² , and g(n) = n².
• It is not too hard to see that if n ≥ 0, n³ ≤ n³ + 4n².
• We have already seen that if n ≥ 1, n² ≤ n³.

Chapter 1
• Thus when n ≥ 1, n² ≤ n³ ≤ n³ + 4n².
• Therefore, 1n² ≤ n³ + 4n² for all n ≥ 1
• Thus, we have shown that n³ + 4n² = Ω(n²) (by
definition of Big-Ω, with n₀ = 1, and c = 1).

50
Ө notation

• Definition: f(n) = Θ(g(n)) iff there are three positive constants c1, c2

Chapter 1
and n0 such that
c1g(n) ≤ f(n) ≤ c2g(n) for all n ≥ n0

• If f(n) is nonnegative, we can simplify the last condition to


0 ≤ c1g(n) ≤ f(n) ≤ c2g(n), ∀ n ≥ n0

• We say that “f(n) is theta of g(n).”


51

• As n increases, f(n) grows at the same rate as g(n). In other words, g(n) is an
asymptotically tight bound on f(n).
Arithmetic of Big-O, Ω , and  notations
• Transitivity:
– f(n)  O(g(n)) and g(n)  O(h(n)) ) => f(n)  O(h(n))
– f(n)  (g(n)) and g(n)  (h(n)) ) => f(n)  (h(n))

• Scaling: if f(n)  O(g(n)) then for any k > 0, f(n)  O(kg(n))

Chapter 1
• Sums: if f1(n)  O(g1(n)) and f2(n)  O(g2(n)) then
(f1 + f2)(n)  O(max(g1(n), g2(n)))

52
Basic rules
1. Nested loops are multiplied together.
2. Sequential loops are added.
3.Only the largest term is kept, all others are dropped.
4. Constants are dropped.

Chapter 1
5. Conditional checks are constant (i.e. 1).

53
Example 1
//linear
fo r ( i = 1 to n)
{

Chapter 1
display i;
}

• A n s : O(n)
54
Example 2
//quadratic
for(int i = 0; i < n; i++) {
for(int j = 0; j < n; j++){

Chapter 1
//do swap stuff, constant time
}
}

• A n s O(n^2)
55
Example 3
for(int i = 0; i < 2*n; i++) {
cout << i << endl;
}

Chapter 1
• A t first you might say that the upper bound is O(2n);
however, we drop constants so it becomes O(n)

56
Example 4
//linear
for(int i = 0; i < n; i++) {
cout << i << endl;
}

Chapter 1
//quadratic
for(int i = 0; i < n; i++) {
for(int j = 0; j < n; j++){
//do constant time stuff
} }
• A n s : In this case we add each loop's Big O, in this case n+n2.
O(n2+n) is not an acceptable answer since we must drop the 57
lowest term. The upper bound is O(n2). Why? Because it has
the largest growth rate
Example 5
for(int i = 0; i < n; i++) {
for(int j = 0; j < 2; j++){
//do stuff

Chapter 1
}
}

• A n s : Outer loop is 'n', inner loop is 2, then we have 2n,


dropped constant gives up O(n)
58
Example 6
for(int i = 1; i < n; i *= 2) {
cout << i << endl;
}

Chapter 1
• T h e re are n iterations, however, instead of simply
incrementing, 'i' is increased by 2*itself each run. Thus
the loop is O(log(n)).

59
Example 7
• for(int j = 1; j < n; j *= 2){ // log (n)

• for(int i = 0; i < n; i++) { //linear

Chapter 1
• //do constant time stuff
• }
• }

• A n s : n*log(n)
60
Typical Running Time Functions
• 1 (constant running time):
• Instructions are executed once or a few times

• log(n) (logarithmic), e.g., binary search


• A big problem is solved by cutting original problem in smaller sizes, by a

Chapter 1
constant fraction at each step

• n (linear): linear search


• A small amount of processing is done on each input element

• n log(n): merge sort


• A problem is solved by dividing it into smaller problems, solving them
independently and combining the solution 61
Typical Running Time Functions
• n2 (quadratic): bubble sort
• Typical for algorithms that process all pairs of data items (double nested
loops)

• n3 (cubic)

Chapter 1
• Processing of triples of data (triple nested loops)

• nK (polynomial)
• 20.694n (exponential): Fib1
• 2n (exponential):
• Few exponential algorithms are appropriate for practical use
62
– 3n (exponential), …
Relation between Ө, O and Ω

Chapter 1
63
THANK YOU!

Chapter 1
64

You might also like