DATA

© All Rights Reserved

55 vues

DATA

© All Rights Reserved

- Autonomous Robotics by Nelson
- Matrix Row Major
- csc
- mathcad
- DSA 7.pdf
- sab80c166
- Interview questions
- Curriculum eBook FREE
- 19640287
- Euro Class 12 Comp Practical
- 04 Lists Stacks
- B3.2-R3 syllabus
- Conversion
- Lenguaje Ensamblador Irvine 5a Edición
- 1 Determinants
- K01149_20181002125220_Updated-PRO FORMA MTS3023 A181 (1)
- C++ PROGRAMS
- Wiles1994 - In Situ Stress Determination Using the Under-excavation Technique--I. Theory
- Expressions
- 2013.fall.prelim1.pdf

Vous êtes sur la page 1sur 55

Structure

1.0

1.1

1.2

1.3

1.4

Introduction

Objectives

Basic Terminology; Elementary Data Organization

Data Structures An Overview

Arrays and Records

1.4.1

1.4.2

1.4.3

1.4.4

1.4.5

1.4.6

Linear Analysis

Representation of Linear Arrays in Memory

Traversing Linear Arrays

Inserting and Deleting

Multidimensional Arrays

1.4.6.1 Two-Dimensional Arrays

1.4.6.2 Representation of Two-Dimensional Arrays in Memory

1.4.6.3 General Multidimensional Arrays

1.4.7 Records : Records Structure

1.4.8 Representation of Records in Memory; Parallel Arrays

1.5

Stacks

1.5.1 Stack Operations

1.5.2 Array Representation of Stacks

1.5.3 Applications of Stacks

1.5.3.1 Arithmetic Expressions; Polish Notation

1.5.3.2 Quick Sort, An application of Stacks

1.5.4 Recursion

1.5.5 Implementation of Recursive Procedures by Stacks

1.6

Queues

1.6.1 Representation of Queues

1.6.2 Deques

1.6.3 Priority Queues

1.7

Linked Lists

1.7.1

1.7.2

1.7.3

1.7.4

1.7.5

1.7.6

1.7.7

1.7.8

Basic Terminology

Representation of Linked Lists in Memory

Traversing a Linked List

Searching a Linked List

Insertion into a Linked List

Deletion from a Linked List

Header Linked Lists

Two-way Lists

1.8

1.9

1.10

1.0

Summary

Solutions/Answers

Further Readings

INTRODUCTION

This unit is an introductory unit and gives you an understanding of what a data structure

is. It is about structuring and organizing data as a fundamental aspect of developing a

computer application. This unit guided you from the definition of data structure to the

linear data structures: arrays and records, stacks, queues and lists. Finally, the last

chapter leads you to the advanced data structures like binary tree. In this chapter, you

could understand the issues related to trees that is how to make a traversal in binary tree,

how to delete a node, how to search for a node, how to construct a tree, etc.

Knowledge of data structures is required for people who design and develop computer

programs of any kind: Systems software or application software. As you know already,

the data means a collection of facts, concepts or instructions in a formalized manner

suitable for communication or processing. Processed data is called as Information. A

data structure is an arrangement of data in a computer's memory or even disk storage.

An example of several common data structures are arrays, linked lists, queues, stacks,

binary trees, and hash tables. Algorithms, on the other hand, are used to manipulate the

data contained in these data structures as in searching and sorting.

Software engineering is the study of ways in which to create large and complex

computer applications by programmers and designers. Software engineering involves the

full life cycle of a software project which includes Analysis, Design, Coding, Testing and

Maintenance. However, the subject of data structures and algorithms is concerned with

the Coding phase. The use of data structures and algorithms is to store and manipulate

data by the programmers.

One of the basic data structures of a program is Array. Array is a data structure which can

represent a collection of elements of same data type. An array can be of any dimensions.

It can be two dimensional, three dimensional etc. Collections of data are frequently

organized into a hierarchy of field, records and files. Specifically, a record is a collection

of related data items, each of which is called a field or attribute, a file is a collection of

similar records.

Stacks and Queues are very useful in Computer Science. Stack is a data structure which

allows elements to be inserted as well as deleted from only one end. Stack is also known

as LIFO data structure. Queue is another most common data structure which allows

elements to be inserted at one end called Front and deleted at another end called Rear.

Queue is also known as FIFO data structure.

The other data structure is Lists. A List is an ordered set consisting of a variable number

of elements to which addition and deletions can be made. The first element of a List is

called head of List and the last element is called the tail of List. List can be represented

using arrays or pointers. If pointers are used, then, the operations on the list such as

Insertion, Deletion etc. will become easy.

The final data structure is Tree. A Tree is a connected, acyclic graph. A Tree contains no

loops or cycles. The concept of trees is one of the most fundamental and useful concepts

in Computer Science. A Tree structure is one in which items of data are related by edges.

As a first section, we will discuss about the basic terminology and concepts of Data

Organization. Then, the discussion on different operations which are applied to these data

structures.

1.1

OBJECTIVES

1.2

define the term data structure;

know the classifications of data structures, i.e., linear and non-linear ;

understand the basic operations of linear and non-linear data structures;

explain the memory representation of all types of data structures and

tell that how to implement the all kinds of data structures.

BASIC TERMINOLOGY; ELEMENTARY DATA ORGANIZATION

Data Structures are building blocks of a program. If a program is built using improper

data structures, then the program may not work as expected always. It is very much

important to use right data structures for a program.

As pointed out earlier, Data are simply values or set of values. A data item is either the

value of a variable or a constant. For example, a data item is a row in a database table,

which is described by a data type. A data item that does not have subordinate data items

is called an elementary item. A data item that is composed of one or more subordinate

data items is called a group item. A record can be either an elementary item or a group

item. For example, an employees name may be divided into three sub items first name,

middle name and last name but the social_security_number would normally be treated

as a single item.

The data are frequently organized into a hierarchy of fields, records and files. In order to

understand these terms, let us see the following example (1.1),

Attributes: Name

Values:

Vignesh

Age

30

Sex

m

Social_Security_Number

123-34-23

(Example 1.1)

An entity is something that has certain attributes or properties which may be assigned

values. The values themselves may be either numeric or nonnumeric. In the above

example, an employee of a given organization is entity. Entities with similar attributes

(E.g. all the employees in an organization) collectively form an entity set. Each attributes

of an entity set has a range of values. The set of all possible values could be assigned to

the particular attributes.

The way that data are organized into the hierarchy of fields, records and files reflects the

relationship between attributes, entities and entity sets. That is, a field is a single

elementary unit of information representing an attributes of an entity, a record is the

collection of field values of a given entity and a file is the collection of records of the

entities in given entity set.

Each record in a file may contain many field items, but the value in a certain field may

uniquely determine the record in the file. Such a field K is called a Primary Key, and the

values K1, K2.in such field are called keys or key values. Let we see the example of

Key and Primary Key.

For example 1.2(a) and 1.2(b)

(a) An automobile dealership maintains an inventory file where each record contains the

following data:

Serial_ Number Type

Year

Price

Accessories

The Serial_Number field can serve as a primary key for the file, since each automobile

has a unique serial number.

Example 1.2(a)

(b) An organization maintains a membership file where each record contains the following

file data:

Name

Address Telephone_Number

Dues_Owed

Example 1.2(b)

Although there are four data items, Name and Address may be group items. Here the

Name field is a primary key. Note that the address and Telephone_Number fields may not

serve as primary keys, since some members may belong to the same family and have the

same address and telephone number.

There are two categories of records according to length of the record. A file can have

fixed length records or variable length records. In fixed-length records, all the records

contain the same data items with the same amount of space assigned to each data items

with the same amount of space assigned to each data item. In variable-length records,

file records may contain different lengths. For Example, student records usually have

variable lengths, since different students take different number of courses. Usually,

variable length records have a maximum and a minimum length.

As we know already, the organization of data into fields, records and files may not be

complex enough to maintain and efficiently process certain collections of data. For this

reason, data are also organized into more complex types of structures. To learn about

these data structures, we have to follow these three steps:

1. Logical or mathematical description of the structure.

2. Implementation of the structure on a computer.

3. Quantitative analysis of the structure, which includes determining the amount of

memory needed to store the structure and the time required to process the

structure.

We may discuss some of these data structures (study of first step) as an introduction part

in next section. The study of second and third steps depend on either the data are stored in

the main (or) primary memory of the computer or, in a secondary (or) external storage

unit.

Check Your Progress 1

1.

2.

3.

4.

Define Data

What do you mean by Data Item, Elementary Data Item and Group Item?

The data is organized into Fields, Records and Files.

True/False

If the records contain the same data items with same amount of space assigned to

each data item is called ----------5. In variable-length records, file records may contain different lengths. True/False

1.3

Data may be organized in many different ways: the logical or mathematical model of a

particular organization of data is called a data structure.

Arrays

The simplest type of data structure is a linear (or one dimensional) array. By a linear

array, we mean a list of a finite number n of similar data elements referenced respectively

by a set of n consecutive numbers, usually 1, 2, 3, n. If we choose the name A for the

array, then the elements of A are denoted by subscript notation

or, by the parenthesis notation

A(1), A(2), A(3),., A(N)

or, by the bracket notation

A[1], a[2], A[3],.., A[N]

Regardless of the notation, the number K in A[K] is called a subscript and A[K] is called

a subscripted variable.

Linear arrays are called one-dimensional arrays because each element in such an array is

referenced by one subscript. A two-dimensional array is a collection of similar data

elements where each element is referenced by two subscripts. Such arrays are called

matrices in mathematics, and tables in business applications.

Stack

A stack, also called a last-in-first-out (LIFO) system, is a linear list in which items may

be inserted or removed only at one end called the top of the stack. A stack may be in our

daily life, for example a stack of dishes on a spring system or a stake of dishes. We can

observe that any dish may be added or removed only from the top of the stack. We also

call these lists as piles or push-down list.

Queue:

A queue, also called a first-in-first-out (FIFO) system, is a linear list in which deletions

can take place only at one end of the list, the front of the list, and insertions can take

place only at the other end of the list, the rear of the list. The features of a Queue are

similar to the features of any queue of customers at a counter, at a bus stop, at railway

reservation counter etc. A queue can be implemented using arrays or linked lists. A queue

can be represented as a circular queue. This representation saves space when compared

to the linear queue. Finally, there are special cases of queues called Dequeues which

allow insertion and deletion of elements at both the end.

Tree

A tree is an acyclic, connected graph. A tree contains no loops or cycles. The concept of

tree is one of the most fundamental and useful concepts in computer science. Trees have

many variations, implementations and applications. Trees find their use in applications

such as compiler construction, database design, windows, operating system programs,

etc. A tree structures is one in which items of data are related by edges. A very common

example is the ancestor tree as given in Figure 1.1. This tree shows the ancestors of

KUMAR. His parents are RAMESH and RADHA. RAMESHs parents are PADMA and

SURESH who are also grand parents of KUMAR (on fathers side); RADHAs parents

are JAYASHRI and RAMAN who are also grand parents of KUMAR (on mothers

side).

KUMAR

RADHA

JAYASHRI

RAMESH

RAMAN

PADMA

SURESH

Graph

All the data structures (Arrays, Lists, Stacks, and Queues) except Graphs are linear data

structures. Graphs are classified in the non-linear category of data structures.

A graph G may be defined as a finite set V of vertices and a set E of edges (pair of

connected vertices). The notation used is as follows:

Graph G = (V,E)

Let we consider graph of figure1.2.

2

4

(Figure 1.2: A graph)

From the above graph, we may observe that the set of vertices for the graph is

V = {1,2,3,4,5}, and the set of edges for the graph is E = {(1,2), (1,5), (1,3), (5,4), (4,3),

(2,3)}. The elements of E are always a pair of elements. The relationship between pairs

of these elements is not necessarily hierarchical in nature.

Check Your Progress 2

1.

2.

3.

4.

The classification of the data structures are ----------------The number K in A[K] is called a .. and A[K] is called a

Dequeues allow insertion and deletion of elements at both the end.

True/False

Graphs are classified into ----------- category of data structures.

6. What do you mean by LIFO?

7. List out the applications which are using the data structure Tree.

1.4

As we studied earlier, the data structures are classified as either linear or nonlinear. A

data structure is said to be linear if its elements form a sequence, or, in other words, a

linear list. There are two basic ways of representing such linear structures in memory.

One way is to have the linear relationship between the elements represented by means of

sequential memory locations. These linear structures are called arrays. The other ways

is to have the linear relationship between the elements represented by means of pointers

or links. These linear structures are called linked lists. We will discuss it in section 1.7.

Examples to non-linear structures are trees and graphs. We may perform the following

operations on any linear structure, whether it is an array or a linked list.

(a)

Traversal.

(b)

Search.

the record with a given key.

(c)

Insertion.

(d)

Deletion.

(e)

Sorting.

(f)

Merging.

Depends upon the relative frequency, we may perform these operations with a particular

any one of these Linear structure. Let us now discuss about a very common linear

structure called an array.

1.4.2 LINEAR ARRAYS

A linear array is a list of a finite number n of inhomogeneous data elements (i.e., data

elements of the same type) such that:

(a) The elements of the array are referenced respectively by an index set consisting of

n consecutive numbers.

(b) The elements of the array are stored respectively in successive memory locations.

We will analyze this for Linear Arrays.

The number n of elements is called the length or size of the array. If not explicitly stated,

we may assume the index set consists of the integers 1, 2, 3, .., n.

The general equation to find the length or the number of data elements of the array is,

Length = UB LB + 1

(1.4a)

where, UB is the largest index, called the upper bound, and LB is the smallest index,

called the lower bound of the array. Remember that length=UB when LB=1.

Also, remember that the elements of an array A may be denoted by the subscript notation,

A1, A2, A3..An

DATA[2]=56, DATA[3]=429, DATA[4]=135, DATA[5]=87, DATA[6]=156.

Example 1.3(a)

The following figures 1.3(a) and (b) depict the array DATA.

247

56

429

135

87

156

247

56

Figure1. 3 (a)

429 135

87

156

Figure 1.3(b)

sold each year from 1932 through 1984.

Example 1.3(b)

Rather than beginning the index set with 1, it is more useful to begin the index set with

1932, so we may learn that,

AUTO[K] = number of automobiles sold in the year K

Then, LB = 1932 is the lower bound and UB=1984 is the upper bound of AUTO. When

we use the equation (1.4a), we can find out the length of this array,

Length = UB LB + 1 = 1984 + 1 = 55.

From this equation, we may conclude that AUTO contains 55 elements and its index set

consists of all integers from 1932 through 1984.

1.4.3 REPRESENTATION OF LINEAR ARRAYS IN MEMORY

Consider LA be a linear array in the memory of the computer. As we know that the

memory of the computer is simply a sequence of addressed location as pictured in

figure 1.4 as given below.

1000

1001

1002

1003

1004

1005

.

.

.

.

Let we use the following notation when calculate the address of any element in linear

arrays in memory,

LOC(LA[K]) = address of the element LA[K] of the array LA.

As previously noted, the elements LA are stored in successive memory cells.

Accordingly, the computer does not need to keep track of the address of ever element of

LA, but needs to keep track only of the address of the first element of LA, which is

denoted by

Base(LA)

and called the base address of LA. Using this address Base(LA), the computer calculates

the address of any element of LA by the following formula:

LOC(LA[K]) = Base(LA) + w (K- lower bound)

(1.4b)

where w is the number of words per memory cell for the array LA.

Let we observe that the time to calculate LOC(LA[K]) is essentially the same for any

value of K. Furthermore, given any subscript K, one can locate and access the content of

LA[K] without scanning any other element of LA.

Example 1.4,

Consider the previous example 1.3(b), array AUTO, which records the number of

automobiles sold each year from 1932 through 1984. Array AUTO appears in memory is

pictured in Figure 1.5. Assume, Base(AUTO) = 200 and w = 4 words per memory cell for

AUTO.

LOC(AUTO[1932]) = 200, LOC(AUTO[1933]) = 204, LOC(AUTO[1934]) = 208, .

Let we find the address of the array element for the year K = 1965. It can be obtained by

using Equation (1.4b):

LOC(AUTO[1965]) = Base(AUTO) + w(1965 lower bound) = 200 + 4(1965-1932) = 332

Again, we emphasize that the contents of this element can be obtained without scanning

any other element in array AUTO.

200

201

202

203

204

205

206

207

208

209

210

211

.

.

.

AUTO[1932]

AUTO[1933]

AUTO[1934]

.

.

.

Consider let A be a collection of data elements stored in the memory of the computer.

Suppose we want to either print the contents of each element of A or to count the number

is, by accessing and processing (frequently called visiting) each element of A exactly

once.

The following algorithm is used to traversing a linear array LA.

As we know already, here, LA is a linear array with lower bound LB and upper bound

UB. This algorithm traverses LA applying an operation PROCESS to each element of

LA.

1. [Initialize counter] Set K:= LB.

2. Repeat Steps 3 and 4 while K< UB

3.

[Visit element] Apply PROCESS to LA[K]

4.

[Increase counter] Set K:= K + 1

[End of Step 2 loop]

5. Exit.

We also state an alternative form of the algorithm which uses a repeat-for loop instead

of the repeat-while loop. This algorithm traverses a linear array LA with lower bound LB

and upper bound UB.

1. Repeat for K = LB to UB:

Apply PROCESS to LA[K].

[End of loop]

2. Exit.

Example 1.5,

Again consider the previous one [example 1.3(b)], array AUTO, which records the

number of automobiles sold each year from 1932 through 1984. Each of the following

algorithms carry out the given operation involves traversing AUTO.

(a) Find the number NUM of years during which more than 300 automobiles

were sold.

1. [Initialization Step] Set NUM := 0.

2. Repeat for K = 1932 to 1984:

If AUTO[K] > 300, then: Set NUM := NUM + 1

[End of loop]

3. Return

(b) Print each year and the number of automobiles sold in that year.

1. Repeat for K = 1932 to 1984:

Write: K, AUTO[K].

[End loop]

2. Return.

Let A be a collection of data elements in the memory of the computer. Inserting refers

to the operation of adding another element to the collection A, and deleting refers to the

operation of removing one of the elements from A. Let we discuss the inserting and

deleting an element when A is a linear array.

Inserting an element at the end of a linear array can be easily done provided the

memory space allocated for the array is large enough to accommodate the additional

element. On the other hand, suppose we need to insert an element in the middle of the

array. Then, on the average, half of the elements must be moved downward to new

locations to accommodate the new element and keep the order of the other elements.

Similarly, deleing an element at the end of an array presents no difficulties, but deleting

an element somewhere in the middle of the array would require that each subsequent

element be moved one location upward in order to fill up the array.

Since leaner arrays are usually pictured extending downward, as given below in figure

1.6, the term downward refers to locations with larger subscripts, and the term

upward refers to location with smaller subscripts.

1

Red

Green

Blue

Black

White

6

(Figure1.6: Linear Array in downward)

Consider TEST has been declared as a 5-element array but data have been recorded only

for TEST [1], TEST [2], and TEST [3].

TEST [4] := X

to add X to the Linear Array. Similarly, if Y is the value of the subsequent element, then

we may assign,

TEST [5] := Y

to add Y to the Linear Array. Now, we may conclude that we can not add any new

element to this Linear Array due to the reach of upper bound.

Consider the other example 1.7,

Suppose NAME is an 8-element linear array, and suppose five names are in the array, as

in Figure 1.7(a). Observe that the names are listed alphabetically, and suppose we want

to keep the array names alphabetical at all times. If, Ford is adding to the array, then,

Johnson, Smith and Wagoner must each be move downward one location, as given in

Figure 1.7(b). Next if we add Taylor to this array; then Wagner must be move, as in

Figure 1.7(c). Last, when we remove Davis from the array, then, the five names Ford,

Johnson, Smith, Taylor and Wagner must each be move upward one location, as in

Figure 1.7(d). We may observe, clearly that such movement of data would be very

expensive if thousands of names are in the array.

1

2

3

4

5

6

7

8

NAME

NAME

NAME

NAME

Brown

Davis

Johnson

Smith

Wapner

Brown

Davis

Ford

Johnson

Smith

Wapner

Brown

Davis

Ford

Johnson

Smith

Taylor

Wapner

Brown

Ford

Johnson

Smith

Taylor

Wapner

Figure 1.7(a)

Figure 1.7(b)

Figure 1.7(c)

Figure 1.7(d)

The following algorithm inserts a data element ITEM into the Kth position in a linear

array LA with N elements i.e. INSERT(LA,N,KM,ITEM).

Here LA is a linear array with N elements and K is a positive integer such that K<N.

This algorithm inserts an element ITEM into the Kth position in LA.

1.

2.

3.

4.

5.

Repeat Steps 3 and 4 while J > K.

[Move Jth element downward] Set LA[J+1] := LA[J]

[Decrease counter] Set J : = J-1

[End of Step 2 loop]

[Insert element] Set LA[K] := ITEM

6.

7.

Exit

The first four steps create space in LA by moving downward one location each element

from the Kth position on. We emphasize that these elements are moved in reverse order

i.e., first LA [N], then LA [N-1], and last LA [K]; otherwise data might be erased.

In more detail, we first set J:=N and then, using J as a counter, decrease J each time the

loop is executed until J reaches K. The next step, Step 5, inserts ITEM into the array in

the space just created. Before the exit from the algorithm, the number N of elements in

LA is increased by 1 to account or the new element.

The following algorithm deletes the Kth element from a linear array LA and assigns it to

a variable ITEM i.e. DELETE{LA.N,K,ITEM).

Here LA is a linear array with N elements and K is a positive integer such that K<N.

This algorithm deletes the Kth element from LA.

1.

2.

3.

4.

Repeat for J = K to N-1

[Move J + 1st element upward] Set LA[J] := LA[J+1]

[End of loop]

[Reset the number N of elements in LA] Set N := N 1.

Exit.

We may conclude that if many deletions and insertions are to be made in a collection of

data elements, then a linear array may not be the most efficient way of storing the data.

1.4.6 MULTIDIMENSIONAL ARRAYS

The linear arrays discussed so far are also called one dimensional arrays, since each

element in the array is referenced by a single subscript. Most programming languages

allow two- dimensional and three- dimensional arrays, ie., arrays where elements are

referenced, respectively, by two and three subscripts. In fact, some programming

languages allow the number of dimensions for an array to be high as 7. this section

discusses these multidimensional arrays.

A two-dimensional m x n array A is a collection of m . n data elements such that each

element is specified by a pair of integers (such as J, K), called subscripts, with the

following property that,

I<=J<=m

and

I <= K <= n

The element of A with first subscripts j and second subscript k will be denoted by

A J,K

or

A[J, K]

tables in business applications; hence two-dimensional arrays are called matrix arrays.

Let we understand that there is a standard way of drawing a twodimensional m x n array

A where the elements of A form a rectangular array with m rows and n columns where

the element A[J, K] appears in row J and column K. Remind that a row is a horizontal

list of elements and a column is a vertical list of elements. In the following figure 1.8, we

may observe that two-dimensional array A has 3 rows and 4 columns. Let we emphasize

that each row contains those elements with the same first subscript, and each column

contains those elements with the same second subscript.

Rows

1

A[1, 1]

A[2, 1]

A[3, 1]

1

2

3

2

A[1, 2]

A[2, 2]

A[3, 2]

3

A[1, 3]

A[2, 3]

A[3, 3]

4

A[1, 4]

A[2, 4]

A[3, 4]

Let each student in a class of 25 students is given 4 tests. Assume the students are

numbered from 1 to 25, the test scores can be assigned to a 25 x 4 matrix array SCORE

as pictured in figure 1.9.

Thus, SCORE[K, L] contains the Kth students score on the Lth test. In particular, the

second row of an array,

SCORE [2, 1],

SCORE[2, 2]

SCORE[2, 3]

Student

Test 1

Test 2

Test 3

Test 4

SCORE[2, 4]

1

2

3

.

.

.

25

84

95

72

.

.

.

78

73

100

66

.

.

.

82

88

88

77

.

.

.

70

81

96

72

.

.

.

85

Let A is a twodimensional m x n array. The first dimension of A contains the index set

1,., m. with lower bound 1 and upper bound m; and the second dimension of A

contains the index set 1,2,.. n, with lower bound 1 and upper bound n. The length of a

dimension is the number of integers in its index set. The pair of lengths m x n (read m

by n) is called the size of the array. Let we find the length of a given dimension (i.e., the

number of integers in its index set) by obtain from the formula,

Length = upper bound lower bound + 1

Let A be a two-dimensional m x n array. Although A is pictured as a rectangular array of

elements with m rows and n columns, the array will be represented in memory by a block

of m.n sequential memory locations. If they are being stored in sequence, then how are

they sequenced? Is it that the elements are stored row wise or column wise? Again, it

depends on the operating system. Specifically, the programming languages will store the

array A in either,

Row by row, called row-major order.

The following figures 1.10 (a) & (b) shows these two ways when A is a two-dimensional

3 x 4 array.

A

Subscript

A

Subscript

(1, 1)

(1,1)

(2, 1) Column 1

(1, 2) ROW 1

(3, 1)

(1, 3)

(1, 2)

(1,4)

(2, 2) Column 2

(2, 1)

(3, 2)

(2, 2) ROW 2

(1, 3)

(2,3)

(2, 3) Column 3

(2, 4)

(3, 3)

(3, 1)

(1, 4)

(3,2) ROW 3

(2, 4) Column 4

(3, 3)

(3, 4)

(3, 4)

Fig.1.10 (a) Column-major order.

Recall that, for a linear array LA, the computer does not keep track of the address

LOC(LA[K]) of every element LA[K] of LA, but does keep track of Base(LA), the

address of the first element of LA. The computer uses the formula

LOC(LA[K]) = Base(LA) + w (K-1)

to find the address of LA[K]. Here w is the number of words per memory cell for the

array LA, and 1 is the lower bound of the index set of LA.

A similar situation also holds for any two-dimensional m x n array A. That is, the

computer keeps track of Base(A) the address of the first element A[1,1] of A and

computes the address LOC(A[J, K]) of A[J, K].

The formula for column and row major order is,

LOC(A[J, K]) = Base(A) + w[M(K-1) + (J-1)]

(1.4.6.2a)

LOC(A[J, K]) = Base(A) + w[N(J-1) + (K-1)]

(1.4.6.2b)

Again, w denotes the number of words per memory location for array A. Note that the

formulas are linear in J and K, and that we may find the address LOC(A[J, K]) in time

independent of J and K.

By using these formulas, let we see the following example 1.9.

Example 1.9

Consider the previous example 1.8, of 25 x 4 matrix array SCORE. Suppose

Base(SCORE) = 200 and there are w = 4 words per memory cell. Further more let the

programming language stores two-dimensional arrays using row-major order. Then the

address of SCORE[12,3], the third test of the twelfth student, follows:

Could you understand that how we get the address of the twelfth student? By simple

using the Equation (1.4.5.2b), we derived it.

Multidimensional arrays clearly illustrate the difference between the logical and the

physical views of data. Figure 1.8 shows that how we could logically views a 3 x 4

matrix array A, that is, as a rectangular array of data where A[J, K] appears in row J and

column K. On the other hand, the data will be physically stored in memory by a linear

collection of memory cells.

General multidimensional arrays are defined analogously. More specifically, an ndimensional m1 x m2 x x mn., array B is a collection of m1, m2.. mn data

elements in which each element is specified by a list of n integers such as K1, K2,. Kn

called subscripts, with the property that

1 <= K1 <= m1,

..

B K1,

K2,. Kn

or

B[K1, K2,. Kn ]

The array will be stored in memory in a sequence of memory locations. Specifically, the

programming language will store the array B either in row-major order or column-major

order.

Check Your Progress 3

1.

2.

3.

4.

5.

6.

What are the operations could perform with linear data structure?

At Maximum, an array can be a two-dimensional Array.

In linear array, the downward refers to .

In linear array, the term upward refers to .

In ., the elements of array are stored row wise.

In ., the elements of array are stored column wise.

True/False

As we studied already that the collections of data are frequently organized into a

hierarchy of field, records and files. Specifically a record is a collection of related

data items, each of which is called a field or attribute, and a file is a collection of

similar records. Each data item itself may be a group item composed of sub items;

those items which are indecomposable are called elementary items or atoms or

scalars. The names given to the various data items are called identifiers.

As we know that a record is a collection of data items, it differs from a linear array in

the following way,

(a) A record may be a collection of non-homogeneous data; i.e., the data items in

a record may have different data types.

(b) The data items in a record are indexed by attribute names, so there may not

be a natural ordering of its elements.

Under the relationship of group item to sub item, the data items in a record form a

hierarchical structure which can be described by means of "level" numbers. Let we

understand through the following examples.

Example 1.10

Consider a hospital keeps a record on each newborn baby who contains the following

data items: Name, Sex, Birthday, Father, and Mother. Assume that Birthday is a

group item with sub items Month, Day and Year, and Father and Mother are group

items, each with sub items Name and Age.

Let we describe the structure of the above record. (Note that Name appears three

times and age appears twice in the structure.)

1

Newborn

2

2

2

Name

Sex

Birthday

3

3

3

Father

3

3

Mother

3

3

Month

Day

Year

Name

Age

Name

Age

The number to the left of each identifier is called a level number. Observe that each

group item is followed by its sub items, and the level of the sub items is 1 more than

the level of the group item. Furthermore, an item is a group if and only if it is

immediately followed by an item with a greater level number.

Some of the identifiers in a record structure may also refer to arrays of elements. In

fact, the first line of the above structure is replaced by

1 Newborn(20)

This will indicate a file of 20 records, and the usual subscript notation will be used to

distinguish between different records in the file. That is, we will write to

Newborn1, Newborn2 , Newborn3 , . . .

or

Newborn[1], Newborn[2], Newborn[3],. . .

to denote different records in the file.

Example 1.11

A class of student records may be organized as follows:

1 Student(20)

2 Name

3 Last

3 First

3 MI (Middle Initial)

2 Test(3)

2 Final

2 Grade

The identifier Student(20) indicates that there are 20 students. The identifier Test (3)

indicates that there are three tests per student. Observe that there are 8 elementary

items per Student, since test is counted three times. Altogether, there are 160

elementary items in the entire Student structure.

Indexing Items in a Record

Suppose we want to access some data item in a record. In some cases, we cannot simply

write the data name of the item since the same name may appear in different places in the

record. For example, Age appears in two places in the record in Example 1.10.

Accordingly, in order to specify a particular item, we may have to qualify the

name by using appropriate group item names in the structure. This qualification

is indicated by using decimal points to separate group items from sub items.

Example 1.12

(a) Consider the record structure Newborn in Example 1.10. Sex and year need no

qualification, since each refers to a unique item in the structure. On the other

hand, suppose we want to refer to the age of the father. This can be done by

writing

Newborn.Father.Age

or simply

Father.Age

The first reference is said to be fully qualified. Sometimes one adds qualifying

identifiers for clarity.

(b) Suppose the first line in the record structure in Example 1.10 is replaced by

1 Newborn(20)

That is, Newborn is defined to be a file with 20 records. Then every item

automatically becomes a 20-element array. Some languages allow the sex of

the sixth newborn to be referenced by writing

Newborn.Sex[6]

or simple

Sex[6]

Analogously, the age of the father of the sixth newborn may be referenced by

writing

Newborn. Father. Age[6]

or simply

Father.Age[6]

(c) Consider the record structure Student in Example 1.11. Since Student is

declared to be a file with 20 students, all items automatically become 20element arrays. Furthermore, Test becomes a two-dimensional array. In

particular, the second test of the sixth student may be referenced by writing

Student. Test[6, 2]

or simply

Test[6, 2]

identifiers. For example,

Test[3,1]

does not refer to the third test of the first student, but to the first test of the

third student.

As we learned already that if the records may contain non-homogeneous data,

then the elements of a record cannot be stored in an array. Some programming

languages, such as PL/1, Pascal and COBOL, do have record structures built into

the language.

Example 1.13

Consider the record structure Newborn in Example 1.10. One can store

such a record in PL/1 following declaration, which defines a data

aggregate called a structure:

DECLARE 1

NEWBORN,

2 NAME CHAR(20),

2 SEX CHAR(1),

2 BIRTHDAY,

3 MONTH FIXED,

3 DAY FIXED,

3 YEA R FIXED,

2 FATHER,

3 NAME CHAR(20),

3 AGE FIXED,

2 MOTHER

3 NAME CHAR(20),

3 AGE FIXED;

Let we observe that the variables SEX and YEAR are unique; hence references to

them need not be qualified. On the other hand, AGE is not unique. Accordingly,

we should use like that

FATHER.AGE

or

MOTHER.AGE

If a programming language does not have available the hierarchical structures that

are label in PL/1, Pascal and COBOL. Assuming the record contains nonhomogeneous data, the record may have to be stored in individual variables, one

for each of its elementary data items. On the other hand, if we want to store an

entire file of records, note that all data elements belonging to the same identifier

do have the same type. Such a file may be stored in memory as a collection of

parallel arrays; that is, where elements in the different arrays with the same

subscript belong to the same record. To understand this parallel arrays, let we go

through the following example 1.14.

Example 1.14

Consider a membership list contains the name, age, sex and telephone number of

each member. We can store the file in four parallel arrays, NAME, AGE, SEX and

PHONE, as pictured in Figure 1.11; that is, for a given subscript K, the elements

NAME[K], AGE[K], SEX[K] and PHONE[K] belong to the same record.

NAME

1

2

3

4

5

.

.

.

John Brown

Paul Cohen

Mary Davis

Linda Evans

AGE

SEX

PHONE

234-5186

456-7272

777-12I2

876-4478

Mark Green

33

24

27

31

Male

Male

Female

Female

Male

.

.

.

.

.

.

.

.

.

.

.

.

28

255-7654

Figure 1.11

Example 1.15

Consider again the Newborn record in previous example 1.10. One can store a file

of such records in nine linear arrays, such as

NAME, SEX, MONTH, DAY, YEAR, FATHERNAME,

FATHERAGE, MOTHERNAME, MOTHERAGE

one array is for each elementary data item. Here we must use different variable

names for the name and age of the father and mother, which was not necessary in

the previous example 1.14. Again, we assume that the arrays are parallel, i.e., that

for a fixed subscript K, the elements

NAME[K], SEX[K], MONTH[K],.., MOTHERAGE[K]

belong to the same record.

Consider an elementary school keeps a record for each student which contains the

following data: Name, Telephone Number, Father, Mother, Siblings. Here Father,

Mother, Siblings contain, respectively, the names of the students father, mother,

and brothers or sisters attending the same school. Three such records may be as

follows.

Adams, John;

Bailey, Susan;

Sami, Mohammed;

345-6677;

222-1234;

567-3344;

Richard;

XXXX;

Abdul;

Mary;

Sheela;

Fathima;

Jame,William,Donald

XXXX

Aliya

Here, XXXX means that the parent has died or is not living with the student, or that

the student has no sibling at the school. From the above records, we could learn that

they are in variable-lengths, since the data element Siblings can contain zero or

more names.

Check your progress 4

1. Define record.

2. Record differs from a linear array

True/False

3. Records may contain . data.

4. List the programming languages which have record structures built-in.

1.5

STACKS

Stacks and Queues are two data structures that allow insertions and deletions operations

only at the beginning or the end of the list, not in the middle. A stack is a linear structure

in which items may be added or removed only at one end. Figure 1.12 pictures three

everyday examples of such a structure: a stack of dishes, a stack of pennies and a stack of

folded towels.

Stacks are also called last-in first-out (LIFO) lists. Other names used for stacks are

"piles" and "push-down lists. Stack has many important applications in computer science.

The notion of recursion is fundamental in computer science. One way of simulating

recursion is by means of stack structure. Let we learn the operations which are performed

by the Stacks.

Figure 1.12

A stack is a list of elements in which an element may be inserted or deleted only at one

end, called the top of the stack. This means, that elements which are inserted last will be

removed first. Special terminology is used for two basic operation associated with

stacks:

(a) "Push" is the term used to insert an element into a stack.

(b) "Pop" is the term used to delete an element from a stack.

Apart from these operations, we could perform these operations on stack: (i) Create a

stack (ii) Check whether a stack is empty, (iii) Check whether a stack is full (iv) Initialize

a stack (v) Read a stack top (vi) Print the entire stack.

Example 1.16

Suppose that following 6 elements are pushed, in order, onto an empty stack:

AAA, BBB,

CCC,

DDD, EEE,

FFF

Figure 1.13(a), (b) & (c) shows three ways of picturing such a stack

(a)

(b)

TOP

1

2

3

4

5

6

.

.

N-1

N

FFF

EEE

DDD

CCC

BBB

AAA

AAA

BBB

CCC

DDD

EEE

FFF

TOP

(c)

1

AAA BBB

3

CCC

DDD EEE

N-1

FFF

TOP

Stacks will be maintained by a linear array STACK; a pointer variable TOP, which

contains the location of the top element of the stack; and a variable MAXSTK which

gives the maximum number of elements that can be held by the stack. The condition

TOP=0 or TOP=NULL will indicate that the stack is empty.

Figure 1.14 pictures such an array representation of a stack. Since TOP=3, the stacks has

three elements, XXX, YYY and ZZZ; and since MAXSTK=8, there is room for 5 more

items in the stack.

MAXSTK 8

7

6

5

4

ZZZ

TOP 3

YYY

2

1

XXX

Figure 1.14

The operation of adding (pushing) an item onto a stack and the operation of removing

(popping) and item from a stack are implemented by the following procedures 1.1 and

1.2, called PUSH and POP respectively. When we adding a new element, first, we must

test whether there is a free space in the stack for the new item; if not, then we have the

condition known as overflow. Analogously, in executing the procedure POP, we must

first test whether there is an element in the stack to be deleted; if not; then we have the

condition known as underflow.

Procedure 1.1: PUSH(STACK, TOP, MAXSTK, ITEM)

This procedure pushes an ITEM onto a stack.

1. [Stack already filled?]

If TOP=MAXSTK, then: Print: OVERFLOW, and Return.

2. Set TOP:=TOP + 1 [Increases TOP by 1.]

3. Set STACK[TOP] := ITEM. [Inserts ITEM in new TOP position.]

4. Return.

Procedure 1.2: POP(STACK, TOP, ITEM)

This procedure deletes the top element of STACK and assigns it

to the variable ITEM.

1. [Stack has an item to be removed?]

If TOP = 0), then: Print: UNDERFLOW, and Return.

2. Set ITEM := STACK[TOP]. [Assigns TOP element to ITEM.]

3. Set TOP := TOP-1. [Decreases TOP by 1]

4. Return.

Frequently, TOP and MAXSTK are global variables; hence the procedures may be called

using only

PUSH(STACK, ITEM) and POP(STACK, ITEM)

respectively. We note that the value of TOP is changed before the insertion in PUSH but

the value of TOP is changed after the deletion in POP.

Example 1.17 (a) & (b)

a. Consider the stack in Fig.1.14. We perform the operation PUSH(STACK, WWW):

2. TOP=3+1 = 4.

3. STACK[TOP] = STACK[4] = WWW.

Note that WWW is now the top element in the stack.

b. Consider again the stack in Fig 1.14. This time we simulate the operation

POP(STACK; ITEM):

1.

2.

3.

4.

ITEM = CCC

TOP = 3 - 1 = 2.

Return.

Observe that STACK[TOP] = STACK[2] = BBB is now the top element in the

stack

1.5.3.1 Arithmetic Expression; Polish Notation

Let Q be an arithmetic expression involving constants and operations. This section gives

an algorithm which finds the value of Q by using reverse Polish (postfix) notation. We

will see that the stack is an essential tool in this algorithm.

Recall that the binary operations in Q may have different levels of precedence.

Exponentiation ( )

Multiplication ( * ) and division ( / )

Addition ( + ) and subtraction ( - )

Highest :

Next highest :

Lowest :

Example 1.18

2 3 + 5 * 2 2 - 12 / 6

First we evaluate the exponentiations to obtain

8 + 5 * 4 - 12 / 6

Then we evaluate the multiplication and division to obtain 8 + 20 - 2. Last, we evaluate

the addition and subtraction to obtain the final result, 26. Observe that the expression is

traversed three times, each time corresponding to a level of precedence of the operations.

Polish Notation

In polish notation, the operator symbol is placed between its two operands. For example,

A+B

C-D

E*F

G/H

This is called infix notation. With this notation, we must distinguish between

(A + B) * C

and

A + (B * C)

precedence levels discussed above. Accordingly, the order of the operators and operands

in an arithmetic expression does not uniquely determine the order in which the operations

are to be performed.

Let we translate, step by step, the following infix expressions into Polish notation using

brackets [ ] to indicate a partial translation:

(A + B) * C = [+AB] * C = * +ABC

A + (B * C) = A + [*BC] = +A*BC

(A + B) / (C - D) = [+AB] / [-CD] = / +AB CD

The fundamental property of Polish notation is that the order in which the operations are

to be performed is completely determined by the positions of the operators and operands

in the expression. There is no need of parentheses when writing expressions in Polish

notation. The computer usually evaluates an arithmetic expression written in infix

notation in two steps. First, it converts the expression to postfix notation, and then it

evaluates the postfix expression.

Suppose P is an arithmetic expression written in postfix notation. The following

algorithm 1.3 uses a STACK to hold operands, evaluates P

Algorithm 1.3: This algorithm finds the VALUE of an arithmetic expression P written in

postfix notation.

2. Scan P from left to right and repeat Steps 3 and 4 for each element of

until the sentinel ")" is encountered.

If an operand is encountered, put it on STACK.

If an operator (x) is encountered, then:

a) Remove the two top elements of STACK, where A is the top

element and B is the next-to-top element.

b) Evaluate B (x) A.

c) Place the result of (b) back on STACK

[End of If structure.]

[End of Step 2 loop.]

5. Set VALUE equal to the top element on STACK.

6. Exit.

3.

4.

Example 1.19

Consider the following arithmetic expression P written in postfix notation:

P:

5, 6, 2, +, *, 12, 4, /, -

(Commas are used to separate the elements of P so that 5, 6, 2 is not interpreted as the

number 562.) The equivalent infix expression Q follows:

Q:

5 * (6 + 2) - 12 / 4

We evaluate P using algorithm 1.3. First we add a sentinel right parenthesis at the end of

P to obtain

P:

5,

(1)

6,

(2)

2,

(3)

+,

(4)

*,

(5)

12,

(6)

4,

(7)

/,

(8)

-,

(9)

)

(10)

Figure 1.15 shows the contents of STACK as each element of P is scanned. The final

number in STACK, 37, which is assigned to VALUE when the sentinel ) is scanned, is

the value of P.

Symbol Scanned

STACK

(1) 5

5

(2) 6

5, 6

(3) 2

5, 6, 2

(4) +

5, 8

(5) *

40

(6) 12

40, 12

(7) 4

40, 12, 4

(8)

/

40, 3

(9) 37

(10) )

Figure 1.15

Let Q be an arithmetic expression written in infix notation. The following algorithm 1.4

transforms the infix expression Q into its equivalent postfix expression P. The algorithm

uses a stack to temporarily hold operators and left parentheses. The postfix expression P

will be constructed from left to right using the operands from Q and the operators which

are removed from STACK. We begin by pushing a left parenthesis onto STACK and

adding a right parenthesis at the end of Q. The algorithm is completed when STACK is

empty.

Algorithm 1.4: POLISH(Q,P)

Suppose Q is an arithmetic expression written in infix notation. This algorithm finds the

equivalent postfix expression P

1. Push "(" onto STACK, and add ")" to the end of Q.

2. Scan Q from left to right and repeat Steps 3 to 6 for each element of Q until the

STACK is empty

3.

If an operand is encountered, add it to P.

4.

If a left parenthesis is encountered, push it onto STACK.

5.

If an operator (x) is encountered, then:

a) Repeatedly pop from STACK and add to P each operator (on the

top of STACK) which has the same precedence as or higher

precedence than (x)

b) Add (x) to STACK.

[End of If structure.]

6.

If a right parenthesis is encountered, then:

a) Repeatedly pop from STACK and add to P each operator (on top

of STACK) until a left parenthesis is encountered.

b) Remove the left parenthesis. [Do not add the left parenthesis to

P.]

[End of If structure.]

[End of Step 2 loop.]

7. Exit.

Example 1.20

Let we see the example. Consider the following arithmetic infix expression

Q:

A + (B * C - (D / E F) * G) * H

We transform Q using algorithm 1.4 into its equivalent postfix expression P. First we

push "(" onto STACK, and then we add ")" to the end of Q to obtain:

Q:

A + (

B *

C -

( D / E F ) *

G )

* H )

We may observe that the Figure 1.16 shows the status of STACK and of the string P as

each element of Q is scanned.

Symbol Scanned

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

(11)

(12)

(13)

(14)

(15)

(16)

(17)

(18)

(19)

(20)

A

+

(

B

*

C

(

D

/

E

F

)

*

G

)

*

H

)

STACK

(

(+

(+(

(+(

(+(*

(+(*

(+((+(-(

(+(-(

(+(-(/

(+(-(/

(+(-(/

(+(-(/

(+((+(-*

(+(-*

(+

(+*

(+*

Expression P

A

A

A

AB

AB

ABC

ABC*

ABC*

ABC*D

ABC*D

ABC*DE

ABC*DE

ABC*DEF

ABC*DEF/

ABC*DEF/

ABC*DEF/G

ABC*DEF/G*ABC*DEF/G*ABC*DEF/G*-H

ABC*DEF/G*-H*+

Figure 1.16

1.5.3.2

Let A be a list of n data items. Sorting A refers to the operation of rearranging the

elements of A so that they are in some logical order, such as numerically ordered when A

contains numerical data, or alphabetically ordered when A contains character data. Let

we discuss the subject of sorting, including various sorting algorithms, is in Unit II.

Thus, Stacks are frequently used in evaluation of arithmetic expressions written in infix

notation. Polish notations are evaluated by stacks. Conversions of different notations

(Prefix, Postfix, Infix) into one another are performed using stacks. Stacks are widely

used inside computer when recursive functions are called. Let we discuss it in next

section.

1.5.4 RECURSION

Recursion is an important concept in computer science. Many algorithms can be best

described in terms of recursion. Let we discuss, how recursion may be useful tool in

developing algorithms for specific problems. Consider P is a procedure containing either

a Call statement to itself or a Call statement to a second procedure that may eventually

result in a Call statement back to the original procedure P. Then P is called a recursive

procedure. A recursive procedure must have the following two properties:

1) There must be certain criteria, called base criteria, for which the procedure

does not call itself.

2) Each time the procedure does call itself (directly or indirectly); it must be

closer to the base criteria.

A recursive procedure with these two properties is said to be well-defined. Similarly, a

function is said to be recursively defined if the function definition refers to itself.

The following examples should help us to clarify these ideas.

Factorial Function

The product of the positive integers from 1 to n, inclusive, is called "n factorial" and is

usually denoted by n!:

n! = 1 . 2 . 3 . . . (n - 2) (n - 1) n

It is also defined that 0! = l. Thus we have,

5! = 1 . 2 . 3 . 4 . 5 = 120

and

6! = 1 . 2 . 3 . 4 . 5 . 6 = 720

This is true for every positive integer n; that is, n! = n . (n - 1)! Accordingly, the factorial

function may also be defined as follows:

Definition: (Factorial Function)

(a)

(b)

If n = 0, then n!= 1.

If n > 0, then n! = n . (n-1)!

Example 1.21

Let us calculate 4! using the recursive definition. This calculation requires the following

nine steps:

(1) 4! = 4 . 3!

(2)

3! = 3 . 2!

(3)

2! = 2 . 1!

(4)

1! = 1 . 0!

(5)

0! = 1

(6)

1! = 1 . 1 = 1

(7)

2! = 2 . 1 = 2

(8)

3! = 3 . 2 = 6

(9) 4! = 4 . 6 = 24

Let we learn the procedure 1.3 that calculate n! This procedure calculates n! and returns

the value in the variable FACT.

Procedure 1.3: FACTORIAL (FACT, N)

1.

2.

3.

4.

Call FACTORIAL (FACT, N-1).

Set FACT := N * FACT.

Return.

The above procedure is a recursive procedure, since it contains a call statement to itself.

Let we discuss, how do stacks may be used to implement recursive procedures? Recall

that a subprogram can contain both parameters and local variables.

The parameters are the variables which receive values from objects in the calling

program, called arguments, and which transmit values back to the calling program.

Besides the parameters and local variables, the subprogram must also keep track of the

return address in the calling program. This return address is essential, since control. must

be transferred back to its proper place in the calling program At the time that the

subprogram is finished executing and control is transferred back to the calling program,

the values of the local variables and the return address are no longer needed.

If our subprogram is a recursive program then each level of execution of the subprogram

may contain different values for the parameters and local variables and for the return

address. Furthermore, if the recursive program does call itself then these current values

must be saved, since they will be used again when the program is reactivated.

Translation of a Recursive Procedure into a Non-recursive Procedure

Consider P is a recursive procedure. We assume that a recursive call to P comes only

from the procedure P. The translation of the recursive procedure P into a non-recursive

procedure works as follows. First of all, let we define:

1. A stack STPAR for each parameter PAR

2. A stack STVAR for each local variable VAR

3. A local variable ADD and a stack STADD to hold return addresses

Each time there is a recursive call to P, the current values of parameters and local

variables are pushed onto the corresponding stacks for future processing, and each time

there is a recursive return to stacks. The handling of the return address is done as follows.

Suppose the procedure P contains a recursive Call P in Step K. Then there are two return

address associated with the execution of this Step K:

1. There is the current return address of the procedure P, which will be used

when the current level of execution of P is finished executing.

2. There is the new return address K+1, which is the address of the step

following the Call P and which will be used to return to the current level of

execution of procedure P

Some texts push the first of these two addresses, the current return address, onto the

return address stack STADD, whereas some texts push the second address, the new return

address K+1, onto STADD. We will choose the latter method, since the translation of P

into a non-recursive procedure will then be simpler. This also means, in particular, that an

empty stack STADD will indicate a return to the main program that initially called the

recursive procedure P.

The algorithm 1.5 which translates the recursive procedure P into a non-recursive

procedure follows. It consists of three parts: (1) preparation, (2) translating each recursive

Call P in procedure P and (3) translating each Return in procedure P.

Algorithm 1.5:

(1)

Preparation.

a) Define a stack. STPAR for each parameter PAR, a stack STVAR for each

local variable VAR, and a local variable ADD and a stack STADD to

hold return addresses.

b) Set TOP := NULL.

a) Push the current values of the parameters and local variables onto the

appropriate stacks, and push the new return address [Step] K + 1 onto

STADD.

b) Rest the parameters using the new argument values.

c) Go to Step 1. [The beginning of the procedure P.]

(3) Translation of "Step J. Return."

a) If STADD is empty, then: Return. [Control is returned to the main

program.]

b) Restore the top values of the stacks. That is, set the parameters and local

variables equal to the top values on the stacks, and set ADD equal to the

top value on the stack STADD.

c) Go to Step ADD.

Observe that the translation of "Step K. Call P" does depend on the value of K, but that

the translation of "Step J. Return" does not depend on the value of J. According to that,

we need to translate only one Return statement, for example, by using

Step L. Return.

Go to Step L.

Check your Progress 5

1. Stacks are sometimes called FIFO lists

True/False

2. Stack allows Push and Pop from both ends

True/False

3. TOS (Top of the Stack) gives the bottom most element in the stack.

4. Multiple stacks can be implemented using

5. are evaluated by stacks.

6. Stack is used whenever a . function is called.

1.6

QUEUES

A queue is a linear structure in which element may be inserted at one end called the rear,

and the deleted at the other end called the front. Figure 1.17 pictures a queue of people

waiting at a bus stop. Queues are also called first-in first-out (FIFO) lists. An important

example of a queue in computer science occurs in a timesharing system, in which

programs with the same priority form a queue while waiting to be executed. Similar to

stack operations, operations that are define a queue.

(Figure 1.17)

Queues may be represented in the computer in various ways, usually by means of oneway lists or linear arrays. Queues will be maintained by a linear array QUEUE and two

pointer variables: FRONT, containing the location of the front element of the queue; and

REAR, containing the location of the rear element of the queue. The condition FRONT =

NULL will indicate that the queue is empty.

Figure 1.18, indicates the way elements will be deleted from the queue and the way new

elements will be added to the queue.

Figure 1.18: Array Representation of a Queue

Observe that whenever an element is deleted from the queue, the value of FRONT is

increased by 1; this can be implemented by the assignment

FRONT := FRONT + 1

Similarly, whenever an element is added to the queue, the value of REAR is increased by

1; this can be implemented by the assignment

REAR := REAR + 1

Assume QUEUE is circular, that is, that QUEUE[1] comes after QUEUE[N] in the array.

With this assumption, we insert ITEM into the queue by assigning ITEM to QUEUE[1].

Specifically, instead of increasing REAR to N+1, we reset REAR=1 and then assign,

QUEUE[REAR]:= ITEM

instead of increasing FRONT to N+1. Suppose that our queue contains only one element,

i.e., suppose that

FRONT = REAR = NULL

FRONT: = NULL

and

REAR:= NULL

The operation of adding an item onto a stack and the operation of removing (popping)

and item from a stack are implemented by the following procedures, called QINSERT

and QDELET respectively.

Procedure 1.4:

This procedure inserts an element ITEM into a queue.

1.

If FRONT = 1 and REAR = N, or if FRONT = REAR+1, then:

Write: OVERFLOW, and Return.

2. [Find new value of REAR.]

If FRONT = NULL, then: [Queue initially empty.]

Set FRONT:=1 and REAR:=1.

Else if REAR = N, then:

Set REAR:=1.

Else:

Set REAR := REAR + 1.

[End of If structure.]

3. Set QUEUE[REAR] := ITEM. [This inserts new element.]

4. Return.

Procedure 1.5:

This procedure deletes an element from a queue and assigns it to the

variable item.

1.

2.

3.

4.

If FRONT := NULL, then: Write: UNDERFLOW, and Return.

Set ITEM := QUEUE[FRONT].

[Find new value of FRONT.]

If FRONT = REAR, then: [Queue has only one element to start.]

Set FRONT := NULL and REAR := NULL.

Else if FRONT = N, then:

Set FRONT := 1.

Else:

Set FRONT := FRONT+1.

[End of If structure.]

Return

1.6.2 DEQUES

A deque (pronounced either "deck" or "dequeue") is a linear list in which elements can be

added or removed at either end but not in the middle. The term deque refers to the name

double-ended queue.

There are two variations of a deque - namely, an input-restricted deque and an outputrestricted deque - which are intermediate between a deque and a queue. An input-

restricted deque is a deque which allows insertions at only one end of the list but allows

deletions at both ends of the list; and an output-restricted deque is a deque, which allows

deletions at only one end of the list buy allows insertions at both ends of the list.

Figure 1.19 pictures two deques, each with 4 elements maintained in an array with N = 8

memory locations. The condition LEFT = NULL will be used to indicate that a deque is

empty.

DEQUE (a)

LEFT: 4

RIGHT : 7

1

AAA

BBB

CCC

DDD

WWW

XXX

DEQUE (b)

LEFT: 4

RIGHT : 7

YYY

ZZZ

4

5

(Figure 1.19)

A priority queue is a collection of elements such that each element has been assigned a

priority and such that the order in which elements are deleted and processed comes from

the following rules:

1) An element of higher priority is processed before any element of lower

priority.

2) Two elements with the same priority are processed according to the order in

which they were added to the queue.

Many application involving queues require priority queues rather than the simple FIFO

strategy. For elements of same priority, the FIFO order is used. For example, in a multiuser system, there will be several programs competing for use of the central processor at

one time. The programs have a priority value associated to them and are held in a

priority queue. The program with the highest priority is given first use of the central

processor.

Scheduling of jobs within a time-sharing system is another application of queues. In such

system many users may equest processing at a time and computer time divided among

these requests. The simlest approach sets up one queue that store all requests for

processing. Computer processes the request at the front of the queue and finished it

before starting on the next. Same approach is also used when several users want to use

the same output device, say a printer.

In a time sharing system, another common approach used is to process a job only for a

specified maximum length of time. If the program is fully processed within that time,

then the computer goes on to the next process. If the program is not completely

processed within the specified time, the intermediate values are stored and the remaining

part of the program is put back on the queue. This approach is useful in handling a mix

of long and short jobs.

1.7

LINKED LIST

A linked list, or one-way list, is a linear collection of data elements, called nodes, where

the linear order is given by means of pointers. That is, each node is divided into two

parts: the first part contains the information of the element, and the second part, called the

link field or next pointer field, contains the address of the next node in the list.

Figure 1.20 is a schematic diagram of a linked list with 6 nodes, Each node is pictured

with two parts. The left part represents the information part of the node, which may

contain an entire record of data items (e.g., NAME, ADDRESS,...). The right part

represents the Next pointer field of the node, and there is an arrow drawn from it to the

next node in the list. This follows the usual practice of drawing an arrow from a field to a

node when the address of the node appears in the given field. The pointer of the last node

contains special value, called the null pointer, which is any invalid address.

(Figure 1.20)

Let LIST be a linked list. Then LIST will be maintained in memory as follows. First of

all, LIST requires two linear arrays-we will call them here INFO and LINK - such that

INFO[K] and LINK[K] contain the information part and the next pointer field of a node

of LIST respectively. START contains the location of the beginning of the list, and a next

pointer sentinel-denoted by NULL-which indicates the end of the list.

The following examples of linked lists indicate that more than one list may be maintained

in the same linear arrays INFO and LINK. However, each list must have its own pointer

variable giving the location of its first node.

Example 1.22

Figure 1.21 pictures a linked list in memory where each node of the list contains a

single character. We can obtain the actual list of characters, or, in other words, the

string, as follows:

INFO

LINK

START

(Figure 1.21)

LINK[9] = 3, so INFO[3] = O is the second character.

LINK[3] = 6, so 1NFO[6] = (blank) is the third character.

LINK[6] = 11, so INFO[11] = E is the fourth character.

LINK[11] = 7, so INFO[7] = X is the fifth character.

LINK[7] = 10, so INFO[10] = I is the sixth character.

LINK[10] = 4, so INFO[4] = T is the seventh character.

LINK[4] = 0, so the NULL value, so the list has ended.

Let LIST be a linked list in memory stored in linear arrays INFO and LINK with START

pointing to the first element and NULL indicating the end of LIST. Suppose we want to

traverse LIST in order to process each node exactly once. Our traversing algorithm uses a

pointer variable PTR which points to the node that is currently being processed.

Accordingly, LINK[PTR] points to the next node to be processed. The assignment

PTR := LINK[PTR]

.

The details of the algorithm1.6 are as follows. Initialize PTR. Then process INFO[PTR],

the information at the first node. Update PTR by the assignment PTR:=LINK[PTR], and

then process INFO[PTR], the information at the second node and so on until

PTR=NULL, which signals the end of the list.

Algorithm 1.6: (Traversing a Linked List) Let LIST be a linked list in memory. This

algorithm traverses LIST, applying an operation PROCESS to each

element of LIST. The variable PTR points to the node currently being

processed.

1. Set PTR := START. [Initializes pointer PTR].

2. Repeat Steps 3 and 4 while PTR # NULL.

3.

Apply PROCESS to INFO[PTR].

4.

Set PTR:=LINK[PTR]. [PTR now points to the next node.]

[End of Step 2 loop.]

5. Exit.

The following procedure 1.6 presents about how traversing a linked list. They are similar

to algorithm 1.6.

Procedure 1.6:

This procedure prints the information at each node of the list.

1. Set PTR := START.

2. Repeat Steps 3 and 4 while PTR # NULL:

3.

Write: INFO[PTR].

4.

Set PTR := LINK[PTR]. [Updates pointer.]

[End of Step 2 loop.]

5. Return.

Procedure 1.7:

1. Set NUM := 0.

2. Set PTR := START.

3. Repeat Steps 4 and 5 while PTR # NULL.

4.

Set NUM := NUM+1.

5.

Set PTR:LINK[PTR].

[End of Step 3 loop.]

6. Return.

[Initializes counter.]

[Initializes pointer.]

[Increases NUM by 1]

[Updates pointer.]

Let LIST be a linked list in memory. We are given an ITEM of information. In this

section we are going to discuss the two searching algorithms for finding the location LOC

of the node where ITEM first appears in LIST.

The first algorithm 1.7 does not assume that the data in LIST are sorted. The second

algorithm 1.8 does assume that LIST is sorted. If ITEM is actually a key value and we

are searching through a file for the record containing ITEM, then ITEM can appear only

once in LIST.

LIST is Unsorted

The data in LIST are not necessarily sorted. Then one searches for ITEM in LIST by

traversing through the list using a pointer variable PTR and comparing ITEM with the

contents INFOR[PTR] of each node, one by one, of LIST. Before we update the pointer

PTR by

PTR := LINK[PTR]

we require two tests. First we have to check whether we reached the end of the list; i.e.,

PTR = NULL

INFO[PTR] = ITEM

Algorithm 1.7: SEARCH(INFO, LINK, START, ITEM, LOC)

LIST is a linked list in memory. This algorithm finds the location LOC

of the node where ITEM first appears in LIST, or sets LOC=NULL.

1. Set PTR := START.

2. Repeat Step 3 while PTR # NULL:

3.

If ITEM = INFO[PTR] then:

Set LOC := PTR and Exit.

Else:

Set PTR := LINK[PTR].[PTR now points to the next node.]

[End of If structure.]

[End of Step 2 loop.]

4. Set LOC:=NULL. [Search is unsuccessful.]

5. Exit.

LIST is Sorted

The data in the LIST are sorted. Again we search for ITEM in LIST by traversing the list

using a point variable PTR and comparing ITEM with the contents INFO[PTR] of each

node, one by one, of LIST. Here we can stop once ITEM exceeds INFO[PTR].

Algorithm 1.8: SRCHSL(INFO, LINK, START, ITEM, LOC)

LIST is a sorted list in memory. This algorithm finds the location LOC of

the node where ITEM first appears in LIST, or sets LOC=NULL.

1. Set PTR := START.

2. Repeat Step 3 while PTR#NULL:

3.

If ITEM < INFO[PTR], then:

Set PTR := LINK[PTR]. [PTR now points to next node]

Else if ITEM = INFO[PTR], then:

Set LOC := PTR, and Exit. [Search is successful.]

Else:

Set LOC := NULL, and Exit.[ITEM now exceeds INFO[PTR]].

[End of If structure.]

[End of Step 2 loop.]

4. Set LOC := NULL.

5. Exit.

Let LIST be a linked list with successive nodes A and B, as pictured in fig. 1.23(a).

Suppose a node N is to be inserted into the list between nodes A and B. The schematic

diagram of such an insertion appears in fig. 1.23(b). That is, node A now points to the

new node N, and node N points to node B, to which A previously pointed.

Insertion Algorithms

Let we discuss three insertion algorithms.

(a) The first one inserts a node at the beginning of the list.

(b) The second one inserts a node into after the node with a given location.

(c) The third one inserts a node into a sorted list.

In all the following algorithms assume that the linked list is in memory in the form

LIST(INFO, LINK, START; AVAIL) and that the variable ITEM contains the new

information to be added to the list. Since our insertion algorithms will use a node in the

AVAIL list, all of the algorithms will include the following steps:

(a)

Checking to see if space is available in the AVAIL list, If not, that is, if

AVAIL=NULL, then the algorithm will print the message OVERFLOW.

(b)

Removing the first node from the AVAIL list. Using the variable NEW to keep

track of the location of the new node, this step can be implemented by the pair

of assignments

NEW := AVAIL, AVAIL := LINK[AVAIL]

(c)

INFO[NEW] := ITEM

The linked list is not sorted. The algorithm 1.9 inserts the node at the beginning of the

list.

This algorithm inserts ITEM as the first node in the list.

1. [OVERFLOW?]

If AVAIL = NULL, then: Write: OVERFLOW and Exit.

2. [Remove first node from AVAIL list].

Set NEW := AVAIL and AVAIL := LINK [AVAIL].

3. [Copy new data into new node.]

Set INFO[NEW] := ITEM.

4. [Make new node now to point the original first node.]

Set LINK[NEW] := START.

5. [Change START so it points to the new node.]

Set START := NEW

6. Exit.

Inserting after a Given Node

Suppose we are given the value of LOC where either LOC is the location of a node A in a

linked LIST or LOC=NULL. The following is an algorithm which inserts ITEM into

LIST so that ITEM follows node A or, when LOC = NULL, so that ITEM is the first

node:

Let N denote the new node (whose location is NEW). If LOC = NULL, then N is inserted

as the first node in LIST as in algorithm 1.9. We let node N point to node B (which

originally followed node A) by the assignment

LINK[NEW] := LINK[LOC]

and we let node A point to the new node N by the assignment

LINK[LOC] := NEW

The algorithm is as follows.

Algorithm 1.10: INSLOC(INFO, LINK, START, AVAIL, LOC, ITEM)

This algorithm inserts ITEM so that ITEM follows the node with

location LOC or inserts ITEM as the first node when LOC or inserts

ITEM as the first node when LOC=NULL.

1. [OVERFLOW?] If AVAIL=NULL, then: Write: OVERFLOW, and Exit.

2. [Remove first node from AVAIL list}

Set NEW := AVAIL and AVAIL := LINK[AVAIL].

3. [Copy new data into new node.] Set INFO[NEW] := ITEM.

4. If LOC = NULL, then: [Insert as first node.]

Set LINK[NEW] := START and START = NEW.

Else: [Insert after node with location LOC.]

Set LINK[NEW] := LINK [LOC] and LINK[LOC]:=NEW.

[End of If structure.]

5. Exit.

Suppose ITEM is to be inserted into a sorted linked LIST. The linked list is not sorted.

The algorithm inserts the node into a Sorted Linked List. The ITEM must be inserted

between nodes A and B so that

INFO(A) < ITEM < INFO(B)

The following procedure finds the location LOC of node A, that is, which finds the

location LOC of the last node in LIST whose value is less than ITEM.

Traverse the list, using a pointer variable PTR and comparing ITEM with INFO[PTR] at

each node. While traversing, keep track of the location of the preceding node by using a

pointer variable PREV, as pictured in figure 1.25. Thus SAVE and PTR are updated by

the assignments

SAVE := PTR

and

PTR := LINK[PTR]

(Figure 1.25)

The traversing stops as soon as ITEM < INFO[PTR]. Then PTR points to node B. so

SAVE will contain the location of the node A.

Procedure 1.8: FINDA(INFO, LINK, START, ITEM, LOC)

This procedure finds the location LOC of the last node in a sorted list

such that INFO[LOC] < ITEM, or sets LOC=NULL.

1. [List empty?]

If START = NULL, then: Set LOC := NULL, and Return.

2. [Special case?]

If ITEM <INFO[START], then: Set LOC := NULL, and Return.

3. [Initializes pointers.]

Set PREV := START and PTR := LINK[START].

4. Repeat Steps 5 and 6 while PTR # NULL.

5.

If ITEM < INFO[PTR],then:

Set LOC := PREV and Return.

[End of If structure.]

6.

[Updates pointers] Set PREV := PTR and PTR := LINIC[PTR].

[End of Step 4 loop.]

7. Set LOC := SAVE

8. Return.

Algorithm 1.11: INSSRT(INFOR, LINK, START, AVAIL, ITEM)

This algorithm inserts ITEM into a sorted linked list.

1. [Use Procedure 1.8 to find the location of the node preceding ITEM]

Call FINDA(INFO, LINK, START, ITEM, LOC).

2. [Use Algorithm 1.10 to insert ITEM after the node with location LOC]

Call INSLOC(INFO, LINK, START, AVAIL, LOC, ITEM).

3. Exit.

Let LIST be a linked list with a node N between nodes A and B, as pictured in

Fig.1.26(a). Suppose node N is to be deleted from the linked list. The schematic diagram

of such a deletion appears in Fig.1.26(b). The deletion occurs as soon as the next pointer

field of node A points to node B. (Accordingly, when performing deletions, one must

keep track of the address of the node which immediately precedes the node that is to be

deleted.)

Deletion Algorithms

We discuss two deletion algorithms.

(a) The first one deletes a node following a given node.

(b) The second one deletes the node with a given ITEM of information.

All our algorithms assume that the linked list is in memory in the form LIST(INFO,

LINK, START; AVAIL) and that the variable ITEM contains the new information to be

added to the list.

All of our deletion algorithms will return the memory space of the deleted node N to the

beginning of the AVAIL list. Accordingly, all of our algorithms will include the

following pair of assignments, where LOC is the deleted node N:

LINK[LOC] := AVAIL and then AVAIL := LOC

Deleting the Node Following a Given Node

Let LIST be a linked list in memory. Suppose we are given the location LOC of a node N

in LIST. Furthermore, suppose we are given the location LOCP of the node proceeding N

or, when N is the first node, we are given LOCP=NULL. The following algorithm 1.12

deletes N from the list.

This algorithm deletes the node N with location LOC. LOCP is the location of

the node which precedes N or, when N is the first node, LOCP=NULL.

1. [Deletes first node.]

If LOCP = NULL, then:

Set START:=LINK[START].

Else: [Deletes node N.]

Set LINK[LOCP] := LINK[LOC].

[End of If structure.]

2. [Return deleted node to the AVAIL list.]

Set LINK[LOC] := AVAIL and AVAIL := LOC.

3. Exit.

Let LIST be a linked list in memory. Suppose we are given an ITEM of information and

we want to delete from the LIST the first node N which contains ITEM. (If ITEM is a

key value, then only one node can contain ITEM.). First we give a procedure 1.9 which

finds the location LOC of the node N containing ITEM and the location LOCP of the

node preceding node N. If N is the first node, we set LOCP = NULL, and if ITEM does

not appear in LIST, we set LOC = NULL. (This procedure is similar to Procedure 1.8)

Traverse the list, using a pointer variable PTR and comparing ITEM with INFO[PTR] at

each node. While traversing, keep track of the location of the preceding node by using a

pointer variable SAVE as in fig.1.25. Thus SAVE and PTR are updated by the

assignments

SAVE := PTR

and

PTR := LINK[PTR]

The traversing stops as soon as ITEM = INFO[PTR]. Then PTR contains the location

LOC of node N and SAVE contains the location LOCP of the node preceding N.

The formal statement of our procedure follows. The cases where the list is empty or

where INFO[START]=ITEM(i.e., where node N is the first node) are treated separately,

since they do not involve the variable SAVE.

Procedure 1.9: FINDB(INFO, LINK, START, ITEM, LOC, LOCP)

This procedure finds the location LOC of the first node N which contains ITEM

and the location LOCP of the node preceding N. If ITEM does not appear in the

list, then the procedure sets LOC=NULL; and if ITEM appears in the first node,

then it sets LOCP=NULL.

1. [List empty?]

If START = NULL, then:

Set LOC:=NULL and LOCP:=NULL, and Return.

[End of If structure.]

2. [ITEM in first node?]

If INFO[START] = ITEM, then:

Set LOC:=START and LOCP=NULL, and Return.

[End of If structure.]

3. [Initializes pointers.]

Set SAVE := START and PTR := LINK[START].

4. Repeat Steps 5 and 6 while PTR # NULL.

5. If INFO[PTR] = ITEM, then:

Set LOC := PTR and LOCP := SAVE, and Return.

[End of If structure.]

6. [Updates pointers.]

Set SAVE := PTR and PTR := LINK[PTR].

[End of Step 4 loop.]

7. [Search unsuccessful.]

Set LOC := NULL.

8. Return.

This algorithm deletes from a linked list the first node N which contains the

given ITEM of information.

1. [Use Procedure 1.9 to find the location of N and its preceding node.]

Call FINDB(INFO, LINK, START, ITEM, LOC, LOCP)

2. [Item not found?]

If LOC = NULL, then:

Write: ITEM not in list, and Exit.

3. [Delete node.]

If LOCP = NULL, then:

Set START := LINK[START]. [Deletes first node.]

Else:

Set LINK[LOCP] := LINK[LOC].

[End of If structure.]

4. [Return deleted node to the AVAIL list.]

Set LINK[LOC] := AVAIL and AVAIL = LOC.

5. Exit.

A header linked list is a linked list which always contains a special node, called the

header node, at the beginning of the list. The following are two kinds of widely used

header lists:

1. A grounded header list is a header list where the last node contains the null

pointer.

2. A circular header list is a header list where the last node points back to the header

node.

Figure 1.27 contains schematic diagrams of these header lists. Unless otherwise stated or

implied, our header lists will always be circular. Accordingly, in such a case, the header

node also acts as a sentinel indicating the end of the list.

(Figure 1.27)

Observe that the list pointer START always points to the header node. Hence,

LINK[START] = NULL indicates that a grounded header list is empty, and

LINK[START] = START indicates that a circular header list is empty.

The following are the various operations performed on a circular header list.

1. Traversing a circular header list

2. Searching in a circular header list

3. Deleting from a circular header list

Let we discuss the respective algorithms below.

Algorithm 1.14: (Traversing a Circular Header List) Let LIST is a circular header

list in memory. This algorithm traverses LIST, applying an

operation PROCESS to each node of LIST

1. [Initializes the pointer PTR.]

Set PTR := LINK[ START].

2. Repeat Steps 3 and 4 while PTR # START:

3. Apply PROCESS to INFO[PTR].

4. [PTR now point to the next node.]

Set PTR := LINK[PTR].[PTR now points to the next node]

[End of Step 2 loop.]

5. Exit.

LIST is circular header list in memory. This algorithm finds the

location LOC of the node where ITEM first appears in LIST or

sets LOC = NULL.

1. Set PTR := LINK[START].

2. Repeat while INFO[PTR] # ITEM and PTR # START:

Set PTR := LINK[PTR].[PTR now points to the next node]

[End of loop]

3. If INFO[PTR] = ITEM, then:

Set LOC := PTR.

Else:

Set LOC := NULL.

[End of If structure]

4. Exit.

The following procedure 1.10 finds the location LOC of the first node N which contains

ITEM and also the location LOCP of the node preceding N.

1. [Initializes pointers.]

Set PREV := START and PTR := LINK[START].

2. [Updates pointers.]

Repeat while INFO[PTR] # ITEM and PTR # START.

Set PREV := PTR and PTR := LINK[PTR].

[End of loop.]

3. If INFO[PTR] = ITEM, then:

Set LOC := PTR and LOCP := PREV.

Else:

Set LOC :=NULL and LOCP := PREV.

[End of If structure.]

4. Exit.

The following algorithm deletes the first node N which contains ITEM in a circular header list.

Algorithm 1.16: DELLOCHL(INFO, LINK, START, AVAIL, ITEM)

1. [Use Procedure 1.10 to find the location of N and its preceding node]

Call FINDBHL(INFO, LINK, START, ITEM; LOC, LOCP).

2. [Item not found?]

If LOC = NULL, then:

Write: ITEM not in list, and Exit.

3. [Delete the node.]

Set LINK[LOCP] := LINK[LOC].

4. [Return deleted node to the AVAIL list.]

Set LINK[LOC] := AVAIL and AVAIL := LOC.

5. Exit.

Remark:There are two other variations of linked lists which sometimes appears in the

literature:

1. A linked list whose last node points back to the first node instead of

containing the null pointer, called a circular list

2. A linked list which contains both a special header node at the beginning of

the list and a special trailer node at the end of the list.

Figure 1.28 contains schematic diagrams of these lists.

(Figure 1.28)

Let we discuss a two-way list, which can be traversed in two directions, either

1. in the usual forward direction from the beginning of the list to the end, or

2. in the backward direction from the end of the list to the beginning.

Furthermore, given the location LOC of a node N in the list, one now has immediate

access to both the next node and the preceding node in the list. This means, in particular,

we may able to delete N from the list without traversing any part of the list.

A two-way list is a linear collection of data elements, called nodes, where each node N is

divided into three parts:

1. An information field INFO which contains the data of N

2. A pointer field FORW which contains the location of the next node in the list

3. A pointer field BACK which contains the location of the preceding node in

the list

The list also requires two list pointer variables: FIRST, which points to the first node in

the list, and LAST, which points to the last node in the list. Figure 1.29 contains a

schematic diagram of such a list. Observe that the null pointer appears in the FORW field

of the last node in the list and also in the BACK field of the first node in the list.

Observe that, using the variable FIRST and the pointer field FORW, we can traverse a

two-way list in the forward direction. On the other hand, using the variable LAST and the

pointer field BACK, we can also traverse the list in the backward direction.

Suppose LOCA and LOCB are the locations, of nodes A and B in a two-way list

respectively. Then the way that the pointers FORW and BACK are defined gives us the

following:

Pointer property:

FORW[LOCA] = LOCB

if and only if

BACK[LOCB] = LOCA

In other words, the statement that node B follows node A is equivalent to the statement

that node A precedes rode B.

Two-Way Header Lists

The advantages of a two-way list and a circular header list may be combined into a twoway circular header list as pictured in Figure.1.30. The list is circular because the two

end nodes point back to the header node. Observe that such a two-way list requires only

one list pointer variable START, which points to the header node. This is because the two

pointer in the header node point to the two ends of the list.

Traversing: Suppose we want to traverse LIST in order to process each node exactly.

once. Then we can use algorithm 1.6, if LIST is an ordinary two-way list, or we can use

algorithm 1.11, if LIST contains a header node. Here it is of no advantage that the data

are organized as a two-way list rather than as a one-way list.

Searching: Suppose we are given an ITEM of information - a key value - and we want to

find location LOC of ITEM in LIST. Then we can use algorithm 1.7, if LIST is an

ordinary two-way list, or we can use algorithm 1.15 if LIST has a header node. Here the

main advantage is that we can search for ITEM in the backward direction if we have

reason to suspect that ITEM appears near the end of the list. For example, LIST is list of

names sorted alphabetically. If ITEM = Smith, then we would search LIST in the

backward direction, but if ITEM = Davis, then we would search LIST in the forward

direction.

Deleting: Suppose we are given the location LOC of a node N in LIST, and suppose we

want to, delete N from the list. We assume that LIST is a two-way circular header list.

Note that BACK[LOC] and FORW[LOC] are the locations, of the nodes which precede

and follow node N respectively. Accordingly, as pictured in Fig. 1.31, N is deleted from

the list by changing the following pair of pointers:

FORW[BACK[LOC]] := FORW[LOC]

and

The deleted node N is then returned to the AVAIL list by the assignments:

FORW[LOC] := AVAIL

and

AVAIL := LOC.

1. [Delete Node.]

Set FORW[BACK[LOC]] := FORW[LOC]and

BACK[FORW[LOC]] := BACK[LOC].

2. [Return node to AVAIL list.]

Set FOR2[LOC] := AVAIL and AVAIL := LOC.

3. Exit.

Here we see one main advantage of a two-way list: If the data were organized as a oneway list, then, in order to delete N, we would have to traverse the one-way list to find the

location of the node preceding N.

Inserting. Suppose we are given the locations LOCA and LOCB of adjacent nodes A and

B in LIST, and suppose we want to insert a given ITEM of information between nodes A

and B. As with a one-way list, first we remove the first node N from the AVAIL list,

using the variable NEW to keep track of its location, and then we copy the data ITEM

into the node N; that is, we set:

NEW := AVAIL,

AVAlL := FORW[AVAIL],

INFO[NEW] := ITEM

Algorithm 1.18: INSTWL(INFO,FORW,BACK,START, AVAIL,LOCA,LOCB,ITFM)

1. [OVERFLOW?]

If AVAIL = NULL, then,:

Write: OVERFLOW, and Exit.

2. [Remove node from AVAIL list and copy new data into node.]

Set NEW := AVAlL, AVAIL := FORW[AVAIL],

INFO[NEW]:=ITEM.

3. [Insert node into list.]

Set FORW[LOCA] := NEW, FORW[NEW] := LOCB,

BACK[LOCB] := NEW,

BACK[NEW] := LOCA.

4. Exit.

Algorithm 1.18 assumes that LIST contains a header node. Hence LOC or LOCB may

point to the header node, in which case N will be inserted as the first node or the last

node. If LIST does not contains a header node, then we must consider the case that

LOCA = NULL and N is inserted as the first node in the list, and the case that

LOCB = NULL and N is inserted as the last node in the list.

Check your progress 6

1. Define the following terms Deque, Priority Queue

2. What are the elements available in List?

3. What do you mean by Linked List?

1.8

SUMMARY

Arrays help to solve problems that require keeping track of many pieces of data. We must

also learn about a special kind of arrays called sparse arrays. In a sparse array, a large

proportion of the elements are zero, but those which are non-zero randomly distributed.

Arrays are used in programming languages for holding group of elements all of the same

kind. Vectors, matrices, chessboards, networks, polynomials, etc. can be represented as

arrays.

A stack is a list in which retrievals, insertion, and deletion can take place at the same

position. It follows the last in first out (LIFO) mechanism. Compiler implements

recursion by generating code for creating and maintaining an activation stack, i.e. a run

time stack that holds the state of each active sub program. A queue is a list in which

retrievals and deletion can take place at one end and insertions occur at another end. It

follows first in first out (FIFO) order. Queues are employed in many situations. The

items on a queue may be vehicle waiting at a crossing, car waiting at the service station,

customers in a check-out line at a departmental store, etc.

List structures allow us to store individual data elements. In Linked Lists these elements

are interconnected by pointers. Storage is available for allocation on peripheral devices

and in the main memory. It is managed either by means of a bit-table, table of contents

file or by Linked Lists. Space is allocated either using the best fit or first fit algorithm.

Free space management is done by garbage collection which relocates fragmented free

space and compact it to get a contiguous chunk of free space.

Tree is one of the most widely used data structure employed for representing various

problems. Binary tree are the special case of trees which have atmost two children.

Binary tree mostly implemented using link lists. Various tree traversal mechanisms

include inorder, preorder and postorder. These tree traversals can be implemented using

recursive procedures and non-recursive procedures. Binary trees have wider applications

in two way decision making problems which use Y/N, T/F etc.

1.9

SOLUTIONS/ANSWERS

1. Data are simply values or set of values.

2. A data item is either the value of a variable or a constant. A data item that does

not have subordinate data items is called an elementary item. A data item that is

composed of one or more subordinate data items is called a group item.

3. True

4. Fixed-length records.

5. True

1.

2.

3.

4.

5.

Subscript and subscript variable

True

Non-linear Category

A stack, last-in-first-out (LIFO), is a linear list in which items may be inserted or

removed only at one end called the top of the stack. A queue, first-in-first-out

(FIFO), is a linear list in which deletions can take place only at the front of the

list, and insertions can take place only at the rear of the list.

6. Compiler construction, database design, windows, operating system programs,

etc.

Check Your Progress 3

1.

2.

3.

4.

5.

6.

False.

downward refers to locations with larger subscripts.

upward refers to location with smaller subscripts.

Row-major order

Column-major order

1.

2.

3.

4.

True

Non-homogeneous data

PL/1, Pascal and COBOL

1.

2.

3.

4.

5.

6.

False

False

False

Arrays or Pointers

Postfix Expressions

Recursive

1. A deque (pronounced either "deck" or "dequeue") is a linear list in which

elements can be added or removed at either end but not in the middle. The term

deque refers to the name double-ended queue.

2. A priority queue is a collection of elements such that each element has been

assigned a priority

3. List contains two parts. The first part contains the information of the element, and

the second part, called the link field or next pointer field, contains the address of

the next node in the list.

4. Linked List is a linear collection of data elements, called nodes, where the linear

order is given by means of pointers.

1.10

FURTHER READINGS

Aaron M Tenenbau, Second Edition, PHI Publications.

2. Algorithms + Data Structures = Programs by Niklaus Wirth, PHI publications.

3. Fundamentals of Data Structures in C++ by E.Horowitz, Sahni and D.Mehta;

Galgotia Publications.

4. Data Structures and Program Design in C by Kruse, C.L.Tonodo and B.Leung;

Pearson Education.

5. Fundamentals of Data Structures in C by R.B. Patel, PHI Publications.

6. Data Structures and Algorithms, V.Aho, Hopcropft, Ullman, LPE.

7. Data Structures, Seymour Lipschutz, Schaums Outline Series.

- Autonomous Robotics by NelsonTransféré parNelson Khuraijam
- Matrix Row MajorTransféré parapi-26849388
- cscTransféré parmrrajubai123
- mathcadTransféré parKeri Mason
- DSA 7.pdfTransféré parNishu Rave
- sab80c166Transféré parwtn2013
- Interview questionsTransféré parValakonda Nikitha
- Curriculum eBook FREETransféré parAnonymous tAS3TIS
- 19640287Transféré parAshiqueHussain
- Euro Class 12 Comp PracticalTransféré pardheeraj
- 04 Lists StacksTransféré parapi-3799621
- B3.2-R3 syllabusTransféré parapi-3782519
- ConversionTransféré parUmer Iftikhar Ahmed
- Lenguaje Ensamblador Irvine 5a EdiciónTransféré parClaudio Guaita
- 1 DeterminantsTransféré parManish Mishra
- K01149_20181002125220_Updated-PRO FORMA MTS3023 A181 (1)Transféré parIrfan Fazail
- C++ PROGRAMSTransféré parNadeemAkther
- Wiles1994 - In Situ Stress Determination Using the Under-excavation Technique--I. TheoryTransféré parRisanto
- ExpressionsTransféré parshemse
- 2013.fall.prelim1.pdfTransféré parSam Kumagai
- data1Transféré parthebhas1954
- cs inTransféré pariftiniazi
- GATE2018 CS ScheduleTransféré pargollapudi vamsikrishna
- MISCELLANEOUS RESULTS AND THEOREMS ON SMARANDACHE TERMS AND FACTOR PARTITIONSTransféré parMia Amalia
- A Short Note on Matlab UsageTransféré parhlvijaykumar
- mat_defiTransféré parJutt Gee
- 02E-ProbSetExam2-2Transféré parAria Carino
- CHE654 2012 Homework1 SolutionsTransféré parseralvaga
- List of C Programming LoopingTransféré parSreenu Royal's
- Generalized PID Observer Design for Descriptor Linear Systems.pdfTransféré pargazzamnounou

- 64 Interview QuestionsTransféré parshivakumar N
- Class 10 MathsTransféré parAndrea Bates
- HistoryTransféré parAndrea Bates
- i x Questions PcmTransféré parAndrea Bates
- Racism PrejudiceTransféré parAndrea Bates
- The Interview - Christopher Silvester, Umberto EcoTransféré parRubina Fatima
- 002044Transféré parAndrea Bates
- 25FormulaFactory NewTransféré parAndrea Bates
- MachTransféré parAndrea Bates
- English Grammar Analysis Understanding the BasicsTransféré parMohammad Alzein
- Poem 05 FinalTransféré parAndrea Bates
- Indesign CS3 TutorialTransféré parDanang Istiaji
- IDCS3 ShortcutsTransféré parAndrea Bates
- Fever in ChildrenTransféré parAndrea Bates
- Data StructuresTransféré parAndrea Bates
- american lit timeline.pdfTransféré parAndrea Bates
- 06-Crisp-Ch-06Transféré parAndrea Bates
- Introduction Indian PhilosophyTransféré pardvdmega
- Trial Balance Errors & Rectification 1Transféré parManju Nath
- Financial Management KangraTransféré parluckylibra
- Boldor ThesisTransféré parNoha Labani
- Civil Engineering (Geotechnical).pdfTransféré parAndrea Bates
- 3.4 HVAC - Revised (Table Format)Transféré parrmtech
- XI & XII - MathematicsTransféré parAndrea Bates
- ENG PHY II- REG 2013Transféré parAndrea Bates
- The MLL DocumentTransféré parAndrea Bates

- Cambridge Maths 7 Chapter 3Transféré parRughfidd Jfdhbigfhg
- Exponential NotationTransféré parJacqueline Acera Balingit
- GR-6 mathTransféré parGolden Sunrise
- MATLABTransféré parpure14
- Algebra Success in 20 Minutes a DayTransféré pargingerbd69
- CSC301 - 2) Variables Constants & CalculationTransféré parshaz uan
- Excel 2003 - Tutorial 3Transféré parGlenn
- Oracle Operators, Functions, Expressions, ConditionsTransféré parLn Amitav Biswas
- IntroMSExcel2003Project2Transféré parSantosh Vm
- Evaluating Expressions With Exponents PDFTransféré parJill
- Maths Program s3 Yr 6 t1Transféré parS TANCRED
- AlgebraTransféré parBlack Ink Tutorials & Services
- Unparsing Expressions With Prefix and Postfix OperatorsTransféré parfoobarxxx111
- 01 Manual C++Transféré parNooraFukuzawa Nor Fauziah
- FOM Chapter 1Transféré parNantha Kumaran
- RPN TutorialTransféré pareungkyu
- An R Companion to Applied Regression Chapter1Transféré parcharlsandroid01
- Improve Your Math Problem Solving Made Easy.pdfTransféré parfaraazxbox1
- Functions and GraphingTransféré parYuyin Kawaski
- alg1-bookTransféré parr2d2sd
- Dataflow OperatorTransféré parSahil Sharma
- Swan SoftTransféré parandreeaoana45
- DLL_MATHEMATICS 6_Q2_W9.docxTransféré parAstro
- Excel 2007 Formulas FunctionsTransféré paradnan akhter
- ch 5 - EMVIC YR7 3pp-5-web (1)Transféré parHishba
- Sigmaplot 8 Programming GuideTransféré parsharkkingking
- HP 42STransféré parGustavo Amaral
- StacksTransféré parwaqas08
- DISCRETE MATHEMATICSTransféré parKumar shantanu Basak
- Hands-On Python Tutorial - Harrington - Loyola (2011)Transféré parIvaldo Paulino