Lecture 8 CS203. Implementation of Data Structures 2 In the last couple of weeks, we have covered...

46
Lecture 8 CS203

Transcript of Lecture 8 CS203. Implementation of Data Structures 2 In the last couple of weeks, we have covered...

Lecture 8CS203

2Implementation of Data Structures

In the last couple of weeks, we have covered various data structures that are implemented in the Java Collections Framework. As a working programmer, you may often use these, and most of the rest of the time, you will use various libraries that supply alternatives. You will not often have to implement the data structures yourself.

However, in order to understand how these structures work at a lower level, we will cover implementation this week.

3

A list stores data in sequential order. For example, a list of students, a list of available rooms, a list of cities, and a list of books, etc. can be stored using lists. The common operations on a list are usually the following: ·         Retrieve an element from this list.·         Insert a new element to this list.·         Delete an element from this list.·         Find how many elements are in this list.·         Find if an element is in this list.·         Find if this list is empty.

Lists

4

There are two common ways to implement a list in Java.

Using arrays. One is to use an array to store the elements. The array is dynamically created. If the capacity of the array is exceeded, create a new larger array and copy all the elements from the current array to the new array.

Using a linked list. The other approach is to use a linked structure. A linked structure consists of nodes. Each node is dynamically created to hold an element. All the nodes are linked together to form a list.

Two Ways to Implement Lists

5

Sample Code is linked from course page

Two Ways to Implement Lists

6

For convenience, let’s name these two classes: MyArrayList and MyLinkedList. These two classes have common operations, but different data fields. The common operations can be generalized in an interface or an abstract class. A good strategy is to combine the virtues of interfaces and abstract classes by providing both interface and abstract class in the design so the user can use either the interface or the abstract class whichever is convenient. Such an abstract class is known as a convenience class.

ArrayList and LinkedList

MyList MyAbstractList

MyArrayList

MyLinkedList

7MyList Interface and MyAbstractList Class

MyList

MyAbstractList

«interface» MyList<E>

+add(e: E) : void

+add(index: int, e: E) : void

+clear(): void

+contains(e: E): boolean

+get(index: int) : E

+indexOf(e: E) : int

+isEmpty(): boolean

+lastIndexOf(e: E) : int

+remove(e: E): boolean

+size(): int

+remove(index: int) : E

+set(index: int, e: E) : E

Appends a new element at the end of this list.

Adds a new element at the specified index in this list.

Removes all the elements from this list.

Returns true if this list contains the element.

Returns the element from this list at the specified index.

Returns the index of the first matching element in this list.

Returns true if this list contains no elements.

Returns the index of the last matching element in this list.

Removes the element from this list.

Returns the number of elements in this list.

Removes the element at the specified index and returns the removed element.

Sets the element at the specified index and returns the element you are replacing.

MyAbstractList<E>

#size: int

#MyAbstractList()

#MyAbstractList(objects: E[])

+add(e: E) : void

+isEmpty(): boolean

+size(): int

+remove(e: E): boolean

The size of the list.

Creates a default list.

Creates a list from an array of objects.

Implements the add method.

Implements the isEmpty method.

Implements the size method.

Implements the remove method.

8

Once an array is created, its size cannot be changed. Nevertheless, you can still use array to implement dynamic data structures. The trick is to create a new larger array to replace the current array if the current array cannot hold new elements in the list.

Initially, an array, say data of Object[] type, is created with a default size. When inserting a new element into the array, first ensure there is enough room in the array. If not, create a new array twice the size of the current one. Copy the elements from the current array to the new array. The new array now becomes the current array.

Array List

9

Before inserting a new element at a specified index, shift all the elements after the index to the right and increase the list size by 1.

Insertion

e0

0 1 … i i+1 k-1 Before inserting e at insertion point i

e1 … ei ei+1

… ek-1

data.length -1 Insertion point e

e0

0 1 … i i+1 After inserting e at insertion point i, list size is incremented by 1

e1 … e ei

… ek-1

data.length -1 e inserted here

ek

ek

k

ei-1

ei-1

k+1 k

ei+1

i+2

…shift…

10

To remove an element at a specified index, shift all the elements after the index to the left by one position and decrease the list size by 1.

Deletion

e0

0 1 … i i+1 k-1 Before deleting the element at index i

e1 … ei ei+1

… ek-1

data.length -1 Delete this element

e0

0 1 … i After deleting the element, list size is decremented by 1

e1 …

… ek

data.length -1

ek

k

ei-1

ei-1

k-1

ei+1

k-2

ek-1

…shift…

11Implementing MyArrayList

MyArrayList<E>

-data: E[]

+MyArrayList()

+MyArrayList(objects: E[])

-ensureCapacity(): void

MyAbstractList<E>

Creates a default array list.

Creates an array list from an array of objects.

Doubles the current array size if needed.

12

Since MyArrayList is implemented using an array, the methods get(int index) and set(int index, Object o) for accessing and modifying an element through an index and the add(Object o) for adding an element at the end of the list are efficient.

However, the methods add(int index, Object o) and remove(int index) are inefficient because it requires shifting potentially a large number of elements. You can use a linked structure to implement a list to improve efficiency for adding and removing an element anywhere in a list.

Linked Lists

13

A linked list consists of nodes. Each node contains an element, and each node is linked to its next neighbor. Thus a node can be defined as a class, as follows:

Nodes in Linked Lists

class Node<E> { E element; Node next;  public Node(E o) { element = o; }}

element

head

next

Node 1

element

next

Node 2 …

element

null

Node n

tail

14

The variable head refers to the first node in the list, and the variable tail refers to the last node in the list. If the list is empty, both are null. For example, you can create three nodes to store three strings in a list, as follows: 

Adding Three Nodes

Step 1: Declare head and tail:

The list is empty now Node<String> head = null;

Node<String> tail = null;

15Adding Three Nodes, cont.

Step 2: Create the first node and insert it to the list:

head "Chicago" next: null

tail

After the first node is inserted inserted

head = new Node<String>("Chicago"); tail = head;

16Adding Three Nodes, cont.

Step 3: Create the second node and insert it to the list:

tail.next = new Node<String>("Denver");

head "Chicago" next

"Denver" next: null

tail

tail = tail.next;

head "Chicago" next

"Denver" next: null

tail

17Adding Three Nodes, cont.

Step 4: Create the third node and insert it to the list:

head "Chicago" next

"Dallas" next: null

tail

"Denver" next

tail.next = new Node<String>("Dallas");

head "Chicago" next

"Dallas" next: null

tail

"Denver" next

tail = tail.next;

18Traversing All Elements in the List

Each node contains the element and a data field named next that points to the next element. If the node is the last in the list, its pointer data field next contains the value null. You can use this property to detect the last node. For example, you may write the following loop to traverse all the nodes in the list.

Node<E> current = head;

while (current != null) {

System.out.println(current.element);

current = current.next;

}

19MyLinkedList

MyLinkedList

MyLinkedList<E>

-head: Node<E>

-tail: Node<E>

+MyLinkedList()

+MyLinkedList(objects: E[])

+addFirst(e: E): void

+addLast(e: E): void

+getFirst(): E

+getLast(): E

+removeFirst(): E

+removeLast(): E

1

m Node<E>

element: E next: Node<E>

Link

1

MyAbstractList<E>

Creates a default linked list.

Creates a linked list from an array of objects.

Adds the object to the head of the list.

Adds the object to the tail of the list.

Returns the first object in the list.

Returns the last object in the list.

Removes the first object from the list.

Removes the last object from the list.

RunTestMyLinkedList

20

public void addFirst(E o) { Node<E> newNode = new Node<E>(o); newNode.next = head; head = newNode; size++; if (tail == null) tail = head;}

Implementing addFirst(E o)

head

e0

next

A new node to be inserted here

ei

next

ei+1

next

tail

… ek

null

element

next

New node inserted here

(a) Before a new node is inserted.

(b) After a new node is inserted.

e0

next

… ei

next

ei+1

next

tail

… ek

null

element

next

head

21

public void addLast(E o) { if (tail == null) { head = tail = new Node<E>(element); } else { tail.next = new Node(element); tail = tail.next; } size++;}

Implementing addLast(E o)

head

e0

next

… ei

next

ei+1

next

tail

… ek

null

o

null

New node inserted here

(a) Before a new node is inserted.

(b) After a new node is inserted.

head

e0

next

… ei

next

ei+1

next

tail

… ek

next

A new node to be inserted here

o

null

22

public void add(int index, E o) { if (index == 0) addFirst(o); else if (index >= size) addLast(o); else { Node<E> current = head; for (int i = 1; i < index; i++) current = current.next; Node<E> temp = current.next; current.next = new Node<E>(o); (current.next).next = temp; size++; }}

Implementing add(int index, E o)

current head

e0

next

A new node to be inserted here

ei

next

temp

ei+1

next

tail

… ek

null

e

null (a) Before a new node is inserted.

current head

e0

next

A new node is inserted in the list

ei

next

temp

ei+1

next

tail

… ek

null

e

next (b) After a new node is inserted.

23

public E removeFirst() { if (size == 0) return null; else { Node<E> temp = head; head = head.next; size--; if (head == null) tail = null; return temp.element; }}

Implementing removeFirst()

head

e0

next

Delete this node

ei

next

ei+1

next

tail

… ek

null

(a) Before the node is deleted.

(b) After the first node is deleted

e1

next

… ei

next

ei+1

next

tail

… ek

null

e1

next

head

24

public E removeLast() { if (size == 0) return null; else if (size == 1) { Node<E> temp = head; head = tail = null; size = 0; return temp.element; } else { Node<E> current = head; for (int i = 0; i < size - 2; i++) current = current.next; Node temp = tail; tail = current; tail.next = null; size--; return temp.element; }}

Implementing removeLast()

head

e0

next

Delete this node

ek-2

next

ek-1

next

tail

ek

null

(a) Before the node is deleted.

(b) After the last node is deleted

e1

next

current

head

e0

next

… ek-2

next

ek-1

null

e1

next

tail

25

public E remove(int index) { if (index < 0 || index >= size) return null; else if (index == 0) return removeFirst(); else if (index == size - 1) return removeLast(); else { Node<E> previous = head; for (int i = 1; i < index; i++) { previous = previous.next; } Node<E> current = previous.next; previous.next = current.next; size--; return current.element; }}

Implementing remove(int index)

previous head

element

next

Node to be deleted

element

next

element

next

tail

… element

null

element

next

(a) Before the node is deleted.

current

previous head

element

next

… element

next

element

next

tail

… element

null

(b) After the node is deleted.

current.next

current.next

26Time Complexity for ArrayList and LinkedList

Methods MyArrayList/ArrayList MyLinkedList/LinkedList

add(e: E) )1(O )1(O

add(index: int, e: E) )(nO )(nO

clear() )1(O )1(O

contains(e: E) )(nO )(nO

get(index: int) )1(O )(nO

indexOf(e: E) )(nO )(nO

isEmpty() )1(O )1(O

lastIndexOf(e: E) )(nO )(nO

remove(e: E) )(nO )(nO

size() )1(O )1(O

remove(index: int) )(nO )(nO

set(index: int, e: E) )(nO )(nO

addFirst(e: E) )(nO )1(O

removeFirst() )(nO )1(O

27

A circular, singly linked list is like a singly linked list, except that the pointer of the last node points back to the first node.

Example uses: playing video/audio files repeatedly ALT + TAB in Windows.

Circular Linked Lists

element

head

next

Node 1

element

next

Node 2 …

element

next

Node n

tail

28

A doubly linked list contains the nodes with two pointers. One points to the next node and the other points to the previous node. These two pointers are conveniently called a forward pointer and a backward pointer. So, a doubly linked list can be traversed forward and backward.

Doubly Linked Lists

element

head

next

Node 1

element

next

Node 2 …

element

null

Node n

tail

null previous previous

29

A circular, doubly linked list is doubly linked list, except that the forward pointer of the last node points to the first node and the backward pointer of the first pointer points to the last node.

Circular Doubly Linked Lists

element

head

next

Node 1

element

next

Node 2 …

element

next

Node n

tail

previous previous previous

30

A stack can be viewed as a special type of list, where the elements are accessed, inserted, and deleted only from the end, called the top, of the stack.

Stacks

Data1 Data2

Data1 Data1 Data2 Data3

Data1 Data2 Data3

Data1 Data2

Data3

Data1

Data2 Data1

31

A queue represents a waiting list. A queue can be viewed as a special type of list, where the elements are inserted into the end (tail) of the queue, and are accessed and deleted from the beginning (head) of the queue.

Queues

Data1 Data2

Data1 Data1 Data2 Data3

Data1 Data2 Data3

Data2 Data3

Data1

Data3

Data2 Data3

32

Using an array list to implement StackUse a linked list to implement Queue

Since the insertion and deletion operations on a stack are made only at the end of the stack, using an array list to implement a stack is more efficient than a linked list. Since deletions are made at the beginning of the list, it is more efficient to implement a queue using a linked list than an array list. This section implements a stack class using an array list and a queue using a linked list.

Implementing Stacks and Queues

33

There are two ways to design the stack and queue classes: Using inheritance: You can define the stack class by extending the

array list class, and the queue class by extending the linked list class.

Design of the Stack and Queue Classes

– Using composition: You can define an array list as a data field in the stack class, and a linked list as a data field in the queue class.

ArrayList GenericStack LinkedList GenericQueue

GenericStack

ArrayList GenericQueue LinkedList

34

Both designs are fine, but using composition is better because it enables you to define a complete new stack class and queue class without inheriting the unnecessary and inappropriate methods from the array list and linked list.

Composition is Better

35MyStack and MyQueue

GenericStack<E> -list: java.util.ArrayList<E>

+GenericStack()

+getSize(): int

+peek(): E

+pop(): E

+push(o: E): void

+isEmpty(): boolean

Creates an empty stack.

Returns the number of elements in this stack.

Returns the top element in this stack.

Returns and removes the top element in this stack.

Adds a new element to the top of this stack.

Returns true if the stack is empty.

An array list to store elements.

GenericQueue<E>

-list: MyLinkedList<E>

+enqueue(e: E): void

+dequeue(): E

+getSize(): int

Adds an element to this queue.

Removes an element from this queue.

Returns the number of elements from this queue.

GenericStack

GenericQueue

36

A regular queue is a first-in and first-out data structure. Elements are appended to the end of the queue and are removed from the beginning of the queue. In a priority queue, elements are assigned with priorities. When accessing elements, the element with the highest priority is removed first. A priority queue has a largest-in, first-out behavior. For example, the emergency room in a hospital assigns patients with priority numbers; the patient with the highest priority is treated first.

Priority Queue

MyPriorityQueue<E>

-heap: Heap<E>

+enqueue(element: E): void

+dequeue(): E

+getSize(): int

Adds an element to this queue.

Removes an element from this queue.

Returns the number of elements from this queue.

37

The term "Heap" is used in several different ways. The Priority Queue example code uses a heap that adheres to this definition:

A heap is a binary tree (one in which each node has at most two children) that with these two properties:Order property: If A is a parent node of B then the key of node A is ordered with respect to the key of node B with the same ordering applying across the heap. Either the keys of parent nodes are always greater than or equal to those of the children and the highest key is in the root node (max heap) or the keys of parent nodes are less than or equal to those of the children and the lowest key is in the root node (min heap).Shape property:1- All leaves are either at depth d or d-1 (for some value d).

2- All of the leaves at depth d-1 are to the right of the leaves at depth d.

3- (a) There is at most 1 node with just 1 child. (b) Any such child is the left child of its parent, and (c) it is the rightmost leaf at depth d.

Heap

38Heap

39Representing a Heap

The tree is represented by values in a list or array.

For a node at position i, its left child is at position 2i+1 and its right child is at position 2i+2, and its parent is (i-1)/2. For example, the node for element 39 is at position 4, so its left child (element 14) is at 9 (2*4+1), its right child (element 33) is at 10 (2*4+2), and its parent (element 42) is at 1 ((4-1)/2).

22 29 14 33 30 17 9

32 39 44 13

42 59

62 62 42 59 32 39 44 13 22 29 14 33 30 17 9

[0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10][11][12][13] [10][11]

parent left

right

40Adding Elements to the Heap

Adding 3, 5, 1, 19, 11, and 22 to a heap, initially empty

3

(a) After adding 3 (b) After adding 5 (c) After adding 1

(d) After adding 19

3

19

5 1

(e) After adding 11

3 5

19

11 1

(f) After adding 22

3 5 1

22

11 19

5

3

5

3 1

41Rebuild the heap after adding a new node

Adding 88 to the heap

(a) Add 88 to a heap

3 5 1 88

22

11 19

(b) After swapping 88 with 19

3 5 1 19

22

11 88

(b) After swapping 88 with 22

3 5 1 19

88

11 22

42Removing the Root and Rebuild the Tree

22 29 14 33 30 17 9

32 39 44 13

42 59

62

Removing root 62 from the heap

43Removing the Root and Rebuild the Tree

22 29 14 33 30 17

32 39 44 13

42 59

9

Move 9 to root

44Removing the Root and Rebuild the Tree

22 29 14 33 30 17

32 39 44 13

42 9

59

Swap 9 with 59

45Removing the Root and Rebuild the Tree

22 29 14 33 30 17

32 39 9 13

42 44

59

Swap 9 with 44

46Removing the Root and Rebuild the Tree

22 29 14 33 9 17

32 39 30 13

42 44

59

Swap 9 with 30