程序代写代做代考 algorithm AVL COMP90038 Algorithms and Complexity

COMP90038 Algorithms and Complexity

COMP90038
Algorithms and Complexity

Lecture 16: Time/Space Tradeoffs – String Search Revisited

(with thanks to Harald Søndergaard & Michael Kirley)

Andres Munoz-Acosta

munoz.m@unimelb.edu.au

Peter Hall Building G.83

mailto:munoz.m@unimelb.edu.au

Recap

• BST have optimal performance when they are balanced.

• AVL Trees:
• Self-balancing trees for which the balance factor is -1, 0, or 1, for every sub-tree.
• Rebalancing is achieved through rotations.
• It guarantees depth of a tree with n nodes to be Θ(log n)

• 2–3 trees:
• Trees that allow more than one item to be stored in a tree node.
• This allows for a simple way of keeping search trees perfectly balanced.
• Insertions, splits and promotions are used to grow and balance the tree.

AVL Trees: R-Rotation

AVL Trees: L-Rotation

AVL Trees: LR-Rotation

AVL Trees: RL-Rotation

Example

• On the tree below, insert the elements {4, 13, 12}

• https://www.cs.usfca.edu/~galles/visualization/AVLtree.html

-1

0 0

1 0

2 0

3 2

0 0

0 1

Example: Build a 2–3 Tree from
{9, 5, 8, 3, 2, 4, 7}

2–3 Tree Analysis

• Worst case search time results when all nodes are 2-nodes. The relation between the
number n of nodes and the height h is:

n = 1 + 2 + 4 + … + 2h = 2h+1 -1

• That is, log2(n+1) = h+1.

• In the best case, all nodes are 3-nodes:

n = 2 + 23 + 232 + … + 23h = 3h+1 -1

• That is, log3(n+1) = h+1.

• Hence we have log3(n+1) – 1 ≤ h ≤ log2(n+1) – 1.

• Useful formula:

Spending Space to Save Time

• Often we can find ways of decreasing the time required to solve a problem, by using
additional memory in a clever way.

• For example, in Lecture 6 (Recursion) we considered the simple recursive way of finding
the n-th Fibonacci number and discovered that the algorithm uses exponential time.

Spending Space to Save Time

• However, suppose the same algorithm uses a table to tabulate the
function FIB() as we go: As soon as an intermediate result FIB(i) has
been found, it is not simply returned to the caller; the value is first
placed in slot i of a table (an array). Each call to FIB() first looks in this
table to see if the required value is there, and only if it is not, the
usual recursive process kicks in.

Fibonacci Numbers with Tabulation

• We assume that, from the outset, all entries of the table F are 0.

• (I show this code just so that you can see the principle; in Lecture 6 we
already discovered a different linear-time algorithm, so here we don’t
really need tabulation.)

Sorting by Counting

• Suppose we need to sort large arrays, but we know that they will hold keys taken from a small, fixed set (so
lots of duplicate keys).

• For example, suppose all keys are single digits:

• Then we can, in a single linear scan, count the occurrences of each key in array A and store the result in a
small table:

• Now use a second linear scan to make the counts cumulative:

•

Sorting by Counting

• We can now create a sorted array S[1]…S[n] of the items by simply slotting items
into pre-determined slots in S (a third linear scan).

• Place the last record (with key 3) in S[12] and decrement Occ[3] (so that the next
`3′ will go into slot 11), and so on.

•

Sorting by Counting

• Note that this gives us a linear-time sorting algorithm (for the cost of some
extra space).

• However, it only works in situations where we have a small range of keys,
known in advance.

• The method never performs a key-to-key comparison.

• The time complexity of key-comparison based sorting has been proven to
be in Ω(n log n).

String Matching Revisited

• In Lecture 5 (Brute Force Methods) we studied an approach to string search.

String Matching Revisited

• “Strings” are usually built from a small, pre-determined alphabet.

• Most of the better algorithms rely on some pre-processing of strings
before the actual matching process starts.

• The pre-processing involves the construction of a small table (of
predictable size).

• Levitin refers to this as “input enhancement”.