Problem 1

Part a

For $0 ⩽ i ⩽ m$ and $0 ⩽ j ⩽ n$ , define

D P [i] [j] = shortest common supersequence of A [1.. i] and B [1.. j],

where substrings defined by invalid ranges are considered to be the empty string.

The natural recurrence is

D P [i] [j] = ⎩ ⎨ ⎧ i j D P [i - 1] [j - 1] + 1 1 + min {D P [i] [j - 1], D P [i - 1] [j]} j = 0 i = 0 A [i] = B [j], i \neq = 0, j \neq = 0 otherwise .

Proof.

The cases for $i = 0$ and $j = 0$ are clear. Suppose $S$ is the SCS of $A [1.. i]$ and $B [1.. j]$ . Suppose $S$ has length $l$ . If $A [i] = B [j] =: c$ , then $S [l]$ must be $c$ . $S [1.. l - 1]$ must be the SCS of $A [1.. i - 1]$ and $B [1.. j - 1]$ ; if it weren’t, we would be able to construct a shorter common subsequence of $A [1.. i]$ and $B [1.. j]$ by the usual cut-and-paste argument. Thus, $D P [i] [j] = D P [i - 1] [j - 1] + 1$ . If $A [i] \neq = B [j]$ , then $S [l]$ must be either $A [i]$ or $B [j]$ , and the same argument yields either $S [1.. l - 1]$ is the SCS of $A [1.. i - 1]$ and $B [1.. j]$ or the SCS of $A [1.. i]$ and $B [1.. j - 1]$ respectively.□

SCS (A [1.. m], B [1.. n]) : D P \leftarrow newArray (m, n) for i \in [0.. m] : for j \in [0.. n] : if i = 0 : D P [i] [j] \leftarrow j else if j = 0 : D P [i] [j] \leftarrow i else if A [i] == B [j] : D P [i] [j] \leftarrow D P [i - 1] [j - 1] + 1 else D P [i] [j] \leftarrow 1 + min {D P [i] [j - 1], D P [i - 1] [j]} return D P [m, n]

The algorithm runs in $O (mn)$ time.

Part b

The problem of finding the longest bitonic subsequence $(LBS)$ can be reduced to the problem of finding the longest increasing subsequence $(LIS)$ :

LBS (X [1.. n]) = 2 ⩽ i ⩽ n - 2 max {LIS (X [1.. i]) + LIS (reverse (X [i + 1.. n]))} .

Below, we implement an algorithm for $LIS$ which runs in $O (n lo g n)$ time.

$pos (B, a)$ returns the largest index $i$ of $B$ such that $B [i] < a$ .

LIS (A [1.. m]) : B \leftarrow [] for a \in A [1.. m] : p \leftarrow pos (B, a) if p == B . length : B . append (a) else : B [p + 1] \leftarrow a return B . length

Claim 336.1(Claim).

In the course of its execution, if the algorithm places $A [i]$ in position $b$ of $B$ , there exists an increasing subsequence of $A [1.. i]$ of length $b$ ending in $A [i]$ .

Proof.

his is clear for $b = 1$ . Suppose the claim is true for $b > 1$ . Suppose the algorithm places $A [i]$ in position $b + 1$ . There exists $k < i$ such that $A [k] < A [i]$ and the algorithm placed $A [k]$ in position $b$ . By the induction hypothesis, there exists an increasing subsequence of length $b$ ending in $A [k]$ ; appending $A [i]$ gives one of length $b + 1$ .□

It is also clear that if $A [i_{1}], A [i_{2}], \dots, A [i_{k}]$ is any increasing subsequence of $A$ , the length of $B$ at the end of execution will be greater than or equal to $k$ . Thus, it follows that $B . length$ is the length of the longest common subsequence.

This allows $LBS$ to run in $O (n^{2} lo g n)$ time.

LBS (X [1.. n]) : A, B \leftarrow [] for i \in [2.. n - 2] A [i] \leftarrow LIS (X [1.. i]) B [i] \leftarrow LIS ([n - i + 1.. n]) an s \leftarrow 0 for i \in [2, n - 2] : an s \leftarrow max (an s, A [i], B [n - i]) return an s

Part c

LOS (X [1.. n]) : u p, d o w n \leftarrow 1 for i \in [2.. n] : if X [i] > X [i - 1] : u p \leftarrow d o w n + 1 if X [i] < X [i - 1] : d o w n \leftarrow u p + 1 return max (u p, d o w n)

It is clear that there exists an alternating subsequence of $X [1.. n]$ of length $LOS (X [1.. n])$ . Since $LOS$ counts the number of oscillations, a longer oscillating subsequence cannot exist. The algorithm runs in $O (n)$ time.

Part d

A sequence $(X_{i_{1}}, X_{i_{2}}, \dots, X_{i_{k}})$ is convex iff its consecutive differences are strictly increasing:

Δ_{t} := X_{i_{t + 1}} - X_{i_{t}} and Δ_{1} < Δ_{2} < \dots < Δ_{k - 1} .

So we must find the longest subsequence of indices whose consecutive differences are strictly increasing.

Define $D [a] [b] = X [b] - X [a]$ for $(a < b)$ and $D [a] [a] = - \infty$ . Let

DP [a] [b] = length of the longest convex subsequence whose last two indices are a < b .

Then the natural recurrence is

DP [a] [a] = 1 DP [a] [b] = 1 + k ⩽ a, D [k] [a] < D [a] [b] max DP [k] [a] (base) (a < b)

The algorithm

$zip$ accepts two arrays $A$ and $B$ of size $n$ and returns an array $C$ of size $n$ such that $C [i] = (A [i], B [i])$ . $SortBySlot2$ accepts the output of $zip$ and sorts it such that the values in the second slot are increasing. $pos (F, c)$ returns the maximum index $i$ in $F$ such that $F [i] [1] < c$ using binary search.

ConvSubseq (X [1.. n]) : D \leftarrow newArray (n, n) for i \in [1.. n - 1] : for j \in [i + 1.. n] : D [i, j] \leftarrow X [j] - X [i] for i \in [1.. n] : D [i, i] \leftarrow - \infty D P \leftarrow newArray (n, n) for i \in [1.. n] : D P [i, i] \leftarrow 1 for i \in [1.. n - 1] : F \leftarrow SortBySlot2 (zip (D P [1.. i, i], D [1.. i, i])) for j \in [2.. i] : F [j] [0] \leftarrow max (F [j] [0], F [j - 1] [0]) for j \in [i + 1, n] : D P [i] [j] \leftarrow F [pos (F, D [i] [j])] [0] + 1 return max (D P)

Correctness

Claim 336.2(Claim).

After the outer loop has processed all indices $(a^{'} \leq a)$ , for every $(k \leq a)$ the table entry $(D P [k, a])$ equals the length of the longest convex subsequence whose last two indices are $(k < a)$ (Also $(D P [a, a] = 1)$ ).

We initialize $(D P [1, 1] = 1)$ . The inner loop for $a = 1$ constructs $D P [1, j]$ for $j > 1$ . The list $F$ contains only the sentinel pair $(1, - \infty)$ , so pos(F, D[1,j]) picks that sentinel and sets $D P [1, j] = 1 + 1 = 2$ , which is correct.

Inductive step. Suppose the invariant holds for all $a^{'} < a$ . Consider fixed $a$ . We form the list $F$ of pairs $(D P [k, a], D [k, a])$ for $k = 1, \dots, a$ (including the sentinel $k = a$ with $D P (a, a) = 1$ and $D P [a, a] = - \infty$ ). We sort $F$ in increasing order of the second slot, and take prefix maxima over the first slot, that is, we set $F [i] [0]$ to be $max_{1 ⩽ j ⩽ i} F [j] [0]$ . Thus pos(F, D[a,b]) finds the maximal $D P [k, a]$ among indices $k < a$ with $D [k, a] < D [a, b]$ . The assignment

D P [a] [b] = F [pos (F, D [a] [b])] [0] + 1

for each $b > a$ exactly implements the recurrence in (E1). Therefore after processing index $a$ , the invariant holds for all pairs ending at $a$ .

Complexity

Precomputing differences: $O (n^{2})$ .
For each $a \in [1.. n]$ we sort $a$ pairs in $O (a lo g a)$ , totaling $O (n^{2} lo g n)$ .
For each $a$ we then answer $n - a$ queries pos by binary search on $F$ , with an overall complexity of $O (n^{2} lo g n)$ .

Thus, the algorithm runs in $O (n^{2} lo g n)$ .

Problem 2

Let P be a set of n points evenly distributed on the unit circle, and let S be a set of m line segments with endpoints in P. The endpoints of the m segments are not necessarily distinct; n could be significantly smaller than 2m. (a) Describe an algorithm to find the size of the largest subset of segments in S such that every pair is disjoint. Two segments are disjoint if they do not intersect even at their endpoints. (b) Describe an algorithm to find the size of the largest subset of segments in S such that every pair is interior-disjoint. Two segments are interior-disjoint if their intersection is either empty or an endpoint of both segments. (c) Describe an algorithm to find the size of the largest subset of segments in S such that every pair intersects. (d) Describe an algorithm to find the size of the largest subset of segments in S such that every pair crosses. Two segments cross if they intersect but not at their endpoints.

Let $P = [1.. n]$ be the list of points and $S [1.. m]$ be the list of edges. Represent edges by objects with properties $left$ , $right$ , and $id$ , where $e . left, e . right \in P$ , $e . left < e . right$ , and $e . id \in [1.. m]$ for all edges $e \in S$ .

Part a

Define

D P [i] [j] = largest independent set of edges in S with both end points in [i .. j]

for $1 ⩽ i < j ⩽ n$ , with $D P [i] [i] = 0$ .

The recurrence is (any out of bounds access returns 0)

D P [i] [j] D P [i] [j] = 0 (j ⩽ i) = max {D P [i] [j - 1], 1 + e \in S, e . right = j, i ⩽ e . left ⩽ j - 1 max D P [i, e . left - 1] + D P [e . left + 1, j - 1]} .

The edges are preprocessed to obtain a list $E$ , where $E [j]$ is a list of all edges with $e . right = j$ . This can be done in $O (m)$ time.

MaxIndSet (P, S, E) : D P \leftarrow newArray (n, n) D P [i] [j] \leftarrow 0 for j ⩽ i for l \in [1.. n - 1] : for i \in [1.. n - l] : j \leftarrow i + l an s \leftarrow D P [i] [j - 1] for e \in E [j] : if i ⩽ e . left ⩽ j - 1 : an s \leftarrow max (an s, 1 + D P [i, e . left - 1] + D P [e . left + 1, j - 1]) D P [i] [j] \leftarrow an s return D P [1] [n] (*)

Edge $e$ is accessed for the computation of $D P [i] [j]$ with $e . right = j$ . Thus, the for loop $(*)$ takes $O (mn)$ time overall. Filling in the DP array takes $O (n^{2})$ time, so the total running time is $O (n^{2}) + O (mn) = O (mn)$ .

Part b

We only need to slightly modify the definitions from part a. Let $E [i] [j]$ be $1$ if there exists an edge with $e . left = i$ and $e . right = j$ , and $0$ otherwise.

D P [i] [j] D P [i] [j] = 0 (j ⩽ i) = E [i] [j] + max {D P [i] [j - 1], e \in S, e . right = j, i + 1 ⩽ e . left ⩽ j - 1 max D P [i, e . left] + D P [e . left, j]} .

The algorithm is essentially the same as in part a, and runs in $O (mn)$ time.

Part d

Let $E [1.. n - 1]$ be a 2d array such that $E [i]$ is an array containing all edges $e$ such that $e . left = i$ sorted in decreasing order of the property $right$ . Assign $e . id$ to be the index of $e$ in $flatten (E)$ for all $e \in S$ .

Let $N [1.. n - 1]$ be an array such that $N [i]$ is the number of edges $e$ satisfying $e . left ⩽ i$ .

Let $F [2.. n]$ be a 2d array such that $F [i]$ is an array containing all edges $e$ such that $e . right = i$ sorted in decreasing order of the property $left$ .

MaximumClique (N, F) : an s \leftarrow 0 for i in [1.. n - 1] : an s = max (an s, LIS (map (e \to e . id, filter (e \to e . id ⩽ N [i], flatten (F [i + 1.. n]))))) return an s

The preprocessing requires $O (m lo g m)$ time. $MaximumClique (N, F)$ runs in $O (mn lo g m)$ time.

Correctness

We wish to encode the information provided about the circle as an enumeration $e_{1}, \dots, e_{m}$ of the edges and a string $s$ over the alphabet ${⊢_{i}, ⊣_{i} : i \in [1.. m]}$ such that

Each symbol in the alphabet appears exactly once, and $⊢_{i}$ appears before $⊣_{i}$ .
$e_{i}$ and $e_{j}$ with $e_{i} . left ⩽ e_{j} . left$ intersect(interiors intersect, no shared endpoints) iff $⊢_{i} ⊢_{j} ⊣_{i} ⊣_{j}$ is a subsequence of $s$
$e_{i} . left < e_{j} . left$ iff $⊢_{i}$ appears before $⊢_{j}$ in $s$
$e_{i} . right < e_{j} . right$ iff $⊣_{i}$ appears before $⊣_{j}$ in $s$ .

Let the enumeration be given by the $id$ property of the edges assigned above. Construct $s$ like so:

T \leftarrow [] for i \in [1.. n] : t e m p \leftarrow "" for e \in F [i] : t e m p . append (’ ⊣_{e . id} ’) for e \in E [i] : t e m p . append (’ ⊢_{e . id} ’) T . append (t e m p) s \leftarrow flatten (T [1.. n])

Suppose $e_{i}, e_{j} \in S$ , $e_{i} . id = i$ , $e_{j} . id = j$ $e_{i} . left ⩽ e_{j} . left$ .

Suppose $e_{i}$ and $e_{j}$ intersect in the interior of the circle. We must have $e_{i} . left < e_{j} . left$ . By the manner in which we assigned the $id$ property, $i < j$ . Also, $e_{i} . right < e_{j} . right$ . Therefore, $⊢_{i} ⊢_{j} ⊣_{i} ⊣_{j}$ is a subsequence of $s$ .
Suppose $e_{i}$ and $e_{j}$ do not intersect, even at their endpoints. Then, we have $e_{i} . left < e_{j} . left$ and $e_{i} . right > e_{j} . right$ . It follows that $⊢_{i} ⊢_{j} ⊣_{j} ⊣_{i}$ , is a subsequence of $s$ , and since each symbol appears only once, $⊢_{i} ⊢_{j} ⊣_{i} ⊣_{j}$ cannot be a subsequence of $s$ .
Suppose $e_{i} . left = e_{j} . left$ and $e_{i} . right > e_{j} . right$ . Since every slot of $E$ is sorted in decreasing order of the property $right$ , $j > i$ . It follows that $⊢_{i} ⊢_{j} ⊣_{j} ⊣_{i}$ is a subsequence of $s$ , and thus $⊢_{i} ⊢_{j} ⊣_{i} ⊣_{j}$ cannot be a subsequence of $s$ .
Suppose $e_{i} . right = e_{j} . right$ and $e_{i} . left < e_{j} . left$ . Again, $i < j$ . Since each slot of $F$ is sorted in decreasing order of the property $left$ , it follows that $⊢_{i} ⊢_{j} ⊣_{j} ⊣_{i}$ is a subsequence of $s$ , and thus $⊢_{i} ⊢_{j} ⊣_{i} ⊣_{j}$ cannot be a subsequence of $s$ .

We have shown that $⊢_{i} ⊢_{j} ⊣_{i} ⊣_{j}$ is a subsequence of $s$ $⟺$ $e_{i}$ and $e_{j}$ intersect in the interior of the circle. Since $s$ is constructed in such a way that the $⊢_{k}$ ‘s appear in increasing order of the edge $id$ s, it. suffices to find the longest increasing subsequence of $⊣_{k}$ ‘s in tails $flatten (S [i .. n])$ such that $e_{k} . left < i$ . This is exactly what $MaximumClique$ does.

Part c

We need only make a few changes to the solution of part d.

Let $E [1.. n - 1]$ be a 2d array such that $E [i]$ is an array containing all edges $e$ such that $e . left = i$ sorted in increasing order of the property $right$ . Assign $e . id$ to be the index of $e$ in $flatten (E)$ for all $e \in S$ .

Let $N [1.. n - 1]$ be an array such that $N [i]$ is the number of edges $e$ satisfying $e . left ⩽ i$ .

Let $F [2.. n]$ be a 2d array such that $F [i]$ is an array containing all edges $e$ such that $e . right = i$ sorted in increasing order of the property $left$ .

This makes $MaximumClique$ recognize edges which share an endpoint as intersecting vertices.

Now, $MaximumClique (N, F)$ returns the required result in $O (mn lo g m)$ time.

Problem 3

We can assume all sequences start with $DBL$ . Note that no shortest sequence will have two consecutive increments, since $DBL \to INC \to INC$ can be replaced by $INC \to DBL$ . Thus, if a shortest sequence has $d$ doublings, it can have at most $d$ increments. Such sequences of $DBL$ and $INC$ can be interpreted as binary numbers, which the following algorithm exploits:

OptSeq (n) : B \leftarrow binary representation of n, with B [1] being the MSB s t e p s \leftarrow 0, n u m \leftarrow 1 for i \in [2.. B . length] : if B [i] == 1 : n u m \leftarrow 2 * n u m + 1; s t e p s \leftarrow s t e p s + 2 if B [i] == 0 : n u m \leftarrow 2 * n u m; s t e p s \leftarrow s t e p s + 1 return s t e p s

Let $λ (n)$ denote the length of the shortest sequence of increments and doublings which achieves from $1$ . It is evident that the algorithm follows a sequence of increments and doubles, and that $n u m = n$ after the conclusion of the for loop. Thus, $OptSeq (n)$ is an upper bound for $λ (n)$ . Suppose the sequence $S_{1}$ discovered by $OptSeq$ uses $d_{1}$ doublings and $i_{1}$ increments. Then, $n ⩾ 2^{d_{1}} + i_{1}$ . Suppose $S_{2}$ is the shortest possible sequence, achieving $n$ with $d_{2}$ doublings and $i_{2}$ increments. Suppose $d_{2} < d_{1}$ . Since $i_{2} \leq d_{2}$ and no two increments can appear consecutively, we have $n ⩽ 2^{d_{2} + 1} - 1 < 2^{d_{1}}$ , a contradiction. If $i_{2} < i_{1}$ , we would have and equality of the form

2^{α_{1}} + 2^{α_{2}} + \dots + 2^{α_{i_{2}}} = 2^{β_{1}} + 2^{β_{2}} + \dots + 2^{β_{i_{1}}},

with $α_{1} < α_{1} < \dots < α_{i_{2}}$ , $β_{1} < β_{2} < \dots < β_{i_{1}}$ , and $i_{2} < i_{1}$ , which is impossible. This, $OptSeq (n)$ is optimal.

Problem 4

Part a

Let $S = {v_{1}, \dots, v_{n}}$ be the columns of $T$ . Let $M$ denote the tuple $(S, I)$ . Clearly $\emptyset \in I$ . If $A \in I$ and $B \subset A$ then any linear combination of vectors in $B$ may be viewed as if it were in $A$ , hence $B$ is linearly independent, i.e. $B \in I$ . Further, if $A, B \in I$ with $∣ A ∣ > ∣ B ∣$ then $A \neq \subseteq span B$ , since if that were the case, $span A$ would be spanned by fewer than or equal to $∣ B ∣$ vectors, contradicting the independence of $A$ . Let $x \in A ∖ span B$ . Indeed $B \cup {x} \in I$ since $x \neq \in span B$ and $B$ is linearly independent. Hence $M$ is a matroid.

Part b

Let $M = (S, I)$ and let $M^{'} = (S, I^{'})$ . Clearly $\emptyset \in I^{'}$ since $\emptyset^{c} = S$ must contain some maximal independent set of $M$ . If $A \in I^{'}$ and $B \subset A$ then $A^{c} \subset B^{c}$ and since $A^{c}$ contains a maximal independent set of $M$ , so does $B^{c}$ and hence $B \in I^{'}$ .

Suppose $A, B \in I^{'}$ , and $∣ A ∣ > ∣ B ∣$ . For all $a \in A$ , suppose $B \cup {a} \neq \in I^{'}$ , that is, $(B \cup {a})^{c}$ does not contain a maximal independent set of $M$ . It follows that every maximal independent set of $M$ in $B^{c}$ contains $A$ . Since $A \in I^{'}$ , $A^{c}$ must contain a maximal independent set $C$ of $M$ . Since $C$ does not contain $A$ , $C$ cannot lie in $B^{c}$ , so $C$ must intersect $B$ . However, by using the exchange property of $M$ , every element $b \in B \cap C$ can be replaced with an element $a \in D$ , where $D$ is an independent set in $B^{c}$ . Let $E$ be the set obtained from $C$ after all elements of $C \cap B$ have been replaced with elements of $D$ . Since all maximal independent sets have the same cardinality, $E$ is a maximal independent set of $M$ . Since we made at most $∣ B ∣$ exchanges, $∣ B ∣ < ∣ A ∣$ , and $C \cap A = \emptyset$ , $E$ does not contain $A$ . This is a contradiction, since we have previously determined that all independent sets of $M$ in $B^{c}$ must contain $A$ .

Part c

Let $M = (S, I)$ and let $P = {S_{1}, \dots, S_{k}}$ be the given partition of $S$ . Clearly $\emptyset \in I$ since $∣\emptyset \cap S_{i} ∣ = 0$ for all $i$ . If $A \in I$ and $B \subset A$ then $∣ B \cap S_{i} ∣ ⩽ ∣ A \cap S_{i} ∣ ⩽ 1$ for all $i$ , hence $B \in I$ . If $A, B \in I$ with $∣ A ∣ > ∣ B ∣$ then there is an $i$ for which $∣ S_{i} \cap A ∣ = 1$ and $∣ S_{i} \cap B ∣ = 0$ . Let $x \in A \cap S_{i}$ then $B \cup {x} \in I$ . Hence $M$ is a matroid.

Problem 5

BoundedDijkstra (G (V, E), W, s \in V) : Initialize list B [0.. W (V - 1)] with \emptyset Initialize list d i s t [1.. V] with \infty B [0] \leftarrow s d i s t [s] \leftarrow 0 for i \in [0.. W (V - 1)] : for v \in B [i] : for each neighbor u of v : d i s t^{'} \leftarrow i + w (v, u) #i=dist[v] if d i s t [u] > d i s t^{'} : remove u from B [d i s t [u]] append u to B [d i s t^{'}] d i s t [u] \leftarrow d i s t^{'} return d i s t

The mechanism to pick the vertex with the shortest distance has been modified to use buckets in pace of the priority queue used in the vanilla algorithm. Bounding the weights by $W$ allows us to bound maximum weight of a path in the graph by $W (V - 1)$ . Every vertex is processed, since a vertex can only be inserted into a later bucket or at the end of the current bucket.

Each edge is processed once, and we pass through $W (V - 1)$ buckets. Thus, the algorithm runs in $O (W ∣ V ∣ + ∣ E ∣)$ time.

Problem 6

Suppose $T$ is a binary tree that is not full. Then there is a node $v$ which has only one child $u$ . We split the solution into two cases:

If $v$ is the root node then delete $v$ and use $u$ are the root node instead.
If $v$ is not the root node then it has a parent, say $w$ . Here, delete $v$ and attach $u$ to $w$ .

In both cases, we have shown that there is a new tree which uses less bits to encode the data (since we are deleting an edge). Hence $T$ cannot correspond to an optimal code.

NoNotes

Graph View

AS ALGO 3

Problem 1

Part a

Part b

Part c

Part d

Problem 2

Part a

Part b

Part d

Part c

Problem 3

Problem 4

Part a

Part b

Part c

Problem 5

Problem 6

Table of Contents

Backlinks