Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by...

8
ECE 534: Elements of Information Theory, Fall 2010 Homework 11 Solutions – all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two independent sources. Can one compress two inde- pendent sources simultaneously better than by compressing the sources individually? The following problem addresses this question. Let {X i } be i.i.d. p(x) with distortion d(x, ˆ x) and rate distortion function R X (D). Similarly, let {Y i } be i.i.d. p(y) with distortion d(y, ˆ y) and rate distortion function R Y (D). Suppose we now wish to describe the process {(X i ,Y i )} subject to distortions Ed(X, ˆ X ) D1 and Ed(Y, ˆ Y ) D2. Thus, a rate R X,Y (D1,D2) is sufficient, where R X,Y (D 1 ,D 2 )= min px, ˆ y|x,y):Ed(X, ˆ X)D 1 ,Ed(Y, ˆ Y )D 2 I (X, Y ; ˆ X, ˆ Y ) Now suppose that the {X i } process and the {Y i } process are independent of each other. (a) Show that: R X,Y (D 1 ,D 2 ) R X (D 1 )+ R Y (D 2 ) Solution: Given: R X,Y (D 1 ,D 2 )= min px, ˆ y|x,y):Ed(X, ˆ X)D 1 ,Ed(Y, ˆ Y )D 2 I (X, Y ; ˆ X, ˆ Y ) (1) Considering that {X i } and the {Y i } are independent of each other we have: I (X, Y ; ˆ X, ˆ Y )= H (X, Y ) - H (X, Y | ˆ X, ˆ Y ) (2) = H (X )+ H (Y ) - H (X | ˆ X, ˆ Y ) - H (Y |X, ˆ X, ˆ Y ) (3) = H (X )+ H (Y ) - H (X | ˆ X ) - H (Y | ˆ Y ) (4) H (X ) - H (X | ˆ X )+ H (Y ) - H (Y | ˆ Y ) (5) I (X ; ˆ X )+ I (Y ; ˆ Y ) (6) Now we obtain: R X,Y (D 1 ,D 2 )= min px, ˆ y|x,y):Ed(X, ˆ X)D 1 ,Ed(Y, ˆ Y )D 2 I (X, Y ; ˆ X, ˆ Y ) (7) min px, ˆ y|x,y):Ed(X, ˆ X)D 1 ,Ed(Y, ˆ Y )D 2 h I (X ; ˆ X )+ I (Y ; ˆ Y ) i (8) min px|x):Ed(X, ˆ X)D 1 I (X ; ˆ X )+ min py|y):Ed(Y, ˆ Y )D 2 I (Y ; ˆ Y ) (9) R X (D 1 )+ R Y (D 2 ) (10) 1

Transcript of Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by...

Page 1: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

ECE 534: Elements of Information Theory, Fall 2010Homework 11Solutions – all by Kenneth S. Palacio BausNovember 17, 2010

1. Problem 10.14. Rate distortion for two independent sources. Can one compress two inde-pendent sources simultaneously better than by compressing the sources individually? Thefollowing problem addresses this question. Let {Xi} be i.i.d. ∼ p(x) with distortion d(x, x)and rate distortion function RX(D). Similarly, let {Yi} be i.i.d. ∼ p(y) with distortion d(y, y)and rate distortion function RY (D). Suppose we now wish to describe the process {(Xi, Yi)}subject to distortions Ed(X, X) ≤ D1 and Ed(Y, Y ) ≤ D2. Thus, a rate RX,Y (D1, D2) issufficient, where

RX,Y (D1, D2) = minp(x,y|x,y):Ed(X,X)≤D1,Ed(Y,Y )≤D2

I(X,Y ; X, Y )

Now suppose that the {Xi} process and the {Yi} process are independent of each other.

(a) Show that:RX,Y (D1, D2) ≥ RX(D1) + RY (D2)

Solution:

Given:

RX,Y (D1, D2) = minp(x,y|x,y):Ed(X,X)≤D1,Ed(Y,Y )≤D2

I(X, Y ; X, Y ) (1)

Considering that {Xi} and the {Yi} are independent of each other we have:

I(X, Y ; X, Y ) = H(X, Y )−H(X,Y |X, Y ) (2)

= H(X) + H(Y )−H(X|X, Y )−H(Y |X, X, Y ) (3)

= H(X) + H(Y )−H(X|X)−H(Y |Y ) (4)

≥ H(X)−H(X|X) + H(Y )−H(Y |Y ) (5)

≥ I(X; X) + I(Y ; Y ) (6)

Now we obtain:

RX,Y (D1, D2) = minp(x,y|x,y):Ed(X,X)≤D1,Ed(Y,Y )≤D2

I(X, Y ; X, Y ) (7)

≥ minp(x,y|x,y):Ed(X,X)≤D1,Ed(Y,Y )≤D2

[I(X; X) + I(Y ; Y )

](8)

≥ minp(x|x):Ed(X,X)≤D1

I(X; X) + minp(y|y):Ed(Y,Y )≤D2

I(Y ; Y ) (9)

≥ RX(D1) + RY (D2) (10)

1

Page 2: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

(b) Does equality hold?

From part (a), we have that: RX,Y (D1, D2) ≥ RX(D1) + RY (D2).

Because of the independence assumed between processes {Xi} and {Yi} we have that:

p(x, y, x, y) = p(x, y|x, y)p(x, y) (11)= p(x|x)p(y|y)p(x)p(y) (12)= p(x, x)p(y, y) (13)

Then, for distributions p(x, x) and p(y, y) achieving rate distortions RX(D1) and RY (D2)we have that the mutual information of the product of the two distributions such thatp(x, y, x, y) = p(x, x)p(y, y) gives:

RX,Y (D1, D2) = minp(x,y|x,y):Ed(X,X)≤D1,Ed(Y,Y )≤D2

I(X, Y ; X, Y ) (14)

= RX(D1) + RY (D2) (15)

Hence, the equality holds.

Now answer the question: Can one compress two independent sources simultane-ously better than by compressing the sources individually?

From the previous result we can see that encoding two independent sources together resultsthe same as encoding each of them independently.

2

Page 3: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

2. Problem 15.1. Cooperative capacity of a multiple-access channel

(a) Suppose that X1 and X2 have access to both indices W1 ∈ {1, 2nR}, W2 ∈ {1, 2nR2}.Thus, the codewords X1(W1, W2), X2(W1, W2) depend on both indices. Find the capacityregion.

Solution:

Since we know that X1 and X2 have access to both indices W1 ∈ {1, 2nR}, W2 ∈ {1, 2nR2},and that the codewords depend on both indices, X1(W1, W2), X2(W1, W2), the pair (X1, X2)can be seen as a single codeword X we have that this is equivalent to have a single userchannel with alphabet given by X1×X2 and indice W1×W2. Then, we have a combined ratefor both senders as the only bound for the achievable region given by:

R1 + R2 ≤ C = maxp(x1,x2)

I(X1, X2; Y ) (16)

We can achieve this setting X2 = 0 so we have a rate pair (C, 0) and also by setting X1 = 0achieving a rate pair (0, C).

(b) Evaluate this region for the binary erasure multiple access channel Y = X1 + X2, Xi ∈{0, 1}. Compare to the noncooperative region.

The operation of this channel is shown in table 1. Then, the alphabet for Y is Y ∈ {0, 1, 2}.

X1 : X2 0 10 0 11 1 2

Table 1: Y = X1 + X2

To evaluate this region for the binary erasure multiple access channel Y = X1 + X2 for the

3

Page 4: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

cooperative capacity region we have:

R1 + R2 ≤ C = maxp(x1,x2)

I(X1, X2; Y ) (17)

= H(Y )−H(Y |X1, X2) (18)= H(Y ) (19)≤ log2 |Y| (20)

R1 + R2 ≤ log2 (3) (21)

To achieve this capacity we need to set the distribution of the possible inputs to be Uniform(13)

for example by setting: p(0, 0) = p(1, 1) = 13 and p(1, 0) + p(0, 1) = 1

3 .

When the senders work in non-cooperative mode, we have the region capacity of the binaryerasure multiple-access channel. The capacity region is shown in the following figure:

Figure 1: Capacity Region for problem 15.1b

4

Page 5: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

3. Problem 15.2. Capacity of multiple-access channels. Find the capacity region for each ofthe following multiple-access channels:

(a) Additive modulo 2 multiple-access channel. X1 ∈ {0, 1}, X2 ∈ {0, 1}, Y = X1⊕X2.

The following table shows the operation Y = X1 ⊕X2.

X1 : X2 0 10 0 11 1 0

Table 2: Y = X1 ⊕X2

Here we see that by setting one of the inputs to zero, it is possible achieve a rate of 1 bit pertransmission for the other sender, for example setting X2 = 0, the output is given by Y = X1.Doing the same for X1 = 0 gives Y = X2, then we have a rate of 1 bit. We can see from thetable 2 that the alphabet of the output is the same as the alphabet of the input, therefore toobtain the verify the capacity:

C = maxdist

I(X1, X2; Y ) ≤ H(Y ) = log2 |Y| = 1

Then, the combined rates R1 + R2 cannot be more than 1 bit, so we obtain a triangularcapacity region as shown in figure 2, defined by R1 + R2 ≤ 1.

Figure 2: Capacity region for problem 15.2

(b) Multiplicative multiple-access channel. X1 ∈ {−1, 1}, X2 ∈ {−1, 1}, Y = X1 ·X2.

The operation of this channel is shown in the following table:

X1 : X2 -1 1-1 +1 -11 -1 +1

Table 3: Y = X1 ·X2

5

Page 6: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

We can see that the result is exactly the same as part (a) as well as the capacity region(R1 + R2 ≤ 1.). Notice that the notation has changed, however, we still have a binarynotation +1 and −1.

6

Page 7: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

4. Problem 15.6. Unusual multiple-access channel. Consider the following multiple-accesschannel: X1 = X2 = Y = {0, 1}. If (X1, X2) = (0, 0), then Y = 0. If (X1, X2) = (0, 1), thenY = 1. If (X1, X2) = (1, 0), then Y = 1. If (X1, X2) = (1, 1), then Y = 0 with probability 1

2and Y = 1 with probability 1

2 .

The following table illustrate the operation of this channel:

X1 X2 Y

0 0 00 1 11 0 11 1 0, (Pr=1/2) —— 1, (Pr=1/2)

Table 4: Unusual multiple-access channel

(a) Show that the rate pairs (1, 0) and (0, 1) are achievable.

Solution:

To achieve the rate pairs (1, 0) and (0, 1) we can set one of the inputs to zero so the othersender will have a rate of 1 bit per transmission. For example setting X1 = 0 will yield intoY = X2 then, the rate pair (0,1) is achievable. Thus, we can also do the counter case andset X2 = 0 to obtain Y = X1 achieving the rate pair (1,0). This can be achieved by timesharing, like it shown in the Binary Multiplier channel example of the textbook.

To prove this, we know the capacity region bound for R1 is given by:

R1 ≤ I(X1; Y |X2)

For example for fixed X2 = 0 we have:

R1 ≤ I(X1; Y |X2 = 0) (22)≤ H(Y |X2 = 0)−H(Y |X1, X2 = 0) (23)

≤ H(Y |X2 = 0) ≤ H(1/2) ≤ 1 (24)

The result for setting X1 = 0 will follow from symmetry.

(b) Show that for any non-degenerate distribution p(x1)p(x2), we have I(X1, X2; Y ) <1.

A non-degenerate distribution is obtained by p(x1) 6= {0, 1} and p(x2) 6= {0, 1}. So, we canset an arbitrary distribution like Pr(X1 = 1) = p and Pr(X2 = 1) = q then know that theregion capacity for the combined rate is given by:

7

Page 8: Kenneth S. Palacio Bausdevroye/courses/ECE534/HW/HW11s.pdf · 17-11-2010  · Solutions { all by Kenneth S. Palacio Baus November 17, 2010 1. Problem 10.14. Rate distortion for two

R1 + R2 ≤ I(X1, X2; Y ) (25)≤ H(Y )−H(Y |X1, X2) (26)

We have the distribution of Y is given by:

Y ={

0, with probability (1− p)(1− q) + 12pq

1 with probability (1− p)q + (1− q)p + 12pq

Then, we can compute the combined rate:

R1 + R2 ≤ H(Y )−H(Y |X1, X2) (27)

≤ H

((1− p)(1− q) +

12pq

)− pqH(

12

) (28)

≤ H

((1− p)(1− q) +

12pq

)− pq (29)

(30)

Here, as p, q ∈ (0, 1), the product pq > 0 and since, the entropy of binary random variable isbounded by 1, we obtain the result:

R1 + R2 < 1 (31)

(c) Argue that there are points in the capacity region of this multiple-accesschannel that can only be achieved by timesharing; that is, there exist achievablerate pairs (R1, R2) that lie in the capacity region for the channel but not in theregion defined by:

R1 ≤ I(X1; Y |X2), (32)R2 ≤ I(X2; Y |X1), (33)

R1 + R2 ≤ I(X1, X2; Y ) (34)

for any product distribution p(x1)p(x2). Hence the operation of convexificationstrictly enlarges the capacity region. This channel was introduced independentlyby Csiszar and Korner [149] and Bierbaum and Wallmeier [59].

From part (b) we know that for a non-degenerate distribution the combined rate is: R1+R2 <1. We also know that the rates R1 and R2 are bounded by 1: R1 ≤ 1. R2 ≤ 1. For degeneratedistributions we have that I(X1; Y |X2) = 0 and I(X2; Y |X1) = 0 yielding rates R1 = 0 orR2 = 0. The achievable pairs (R1, R2) should lie in the region R1 + R2 < 1 (Convex hull).

Hence, the points in the capacity region that can be achieved only by time sharing are thoselying on the line R1 + R2 = 1 which defines the triangular capacity region for the rate pairs(1,0) and (0,1), as in part (a).

8