概述
- 枯燥预警,证明很繁琐,实际上尽力记住公式内涵即可,重点掌握詹森不等式。
- 选自《信息论(第四版)——基础理论与应用》傅祖芸编著 P 68 , 2.15 , 2.16 P68,2.15,2.16 P68,2.15,2.16,解答抄答案
- 写在前面:条件熵与无条件熵的关系为
H ( X 2 / X 1 ) ⩽ H ( X 2 ) H(X_2/X_1)leqslant H(X_2) H(X2/X1)⩽H(X2) - 证明:
在区域 [ 0 , 1 ] [0,1] [0,1]中,设 f ( x ) = − x l o g x f(x)=-xlogx f(x)=−xlogx,它是区域内的 ⋂ bigcap ⋂ 型凸函数。并设 x i = P ( a j / a i ) = p i j x_i=P(a_j/a_i)=p_{ij} xi=P(aj/ai)=pij,而 P ( a i ) = p i P(a_i)=p_i P(ai)=pi,有
∑ i = 1 q p i = 1 sum_{i=1}^qp_i=1 i=1∑qpi=1
所以根据詹森不等式
∑ i = 1 q p i f ( x i ) ⩽ f ( ∑ i = 1 q p i x i ) sum_{i=1}^qp_if(x_i)leqslant f(sum_{i=1}^qp_ix_i) i=1∑qpif(xi)⩽f(i=1∑qpixi)
得
− ∑ i = 1 q p i p i j l o g p i j ⩽ − ∑ i = 1 q p i p i j l o g ( ∑ i = 1 q p i p i j ) = − p j l o g p j (1) -sum_{i=1}^qp_ip_{ij}logp_{ij}leqslant-sum_{i=1}^qp_ip_{ij}log(sum_{i=1}^qp_ip_{ij})=-p_jlogp_jtag{1} −i=1∑qpipijlogpij⩽−i=1∑qpipijlog(i=1∑qpipij)=−pjlogpj(1)
∑ i = 1 q p i p i j = ∑ i = 1 q P ( a i ) P ( a j / a i ) = ∑ i = 1 q P ( a i a j ) = P ( a j ) = p j sum_{i=1}^q p_ip_{ij} = sum_{i=1}^qP(a_i)P(a_j/a_i)=sum_{i=1}^qP(a_ia_j)=P(a_j)=p_j i=1∑qpipij=i=1∑qP(ai)P(aj/ai)=i=1∑qP(aiaj)=P(aj)=pj
然后将式子
(
1
)
(1)
(1)两边对所有
j
j
j 求和,得
−
∑
i
=
1
q
∑
j
=
1
q
p
i
p
i
j
l
o
g
p
i
j
⩽
−
∑
j
=
1
q
p
j
l
o
g
p
j
-sum_{i=1}^qsum_{j=1}^qp_ip_{ij}logp_{ij}leqslant-sum_{j=1}^qp_jlogp_j
−i=1∑qj=1∑qpipijlogpij⩽−j=1∑qpjlogpj
即
只有当 P ( a j / a i ) = P ( a j ) P(a_j/a_i)=P(a_j) P(aj/ai)=P(aj),即前后两个符号出现统计独立时等式成立。
-
试证明离散平稳信源有 H ( X 3 / X 1 X 2 ) ⩽ H ( X 2 / X 1 ) H(X_3/X_1X_2)leqslant H(X_2/X_1) H(X3/X1X2)⩽H(X2/X1),并说明等式成立条件。
- 证明:
- 假设
离散平稳信源输出的随机序列符号为 ⋯ X 1 , X 2 , X 3 , ⋯ cdots X_1,X_2,X_3,cdots ⋯X1,X2,X3,⋯。又设 x 1 ∈ X 1 , x 2 ∈ X 2 , x 3 ∈ X 3 x_1in X_1,x_2in X_2,x_3in X_3 x1∈X1,x2∈X2,x3∈X3,而且 x 1 , x 2 , x 3 x_1,x_2,x_3 x1,x2,x3都取自于同一符号集 A = { a 1 , a 2 , ⋯ , a 3 } A= {a_1,a_2,cdots ,a_3} A={a1,a2,⋯,a3},并满足有
∑ X 2 P ( x 2 / x 1 ) = 1 , ∑ X 3 P ( x 3 / x 2 ) = 1 , ∑ X 3 P ( x 3 / x 1 x 2 ) = 1 ∑ X 1 P ( x 1 ) = ∑ X 2 P ( x 2 ) = ∑ X 3 P ( x 3 ) = 1 sum_{X_2}P(x_2/x_1)=1,sum_{X_3}P(x_3/x_2)=1,sum_{X_3}P(x_3/x_1x_2)=1\ sum_{X_1}P(x_1)=sum_{X_2}P(x_2)=sum_{X_3}P(x_3)=1 X2∑P(x2/x1)=1,X3∑P(x3/x2)=1,X3∑P(x3/x1x2)=1X1∑P(x1)=X2∑P(x2)=X3∑P(x3)=1
得到
∑ X 1 X 2 P ( x 1 x 2 ) = ∑ X 2 X 3 P ( x 2 x 3 ) = ∑ X 1 X 3 P ( x 1 x 3 ) = 1 ∑ X 1 X 2 X 3 P ( x 1 x 2 x 3 ) = 1 ∑ X 1 P ( x 1 x 2 x 3 ) = P ( x 2 x 3 ) ∑ X 2 P ( x 1 x 2 x 3 ) = P ( x 1 x 3 ) ∑ X 3 P ( x 1 x 2 x 3 ) = P ( x 1 x 2 ) sum_{X_1X_2}P(x_1x_2)=sum_{X_2X_3}P(x_2x_3)=sum_{X_1X_3}P(x_1x_3)=1\ sum_{X_1X_2X_3}P(x_1x_2x_3)=1\ sum_{X_1}P(x_1x_2x_3)=P(x_2x_3)\ sum_{X_2}P(x_1x_2x_3)=P(x_1x_3)\ sum_{X_3}P(x_1x_2x_3)=P(x_1x_2)\ X1X2∑P(x1x2)=X2X3∑P(x2x3)=X1X3∑P(x1x3)=1X1X2X3∑P(x1x2x3)=1X1∑P(x1x2x3)=P(x2x3)X2∑P(x1x2x3)=P(x1x3)X3∑P(x1x2x3)=P(x1x2)
- 方法一
在区域 [ 0 , 1 ] [0,1] [0,1]内设 f ( x ) = − x l o g x , f ( x ) f(x)=-xlogx,f(x) f(x)=−xlogx,f(x)在 [ 0 , 1 ] [0,1] [0,1]内是 ⋂ bigcap ⋂型凸函数,所以满足詹森不等式
∑ i = 1 q P i f ( x i ) ⩽ f ( ∑ i = 1 q P i x i ) 其 中 ∑ i = 1 q P i = 1 现 今 x i = P ( x 3 / x 1 x 2 ) , 设 其 概 率 空 间 为 P ( x 1 / x 2 ) , 并 满 足 ∑ X 1 P ( x 1 / x 2 ) = 1 sum_{i=1}^qP_if(x_i)leqslant f(sum_{i=1}^qP_ix_i)quad其中sum_{i=1}^qP_i=1\ 现今x_i=P(x_3/x_1x_2),设其概率空间为P(x_1/x_2),并满足\ sum_{X_1}P(x_1/x_2)=1 i=1∑qPif(xi)⩽f(i=1∑qPixi)其中i=1∑qPi=1现今xi=P(x3/x1x2),设其概率空间为P(x1/x2),并满足X1∑P(x1/x2)=1
所以根据詹森不等式得
∑ X 1 P ( x 1 / x 2 ) [ − x i l o g x i ] ⩽ − [ ∑ X 1 P ( x 1 / x 2 ) x i ] l o g [ ∑ X 1 P ( x 1 / x 2 ) x i ] sum_{X_1}P(x_1/x_2)[-x_ilogx_i]leqslant-left[ sum_{X_1}P(x_1/x_2)x_iright]logleft[ sum_{X_1}P(x_1/x_2)x_iright] X1∑P(x1/x2)[−xilogxi]⩽−[X1∑P(x1/x2)xi]log[X1∑P(x1/x2)xi]
x i = P ( x 3 / x 1 x 2 ) x_i=P(x_3/x_1x_2) xi=P(x3/x1x2)
− ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) l o g P ( x 3 / x 1 x 2 ) ⩽ − ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) l o g ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) -sum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)logP(x_3/x_1x_2)\ leqslant -sum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)logsum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)\ −X1∑P(x1/x2)P(x3/x1x2)logP(x3/x1x2)⩽−X1∑P(x1/x2)P(x3/x1x2)logX1∑P(x1/x2)P(x3/x1x2)
这里的所以非常的莫名奇妙,不等号直接跨到等号,我觉得可以直接得出
所以
∑
X
1
P
(
x
1
x
2
x
3
)
=
P
(
x
2
x
3
)
∑
X
1
P
(
x
1
x
3
/
x
2
)
P
(
x
2
)
=
P
(
x
3
/
x
2
)
P
(
x
2
)
sum_{X_1}P(x_1x_2x_3)=P(x_2x_3)\ sum_{X_1}P(x_1x_3/x_2)P(x_2)=P(x_3/x_2)P(x_2)
X1∑P(x1x2x3)=P(x2x3)X1∑P(x1x3/x2)P(x2)=P(x3/x2)P(x2)
上式对所有
x
1
,
x
2
,
x
3
x_1,x_2,x_3
x1,x2,x3的取值都成立,所以
直观感受这里把 P ( X 2 ) P(X_2) P(X2)约去了
∑ X 1 P ( x 1 x 3 / x 2 ) = P ( x 3 / x 2 ) sum_{X_1}P(x_1x_3/x_2)=P(x_3/x_2)\ X1∑P(x1x3/x2)=P(x3/x2)
为什么不能直接写
∑ X 1 P ( x 1 x 2 / x 3 ) = P ( x 2 / x 3 ) sum_{X_1}P(x_1x_2/x_3)=P(x_2/x_3)\ X1∑P(x1x2/x3)=P(x2/x3)
即
∑
X
1
P
(
x
1
/
x
2
)
P
(
x
3
/
x
1
x
2
)
=
P
(
x
3
/
x
2
)
sum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)=P(x_3/x_2)
X1∑P(x1/x2)P(x3/x1x2)=P(x3/x2)
∑ X 1 P ( ( x 1 x 3 ) / x 2 ) = ∑ X 1 P ( ( ( x 3 / x 1 ) ( x 1 ) ) x 2 ) = ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) sum_{X_1}PBig((x_1x_3)/x_2Big)=sum_{X_1}Pbigg(frac{Big((x_3/x_1)(x_1)Big)}{x_2}bigg)=sum_{X_1}P(x_1/x_2)P(x_3/x_1x_2) X1∑P((x1x3)/x2)=X1∑P(x2((x3/x1)(x1)))=X1∑P(x1/x2)P(x3/x1x2)
所以 − ∑ X 1 P ( x 1 x 3 / x 2 ) l o g P ( x 3 / x 1 x 2 ) ⩽ − P ( x 3 / x 2 ) l o g P ( x 3 / x 2 ) -sum_{X_1}P(x_1x_3/x_2)logP(x_3/x_1x_2)leqslant -P(x_3/x_2)logP(x_3/x_2) −X1∑P(x1x3/x2)logP(x3/x1x2)⩽−P(x3/x2)logP(x3/x2)
− ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) l o g P ( x 3 / x 1 x 2 ) ⩽ − ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) l o g ∑ X 1 P ( x 1 / x 2 ) P ( x 3 / x 1 x 2 ) -sum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)logP(x_3/x_1x_2)\ leqslant -sum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)logsum_{X_1}P(x_1/x_2)P(x_3/x_1x_2)\ −X1∑P(x1/x2)P(x3/x1x2)logP(x3/x1x2)⩽−X1∑P(x1/x2)P(x3/x1x2)logX1∑P(x1/x2)P(x3/x1x2)
因为
0
⩽
P
(
x
2
)
⩽
1
0leqslant P(x_2) leqslant1
0⩽P(x2)⩽1,
x
2
∈
X
2
x_2in X_2
x2∈X2,所以上式两边相乘,等号不变,有
−
∑
X
1
P
(
x
2
)
P
(
x
1
x
3
/
x
2
)
l
o
g
P
(
x
3
/
x
1
x
2
)
⩽
−
P
(
x
2
)
P
(
x
3
/
x
2
)
l
o
g
P
(
x
3
/
x
2
)
-sum_{X_1}P(x_2)P(x_1x_3/x_2)logP(x_3/x_1x_2)leqslant -P(x_2)P(x_3/x_2)logP(x_3/x_2)
−X1∑P(x2)P(x1x3/x2)logP(x3/x1x2)⩽−P(x2)P(x3/x2)logP(x3/x2)
上式对所有
x
2
x_2
x2、
x
3
x_3
x3都成立,所以对所有
x
2
x_2
x2,
x
3
x_3
x3求和下式也成立
−
∑
X
1
∑
X
2
∑
X
3
P
(
x
1
x
2
x
3
)
l
o
g
P
(
x
3
/
x
1
x
2
)
⩽
−
∑
X
2
∑
X
3
P
(
x
2
x
3
)
l
o
g
P
(
x
3
/
x
2
)
-sum_{X_1}sum_{X_2}sum_{X_3}P(x_1x_2x_3)logP(x_3/x_1x_2)leqslant-sum_{X_2}sum_{X_3}P(x_2x_3)logP(x_3/x_2)
−X1∑X2∑X3∑P(x1x2x3)logP(x3/x1x2)⩽−X2∑X3∑P(x2x3)logP(x3/x2)
因为
H
(
X
3
/
X
1
X
2
)
⩽
H
(
X
3
/
X
2
)
quad H(X_3/X_1X_2)leqslant H(X_3/X_2)
H(X3/X1X2)⩽H(X3/X2)
所以是平稳信源
H
(
X
3
/
X
2
)
=
H
(
X
2
/
X
1
)
quad H(X_3/X_2)=H(X_2/X_1)
H(X3/X2)=H(X2/X1)
所以
H
(
X
3
/
X
1
X
2
)
⩽
H
(
X
2
/
X
1
)
quad H(X_3/X_1X_2) leqslant H(X_2/X_1)
H(X3/X1X2)⩽H(X2/X1)
只有当
P
(
x
3
/
x
1
x
2
)
=
p
(
x
3
/
x
2
)
P(x_3/x_1x_2)=p(x_3/x_2)
P(x3/x1x2)=p(x3/x2)(对所有
x
1
,
x
2
,
x
3
x_1,x_2,x_3
x1,x2,x3)时等式成立。
-
方法二更变态,但好像更通用,太长写不下,放个链接
-
试证明离散平稳信源有 H ( X 1 X 2 ⋯ X N ) ⩽ H ( X 1 ) + H ( X 2 ) + ⋯ + H ( X N ) H(X_1X_2cdots X_N)leqslant H(X_1)+H(X_2)+cdots+H(X_N) H(X1X2⋯XN)⩽H(X1)+H(X2)+⋯+H(XN),并说明等式成立条件。
- 解:
设离散平稳信源输出的随机符号序列为 ⋯ X 1 X 2 ⋯ X N − 1 X N X N + 1 ⋯ cdots X_1X_2cdots X_{N-1}X_NX_{N+1}cdots ⋯X1X2⋯XN−1XNXN+1⋯,又设 x 1 ∈ X 1 , x 2 ∈ X 2 , ⋯ , x N − 1 ∈ X N − 1 , x N ∈ X N x_1 in X_1,x_2in X_2,cdots,x_{N-1}in X_{N-1},x_Nin X_N x1∈X1,x2∈X2,⋯,xN−1∈XN−1,xN∈XN,而且 x 1 , x 2 , ⋯ x N − 1 , x N x_1,x_2,cdots x_{N-1},x_N x1,x2,⋯xN−1,xN都取自于同一个符号集 A : ∣ a 1 , a 2 , ⋯ , a q ∣ A:|a_1,a_2,cdots,a_q| A:∣a1,a2,⋯,aq∣,并满足有
∑ X i ∑ X i + 1 P ( x i x i + 1 ) = 1 i = 1 , 2 , ⋯ , N ∑ X i ∑ X i + 1 ∑ X i + 2 P ( x i x i + 1 x i + 2 ) = 1 i = 1 , 2 , ⋯ , N ⋮ ∑ X 1 ∑ X 2 ⋯ ∑ X N P ( x 1 x 2 ⋯ x N − 1 x N ) = 1 sum_{X_i}sum_{X_{i+1}}P(x_ix_{i+1})=1quad i=1,2,cdots,N\ sum_{X_i}sum_{X_{i+1}}sum_{X_{i+2}}P(x_ix_{i+1}x_{i+2})=1quad i=1,2,cdots,N\ vdots \ sum_{X_1}sum_{X_2}cdotssum_{X_N}P(x_1x_2cdots x_{N-1}x_N)=1quad \ Xi∑Xi+1∑P(xixi+1)=1i=1,2,⋯,NXi∑Xi+1∑Xi+2∑P(xixi+1xi+2)=1i=1,2,⋯,N⋮X1∑X2∑⋯XN∑P(x1x2⋯xN−1xN)=1
这其实是两个系列的式子
∑
X
i
P
(
x
i
x
i
+
1
)
=
P
(
x
i
+
1
)
i
=
1
,
2
,
⋯
,
N
∑
X
i
∑
X
i
+
1
P
(
x
i
x
i
+
1
x
i
+
2
)
=
P
(
x
i
+
2
)
i
=
1
,
2
,
⋯
,
N
⋮
∑
X
1
∑
X
2
⋯
∑
X
N
−
1
P
(
x
1
x
2
⋯
x
N
−
1
x
N
)
=
P
(
x
N
)
sum_{X_i}P(x_ix_{i+1})=P(x_{i+1})quad i=1,2,cdots,N\ sum_{X_i}sum_{X_{i+1}}P(x_ix_{i+1}x_{i+2})=P(x_{i+2})quad i=1,2,cdots,N\ vdots\ sum_{X_1}sum_{X_2}cdotssum_{X_{N-1}}P(x_1x_2cdots x_{N-1}x_N)=P(x_N)\
Xi∑P(xixi+1)=P(xi+1)i=1,2,⋯,NXi∑Xi+1∑P(xixi+1xi+2)=P(xi+2)i=1,2,⋯,N⋮X1∑X2∑⋯XN−1∑P(x1x2⋯xN−1xN)=P(xN)
根据离平稳信源的信息熵的表达式有
H
(
X
1
X
2
⋯
X
N
−
1
X
N
)
=
H
(
X
1
)
+
H
(
X
2
/
X
1
)
+
H
(
X
3
/
X
1
X
2
)
+
H
(
X
N
/
X
1
X
2
⋯
X
N
−
1
)
H(X_1X_2cdots X_{N-1}X_N)=H(X_1)+H(X_2/X_1)+\ H(X_3/X_1X_2)+H(X_N/X_1X_2cdots X_{N-1})
H(X1X2⋯XN−1XN)=H(X1)+H(X2/X1)+H(X3/X1X2)+H(XN/X1X2⋯XN−1)
根据上题
(
2
)
(2)
(2)式,现设
???喵喵喵,还带这样玩?
− ∑ i = 1 q P i l o g P i ⩽ − ∑ i = 1 q P i l o g P i ′ (2) -sum_{i=1}^qP_ilogP_ileqslant -sum_{i=1}^qP_ilog{P_i}^{'}tag{2} −i=1∑qPilogPi⩽−i=1∑qPilogPi′(2)
P
i
=
P
(
x
2
/
x
1
)
其
满
足
0
⩽
P
i
⩽
1
,
∑
X
2
P
(
x
2
/
x
1
)
=
1
P
i
′
=
P
(
x
2
)
也
满
足
0
⩽
P
i
′
⩽
1
,
∑
X
2
P
(
x
2
)
=
1
P_i=P(x_2/x_1)quad其满足quad0leqslant P_ileqslant 1,sum_{X_2}P(x_2/x_1)=1\ {P_i}^{'}=P(x_2)quad也满足quad0leqslant P_i^{'}leqslant 1,sum_{X_2}P(x_2)=1\
Pi=P(x2/x1)其满足0⩽Pi⩽1,X2∑P(x2/x1)=1Pi′=P(x2)也满足0⩽Pi′⩽1,X2∑P(x2)=1
则有
−
∑
X
2
P
(
x
2
/
x
1
)
l
o
g
P
(
x
2
/
x
1
)
⩽
−
∑
X
2
P
(
x
2
/
x
1
)
l
o
g
P
(
x
2
)
(2)
-sum_{X_2}P(x_2/x_1)logP(x_2/x_1)leqslant -sum_{X_2}P(x_2/x_1)logP(x_2)tag{2}
−X2∑P(x2/x1)logP(x2/x1)⩽−X2∑P(x2/x1)logP(x2)(2)
因为
P
(
x
1
)
>
0
P(x_1)>0
P(x1)>0,所以上式两边相乘,等号不变。
−
∑
X
2
P
(
x
1
)
P
(
x
2
/
x
1
)
l
o
g
P
(
x
2
/
x
1
)
⩽
−
∑
X
2
P
(
x
1
)
P
(
x
2
/
x
1
)
l
o
g
P
(
x
2
)
-sum_{X_2}P(x_1)P(x_2/x_1)logP(x_2/x_1)leqslant -sum_{X_2}P(x_1)P(x_2/x_1)logP(x_2)
−X2∑P(x1)P(x2/x1)logP(x2/x1)⩽−X2∑P(x1)P(x2/x1)logP(x2)
P ( x 1 ) P ( x 2 / x 1 ) = P ( x 2 x 1 ) P(x_1)P(x_2/x_1)=P(x_2x_1) P(x1)P(x2/x1)=P(x2x1)
上式对所有
x
1
x_1
x1 的取值成立,所以对所有
x
1
x_1
x1 求和,下式成立
−
∑
X
1
∑
X
2
P
(
x
2
x
1
)
l
o
g
P
(
x
2
/
x
1
)
⩽
−
∑
X
1
∑
X
2
P
(
x
1
x
2
)
l
o
g
P
(
x
2
)
⩽
−
∑
X
2
P
(
x
2
)
l
o
g
P
(
x
2
)
=
H
(
X
2
)
begin{aligned} -sum_{X_1}sum_{X_2}P(x_2x_1)logP(x_2/x_1)leqslant& -sum_{X_1}sum_{X_2}P(x_1x_2)logP(x_2)\ leqslant&-sum_{X_2}P(x_2)logP(x_2)\ =&H(X_2) end{aligned}
−X1∑X2∑P(x2x1)logP(x2/x1)⩽⩽=−X1∑X2∑P(x1x2)logP(x2)−X2∑P(x2)logP(x2)H(X2)
证得
H
(
X
2
/
X
1
)
⩽
H
(
X
2
)
quadquad H(X_2/X_1)leqslant H(X_2)
H(X2/X1)⩽H(X2)
只有当
P
(
x
2
/
x
1
)
=
P
(
x
2
)
P(x_2/x_1)=P(x_2)
P(x2/x1)=P(x2),(对所有
x
1
,
x
2
x_1,x_2
x1,x2)时等式成立。
同理,设
P
(
x
i
)
=
P
(
x
3
/
x
1
x
2
)
quad P(x_i)=P(x_3/x_1x_2)quad
P(xi)=P(x3/x1x2)其满足
0
⩽
P
i
⩽
1
,
∑
X
3
P
(
x
3
/
x
1
x
2
)
=
1
quad0leqslant P_ileqslant 1,displaystylesum_{X_3}P(x_3/x_1x_2)=1
0⩽Pi⩽1,X3∑P(x3/x1x2)=1
P
i
′
=
P
(
x
3
)
也
满
足
0
⩽
P
i
′
⩽
1
,
∑
X
3
P
(
x
3
)
=
1
{P_i}^{'}=P(x_3)quad也满足quad0leqslant P_i^{'}leqslant 1,sum_{X_3}P(x_3)=1
Pi′=P(x3)也满足0⩽Pi′⩽1,X3∑P(x3)=1
继续补充 ( 2 ) (2) (2)式
− ∑ i = 1 q P i l o g P i ⩽ − ∑ i = 1 q P i l o g P i ′ (2) -sum_{i=1}^qP_ilogP_ileqslant -sum_{i=1}^qP_ilog{P_i}^{'}tag{2} −i=1∑qPilogPi⩽−i=1∑qPilogPi′(2)
则有
−
∑
X
3
P
(
x
3
/
x
1
x
2
)
l
o
g
P
(
x
3
/
x
1
x
2
)
⩽
∑
X
3
P
(
x
3
/
x
1
x
2
)
l
o
g
P
(
x
3
)
quad-displaystylesum_{X_3}P(x_3/x_1x_2)logP(x_3/x_1x_2) leqslantsum_{X_3}P(x_3/x_1x_2)logP(x_3)
−X3∑P(x3/x1x2)logP(x3/x1x2)⩽X3∑P(x3/x1x2)logP(x3)
因为
P
(
x
1
x
2
)
>
0
P(x_1x_2)>0
P(x1x2)>0,所以上式两边相乘,符号不变,得
−
∑
X
3
P
(
x
1
x
2
)
P
(
x
3
/
x
1
x
2
)
l
o
g
P
(
x
3
/
x
1
x
2
)
⩽
−
∑
X
3
P
(
x
1
x
2
)
P
(
x
3
/
x
1
x
2
)
l
o
g
P
(
x
3
)
quad-displaystylesum_{X_3}P(x_1x_2)P(x_3/x_1x_2)logP(x_3/x_1x_2)\ leqslant-sum_{X_3}P(x_1x_2)P(x_3/x_1x_2)logP(x_3)
−X3∑P(x1x2)P(x3/x1x2)logP(x3/x1x2)⩽−X3∑P(x1x2)P(x3/x1x2)logP(x3)
上式对所有
x
1
,
x
2
x_1,x_2
x1,x2 的取值都成立,所以有
−
∑
X
1
∑
X
2
∑
X
3
P
(
x
1
x
2
x
3
)
l
o
g
P
(
x
3
/
x
1
x
2
)
⩽
−
∑
X
1
∑
X
2
∑
X
3
P
(
x
1
x
2
x
3
)
l
o
g
P
(
x
3
)
=
−
∑
X
3
P
(
x
3
)
l
o
g
P
(
x
3
)
begin{aligned} &-sum_{X_1}sum_{X_2}sum_{X_3}P(x_1x_2x_3)logP(x_3/x_1x_2)\ &leqslant-sum_{X_1}sum_{X_2}sum_{X_3}P(x_1x_2x_3)logP(x_3)\ &=-sum_{X_3}P(x_3)logP(x_3)\ end{aligned}\
−X1∑X2∑X3∑P(x1x2x3)logP(x3/x1x2)⩽−X1∑X2∑X3∑P(x1x2x3)logP(x3)=−X3∑P(x3)logP(x3)
⇒
H
(
X
3
/
X
1
X
2
)
⩽
H
(
X
3
)
Rightarrow H(X_3/X_1X_2)leqslant H(X_3)
⇒H(X3/X1X2)⩽H(X3)
只有当
P
(
x
3
/
x
1
x
2
)
=
P
(
x
3
)
P(x_3/x_1x_2)=P(x_3)
P(x3/x1x2)=P(x3) (对所有
x
1
,
x
2
,
x
3
x_1,x_2,x_3
x1,x2,x3)时等式成立
依此类推,对于所有
H
(
X
N
/
X
1
X
2
⋯
X
N
−
1
)
H(X_N/X_1X_2cdots X_{N-1})
H(XN/X1X2⋯XN−1) 同理可设
P
i
=
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
P_i=P(x_N/x_1x_2cdots x_{N-1})
Pi=P(xN/x1x2⋯xN−1)
其满足
0
⩽
P
i
⩽
1
0leqslant P_ileqslant1
0⩽Pi⩽1,则有
∑
X
N
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
=
1
sum_{X_N}P(x_N/x_1x_2cdots x_{N-1})=1
XN∑P(xN/x1x2⋯xN−1)=1
又设
P
i
′
=
P
(
x
N
)
P_i^{'}=P(x_N)
Pi′=P(xN),也满足
0
⩽
P
i
′
⩽
1
∑
X
N
P
(
x
N
)
=
1
0leqslant P_i^{'}leqslant1quad displaystylesum_{X_N}P(x_N)=1
0⩽Pi′⩽1XN∑P(xN)=1,则有
−
∑
X
N
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
l
o
g
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
⩽
−
∑
X
N
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
l
o
g
P
(
x
N
)
-sum_{X_N}P(x_N/x_1x_2cdots x_{N-1})logP(x_N/x_1x_2cdots x_{N-1})\ leqslant-sum_{X_N}P(x_N/x_1x_2cdots x_{N-1})logP(x_N)
−XN∑P(xN/x1x2⋯xN−1)logP(xN/x1x2⋯xN−1)⩽−XN∑P(xN/x1x2⋯xN−1)logP(xN)
再来一遍 ( 2 ) (2) (2)式
− ∑ i = 1 q P i l o g P i ⩽ − ∑ i = 1 q P i l o g P i ′ (2) -sum_{i=1}^qP_ilogP_ileqslant -sum_{i=1}^qP_ilog{P_i}^{'}tag{2} −i=1∑qPilogPi⩽−i=1∑qPilogPi′(2)
因为
P
(
x
1
x
2
⋯
x
N
−
1
)
>
0
P(x_1x_2cdots x_{N-1})>0
P(x1x2⋯xN−1)>0,所以上式两边相乘,符号不变,得
−
∑
X
N
P
(
x
1
x
2
⋯
x
N
−
1
)
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
l
o
g
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
⩽
−
∑
X
N
P
(
x
1
x
2
⋯
x
N
−
1
)
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
l
o
g
P
(
x
N
)
-sum_{X_N}P(x_1x_2cdots x_{N-1})P(x_N/x_1x_2cdots x_{N-1})logP(x_N/x_1x_2cdots x_{N-1})\ leqslant-sum_{X_N}P(x_1x_2cdots x_{N-1})P(x_N/x_1x_2cdots x_{N-1})logP(x_N)
−XN∑P(x1x2⋯xN−1)P(xN/x1x2⋯xN−1)logP(xN/x1x2⋯xN−1)⩽−XN∑P(x1x2⋯xN−1)P(xN/x1x2⋯xN−1)logP(xN)
上式对所有
x
1
,
x
2
,
⋯
,
x
N
−
1
x_1,x_2,cdots,x_{N-1}
x1,x2,⋯,xN−1 的取值都成立,所以对
x
1
,
x
2
,
⋯
,
x
N
−
1
x_1,x_2,cdots,x_{N-1}
x1,x2,⋯,xN−1 求和,下式成立
−
∑
X
1
∑
X
2
⋯
∑
X
N
−
1
∑
X
N
P
(
x
1
x
2
⋯
x
N
−
1
x
N
)
l
o
g
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
⩽
−
∑
X
1
∑
X
2
⋯
∑
X
N
−
1
∑
X
N
P
(
x
1
x
2
⋯
x
N
−
1
x
N
)
l
o
g
P
(
x
N
)
=
−
∑
X
N
P
(
x
N
)
l
o
g
P
(
x
N
)
=
H
(
x
N
)
begin{aligned} &-sum_{X_1}sum_{X_2}cdotssum_{X_{N-1}}sum_{X_N}P(x_1x_2cdots x_{N-1}x_N)logP(x_N/x_1x_2cdots x_{N-1})\ &leqslant-sum_{X_1}sum_{X_2}cdotssum_{X_{N-1}}sum_{X_N}P(x_1x_2cdots x_{N-1}x_N)logP(x_N)\ &=-sum_{X_N}P(x_N)logP(x_N)\ &=H(x_N) end{aligned}
−X1∑X2∑⋯XN−1∑XN∑P(x1x2⋯xN−1xN)logP(xN/x1x2⋯xN−1)⩽−X1∑X2∑⋯XN−1∑XN∑P(x1x2⋯xN−1xN)logP(xN)=−XN∑P(xN)logP(xN)=H(xN)
⇒
H
(
X
N
/
X
1
X
2
⋯
X
N
−
1
)
⩽
H
(
X
N
)
Rightarrow H(X_N/X_1X_2cdots X_{N-1})leqslant H(X_N)
⇒H(XN/X1X2⋯XN−1)⩽H(XN)
只有当
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
=
P
(
x
N
)
P(x_N/x_1x_2cdots x_{N-1})=P(x_N)
P(xN/x1x2⋯xN−1)=P(xN) (对所有
x
1
,
x
2
,
⋯
,
x
N
−
1
,
x
N
x_1,x_2,cdots,x_{N-1},x_N
x1,x2,⋯,xN−1,xN)时等式成立。
由此可证得
H
(
X
1
X
2
⋯
X
N
−
1
X
N
)
=
H
(
X
1
)
+
H
(
X
2
/
X
1
)
+
H
(
X
3
/
X
2
X
1
)
+
⋯
+
H
(
X
N
/
X
1
X
2
⋯
X
N
−
1
)
⩽
H
(
X
1
)
+
H
(
X
2
)
+
H
(
X
3
)
+
⋯
+
H
(
X
N
)
begin{aligned} &H(X_1X_2cdots X_{N-1}X_N)\ & = H(X_1)+H(X_2/X_1)+H(X_3/X_2X_1)+cdots+H(X_N/X_1X_2cdots X_{N-1})\ & leqslant H(X_1)+H(X_2)+H(X_3)+cdots+H(X_N) end{aligned}
H(X1X2⋯XN−1XN)=H(X1)+H(X2/X1)+H(X3/X2X1)+⋯+H(XN/X1X2⋯XN−1)⩽H(X1)+H(X2)+H(X3)+⋯+H(XN)
只有当下列等式对所有
x
1
,
x
2
,
⋯
,
x
N
−
1
,
x
N
x_1,x_2,cdots,x_{N-1},x_N
x1,x2,⋯,xN−1,xN
P
(
x
2
/
x
1
)
=
P
(
x
2
)
P
(
x
3
/
x
1
x
2
)
=
P
(
x
3
)
⋮
P
(
x
N
/
x
1
x
2
⋯
x
N
−
1
)
=
P
(
x
N
)
}
left. begin{matrix} P(x_2/x_1)=P(x_2)&\ P(x_3/x_1x_2)=P(x_3)&\ vdots \ P(x_N/x_1x_2cdots x_{N-1})=P(x_N)&\ end{matrix} right}
P(x2/x1)=P(x2)P(x3/x1x2)=P(x3)⋮P(xN/x1x2⋯xN−1)=P(xN)⎭⎪⎪⎪⎬⎪⎪⎪⎫
同时成立时,等式成立。
即只有当
P
(
x
1
x
2
⋯
x
N
−
1
x
N
)
=
P
(
x
1
)
P
(
x
2
)
⋯
P
(
x
N
−
1
)
P
(
x
N
)
P(x_1x_2cdots x_{N-1}x_N)=P(x_1)P(x_2)cdots P(x_{N-1})P(x_N)
P(x1x2⋯xN−1xN)=P(x1)P(x2)⋯P(xN−1)P(xN) 满足时,等式成立。
这就表明只有当离散信源输出的
N
N
N 长的随机序列之间彼此统计无依赖时,等式成立。
最后
以上就是兴奋蜻蜓为你收集整理的由条件熵与无条件熵的关系引出的不等式证明题(不会抄答案系列)的全部内容,希望文章能够帮你解决由条件熵与无条件熵的关系引出的不等式证明题(不会抄答案系列)所遇到的程序开发问题。
如果觉得靠谱客网站的内容还不错,欢迎将靠谱客网站推荐给程序员好友。
发表评论 取消回复