A Refinement of Expurgation

Giuseppe Cocco, Albert Guillén i Fàbregas and Josep Font-Segura Giuseppe Cocco is with the Department of Signal Theory and Communications, Universitat Politècnica de Catalunya, 08034, Barcelona, Spain (e-mail: [email protected]). Albert Guillén i Fàbregas is with the Department of Engineering, University of Cambridge, CB2 1PZ Cambridge, U.K., and also with the Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018 Barcelona, Spain (e-mail: [email protected]). Josep Font-Segura is with the Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018, Barcelona, Spain (e-mail: [email protected]). This work was supported in part by the Ramon y Cajal fellowship program (grant RYC2021-033908-I) funded by the Spanish Government through MCIN/AEI/10.13039/501100011033 and the European Union “NextGenerationEU” Recovery Plan, the European Research Council under ERC Agreement 725411 and by the Spanish Ministry of Economy and Competitiveness under Grant PID2020-116683GB-C22.
Abstract

We show that for a wide range of channels and code ensembles with pairwise-independent codewords, with probability tending to 1111 with the code length, expurgating an arbitrarily small fraction of codewords from a randomly selected code results in a code attaining the expurgated exponent.

I Preliminaries

We consider the problem of reliable communication of Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT equiprobable messages over noisy channels described by a random transformation Wn(𝒚|𝒙)superscript𝑊𝑛conditional𝒚𝒙W^{n}\mkern-1.5mu(\boldsymbol{y}|\boldsymbol{x})italic_W start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_y | bold_italic_x ), where 𝒙𝒳n𝒙superscript𝒳𝑛\boldsymbol{x}\in\mathcal{X}^{n}bold_italic_x ∈ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and 𝒚𝒴n𝒚superscript𝒴𝑛\boldsymbol{y}\in\mathcal{Y}^{n}bold_italic_y ∈ caligraphic_Y start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are the channel input and output sequences, and 𝒳𝒳\mathcal{X}caligraphic_X and 𝒴𝒴\mathcal{Y}caligraphic_Y are the input and output alphabets, respectively. Each message m{1,,Mn}𝑚1subscript𝑀𝑛m\in\{1,\dotsc,M_{n}\}italic_m ∈ { 1 , … , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, where Mn=2nRsubscript𝑀𝑛superscript2𝑛𝑅M_{n}=\lceil 2^{nR}\rceilitalic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ⌈ 2 start_POSTSUPERSCRIPT italic_n italic_R end_POSTSUPERSCRIPT ⌉, R𝑅Ritalic_R being the code rate, is mapped onto an n𝑛nitalic_n-length codeword 𝒙msubscript𝒙𝑚\boldsymbol{x}_{m}bold_italic_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT sent over the channel. The code is defined as 𝒞(Mn,n)={𝒙1,,𝒙Mn}𝒞subscript𝑀𝑛𝑛subscript𝒙1subscript𝒙subscript𝑀𝑛\mathcal{C}(M_{n},n)=\{\boldsymbol{x}_{1},\dotsc,\boldsymbol{x}_{M_{n}}\}caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) = { bold_italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_italic_x start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT }. We denote with Pe,m(𝒞(Mn,n))subscript𝑃e𝑚𝒞subscript𝑀𝑛𝑛P_{{\rm e},m}\bigl{(}\mathcal{C}(M_{n},n)\bigr{)}italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) the error probability when codeword m{1,,Mn}𝑚1subscript𝑀𝑛m\in\{1,\dotsc,M_{n}\}italic_m ∈ { 1 , … , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } from code 𝒞(Mn,n)𝒞subscript𝑀𝑛𝑛\mathcal{C}(M_{n},n)caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) is transmitted; similarly Pe(𝒞(Mn,n))=1Mnm=1MnPe,m(𝒞(Mn,n))subscript𝑃e𝒞subscript𝑀𝑛𝑛1subscript𝑀𝑛superscriptsubscript𝑚1subscript𝑀𝑛subscript𝑃e𝑚𝒞subscript𝑀𝑛𝑛P_{\rm e}\bigl{(}\mathcal{C}(M_{n},n)\bigr{)}=\frac{1}{M_{n}}\sum_{m=1}^{M_{n}% }P_{{\rm e},m}\bigl{(}\mathcal{C}(M_{n},n)\bigr{)}italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) = divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) denotes the average error probability of the code. Let 𝖢(Mn,n)={𝑿1,,𝑿Mn}𝖢subscript𝑀𝑛𝑛subscript𝑿1subscript𝑿subscript𝑀𝑛\mathsf{C}(M_{n},n)=\{\boldsymbol{X}_{1},\dotsc,\boldsymbol{X}_{M_{n}}\}sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) = { bold_italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_italic_X start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT } be a random code, i.e., a set of Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT random codewords generated with probability [𝖢(Mn,n)=𝒞(Mn,n)]=[𝑿1=𝒙1,,𝑿Mn=𝒙Mn]delimited-[]𝖢subscript𝑀𝑛𝑛𝒞subscript𝑀𝑛𝑛delimited-[]formulae-sequencesubscript𝑿1subscript𝒙1subscript𝑿subscript𝑀𝑛subscript𝒙subscript𝑀𝑛\mathbb{P}[\mathsf{C}(M_{n},n)=\mathcal{C}(M_{n},n)]=\mathbb{P}[\boldsymbol{X}% _{1}=\boldsymbol{x}_{1},\dotsc,\boldsymbol{X}_{M_{n}}=\boldsymbol{x}_{M_{n}}]blackboard_P [ sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) = caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ] = blackboard_P [ bold_italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = bold_italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_italic_X start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT = bold_italic_x start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ]. We assume that codewords are generated in a pairwise independent manner, that is, for any two indices m,k{1,,Mn},mkformulae-sequence𝑚𝑘1subscript𝑀𝑛𝑚𝑘m,k\in\{1,\ldots,M_{n}\},m\neq kitalic_m , italic_k ∈ { 1 , … , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } , italic_m ≠ italic_k, it holds that [𝑿m=𝒙m,𝑿k=𝒙k]=Qn(𝒙m)Qn(𝒙k)delimited-[]formulae-sequencesubscript𝑿𝑚subscript𝒙𝑚subscript𝑿𝑘subscript𝒙𝑘superscript𝑄𝑛subscript𝒙𝑚superscript𝑄𝑛subscript𝒙𝑘\mathbb{P}[\boldsymbol{X}_{m}=\boldsymbol{x}_{m},\boldsymbol{X}_{k}=% \boldsymbol{x}_{k}]=Q^{n}(\boldsymbol{x}_{m})Q^{n}(\boldsymbol{x}_{k})blackboard_P [ bold_italic_X start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = bold_italic_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , bold_italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = bold_italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] = italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ), where Qn(𝒙m)=[𝑿m=𝒙m]superscript𝑄𝑛subscript𝒙𝑚delimited-[]subscript𝑿𝑚subscript𝒙𝑚Q^{n}(\boldsymbol{x}_{m})=\mathbb{P}[\boldsymbol{X}_{m}=\boldsymbol{x}_{m}]italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) = blackboard_P [ bold_italic_X start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = bold_italic_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ] is a probability distribution defined over 𝒳nsuperscript𝒳𝑛\mathcal{X}^{n}caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

Let Pe,m(𝖢(Mn,n))subscript𝑃e𝑚𝖢subscript𝑀𝑛𝑛P_{{\rm e},m}\big{(}\mathsf{C}(M_{n},n)\big{)}italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) and Pe(𝖢(Mn,n))subscript𝑃e𝖢subscript𝑀𝑛𝑛P_{\rm e}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) be the random variables denoting the error probability of the m𝑚mitalic_m-th codeword for random code 𝖢(Mn,n)𝖢subscript𝑀𝑛𝑛\mathsf{C}(M_{n},n)sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) and the average error probability of the code, respectively. We denote the n𝑛nitalic_n-length error exponents of such random variables by Em(𝖢(Mn,n))=1nlogPe,m(𝖢(Mn,n))subscript𝐸𝑚𝖢subscript𝑀𝑛𝑛1𝑛subscript𝑃e𝑚𝖢subscript𝑀𝑛𝑛E_{m}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}=-\frac{1}{n}\log P_{{\rm e},m}\big{(}% \mathsf{C}(M_{n},n)\big{)}italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) = - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) and E(𝖢(Mn,n))=1nlogPe(𝖢(Mn,n))𝐸𝖢subscript𝑀𝑛𝑛1𝑛subscript𝑃e𝖢subscript𝑀𝑛𝑛E\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}=-\frac{1}{n}\log P_{\rm e}\bigl{(}\mathsf% {C}(M_{n},n)\bigr{)}italic_E ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) = - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ), respectively. For some ensembles and channels the ensemble-average of the code error probability 𝔼[Pe(𝖢(Mn,n))]𝔼delimited-[]subscript𝑃e𝖢subscript𝑀𝑛𝑛\mathbb{E}\bigl{[}P_{\rm e}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}\bigr{]}blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) ] is known to decay exponentially in n𝑛nitalic_n [1]. A lower bound on the error exponent 1nlog𝔼[Pe(𝖢(Mn,n))]1𝑛𝔼delimited-[]subscript𝑃e𝖢subscript𝑀𝑛𝑛-\frac{1}{n}\log\mathbb{E}\bigl{[}P_{\rm e}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}% \bigr{]}- divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) ] is given by Gallager’s multi-letter random coding exponent Ern(R,Qn)superscriptsubscript𝐸r𝑛𝑅superscript𝑄𝑛E_{{\rm r}}^{n}(R,Q^{n})italic_E start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) in [2, Eq. (5.6.16)]. For the DMC (DMC), this bound is known to coincide with the sphere-packing upper bound on the reliability function [3, 4] in the high rate region.

In [5, Sec. 5.7] Gallager showed that, for some channels and ensembles, there exists a code with strictly higher error exponent than Ern(R,Qn)superscriptsubscript𝐸r𝑛𝑅superscript𝑄𝑛E_{{\rm r}}^{n}(R,Q^{n})italic_E start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) at low rates. In order to show this, Gallager considered a pairwise-independent ensemble with Mn=2Mn1superscriptsubscript𝑀𝑛2subscript𝑀𝑛1M_{n}^{\prime}=2M_{n}-1italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 2 italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 codewords. Using Markov’s inequality he showed that

[Pe,m(𝖢(Mn,n))21s𝔼[Pe,m(𝖢(Mn,n))s]1s]12delimited-[]subscript𝑃e𝑚𝖢superscriptsubscript𝑀𝑛𝑛superscript21𝑠𝔼superscriptdelimited-[]subscript𝑃e𝑚superscript𝖢superscriptsubscript𝑀𝑛𝑛𝑠1𝑠12\displaystyle\mathbb{P}\Bigl{[}P_{{\rm e},m}(\mathsf{C}(M_{n}^{\prime},n))\geq 2% ^{\frac{1}{s}}\mathbb{E}[P_{{\rm e},m}(\mathsf{C}(M_{n}^{\prime},n))^{s}]^{% \frac{1}{s}}\Bigr{]}\leq\frac{1}{2}blackboard_P [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ≥ 2 start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT ] ≤ divide start_ARG 1 end_ARG start_ARG 2 end_ARG (1)

for any s>0𝑠0s>0italic_s > 0. He then introduced the indicator function

φm(𝒞(Mn,n))subscript𝜑𝑚𝒞subscript𝑀𝑛𝑛\displaystyle\varphi_{m}\bigl{(}\mathcal{C}(M_{n},n)\bigr{)}italic_φ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) )
={1 if Pe,m(𝒞(Mn,n))<21s𝔼[Pe,m(𝖢(Mn,n))s]1s0 otherwiseabsentcases1 if subscript𝑃e𝑚𝒞subscript𝑀𝑛𝑛superscript21𝑠𝔼superscriptdelimited-[]subscript𝑃e𝑚superscript𝖢subscript𝑀𝑛𝑛𝑠1𝑠otherwise0 otherwiseotherwise\displaystyle\ =\begin{cases}1\ \text{ if }P_{{\rm e},m}\bigl{(}\mathcal{C}(M_% {n},n)\bigr{)}<2^{\frac{1}{s}}\mathbb{E}\bigl{[}P_{{\rm e},m}\big{(}\mathsf{C}% (M_{n},n)\big{)}^{s}\bigr{]}^{\frac{1}{s}}\\ 0\ \text{ otherwise}\end{cases}= { start_ROW start_CELL 1 if italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) < 2 start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 otherwise end_CELL start_CELL end_CELL end_ROW (2)

and showed that, using (1) and (I), the following inequality holds

𝔼[m=1Mnφm(𝖢(Mn,n))]Mn.𝔼delimited-[]superscriptsubscript𝑚1superscriptsubscript𝑀𝑛subscript𝜑𝑚𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛\displaystyle\mathbb{E}\left[\sum_{m=1}^{M_{n}^{\prime}}\varphi_{m}(\mathsf{C}% (M_{n}^{\prime},n))\right]\geq M_{n}.blackboard_E [ ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_φ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ] ≥ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT . (3)

From (3) it follows that, since the average number of codewords that have a probability of error smaller than 21s𝔼[Pe,m(𝒞(Mn,n))s]1ssuperscript21𝑠𝔼superscriptdelimited-[]subscript𝑃e𝑚superscript𝒞superscriptsubscript𝑀𝑛𝑛𝑠1𝑠2^{\frac{1}{s}}\mathbb{E}\bigl{[}P_{{\rm e},m}\bigl{(}\mathcal{C}(M_{n}^{% \prime},n)\bigr{)}^{s}\bigr{]}^{\frac{1}{s}}2 start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT in a randomly generated code with Mn=2Mn1superscriptsubscript𝑀𝑛2subscript𝑀𝑛1M_{n}^{\prime}=2M_{n}-1italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 2 italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 codewords is at least Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, there must exist a code having at least Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT codewords, out of the Mnsuperscriptsubscript𝑀𝑛M_{n}^{\prime}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, fulfilling this property. Thus, by removing (expurgating) the worst half of the codewords from the code with Mnsuperscriptsubscript𝑀𝑛M_{n}^{\prime}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT codewords we obtain a new code with Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT codewords, each of which satisfies the condition in the first line of the right-hand side in (I). Finally, restricting s𝑠sitalic_s to 0<s10𝑠10<s\leq 10 < italic_s ≤ 1, Gallager derives a lower bound on the exponent of 21s𝔼[Pe,m(𝒞(Mn,n))s]1ssuperscript21𝑠𝔼superscriptdelimited-[]subscript𝑃e𝑚superscript𝒞superscriptsubscript𝑀𝑛𝑛𝑠1𝑠2^{\frac{1}{s}}\mathbb{E}[P_{{\rm e},m}\bigl{(}\mathcal{C}(M_{n}^{\prime},n)% \bigr{)}^{s}]^{\frac{1}{s}}2 start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT, given by

Eexn(R,Qn)=Exn(ρ^n,Qn)ρ^nR,superscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛superscriptsubscript𝐸x𝑛subscript^𝜌𝑛superscript𝑄𝑛subscript^𝜌𝑛𝑅\displaystyle E_{\rm ex}^{n}(R,Q^{n})=E_{\rm x}^{n}(\hat{\rho}_{n},Q^{n})-\hat% {\rho}_{n}R,italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) = italic_E start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_R , (4)

where

Exn(ρ,Qn)superscriptsubscript𝐸x𝑛𝜌superscript𝑄𝑛\displaystyle E_{\rm x}^{n}(\rho,Q^{n})italic_E start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_ρ , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) =1nlog(𝒙𝒙Qn(𝒙)Qn(𝒙)Zn(𝒙,𝒙)1ρ)ρ,\displaystyle=-\frac{1}{n}\log\biggl{(}\sum_{\boldsymbol{x}}\sum_{\boldsymbol{% x}^{\prime}}Q^{n}(\boldsymbol{x})Q^{n}(\boldsymbol{x}^{\prime})Z_{n}(% \boldsymbol{x},\boldsymbol{x}^{\prime})^{\frac{1}{\rho}}\biggr{)}^{\rho},= - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log ( ∑ start_POSTSUBSCRIPT bold_italic_x end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_x ) italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_Z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( bold_italic_x , bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_ρ end_POSTSUPERSCRIPT , (5)

Zn(𝒙,𝒙)=𝒚Wn(𝒚|𝒙)Wn(𝒚|𝒙)subscript𝑍𝑛𝒙superscript𝒙subscript𝒚superscript𝑊𝑛conditional𝒚𝒙superscript𝑊𝑛conditional𝒚superscript𝒙Z_{n}(\boldsymbol{x},\boldsymbol{x}^{\prime})=\sum_{\boldsymbol{y}}\sqrt{W^{n}% \mkern-1.5mu(\boldsymbol{y}|\boldsymbol{x})W^{n}\mkern-1.5mu(\boldsymbol{y}|% \boldsymbol{x}^{\prime})}italic_Z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( bold_italic_x , bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT bold_italic_y end_POSTSUBSCRIPT square-root start_ARG italic_W start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_y | bold_italic_x ) italic_W start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_y | bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG is the Bhattacharyya coefficient between codewords 𝒙,𝒙𝒳n𝒙superscript𝒙superscript𝒳𝑛\boldsymbol{x},\boldsymbol{x}^{\prime}\in\mathcal{X}^{n}bold_italic_x , bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT while

ρ^n=argmaxρ1{Exn(ρ,Qn)ρR}subscript^𝜌𝑛subscriptargmax𝜌1superscriptsubscript𝐸x𝑛𝜌superscript𝑄𝑛𝜌𝑅\displaystyle\hat{\rho}_{n}=\operatorname*{arg\,max}_{\rho\geq 1}\bigl{\{}E_{% \rm x}^{n}(\rho,Q^{n})-\rho R\bigr{\}}over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_ρ ≥ 1 end_POSTSUBSCRIPT { italic_E start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_ρ , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_ρ italic_R } (6)

is the parameter that yields the highest exponent. The preceding argument is valid for the maximal probability of error, since every codeword in the expurgated code attains the same exponent. In addition, observe that since (3) uses the standard ensemble-average argument (i.e. by taking the average over the ensemble) we show the existence of a code with the desired property. The exponent in (4) is the expurgated exponent. We refer to the code with Mnsuperscriptsubscript𝑀𝑛M_{n}^{\prime}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT codewords before expurgation as a mother code. We say that a mother code is good if, once expurgated, we obtain a code with asymptotically the same rate, the codewords of which each have an exponent at least as large as the expurgated.

A refinement of the above follows from (1). Specifically, for ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0 it can be shown that there exists a code with Mn=Mn(1+ϵ)superscriptsubscript𝑀𝑛subscript𝑀𝑛1italic-ϵM_{n}^{\prime}=M_{n}(1+\epsilon)italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) codewords such that removing ϵMnitalic-ϵsubscript𝑀𝑛\epsilon M_{n}italic_ϵ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT codewords yields a code that attains the expurgated exponent [6, Lemma 1]. Although [6, Lemma 1] generalizes Gallager’s method, it still only shows the existence of a code that attains the expurgated exponent.

II Main Result

This paper strengthens existing results on expurgation by showing that the probability of finding a code with Mn=(1+ϵ)Mnsuperscriptsubscript𝑀𝑛1italic-ϵsubscript𝑀𝑛M_{n}^{\prime}=(1+\epsilon)M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( 1 + italic_ϵ ) italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT codewords that contains a code with at least Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT codewords each of which achieving the expurgated exponent tends to 1111 with the code length. We define the sequence δn=ρ^nnlogγnsubscript𝛿𝑛subscript^𝜌𝑛𝑛subscript𝛾𝑛\delta_{n}=\frac{\hat{\rho}_{n}}{n}\log\gamma_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG roman_log italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, where γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is such that limnγn=subscript𝑛subscript𝛾𝑛\lim_{n\rightarrow\infty}\gamma_{n}=\inftyroman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∞ while limnlogγnn=0subscript𝑛subscript𝛾𝑛𝑛0\lim_{n\rightarrow\infty}\frac{\log\gamma_{n}}{n}=0roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG = 0, ρ^nsubscript^𝜌𝑛\hat{\rho}_{n}over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT being a positive sequence defined in (6) that depends on the channel, the ensemble and the rate. From the definition of δnsubscript𝛿𝑛\delta_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT it can be seen that if ρ^nsubscript^𝜌𝑛\hat{\rho}_{n}over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT either converges to a constant or grows sufficiently slowly, there exists a γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT such that δn0subscript𝛿𝑛0\delta_{n}\rightarrow 0italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → 0. Similarly to Gallager, for a given δnsubscript𝛿𝑛\delta_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we define the indicator function

ϕm(𝒞(Mn,n))={1 if Em(𝒞(Mn,n))>Eexn(R,Qn)δn0 otherwise,subscriptitalic-ϕ𝑚𝒞subscript𝑀𝑛𝑛cases1 if subscript𝐸𝑚𝒞subscript𝑀𝑛𝑛superscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛subscript𝛿𝑛otherwise0 otherwiseotherwise\displaystyle\phi_{m}\bigl{(}\mathcal{C}(M_{n},n)\bigr{)}=\begin{cases}1\ % \text{ if }E_{m}\bigl{(}\mathcal{C}(M_{n},n)\bigr{)}>E_{\rm ex}^{n}(R,Q^{n})-% \delta_{n}\\ 0\ \text{ otherwise},\end{cases}italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) = { start_ROW start_CELL 1 if italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) > italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 otherwise , end_CELL start_CELL end_CELL end_ROW (7)

and the number of codewords attaining an exponent higher than Eexn(R,Qn)δnsuperscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛subscript𝛿𝑛E_{\rm ex}^{n}(R,Q^{n})-\delta_{n}italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT as

Φ(𝒞(Mn,n))=m=1Mnϕm(𝒞(Mn,n)).Φ𝒞superscriptsubscript𝑀𝑛𝑛superscriptsubscript𝑚1superscriptsubscript𝑀𝑛subscriptitalic-ϕ𝑚𝒞superscriptsubscript𝑀𝑛𝑛\displaystyle\Phi\bigl{(}\mathcal{C}(M_{n}^{\prime},n)\bigr{)}~{}{=}\sum_{m=1}% ^{M_{n}^{\prime}}\phi_{m}\bigl{(}\mathcal{C}(M_{n}^{\prime},n)\bigr{)}.roman_Φ ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( caligraphic_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) . (8)
Theorem 1

Consider a pairwise-independent code ensemble with Mn=Mn(1+ϵ)superscriptsubscript𝑀𝑛subscript𝑀𝑛1italic-ϵM_{n}^{\prime}=M_{n}(1+\epsilon)italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) codewords and any ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0. If the sequence {δn}n=1superscriptsubscriptsubscript𝛿𝑛𝑛1\{\delta_{n}\}_{n=1}^{\infty}{ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, which depends on the channel and the ensemble, satisfies limnδn=0subscript𝑛subscript𝛿𝑛0\lim_{n\rightarrow\infty}\delta_{n}=0roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0, then for any 0<ϵ1<ϵ0subscriptitalic-ϵ1italic-ϵ0<\epsilon_{1}<\epsilon0 < italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_ϵ, it holds that

limn[Φ(𝖢(Mn,n))Mn(1+ϵ1)]=1.subscript𝑛delimited-[]Φ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1subscriptitalic-ϵ11\displaystyle\lim_{n\rightarrow\infty}\mathbb{P}\bigl{[}\Phi\bigl{(}\mathsf{C}% (M_{n}^{\prime},n)\bigr{)}\geq M_{n}(1+\epsilon_{1})\bigr{]}=1.roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT blackboard_P [ roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ≥ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ] = 1 . (9)
Proof:

See Section III. ∎

In words, with high probability we find a mother code with Mn=(1+ϵ)Mnsuperscriptsubscript𝑀𝑛1italic-ϵsubscript𝑀𝑛M_{n}^{\prime}=(1+\epsilon)M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( 1 + italic_ϵ ) italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT codewords, Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of which attain the expurgated exponent. That is, good mother codes are found easily and only contain an arbitrarily small fraction ϵ/(1+ϵ)italic-ϵ1italic-ϵ\epsilon/(1+\epsilon)italic_ϵ / ( 1 + italic_ϵ ) of codewords that need to be expurgated. Theorem 1 extends Gallager’s method, and applies, among others, to i.i.d. (i.i.d.) and constant composition codes over DMCs, as well as channels with memory such as the finite-state channel in [2, Sec. 4.6], for which the expurgated exponent is derived in [7].

As a final remark, recent works [8, 9, 7, 10] show that for many ensembles, most low-rate codes have an error exponent E(𝖢(Mn,n))𝐸𝖢subscript𝑀𝑛𝑛E\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}italic_E ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) that is strictly larger than the exponent of the ensemble average error probability, i.e., the random coding exponent. Similarly, Theorem 1 implies that for most codes, almost any codeword has an associated error exponent Em(𝖢(Mn,n))subscript𝐸𝑚𝖢subscript𝑀𝑛𝑛E_{m}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) that is strictly larger than the ensemble average of the exponent of the error probability of the codebook 𝔼[E(𝖢(Mn,n))]𝔼delimited-[]𝐸𝖢subscript𝑀𝑛𝑛\mathbb{E}\big{[}E\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}\big{]}blackboard_E [ italic_E ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) ]. In both cases the smaller error exponent of the average probability of error is due to a relatively small number of elements (codes in the first case, codewords in the second) that perform poorly. Furthermore, as shown in [9, 10] for i.i.d. and constant composition codes over DMC, the error exponents of the codes in the ensemble concentrate around the TRC (TRC) exponent [11, 8]. Similarly to such works, it can be shown that the error exponent Em(𝖢(Mn,n))subscript𝐸𝑚𝖢subscript𝑀𝑛𝑛E_{m}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ), for any m𝑚mitalic_m, concentrates around its mean, the expurgated exponent. The proof makes use of Lemma 1 in Section III, and follows almost identical steps as in [10, Theorem 1], [7, Theorem 1] and [7, Theorem 2] once Pe(𝒞)subscript𝑃e𝒞P_{\rm e}(\mathcal{C})italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( caligraphic_C ) is replaced by Pe,m(𝒞)subscript𝑃e𝑚𝒞P_{{\rm e},m}\bigl{(}\mathcal{C}\bigr{)}italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( caligraphic_C ) and it is omitted here.

III Proof of Theorem 1

We start with the following lemma, whose proof is almost identical to that of [7, Lemma 1].

Lemma 1

For a channel Wnsuperscript𝑊𝑛W^{n}italic_W start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and a pairwise-independent Mnsuperscriptsubscript𝑀𝑛M_{n}^{\prime}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT-codewords code ensemble with codeword distribution Qnsuperscript𝑄𝑛Q^{n}italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, for any m{1,,Mn}𝑚1superscriptsubscript𝑀𝑛{m}\in\{1,\ldots,M_{n}^{\prime}\}italic_m ∈ { 1 , … , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } it holds that

[Em(𝖢(Mn,n))>Eexn(R,Qn)δn]11γn,delimited-[]subscript𝐸𝑚𝖢superscriptsubscript𝑀𝑛𝑛superscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛subscript𝛿𝑛11subscript𝛾𝑛\mathbb{P}\bigl{[}E_{m}\bigl{(}\mathsf{C}(M_{n}^{\prime},n)\bigr{)}>E_{\rm ex}% ^{n}(R,Q^{n})-\delta_{n}\bigr{]}\geq 1-\frac{1}{\gamma_{n}},blackboard_P [ italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) > italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] ≥ 1 - divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG , (10)

where γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and δnsubscript𝛿𝑛\delta_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT are positive real-valued sequences.

The proof of Lemma 1 follows from Markov’s inequality

[Pe,m(𝖢n)γn1s𝔼[Pe,m(𝖢n)s]1s]1γndelimited-[]subscript𝑃e𝑚subscript𝖢𝑛superscriptsubscript𝛾𝑛1𝑠𝔼superscriptdelimited-[]subscript𝑃e𝑚superscriptsubscript𝖢𝑛𝑠1𝑠1subscript𝛾𝑛\displaystyle\mathbb{P}\Bigl{[}P_{{\rm e},m}(\mathsf{C}_{n})\geq\gamma_{n}^{% \frac{1}{s}}\mathbb{E}[P_{{\rm e},m}(\mathsf{C}_{n})^{s}]^{\frac{1}{s}}\Bigr{]% }\leq\frac{1}{\gamma_{n}}blackboard_P [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT blackboard_E [ italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_s end_ARG end_POSTSUPERSCRIPT ] ≤ divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG (11)

and applying the same steps as in [7, Theorem 1] once Pe(𝖢n)subscript𝑃esubscript𝖢𝑛P_{\rm e}(\mathsf{C}_{n})italic_P start_POSTSUBSCRIPT roman_e end_POSTSUBSCRIPT ( sansserif_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) is replaced with Pe,m(𝖢n)subscript𝑃e𝑚subscript𝖢𝑛P_{{\rm e},m}(\mathsf{C}_{n})italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). The sequences γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and δnsubscript𝛿𝑛\delta_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT are the same as those introduced in Section II. Observe that using inequality (11) and following similar steps as in [7] it can be shown that limnEexn(R,Qn)subscript𝑛superscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛\lim_{n\rightarrow\infty}E_{\rm ex}^{n}(R,Q^{n})roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) is a lower bound on limn𝔼[Em(𝖢(Mn,n))]subscript𝑛𝔼delimited-[]subscript𝐸𝑚𝖢subscript𝑀𝑛𝑛\lim_{n\rightarrow\infty}\mathbb{E}[E_{m}\bigl{(}\mathsf{C}(M_{n},n)\bigr{)}]roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT blackboard_E [ italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) ]. Furthermore, using similar arguments as in [10] it can be shown that such bound is tight at least for i.i.d. and constant composition codes over DMC. That is, for such ensembles and channels limn𝔼[1nlogPe,m(𝖢(Mn,n))]=limnEexn(R,Qn)subscript𝑛𝔼delimited-[]1𝑛subscript𝑃e𝑚𝖢subscript𝑀𝑛𝑛subscript𝑛superscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛\lim_{n\rightarrow\infty}\mathbb{E}\big{[}-\frac{1}{n}\log P_{{\rm e},m}\big{(% }\mathsf{C}(M_{n},n)\big{)}\big{]}=\lim_{n\rightarrow\infty}E_{\rm ex}^{n}(R,Q% ^{n})roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT blackboard_E [ - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_P start_POSTSUBSCRIPT roman_e , italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ) ) ] = roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ), i.e., the expurgated is the typical codeword exponent.

If the positive sequence ρ^nsubscript^𝜌𝑛\hat{\rho}_{n}over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, defined in (6), converges or grows sufficiently slowly, then there exists a sequence γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT such that limnγn=subscript𝑛subscript𝛾𝑛\lim_{n\rightarrow\infty}\gamma_{n}=\inftyroman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∞, limnlogγnn=0subscript𝑛subscript𝛾𝑛𝑛0\lim_{n\rightarrow\infty}\frac{\log\gamma_{n}}{n}=0roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG roman_log italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG = 0, for which δn=ρ^nnlogγn0subscript𝛿𝑛subscript^𝜌𝑛𝑛subscript𝛾𝑛0\delta_{n}=\frac{\hat{\rho}_{n}}{n}\log\gamma_{n}\rightarrow 0italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG roman_log italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → 0. For rate zero, that is when limn1nlogMn=0subscript𝑛1𝑛subscript𝑀𝑛0\lim_{n\to\infty}\frac{1}{n}\log{M_{n}}=0roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG roman_log italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0, the n𝑛nitalic_n-length error exponent in (4) depends on the particular subexponential growth of Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, while ρ^nsubscript^𝜌𝑛\hat{\rho}_{n}over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT tends to infinity with a growth that depends on the channel and the ensemble. In this case, as discussed in the paragraph succeeding [7, Eq. (89)], the assumption that ρ^nnlogγn0subscript^𝜌𝑛𝑛subscript𝛾𝑛0\frac{\hat{\rho}_{n}}{n}\log\gamma_{n}\rightarrow 0divide start_ARG over^ start_ARG italic_ρ end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG roman_log italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → 0 holds if the normalized variance of the Bhattacharyya coefficient Zn(𝒙,𝒙)subscript𝑍𝑛𝒙superscript𝒙Z_{n}(\boldsymbol{x},\boldsymbol{x}^{\prime})italic_Z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( bold_italic_x , bold_italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) grows slower than nlogγn𝑛subscript𝛾𝑛\smash{\sqrt{\frac{n}{\log\gamma_{n}}}}square-root start_ARG divide start_ARG italic_n end_ARG start_ARG roman_log italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG. In any case, choosing such γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and applying Lemma 1 we have that

[Em(𝖢(Mn,n))>Eexn(R,Qn)δn]11γn.delimited-[]subscript𝐸𝑚𝖢superscriptsubscript𝑀𝑛𝑛superscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛subscript𝛿𝑛11subscript𝛾𝑛\mathbb{P}\bigl{[}E_{m}\bigl{(}\mathsf{C}(M_{n}^{\prime},n)\bigr{)}>E_{\rm ex}% ^{n}(R,Q^{n})-\delta_{n}\bigr{]}\geq 1-\frac{1}{\gamma_{n}}.blackboard_P [ italic_E start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) > italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] ≥ 1 - divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG . (12)

The random variable Φ(𝖢(Mn,n))Φ𝖢superscriptsubscript𝑀𝑛𝑛\Phi(\mathsf{C}(M_{n}^{\prime},n))roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ), averaged across the ensemble, satisfies

𝔼[Φ(𝖢(Mn,n))]𝔼delimited-[]Φ𝖢superscriptsubscript𝑀𝑛𝑛\displaystyle\mathbb{E}[\Phi(\mathsf{C}(M_{n}^{\prime},n))]blackboard_E [ roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ] =m=1Mn(1+ϵ)𝔼[ϕm(𝖢(Mn,n))]absentsuperscriptsubscript𝑚1subscript𝑀𝑛1italic-ϵ𝔼delimited-[]subscriptitalic-ϕ𝑚𝖢superscriptsubscript𝑀𝑛𝑛\displaystyle=\sum_{m=1}^{M_{n}(1+\epsilon)}\mathbb{E}[\phi_{m}(\mathsf{C}(M_{% n}^{\prime},n))]= ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) end_POSTSUPERSCRIPT blackboard_E [ italic_ϕ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ] (13)
m=1Mn(1+ϵ)(11γn)absentsuperscriptsubscript𝑚1subscript𝑀𝑛1italic-ϵ11subscript𝛾𝑛\displaystyle\geq\sum_{m=1}^{M_{n}(1+\epsilon)}\left(1-\frac{1}{\gamma_{n}}\right)≥ ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) end_POSTSUPERSCRIPT ( 1 - divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) (14)
=Mn(1+ϵ)(11γn),absentsubscript𝑀𝑛1italic-ϵ11subscript𝛾𝑛\displaystyle=M_{n}(1+\epsilon)\left(1-\frac{1}{\gamma_{n}}\right),= italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) ( 1 - divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) , (15)

where (14) follows from the definition of the indicator function (7) and (12).

We define Ψ(𝖢(Mn,n))=MnΦ(𝖢(Mn,n))Ψ𝖢superscriptsubscript𝑀𝑛𝑛superscriptsubscript𝑀𝑛Φ𝖢superscriptsubscript𝑀𝑛𝑛\Psi(\mathsf{C}(M_{n}^{\prime},n))=M_{n}^{\prime}-\Phi(\mathsf{C}(M_{n}^{% \prime},n))roman_Ψ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) = italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ), which is the number of codewords with exponent smaller than Eexn(R,Qn)δnsuperscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛subscript𝛿𝑛E_{\rm ex}^{n}(R,Q^{n})-\delta_{n}italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. From (15) it follows that

𝔼[Ψ(𝖢(Mn,n))]Mn(1+ϵ)γn.𝔼delimited-[]Ψ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1italic-ϵsubscript𝛾𝑛\displaystyle\mathbb{E}[\Psi(\mathsf{C}(M_{n}^{\prime},n))]\leq\frac{M_{n}(1+% \epsilon)}{\gamma_{n}}.blackboard_E [ roman_Ψ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ] ≤ divide start_ARG italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) end_ARG start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG . (16)

Then, for sufficiently large n𝑛nitalic_n we have that

[Ψ(𝖢(Mn,n))>Mn(1+ϵ)γn]delimited-[]Ψ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1italic-ϵsubscript𝛾𝑛\displaystyle\mathbb{P}\Big{[}\Psi(\mathsf{C}(M_{n}^{\prime},n))>\frac{M_{n}(1% +\epsilon)}{\sqrt{\gamma_{n}}}\Big{]}blackboard_P [ roman_Ψ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) > divide start_ARG italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) end_ARG start_ARG square-root start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG ] 1γn,absent1subscript𝛾𝑛\displaystyle~{}{\leq}~{}\frac{1}{\sqrt{\gamma_{n}}},≤ divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG , (17)

where (17) follows from Markov’s inequality and (16). This shows that the probability of finding a code with many codewords with exponent strictly smaller than Eexn(R,Qn)δnsuperscriptsubscript𝐸ex𝑛𝑅superscript𝑄𝑛subscript𝛿𝑛E_{\rm ex}^{n}(R,Q^{n})-\delta_{n}italic_E start_POSTSUBSCRIPT roman_ex end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_R , italic_Q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) - italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT vanishes with n𝑛nitalic_n. To prove our main result, we write the tail probability in (9) as

[Φ(𝖢(Mn,n))Mn(1+ϵ1)]delimited-[]Φ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1subscriptitalic-ϵ1\displaystyle\mathbb{P}\Big{[}\Phi(\mathsf{C}(M_{n}^{\prime},n))\geq M_{n}(1+% \epsilon_{1})\Big{]}blackboard_P [ roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ≥ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ]
=1[Φ(𝖢(Mn,n))<Mn(1+ϵ1)]absent1delimited-[]Φ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1subscriptitalic-ϵ1\displaystyle\ =1-\mathbb{P}\Big{[}\Phi(\mathsf{C}(M_{n}^{\prime},n))<M_{n}(1+% \epsilon_{1})\Big{]}= 1 - blackboard_P [ roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) < italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ] (18)
=1[Ψ(𝖢(Mn,n))>Mn(ϵϵ1)],absent1delimited-[]Ψ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛italic-ϵsubscriptitalic-ϵ1\displaystyle\ =1-\mathbb{P}\Big{[}\Psi(\mathsf{C}(M_{n}^{\prime},n))>M_{n}(% \epsilon-\epsilon_{1})\Big{]},= 1 - blackboard_P [ roman_Ψ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) > italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_ϵ - italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ] , (19)

where we used the definitions of Ψ(𝖢(Mn,n))Ψ𝖢superscriptsubscript𝑀𝑛𝑛\Psi(\mathsf{C}(M_{n}^{\prime},n))roman_Ψ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) and Mnsuperscriptsubscript𝑀𝑛M_{n}^{\prime}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Since γnsubscript𝛾𝑛\gamma_{n}italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT tends to infinity, there must exist an n0subscript𝑛0n_{0}\in\mathbb{N}italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ blackboard_N such that ϵϵ1>(1+ϵ)γnitalic-ϵsubscriptitalic-ϵ11italic-ϵsubscript𝛾𝑛\epsilon-\epsilon_{1}>\frac{(1+\epsilon)}{\sqrt{\gamma_{n}}}italic_ϵ - italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > divide start_ARG ( 1 + italic_ϵ ) end_ARG start_ARG square-root start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG for n>n0𝑛subscript𝑛0n>n_{0}italic_n > italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and therefore

limn[Φ(𝖢(Mn,n))Mn(1+ϵ1)]subscript𝑛delimited-[]Φ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1subscriptitalic-ϵ1\displaystyle\lim_{n\to\infty}\mathbb{P}\Big{[}\Phi(\mathsf{C}(M_{n}^{\prime},% n))\geq M_{n}(1+\epsilon_{1})\Big{]}roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT blackboard_P [ roman_Φ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) ≥ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ]
limn1[Ψ(𝖢(Mn,n))>Mn(1+ϵ)γn]absentsubscript𝑛1delimited-[]Ψ𝖢superscriptsubscript𝑀𝑛𝑛subscript𝑀𝑛1italic-ϵsubscript𝛾𝑛\displaystyle\ \geq\lim_{n\to\infty}1-\mathbb{P}\Big{[}\Psi(\mathsf{C}(M_{n}^{% \prime},n))>\frac{M_{n}(1+\epsilon)}{\sqrt{\gamma_{n}}}\Big{]}≥ roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT 1 - blackboard_P [ roman_Ψ ( sansserif_C ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_n ) ) > divide start_ARG italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 + italic_ϵ ) end_ARG start_ARG square-root start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG ] (20)
limn11γn,absentsubscript𝑛11subscript𝛾𝑛\displaystyle\ \geq\lim_{n\to\infty}1-\frac{1}{\sqrt{\gamma_{n}}},≥ roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT 1 - divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG , (21)

where (21) follows from (17). Finally, solving the limit yields the desired result.

References

  • [1] A. Feinstein, “Error bounds in noisy channels without memory,” IRE Trans. on Information Theory, vol. 1, no. 2, pp. 13–14, 1955.
  • [2] R. Gallager, Information Theory and Reliable Communication.   USA: John Wiley & Sons, Inc., 1968.
  • [3] C. Shannon, R. Gallager, and E. Berlekamp, “Lower bounds to error probability for coding on discrete memoryless channels. I,” Information and Control, vol. 10, no. 1, pp. 65–103, 1967. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0019995867900526
  • [4] R. Fano, Transmission of Information: A Statistical Theory of Communication.   Massachusetts Institute of Technology Press, 1961.
  • [5] R. Gallager, “A simple derivation of the coding theorem and some applications,” IEEE Trans. Inf. Theory, vol. 11, no. 1, pp. 3–18, 1965.
  • [6] J. Scarlett, L. Peng, N. Merhav, A. Martinez, and A. Guillén i Fàbregas, “Expurgated random-coding ensembles: Exponents, refinements, and connections,” IEEE Trans. Inf. Theory, vol. 60, no. 8, pp. 4449–4462, 2014.
  • [7] G. Cocco, A. Guillén i Fàbregas, and J. Font-Segura, “Typical error exponents: A dual domain derivation,” IEEE Trans. Inf. Theory, vol. 69, no. 2, pp. 776–793, Feb. 2023.
  • [8] N. Merhav, “Error exponents of typical random codes,” IEEE Trans. Inf. Theory, vol. 64, no. 9, pp. 6223–6235, Sep. 2018.
  • [9] R. Tamir, N. Merhav, N. Weinberger, and A. Guillén i Fàbregas, “Large deviations behavior of the logarithmic error probability of random codes,” IEEE Trans. Inf. Theory, vol. 66, no. 11, pp. 6635–6659, 2020.
  • [10] L. V. Truong, G. Cocco, J. Font-Segura, and A. Guillén i Fàbregas, “Concentration properties of random codes,” IEEE Trans. Inf. Theory, vol. 69, no. 12, pp. 7499–7537, Dec. 2023.
  • [11] A. Barg and G. Forney, “Random codes: Minimum distances and error exponents,” IEEE Trans. Inf. Theory, vol. 48, no. 9, pp. 2568–2573, Sep. 2002.