The H. 264-AVC Video Coding Standard For The Next Generation Multimedia Communication
The H. 264-AVC Video Coding Standard For The Next Generation Multimedia Communication
The H. 264-AVC Video Coding Standard For The Next Generation Multimedia Communication
the Video Coding Layer (VCL) from the Network Abstraction Input Res. data Entropy
+ DCT Quant
Layer (NAL). VCL provides the core compressed video Video - & Headers Coding
(P)
contents, while NAL supports delivery over various types of Output
Mode Predicted blocks (P)
network. This network friendliness feature of the standard Inverse Bitstream
Select;
Quant
facilitates easier packetization and better information priority Intra Pred Reconstructed
control. In addition, to adapt H.264 to applications involving blocks
bit errors and packet losses, a number of error resilience Frame Loop +
+
IDCT
techniques are provided in the standard. Buffer Filter
that make the standard more efficient than the previous ones
are particularly emphasized. Error resilient features of the 1) Intra Prediction Modes: There are nine advanced
codec are discussed in Section III as they are the most prediction modes for luminance (luma) samples when the MB
demanding features of the codec for error prone channels. is partitioned into 4x4 blocks (i.e. intra-4x4 modes).
Section IV provides a description on the non-normative R-D Additionally, four other modes are used for predicting the
optimization method utilized in the JVT test model software whole 16x16 intra MB (intra-16x16). The chrominance
which is a key feature of the success of the method. Selected (chroma) components can also be predicted in 4 different
simulation results verifying the R-D efficiency of the modes (i.e. intra-chroma modes). Various intra-4x4 modes are
algorithm and error resilient techniques are demonstrated in demonstrated in Fig. 3, where the prediction values for pixels
Section V. are calculated from the neighboring boundary pixel values.
Each mode is suitable to predict directional structures in the
picture at different angles (e.g. horizontal, vertical, diagonal,
II. AN OVERVIEW OF THE H.264/AVC VIDEO CODING etc.). Intra-16x16 and intra-chroma modes consist of
METHOD horizontal, vertical, DC and plane modes. In these modes
similar to the intra-4x4 modes, an arrangement of neighboring
Standard video coding methods including H.264 are based
boundary pixels produce prediction pixels. For more details of
on three fundamental redundancy reduction principles, namely
each intra prediction mode, the H.264 recommendation should
spatial and temporal redundancy reduction and entropy coding
be consulted [7].
[16]. To carry out spatial redundancy reduction in intraframes
(Fig. 2), transform coding is applied using Discrete Cosine Mode 0: Vertical Mode 1: Horizontal Mode 2: DC
produce the predicted blocks. A block diagram of interframe inter pictures, the MVs are in quarter sample precision. To
coding in the H.264 coder is depicted in Fig. 6. Similar to generate the values of half-pixel positions a 6-tap Finite
intraframe coding, the residual data of original and predicted Impulse Response (FIR) filter is applied to integer position
blocks, are transform coded. The inter prediction method in samples. The quarter-pixel samples are then generated using
H.264 has some interesting features which are explained in the simple interpolation between neighboring (integer or half-
following parts. pixel position) samples [21].
4) B-pictures consideration: B-pictures are encoded using
Input Res. data, Entropy both past and future pictures as references in contrast to P-
+ DCT Quant
Video - Headers, Coding pictures which are only predicted form the past references. In
(P) & MVs
(P) Output other words, each block of a B-picture can be predicted from
Mode Bitstream
Select;
Inverse either of two reference blocks or a linear combination of them.
Quant
MC/Pred (R) In H.264 these references could be both in the past or one in
MVs the past and one in the future. New concept of direct
ME
Frame Loop +
+
IDCT
prediction mode is specified in H.264, where no data (such as
Buffer Filter MVs and reference indices) is present in the bitstream for the
prediction process and they are derived from the available
Fig. 6. A simplified block diagram of interframe coding in the H.264 encoder data of co-located MBs of the subsequent pictures. To support
B-picture coding as well as P-picture, proper mode
1) Inter Prediction Modes: Interframe predictive coding is
referencing tables and entropy coding methods are specified
where H.264 makes most of its gain in compression
[7][22].
efficiency. Motion compensation on each 16x16 MB can be
performed with various block sizes and shapes illustrated in
Fig. 7. The partitioning choice of a MB into 16x16, 8x16,
III. H.264 NAL LAYER AND ERROR RESILIENCE FEATURES
16x8 or 8x8 blocks is determined by mb-type. In 8x8 mode
(i.e. mb-type 3) each of the blocks can be further divided In a communication channel the quality of service is
independently into 8x8, 8x4, 4x8 or 4x4 sub-partitions affected by the two parameters of bandwidth and the
determined by sub-mb-type. Note that each of these blocks probability of error. Therefore, as well as video compression
contains its own MV and hence more precise motion efficiency which is provided through the VCL layer, the
compensation can be performed when the MB is divided into adaptation to communication channels should be carefully
smaller blocks. considered. The concept of NAL layer and the error resilience
features in H.264 are to provide an appropriate VCL
mb-type: 0 mb-type: 1 mb-type: 2 mb-type: 3
16 16 8 8 representation for conveyance on a variety of channels to cope
ref-idx0, MV0
8
ref-idx0, MV0 ref-idx0,
MV0
ref-idx1,
MV1
sub-type0, sub-type1,
8 ref-idx0 ref-idx1 with their erroneous situations.
16 16
ref-idx1, MV1 sub-type2, sub-type3,
ref-idx2 ref-idx3
A. NAL
The Network Abstraction Layer facilitates the delivery of
the H.264 VCL data to the underlying transport layers such as
sub-mb-type: 0 sub-mb-type: 1 sub-mb-type: 2 sub-mb-type: 3
8 8 4 4 RTP/IP, H.32X and MPEG-2 systems [23]. Each NAL unit
MV0
4
MV0 MV0 MV1
4
MV0 MV1
could be considered as a packet that contains an integer
8
MV1
8
MV2 MV3
number of bytes including a header and a payload (see Fig. 8).
The header specifies the NAL unit type and the payload
contains the related data.
Fig. 7. Top: various 16x16 MB partitioning modes for MC. Bottom: sub
partitioning modes of 8x8 blocks when mb-type 3. TABLE II
NAL UNIT TYPES
2) Multiple Reference Prediction: The H.264 standard NAL
Class Content of NAL unit
offers the option of using many previous pictures for unit type
prediction. Every MB partition (but not sub-partition) shown 0 - Unspecified
1 VCL Coded slice
on the top part of Fig. 7, could have a different reference 2 VCL Coded slice data partition A
picture that is more appropriate for that particular block. This 3 VCL Coded slice data partition B
will increase the coding efficiency and produce better 4 VCL Coded slice data partition C
5 VCL Coded slice of an IDR picture
subjective quality. Moreover, using this mode improves the 6-12 Non-VCL Supplemental Information,
robustness of the bitstream to channel errors [12]. Parameter Sets, etc.
3) Quarter Sample MVs: The objects movements in the 12-23 - Reserved
24-31 - Unspecified
consecutive pictures of a sequence particularly for reduced
size pictures such as QCIF, are not necessarily in integer pixel
units. Therefore, to improve the motion modeling scheme of Table II gives a summarized list of different NAL unit
types. NAL units 1 to 5 contain different VCL data that will
IAEEE Journal 5
be described later. NAL units 6 to 12 are non-VCL units unequal error protection to various important parts of video
containing additional information such as parameter sets and data becomes straightforward. For example a high error
supplemental information. Parameter sets are header data that protection can be applied to NAL units of type 2, 3 and 5
are unchanged in a number of NAL units, and then are sent to (headers, intra residuals and IDR pictures respectively) and a
prevent repeating them. Supplemental information is timing or lower protection for type 4 units (inter residual data), since
other addressing data that enhances the decoder usability but they are less important. It should be noted that there is no DP
are not essential in decoding the pictures. NAL units 12 to 23 for IDR pictures. However, since they have a different NAL
are reserved for future use of H.264 extensions and the types unit type (type 5), special error protection could be applied to
24 to 31 are available for use by different applications. them.
3) Flexible MB Ordering (FMO): In FMO mode, MBs are
B. Error Resilience Features
allowed to be assigned to any slice in a frame, so they are
H.264 provides features that make the generated bitstream flexible to be transmitted in a non-scanning order. Since each
robust to the bit errors and packet losses. Following is a brief slice is independently decodable, one can mange to spatially
description of each of these features: interleave MBs into different slices. Therefore, if one of the
1) Resynchronization (Slice) Headers: Each frame can be slices is missed, its missing MBs are surrounded by correctly
divided into several slices; each contains a flexible number of received MBs. Hence, by applying an appropriate error
MBs. In each slice the arithmetic coder is aligned and the concealment method, the visual artifact of the losses could
predictions are reset. Hence, every slice in the frame is significantly be reduced.
independently decodable. Therefore, they can be considered 4) Redundant Slices: H.264 has a new feature that allows
as resynchronization points that prevent propagation of a the encoder to send Redundant Slices (RSs) containing
probable error to entire frame. If the resynchronization information of the same primarily transmitted MBs. If the
markers used too often, they limit the damaged area more original slices are lost, the RSs represent an alternative data
tightly, so improving the error resilience of the codec. It is (but somewhat in lower quality) that can be used to recover
obvious that the slicing introduces some overhead and reduces the corrupted MBs.
the compression efficiency, but in the erroneous situations, 5) Intra Refresh and Multiple References: Features such as
due to the better error resilience, the received video quality Intra modes and Multiple Reference Selection (MRS),
could be much better. improve the error robustness of a bitstream as well as the
In H.264, each slice is placed in a separate NAL unit (see encoding efficiency. The intra MBs do not use temporal
table II). The slices of an IDR picture (i.e. a picture with all prediction, so they prevent error propagation through the
intra slices) are located in type 5 NAL units, while those bitstream. Similarly, using multiple references (especially in
belonged to a non-IDR picture are placed in NAL units of the presence of a back-channel) can prevent corrupted pictures
type 1 to 4 depending on the Data Partitioning (DP) mode to be used as references and hence stop the error propagation
which is explained in the following part. [24],[25].
2) Data Partitioning: DP is another important and efficient Intra MBs can be inserted very often in the bitstream and in
way to make a bitstream more robust. It is proposed due to the this case the coding efficiency is reduced, but the error
fact that the symbols appeared earlier in the bitstream suffer robustness is improved. This is the case for MRS mode which
less from the errors than those which come later. Therefore, the encoder can be purely optimized for coding efficiency or
by bringing the more important parts of the video data (such to generate more robust bitstreams.
as headers and MVs) ahead of the non-important data, the
channel error side effect can be significantly reduced. IV. RATE-DISTORTION OPTIMIZATION
In H.264, when DP is enabled, every slice is divided into
In H.264 similar to the other standards, only the bitstream
three separate partitions (Fig. 8) and every partition is located
syntax and the decoding procedure are specified, and the
in a particular NAL unit (see table II as well). Therefore, data
encoder is flexible to have different implementations. It
partitioning can be used as an efficient layering method that
should select the parameters such as MB modes, MVs, QP and
separates the data with different importance.
the residual data for each MB among several choices. This
TYPE 1
Slice
Slice Data: MB Headers, MVs, etc. and Intra and Inter Residual Data
huge amount of choices makes optimization of the encoder
Header
very crucial to achieve a good R-D efficiency. In other words,
a poor design of encoder can generate even lower quality
TYPE 2 Slice MB Headers, TYPE 3 Intra Residual TYPE 4 Inter Residual
(part A) Header MVs, etc. (part B) Data (part C) Data bitstreams than a simple traditional video coder. In the JVT
reference software a Lagrangian optimization technique is
NAL-Unit Header NAL-Unit Payload applied to achieve R-D efficiency [9].
each, and coded them in Intra mode and their averages are compared in CABAC and CAVLC modes. In these tests, the
calculated. Each test was done in three scenarios; firstly all CABAC entropy coding mode has outperformed the CAVLC
intra prediction modes were allowed, secondly they were method by up to 8%. Obviously the achieved saving is
limited to 16x16 intra modes only, and finally the encoder was dependant on the test sequence and the bit rate as well.
forced to select 4x4 DC mode only. In the latter scenario, we
have amended the bitstream structure in a way that no header 42
PSNR Y (dB)
44
36
42
40 34
38
32 CABAC
PSNR Y (dB)
36 CAVLC
34 30
25 75 125 175 225
32 Bit Rate (Kb/s)
Akio
30 News 45
Foreman
28
Mobile 43
26
0 50 100 150 200 250 300 41
Bit Rate (Kb/s)
PSNR Y (dB)
Fig. 10. R-D curves of the selected tests in default modes. 39
37
42 35
CABAC
40
33 CAVLC
38
31
5 15 25 35 45 55
PSNR Y
36
Bit Rate (Kb/s)
34 Fig. 12. CABAC and CAVLC modes, Top: Foreman, Bottom: Akio
44 35
42 33
40 31 Foreman, E
Foreman, D
38 Mobile, E
29
Mobile, D
PSNR Y
36
27
0 50 100 150 200
34
Bit Rate (Kb/s)
All Intra Modes
32 Fig. 13. Foreman and Mobile tests, deblocking filter enabled and disabled
Only 4x4 DC Mode
30 Only 16x16 Modes
28
25 45 65 85 105 125 145
Bit Rate (Kb/s)
Fig. 11. All Intra pictures, 3 Hz, Top: Foreman, Bottom: News
how applying the deblocking filter has improved the R-D coding efficiency. In fact this feature of H.264 is one of the
efficiency for the Foreman test, especially in lower bit rates. key reasons of it success. Note that in lower bit rates the
On Mobile test due to the complex texture and high bit rate, improvement caused by the sub-MB-modes is less than the
in-loop filter has not improved the efficiency. Fig. 14 higher rates. The reason is that when the bit budget is very
demonstrates the subjective quality of the Foreman picture limited, the R-D optimization process selects 16x16 modes
when the filter is enabled and disabled. It is clear that the mostly rather than smaller division modes. Fig. 16 shows the
deblocking filter has subjectively improved the picture quality overlay of MB sizes for the high and low bit rate of the second
by smoothing the blocking artifacts. frame of the Foreman test sequence.
5) Multiple Reference Prediction: Fig. 17 shows the R-D
43
(quality) curves of the Foreman and News tests when the
41
number of reference frames for inter prediction is varied from
1 to 9. We have removed some middle values to make the
39 graphs more readable. From the figure is can be observed that
PSNR Y (dB)
33
40
31
38
PSNR Y (dB)
29
All Inter Modes 36
27 No Sub Mode
Only 16x16 Mode 34
25
50 100 150 200 250 300 350 400 450 32 9 Ref
Bit Rate (Kb/s) 5 Ref
30 3 Ref
Fig. 15. Top: Akio test, Bottom: Mobile test. (i) all inter prediction modes are
1 Ref
allowed, (ii) only 16x16 to 8x8 modes are allowed and (iii) only 16x16 mode
28
is allowed. 10 60 110 160 210
Bit Rate (Kb/s)
43
41
39
37
PSNR Y (dB)
35
33
31 9 Ref
5 Ref
Fig. 16. MB inter partitioning modes, Foreman frame 2, high bit rate QP: 20 29
1 Ref
(left) and low bit rate QP: 40 (right)
27
0 20 40 60 80 100
4) Inter Prediction Modes: We have compared the Bit Rate (Kb/s)
performance of prediction modes under various scenarios of: Fig. 17. Top: Foreman, Bottom: News. Different number of prediction
(i) all inter modes are allowed (normal H.264 case), (ii) only reference frames.
16x16 to 8x8 modes are allowed and (iii) only 16x16 inter
mode is allowed. For the second and the third scenarios, we B. Evaluation of Error Resilience Features
have amended the bitstream semantics in a way that no sub- To be able to simulate the impact of channel errors, we
mb-mode is sent and for the 16x16 scenario, there is no mb- have modified the JVT test model decoder software to cope
mode either (since there is only one mode). Fig. 15 shows the with errors. To introduce the channel noise on a bitstream, a
results for Akio and Mobile. It can be seen that the use of all discrete two-stage Elliot-Gilbert model is used [37]. Based on
H.264 inter prediction modes significantly improves the the channel bit error rate and the mean burst length the model
IAEEE Journal 9
randomly alters the polarity of the bits (0 or 1). During the sliced bitstreams have significantly better qualities. It should
decoding procedure, when the first error in a slice is detected, be noted that when the number of slices is more than a
the decoder skips the bits to the next slice and marks un- specific amount (in this test 15), no further improvement is
decoded MBs as corrupted. achieved.
When a picture is completely decoded, the corrupted MBs
are concealed using their correctly received neighboring MBs. 3) Unequal Error Protection: Fig. 20 shows the quality of
In the concealment method [38], the MVs are recovered using the received video when the DPA part of the data has error
a boundary matching technique. Finally, the quality of the protection and supposed to be error free. This is possible by
decoded and concealed pictures is assessed based on their applying advanced channel coding techniques [39]. In this
PSNR. To make the statistics more reliable, every simulation particular test, DPA is 40% of the bit rate. Additionally, the
is run 30 times and the resulting distortions are averaged. applied channel coding technique introduces a 25% overhead.
1) Data Partitioning: In Fig. 18 the average Luma PSNR of Hence, to make the comparison fair, we have adjusted the
the foreman coded bitstream with different bit error rates is non-protected bitstream bit rate by 10% more than the
illustrated. Bit rates of the bitstreams were set to 100 kb/s, protected one. From the figure it is clear that by applying error
using our Lagrangian rate controller [28]. From the figure it protection to the DPA, the average output quality in high error
can be observed that enabling the DP which is simple with rates has been dramatically improved.
negligible overhead (and hence, small quality degradation) has
significantly improved the resilience of the codec to channel 37
errors.
34
39
31
Y PSNR (dB)
36
33 28
30 25
Y PSNR (dB)
DP: ON 19
21
0.01 0.001 0.0001 0.00001 0.000001
DP: OFF bit error rate
18
Fig. 20. PSNR vs. bit error rate, DP enabled, 9 slices per frame, with and
15 without DPA error protection. Foreman 100 and 110 kb/s respectively.
0.01 0.001 0.0001 0.00001 0.000001
bit error rate
Fig. 18. PSNR vs. bit error rate when DP is enabled and disabled. Foreman,
100 KBits/Sec. VI. CONCLUSION
The H.264/AVC video coding standard has achieved a
significant improvement compared to its predecessors. As
37
well as new features that improves the compression efficiency
34 such as advanced inter and intra prediction, H.264 supports a
number of error resilience techniques that facilitate the codec
31
to cope with different channel situations. These characteristics
Y PSNR (dB)
28 no slice
of AVC make it an ideal codec for applications with very
9 slice/frame
limited channel capacity and extremely error prone channels
25
such as mobile systems and video telephony. Due to its high
15 slice/frame
22
compression efficiency, the codec can be used for coding of
20 slice/frame
high quality video at lower rates. Therefore, this standard will
19
be a serious contender for a variety of next generation
0.01 0.001 0.0001 0.00001 0.000001
bit error rate multimedia applications. For instance, the DVD Forum
Fig. 19. PSNR vs. bit error rate, DP enabled, with and without slices. Steering Committee has recently selected H.264 decoding as a
Foreman, 100 KBits/Sec. mandatory capability for players of its upcoming new HD-
DVD format. This is the first selection of the new standard by
2) Slice Structure: Fig. 19 shows the quality of the decoded
a major industry consortium for a consumer end-user product
erroneous video bitstream of the same test sequence when the
technology with clear potential for extremely widespread use.
slice structure is enabled. Due to the insertion of more
resynchronization headers (i.e. more slices per frame) higher
ACKNOWLEDGMENT
overhead is used and hence, the video quality in the error free
situation is degraded. However, in higher bit error rates, the This work is supported by the Engineering and Physical
Sciences Research Council (EPSRC) of the UK.
IAEEE Journal 10
REFERENCES [26] Antonio Ortega and Kannan Ramchandran, “Rate-distortion methods for
image and video compression,” IEEE signal processing magazine, vol.
[1] Mohammed Ghanbari, Standard Codecs: Image Compression to 15, Nov. 1998, pp. 23-50.
Advanced Video Coding. London, UK: IEE, 2003, ch. 1. [27] Gary J. Sullivan and Thomas Wiegand, “Rate-distortion optimization for
[2] “Video Codec for Audio Visual Services at px64 kbits/s,” ITU-T video compression,” IEEE signal processing magazine, vol. 15, Nov.
Recommendation H.261, 1990 1998, pp. 74-90.
[3] “Coding of Moving Pictures and Associated Audio for Digital storage [28] M. Mahdi Ghandi and Mohammed Ghanbari, “A Lagrangian optimized
Media at up to About 1.5 Mbits/s,” ISO/IEC 1117-2: Video (MPEG-1), rate control algorithm for the H.264/AVC encoder,” IEEE International
November, 1991 Conference on Image Processing (ICIP), October 2004.
[4] “Generic Coding of Moving Pictures and Associated Audio [29] Siwei Ma, Wen Gao, Feng Wu and Yan Lu, “Rate control for JVT video
Information,” ISO/IEC 13818-2: Video (MPEG-2), May 1996 coding scheme with HRD consideration,” in Proc. Int. Conf. Image
[5] “Video Coding for Low Bit Rate Communication, Version 1,” ITU-T Processing, Spain, Sep. 2003, pp. 793-796.
Recommendation H.263, 1995 [30] Simone Milani, Luca Celetto and Gian Antonio Mian, “A rate control
[6] “Coding of Audio-Visual Objects, Part 2: Visual,” ISO/IEC 14496-2 algorithm for the H.264 encoder,” in Proc. 6th baiona Workshop Signal
(MPEG-4 visual version 1), 1999 Processing Comm., Spain, Sep. 2003.
[7] “Draft ITU-T Recommendation and Final Draft International Standard [31] Zhengguo Li, Feng Pan and Keng Pang, “Adaptive Basic Unit Layer
of Joint Video Specification (ITU-T Rec. H.264 | ISO/IEC 144496-10 Rate Control for JVT,” JVT of ISO/IEC MPEG & ITU-T VCEG,
AVC),” Joint Video Team of ISO/IEC and ITU-T, March 2003 Pattaya, Thailand, 7-14 March, 2003
[8] Thomas Wiegand, Gary J. Sullivan, Gisle Bjontegaard and Ajay Luthra, [32] Chuan-Yu Cho, Shiang-Yang Huang and Jia-Shung Wang, “An
“Overview of the H.264/AVC video coding standard,” IEEE Trans. embedded merging scheme for H.264/AVC motion estimation,” in Proc.
Circuits Syst. Video Technol., vol. 13, July 2003, pp. 560-576. Int. Conf. Image Processing, Spain, Sep. 2003, pp. 909-912.
[9] Thomas Wiegand, Heiko Schwarz, Anthony Joch, Faouzi Kossentini and [33] Peng Yin, Hye-Yeon Cheong Tourapis, Alexis Michael Tourapis and Jill
Gary J. Sullivan, “Rate-constrained coder control and comparison of Boyce, “Fast mode decision and motion estimation for JVT/H.264,” in
video coding standards,” IEEE Trans. Circuits Syst. Video Technol., vol. Proc. Int. Conf. Image Processing, Spain, Sep. 2003, pp. 853-856.
13, July 2003, pp. 688-703. [34] Feng Pan, Xiao Lin and Rahardja, “Fast mode decision for intra
[10] Daniel Alfonso, Daniele Bagni, Danilo Pau and Antonio Chimienti, “A prediction,” JVT of ISO/IEC MPEG & ITU-T VCEG, Pattaya, Thailand,
Performance analysis of H.264 video coding standard,” in Proc. 23rd 7-14 March, 2003.
Picture Coding Symposium, Saint-Malo, France, April 2003, pp. 23-28. [35] K. P. Lim, S. Wu, D. J. Wu, S. Rahardja, X. Lin, F. Pan and Z. G. Li,
[11] Anthony Joch, Faouzi Kossentini and Panos Nasiopoulos, “A “Fast inter mode selection,” JVT of ISO/IEC MPEG & ITU-T VCEG,
Performance analysis of the ITU-T draft H.26L video coding standard,” San Diego, USA, 2-5 Sep., 2003.
Proc. 12th Int. Packet Video Workshop, Pittsburgh, PA, April 2002 [36] JVT, “Test model software,” http://bs.hhi.de/~suehring/tml/download
[12] Mohammed Ghanbari, Standard Codecs: Image Compression to [37] Mohammed Ghanbari, Standard Codecs: Image Compression to
Advanced Video Coding. London, UK: IEE, 2003, ch. 9. Advanced Video Coding. London, UK: IEE, 2003, Appendix E.
[13] Henrique S. Malvar, Antti Hallapuro, Marta Karczewicz and Louis [38] Ye-Kui Wang, M.M. Hannuksela, V. Varsa, A. Hourunranta and M.
Kerofsky, “Low-complexity transform and quantization in H.264/AVC,” Gabbouj, “The Error Concealment Feature in the H.26L Test Model,” in
IEEE Trans. Circuits Syst. Video Technol., vol. 13, July 2003, pp. 598- Proc. Int. Conf. Image Processing, Sep. 2002, pp. 729-732.
603. [39] Peter Sweeney, Error Control Coding, Chichester, UK: John Wiley &
[14] Ville Lappalainen, Antti Hallapuro, and Timo D. Hamalainen, Sons LTD, 2002
“Complexity of optimized H.26L video decoder implementation,” IEEE
Trans. Circuits Syst. Video Technol., vol. 13, July 2003, pp. 717-725.
[15] Michael Horowitz, Anthony Joch, Faouzi Kossentini and Antti
Hallapuro, “H.264/AVC baseline profile decoder complexity analysis,”
IEEE Trans. Circuits Syst. Video Technol., vol. 13, July 2003, pp. 704-
716.
[16] Mohammed Ghanbari, Standard Codecs: Image Compression to
Advanced Video Coding. London, UK: IEE, 2003, ch. 3.
[17] Iain E.G. Richardson, H.264 and MPEG-4 Video Compression, Video
Coding for Next Generation Multimedia. Chichester, UK: John Wiley &
Sons Ltd, 2003, ch. 6.
[18] Mathias Wien, “Variable block-size transforms for H.264/AVC,” IEEE
Trans. Circuits Syst. Video Technol., vol. 13, July 2003, pp. 604-613.
[19] Detlev Marpe, Heiko Schwarz and Thomas Wiegand, “Context-based
adaptive binary arithmetic coding in the H.264/AVC video compression
standard,” IEEE Trans. Circuits Syst. Video Technol., vol. 13, July
2003, pp. 620-636.
[20] Peter List, Anthony Joch, Jani Lainema, Gisle Bjontegaard and Marta
Karczewicz, “Adaptive deblocking filter,” IEEE Trans. Circuits Syst.
Video Technol., vol. 13, July 2003, pp. 614-619.
[21] Thomas Wedi and Georg Musmann, “Motion- and aliasing-compensated
prediction for hybrid video coding,” IEEE Trans. Circuits Syst. Video
Technol., vol. 13, July 2003, pp. 577-586.
[22] Markus Flierl and Bernd Girod, “Generalized B pictures and the draft
H.264/AVC video-compression standard,” IEEE Trans. Circuits Syst.
Video Technol., vol. 13, July 2003, pp. 587-597.
[23] Stephan Wenger, “H.264/AVC over IP,” IEEE Trans. Circuits Syst.
Video Technol., vol. 13, July 2003, pp. 645-656.
[24] Thomas Stockhammer, Miska M. Hannuksela and Thomas Wiegand,
“H.264/AVC in wireless environment,” IEEE Trans. Circuits Syst.
Video Technol., vol. 13, July 2003, pp. 657-673.
[25] Bernd Girod and Niko Farber, “Feedback-based error control for mobile
video transmission,” Proceedings of the IEEE, vol. 87, No. 10, October
1999