IPR2021-00102, No. 1008-6 Exhibit - Zhang (P.T.A.B. Nov. 20, 2020)

PROCEEDINGS OF SPIE
`
`SPIEDigitalLibrary.org/conference-proceedings-of-spie
`
`(cid:40)(cid:85)(cid:85)(cid:82)(cid:85)(cid:3)(cid:85)(cid:72)(cid:86)(cid:76)(cid:79)(cid:76)(cid:72)(cid:81)(cid:87)(cid:3)(cid:89)(cid:76)(cid:71)(cid:72)(cid:82)(cid:3)(cid:70)(cid:82)(cid:71)(cid:76)(cid:81)(cid:74)(cid:3)(cid:88)(cid:86)(cid:76)(cid:81)(cid:74)
`(cid:89)(cid:76)(cid:85)(cid:87)(cid:88)(cid:68)(cid:79)(cid:3)(cid:85)(cid:72)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:3)(cid:83)(cid:76)(cid:70)(cid:87)(cid:88)(cid:85)(cid:72)
`
`(cid:61)(cid:75)(cid:68)(cid:81)(cid:74)(cid:15)(cid:3)(cid:42)(cid:88)(cid:68)(cid:81)(cid:77)(cid:88)(cid:81)(cid:15)(cid:3)(cid:54)(cid:87)(cid:72)(cid:89)(cid:72)(cid:81)(cid:86)(cid:82)(cid:81)(cid:15)(cid:3)(cid:53)(cid:82)(cid:69)(cid:72)(cid:85)(cid:87)
`
`(cid:42)(cid:88)(cid:68)(cid:81)(cid:77)(cid:88)(cid:81)(cid:3)(cid:61)(cid:75)(cid:68)(cid:81)(cid:74)(cid:15)(cid:3)(cid:53)(cid:82)(cid:69)(cid:72)(cid:85)(cid:87)(cid:3)(cid:47)(cid:17)(cid:3)(cid:54)(cid:87)(cid:72)(cid:89)(cid:72)(cid:81)(cid:86)(cid:82)(cid:81)(cid:15)(cid:3)(cid:5)(cid:40)(cid:85)(cid:85)(cid:82)(cid:85)(cid:3)(cid:85)(cid:72)(cid:86)(cid:76)(cid:79)(cid:76)(cid:72)(cid:81)(cid:87)(cid:3)(cid:89)(cid:76)(cid:71)(cid:72)(cid:82)(cid:3)(cid:70)(cid:82)(cid:71)(cid:76)(cid:81)(cid:74)(cid:3)(cid:88)(cid:86)(cid:76)(cid:81)(cid:74)
`(cid:89)(cid:76)(cid:85)(cid:87)(cid:88)(cid:68)(cid:79)(cid:3)(cid:85)(cid:72)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:3)(cid:83)(cid:76)(cid:70)(cid:87)(cid:88)(cid:85)(cid:72)(cid:15)(cid:5)(cid:3)(cid:51)(cid:85)(cid:82)(cid:70)(cid:17)(cid:3)(cid:54)(cid:51)(cid:44)(cid:40)(cid:3)(cid:24)(cid:25)(cid:27)(cid:24)(cid:15)(cid:3)(cid:44)(cid:80)(cid:68)(cid:74)(cid:72)(cid:3)(cid:68)(cid:81)(cid:71)(cid:3)(cid:57)(cid:76)(cid:71)(cid:72)(cid:82)(cid:3)(cid:38)(cid:82)(cid:80)(cid:80)(cid:88)(cid:81)(cid:76)(cid:70)(cid:68)(cid:87)(cid:76)(cid:82)(cid:81)(cid:86)
`(cid:68)(cid:81)(cid:71)(cid:3)(cid:51)(cid:85)(cid:82)(cid:70)(cid:72)(cid:86)(cid:86)(cid:76)(cid:81)(cid:74)(cid:3)(cid:21)(cid:19)(cid:19)(cid:24)(cid:15)(cid:3)(cid:3)(cid:11)(cid:20)(cid:23)(cid:3)(cid:48)(cid:68)(cid:85)(cid:70)(cid:75)(cid:3)(cid:21)(cid:19)(cid:19)(cid:24)(cid:12)(cid:30)(cid:3)(cid:71)(cid:82)(cid:76)(cid:29)(cid:3)(cid:20)(cid:19)(cid:17)(cid:20)(cid:20)(cid:20)(cid:26)(cid:18)(cid:20)(cid:21)(cid:17)(cid:24)(cid:27)(cid:26)(cid:22)(cid:19)(cid:25)
`(cid:40)(cid:89)(cid:72)(cid:81)(cid:87)(cid:29)(cid:3)(cid:40)(cid:79)(cid:72)(cid:70)(cid:87)(cid:85)(cid:82)(cid:81)(cid:76)(cid:70)(cid:3)(cid:44)(cid:80)(cid:68)(cid:74)(cid:76)(cid:81)(cid:74)(cid:3)(cid:21)(cid:19)(cid:19)(cid:24)(cid:15)(cid:3)(cid:21)(cid:19)(cid:19)(cid:24)(cid:15)(cid:3)(cid:54)(cid:68)(cid:81)(cid:3)(cid:45)(cid:82)(cid:86)(cid:72)(cid:15)(cid:3)(cid:38)(cid:68)(cid:79)(cid:76)(cid:73)(cid:82)(cid:85)(cid:81)(cid:76)(cid:68)(cid:15)(cid:3)(cid:56)(cid:81)(cid:76)(cid:87)(cid:72)(cid:71)(cid:3)(cid:54)(cid:87)(cid:68)(cid:87)(cid:72)(cid:86)
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)(cid:3)(cid:3)(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`UNIFIED 1008
`
`

`Error Resilient Video Coding Using Virtual Reference Picture
`
`Guanjun Zhang and Robert L. Stevenson
`Department of Electrical Engineering
`University of Notre Dame
`Notre Dame, IN 46556, USA
`
`ABSTRACT
`Due to widely used motion-compensated prediction coding, errors propagate along decoded video sequence and may result
`in severe quality degradation. Various methods have been reported to address this problem based on the common idea of
`diversifying prediction references. In this paper, we present an alternative way of concealing the references pictures errors.
`A generated virtual picture is used as a reference instead of an actual sequence picture in the temporal prediction. The
`virtual reference picture is generated in a way to ﬁlter damaged parts of previously decoded pictures so that the decoder
`can still get a clean reference picture in case of errors. Coding efﬁciency is effected due to the fact that the virtual reference
`is less correlated to the currently encoded picture. The simulations on H.264 codec have shown quality improvement of
`the proposed method over intra-coded macroblock refreshment. It can be used on any motion-compensated video codec to
`combat channel errors.
`
`1. INTRODUCTION
`Error resilience become an important feature of video transmission over error-prone networks. Generating error resilient
`video bit streams has become an intensive research area and many solutions has been reported to address this problem.
`In conventional motion-compensated prediction video coding, errors in a single picture may cause mismatch between
`encoder and decoder so that the effect of errors may propagate along decoded video sequence and thus result in severe
`quality degradation. A common way is using independently decodable video coding structures in different levels from
`macroblock to sequence. The picture level refreshment is commonly found as I frame in all coding standards. In sequence
`level, there is multiple description coding, such as Video Redundancy Coding(VRC) in H.263+. 1 The refreshment using
`intra coded macroblocks has been proven to be an effective way to combat errors due to packet loss or bit errors. The side
`effect of using intra coded macroblocks is loss in coding efﬁciency.
`A another basic idea of concealing error effect in inter coded video stream is to use robust prediction reference so
`that the visual quality variation is alleviated. Girod et. al. diversify the references of motion-compensated prediction to
`improve the coding efﬁciency 2,3 while Wang et. al. found out it can also function against errors, 4 as long as a balance
`between efﬁciency and robustness is achieved. Multiple references of motion-compensated prediction has been adopted
`into H.263 and H.26L as an important tool for robustness as well as higher coding efﬁciency 5.6 Reference of current
`pictures are distributed in several previously coded pictures. If each currently coded block use one reference block, the
`situation is not different from conventional prediction since a block is damaged when its reference has errors. If more than
`one reference blocks are searched, it may cost a large amount of additional time and power. In 7 the authors ﬁnd out that,
`although multiple reference are used, the actual references are unevenly located in the temporally previous picture and a
`fast motion estimation algorithm is then developed.
`In error concealment capable video codecs, the decoders always ﬁrst try to conceal errors in current decoded pictures
`before using them as references. If back channels are not available or timely inapplicable, the decoders usually directly
`copy the same area from another picture. If more time and complexity is affordable, a space interpolation or motion-
`compensated interpolation can be applied to yield more accurate reconstructions.
`To further improve error concealment, methods of ﬂexible slice structures are standardized in MPEG-4 AVC/H.264.
`It basically distributes macroblocks from different areas of the pictures into a slice structure. When a slice is crushed,
`the beneﬁts are two-fold: visual quality is less effected since the macroblocks are scattered in the picture; and space
`Tel: (574)631-8308 Fax: (574)631-4393 Email: {gzhang, rls}@nd.edu
`
`896
`
`Image and Video Communications and Processing 2005, edited by Amir Said,
`John G. Apostolopoulos, Proc. of SPIE-IS&T Electronic Imaging,
`SPIE Vol. 5685 © 2005 SPIE and IS&T · 0277-786X/05/$15
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`interpolation is more efﬁcient because damaged macroblocks have more opportunities to be surrounded by error-free
`macroblocks from other slice. Although these concealment steps utilize clean content from earlier decoded pictures, the
`damaged areas can only be partially recovered in most cases. The mismatch problem still exists, if the concealed areas do
`not happen to completely match the original ones.
`In this paper, we present a video coding system which has the capability to combat the error propagation in a damaged
`video sequence. A virtual reference picture(VRP) is generated by using a non-linear ﬁlter on several real pictures in both
`encoder and decoder, and then used to predict the incoming picture. It become an novel solution to the mismatch problem
`in terms of more chances to completely recover the damaged areas than the above systems.
`The rest of this paper is organized as follows. In section 2, we will introduce the video coding system using virtual
`reference pictures and discuss its error resilience capability. In section 3, we will describe the simulation environment.
`Simulation results will be presented and discussed in section 4, which is followed by the conclusions.
`
`2. VIRTUAL REFERENCE PICTURE
`In today’s popular video codecs, motion-compensated prediction uses earlier decoded pictures as references, which are
`actually displayed after decoding. If a back channel or retransmission is not available to the system, a damaged picture
`becomes a damaged reference at the decoder. We argue that it is not necessary to use a real video frame as a prediction
`reference. A virtual picture generated from real pictures can also be use as the reference, as long as it has appropriate error
`resilience properties and acceptable price in the sense of efﬁciency loss.
`We propose a video coding system, as illustrated in ﬁgure 1. The encoder generates a virtual picture by applying a
`nonlinear ﬁlter on several previously coded pictures. It then uses the virtual picture to predict the incoming picture. A
`same ﬁlter is used in the decoder to generate matched reference picture as in the encoder. We assume that there is no back
`channel available.
`
`. . .
`
`DCT
`
`Q
`
`IQ
`
`IDCT
`
`Frame
`Memory
`
`Motion Vectors
`
`Motion
`Compensation
`
`Virtual
`Reference
`Picture
`Generation
`
`Motion
`Estimation
`
`Entropy
`Encoding
`
`1010011...
`
`Entropy
`Decoding
`
`IQ
`
`IDCT
`
`1010011 . . .
`
`Motion Vectors
`
`Motion
`Compensation
`
`. .
`
`Frame
`Memory
`
`Virtual
`Reference
`Picture
`Generation
`
`(a) Encoder
`
`(b) Decoder
`
`Figure 1. A video coding system using virtual reference pictures.
`
`In an error free environment, both encoder and decoder use identical virtual reference pictures. In case of errors, the
`decoder ﬁrst conceals the errors in a picture by spatial interpolation or simple temporal interpolation, such as a copy from
`the same areas in a previously decoded picture. Then the virtual reference picture is yielded by using the concealed picture
`and several other decoded pictures.
`If we consider the concealed picture as a copy of the encoded picture which is damaged by additive noise, the ﬁlter
`shall have the capability to conceal the noise from this picture and generate a virtual picture mostly using information from
`
`SPIE-IS&T/ Vol. 5685 897
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`other clean pictures. When this virtual picture is used to predict the next incoming picture, the effect of errors in the next
`picture is further constrained since its reference has more correct data compared to the damaged picture.
`In this work, we assume the difference between the concealed picture and the error-free picture is equivalent to additive
`white noise for simplicity. We will only use and test the nonlinear median ﬁlter to generate the virtual reference pictures
`due to its capability of conceal white noise. The virtual reference picture is generated from three previously decoded
`pictures. Every pixel value in the virtual reference picture is obtained by median ﬁltering the three pixels, which are from
`the same position of the three decoded pictures, respectively. If we only use forward prediction, the dependence of the
`real sequence pictures and the virtual reference pictures are illustrated in ﬁgure 2. the solid arrows represent the prediction
`directions and the dash arrows represent contributions of real pictures to the virtual reference pictures. Note that the ﬁrst
`two pictures of a video sequence still use the previous real picture as their reference, respectively.
`
`VRP
`
`0
`
`VRP
`
`2
`
`I
`
`0
`
`P
`
`1
`
`P
`
`2
`
`P
`
`3
`
`P
`4
`
`5P
`
`P
`
`6
`
`VRP
`
`1
`
`VRP3
`
`Figure 2. Dependence of pictures in the virtual reference video coding system.
`
`Due to the lower correlation between the virtual reference picture and the incoming picture, coding efﬁciency in the
`proposed video coding system will be lower than the conventional coding system. In section 4, we will show that the bit
`rate increment is acceptable and comparable to that of the macroblock refreshment in most tests.
`
`3. SIMULATION SETUP
`The proposed video coding system is built based on MPEG-4 AVC/H.264 reference software JM9.0. 8 Simulations are
`run on both Virtual Reference Picture(VRP) codec and the original H.264 codec, and their results are compared. The error
`concealment of a damaged picture involves spatial and temporal interpolations as developed in. 9 The concealed picture
`along with two more previously decoded pictures are used to generate the virtual reference picture.
`All pictures of a test sequence are coded in forward prediction frame mode except the ﬁrst I picture. All valid types of
`motion estimations and predictions are allowed as in H.264 baseline. At this stage, only intra macroblock refreshment is
`used in both codecs as the error control method in order to compare their performance in error-prone environments. Test
`video sequences are encoded by both VRP and H.264 encoder using the same H.264 baseline parameters except settings
`for how many intra macroblock per picture. The VRP encoder use the same or half number of intra macroblocks than the
`H.264 encoder. In H.264, only one reference picture is use.
`The noisy channel is assumed to be the binary symmetric channel as in many wireless applications. Random bit errors
`are added to the coded bit streams to simulate the situations when lower level protection, i.e., channel coding, fails. These
`errors may damage header or data information and cause termination of decoding a slice. Information in the slice after the
`
`898 SPIE-IS&T/ Vol. 5685
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`errors are considered lost. For simplicity, we set each slice to have the same number of macroblocks in order to synchronize
`displaying of next slice.
`The ﬁrst test is to demonstrate that the VRP system is able to conceal errors of a decoded picture and constrain the
`propagation of errors. Uniformly distributed random bit errors are generated and added to both bit streams. PSNR values
`of each decoded or concealed picture in the sequences are recorded and compared. Rate distortion optimization is set to
`work so that compared sequences have the same PSNR. Coding efﬁciency is compared in terms of bit stream length.
`The second test is to statistically show that the VRP system has more protection to the video streams than the intra
`macroblock refreshment. This test collects PSNR values of decoded sequences at different bit error rates(BER) and error
`patterns. At each interested BER point, the ﬁrst test is repeated one hundred times under different errors and yield an
`averaged PSNR value. Then a trend of video quality drop along with increased BER can be drawn. Comparing curves of
`two system will bring the conclusion of error resilience capability.
`
`4. RESULTS AND DISCUSSION
`Error resilient capability of the VRP video coding system is ﬁrst illustrated. The test sequence mobile contains 30 pictures
`and is in 4:2:0 SIF format(352 × 240). Quantization parameters are set to 28 for all pictures in both encoders. In both
`sequences, there are 10 intra macroblocks in each H.264 inter picture and 5 in each VRP inter picture. Figure 3 shows
`a typical example of the PSNR values of the damaged sequences. Between picture No.7 and No.9, The proposed VRP
`sequence only experiences a short period of quality drop and recover quickly, while the H.264 sequence needs a very long
`time to recover from errors in the single picture No.8. The same thing happens to pictures between No.12 and No.16.
`This results in a higher average PSNR of VRP sequence(27.77dB) than H.264 sequence(25.05dB). Pictures from No.7 to
`No.9 of VRP sequence are displayed in ﬁgure 4, which shows that the part in the middle of the calendar is damaged and
`concealed. The PSNR drops that happen earlier in both sequences are due to some errors that can not be concealed by the
`current algorithm implementations in the reference software JM9.0.
`
`PSNR of decoded Frame of mobile when the BER is 1E−5
`
`VRP
`Error Free
`H264
`
`0
`
`5
`
`10
`
`15
`Frame number
`
`20
`
`25
`
`30
`
`Figure 3. PSNR value comparison of sequence mobile.
`
`38
`
`36
`
`34
`
`32
`
`30
`
`28
`
`26
`
`24
`
`22
`
`20
`
`18
`
`16
`
`PSNR (dB)
`
`At a frame rate of 30 frame per second, the average bit rate of each P frame in mobile is 10.9 KByte/s from the H.264
`encoder and 16.2 KByte/s from the VRP encoder. The overhead of 48.7% largely dues to the fact that complicated context
`and details of a real picture is ﬁltered by the median ﬁlter. Because the VRP bit stream is longer than the H.264 bit stream,
`it also encounters more errors. This makes the VRP curve have more pits than the H.264 curve.
`
`SPIE-IS&T/ Vol. 5685 899
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`(a)
`
`(d)
`
`(b)
`
`(e)
`
`(c)
`
`(f)
`
`Figure 4. Pictures from mobile: a) - c) picture No.7 to No.9 of the error free sequence; d) - f) picture No.7 to No.9 of the damaged and
`concealed VRP sequence.
`
`Sequence
`
`foreman
`container
`silent
`
`Table 1. Overhead of VRP pictures to the H.264 pictures.
`VRP
`H.264
`Intra MB/picture
`Intra MB/picture
`bitrate(Byte/f)
`5
`10
`854
`5
`5
`366
`5
`5
`557
`
`bitrate(Byte/f)
`996
`433
`665
`
`Overhead
`
`16.6%
`18.3%
`19.4%
`
`Following results demonstrate that the VRP codec outperforms the H.264 codec in sense of error resilience. Sequence
`foreman, container and silent are all in 4:2:0 QCIF format(176× 144). With quantization parameter set to 28, one hundred
`pictures of each sequence are coded in IPPP··· order with only one intra picture at the begin of the sequence.
`Table 1 shows that the overhead of the VRP pictures to the H.264 pictures is lower than 20%. These numbers are
`much smaller than the one from mobile because their context is homogeneous and correlation between pictures are higher.
`Considering that the VRP pictures use only half of intra macroblocks as H.264 pictures, this number is acceptable given the
`PSNR vs. BER performance shown in ﬁgure 5, 6, and 7. Basically, VRP sequences lead H.264 at least 1.5dB at the BER
`1.0E-5. The difference of two curves get larger as BER goes up. In ﬁgure 5, the simulations run over a large range of BER
`so that a complete picture of the quality difference is shown. Others only run in smaller BER ranges since they have clearly
`illustrated their performance. And in fact, the channel coding usually does not behave that bad to have source decoders
`face a very high BER. From ﬁgure 6 and ﬁgure 7, we can see that the VRP sequences are almost ﬂat, which implies that
`the VRP has very good error resilience capability in this area. In ﬁgure 8, the PSNR vs. BER curves of sequence mobile
`mentioned above is also given. The difference of VRP and H.264 at 1.0E-5 is 2.28dB.
`Note that all these results are achieved when we only use a very basic median ﬁlter and assume a simple error model.
`In the future, the work will focus on developing more practical error models and ﬁnding more effective ﬁlters.
`
`900 SPIE-IS&T/ Vol. 5685
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`PSNR vs. BER for sequence foreman
`
`VRP
`H264
`
`2e−6
`
`4e−6
`
`6e−6
`
`8e−6
`
`1e−5
`BER
`
`3e−5
`
`5e−5
`
`7e−5
`
`9e−5
`
`2e−4
`
`Figure 5. PSNR vs. bit error rate curves of sequence foreman, QCIF.
`
`40
`
`35
`
`30
`
`25
`
`20
`
`15
`
`PSNR (dB)
`
`5. CONCLUSIONS
`We propose a novel solution to alleviate the error propagation in the motion-compensated video coding. The video coding
`system uses a virtual reference picture instead of a real sequence picture for inter prediction. An identical nonlinear ﬁlter
`is used to generate the VRP at both encoder and decoder from three previously decoded pictures. Errors from the damaged
`picture are ﬁltered and clean information from other pictures are used.
`Simulations demonstrate that the proposed video coding system can achieve smaller period of video quality degradation
`and higher PSNR than standard H.264 video in error environments when only macroblock refreshment is applied. The
`proposed method need slightly higher bitrate to maintain the same video quality in error free environment due to less
`correlation between VRP and incoming pictures. The proposed approach is suitable for error resilient hybrid motion-
`compensated video coding such as MPEG-4 AVC/H.264.
`
`REFERENCES
`1. S. Wenger, G.D. Knorr, J. Ott, and F. Kossentini. Error resilience support in h.263+. IEEE Trans. Circuits Syst. Video
`Technol., 8(7):867–877, Nov. 1998.
`2. Bernd Girod. Efﬁciency analysis of multihypothesis motion-compensated prediction for video coding. IEEE Tran. on
`Image Processing., 9(12):173–183, Feb. 2000.
`3. Markus Flierl, Thomas Wiegand, and Bernd Girod. Rate-constrained multihypothesis prediction for motion-
`compensated video compression. IEEE Trans. on Circuit and Systems for Video Tech., 12(11), November 2002.
`4. Huisheng Wang and Antonio Ortega. Robust video communication by combining scalability and multiple description
`coding techniques. In Conf. Proc. IS&T/SPIE 15th Annual Symp. on Electronic Imaging 2003(EI03), Jan. 2003.
`5. Study group 16 ITU-T. Annexes u, v, and w to recommendation h.263. Draft for ”H.263++”, November 2000.
`6. ITU-T. Recommendation h.264 / iso/iec 11496-10 ﬁnal committee draft. JVT-E022, September 2002.
`7. Yu-Wen Huang et. al. Analysis and reduction of reference frames for motion estimation in mpeg-4 avc/jvt/h.264. In
`IEEE Int. Conf. on Acoustics Speech and Signal Processing(ICASSP03), volume 3, pages 145–148, 2003.
`8. Joint Video Team(JVT). Reference software jm9.0. http://iphome.hhi.de/suehring/tml/download/, Nov. 2004.
`
`SPIE-IS&T/ Vol. 5685 901
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`PSNR vs. BER for sequence container
`
`VRP
`H264
`
`40
`
`39
`
`38
`
`37
`
`36
`
`35
`
`34
`
`33
`
`32
`
`31
`
`PSNR (dB)
`
`30
`1e−6
`
`2e−6
`
`3e−6
`
`4e−6
`
`5e−6
`
`6e−6
`
`7e−6
`
`8e−6
`
`9e−6
`
`1e−5
`
`BER
`
`Figure 6. PSNR vs. bit error rate curves of sequence container, QCIF.
`
`9. Ye-Kui Wang et. al. The error concealment feature in the h.26l test model. In IEEE Inter. Conf. on Image Process-
`ing(ICIP), 2002.
`
`902 SPIE-IS&T/ Vol. 5685
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:70)(cid:82)(cid:81)(cid:73)(cid:72)(cid:85)(cid:72)(cid:81)(cid:70)(cid:72)(cid:16)(cid:83)(cid:85)(cid:82)(cid:70)(cid:72)(cid:72)(cid:71)(cid:76)(cid:81)(cid:74)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:86)(cid:83)(cid:76)(cid:72)(cid:3)(cid:82)(cid:81)(cid:3)(cid:20)(cid:24)(cid:3)(cid:45)(cid:88)(cid:81)(cid:3)(cid:21)(cid:19)(cid:21)(cid:19)
`(cid:55)(cid:72)(cid:85)(cid:80)(cid:86)(cid:3)(cid:82)(cid:73)(cid:3)(cid:56)(cid:86)(cid:72)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(cid:18)(cid:90)(cid:90)(cid:90)(cid:17)(cid:86)(cid:83)(cid:76)(cid:72)(cid:71)(cid:76)(cid:74)(cid:76)(cid:87)(cid:68)(cid:79)(cid:79)(cid:76)(cid:69)(cid:85)(cid:68)(cid:85)(cid:92)(cid:17)(cid:82)(cid:85)(cid:74)(cid:18)(cid:87)(cid:72)(cid:85)(cid:80)(cid:86)(cid:16)(cid:82)(cid:73)(cid:16)(cid:88)(cid:86)(cid:72)
`
`

`PSNR vs. BER for sequence silent
`
`40
`
`35
`
`30
`
`PSNR (dB)
`
`VRP
`H264
`
`25
`1e−6
`
`2e−6
`
`3e−6
`
`4e−6
`
`5e−6
`
`6e−6
`BER
`
`7e−6
`
`8e−6
`
`9e−6
`
`1e−5
`
`2e−5
`
`Figure 7. PSNR vs. bit error rate curves of sequence silent, QCIF.
`
`PSNR vs. BER for sequence mobile
`
`VRP
`H264
`
`40
`
`35
`
`30
`
`25
`
`20
`
`PSNR (dB)
`
`15
`1e−6
`
`3e−6
`
`5e−6
`
`7e−6
`BER
`
`9e−6
`
`2e−5
`
`Figure 8. PSNR vs. bit error rate curves of sequence mobile, SIF.
`
`SPIE-IS&T/ Vol. 5685 903
`
`(cid:39)(cid:82)(cid:90)(cid:81)(cid:79)(cid:82)(cid:68)(cid:71)(cid:72)(cid:71)(cid:3)(cid:41)(cid:85)(cid:82)(cid:80)(cid:29)(cid:3)(cid:75)(cid:87)(cid:87)(cid:83)(cid:86)(cid:29)(cid:18)(

This document is available on Docket Alarm but you must sign up to view it.

Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

Up-to-date information for this case.
Email alerts whenever there is an update.
Full text search for other cases.
Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.

Access Government Site

We are redirecting you
to a mobile optimized page.

Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket

Supplemental Search

Search for PTAB Motions

PTAB Analytics

TTAB Analytics

Basic Search

Filters

Party Search

Advanced

Selected Courts

Recently Selected Courts

Find PTAB Decisions

PTAB Analytics

Special PTAB Alerts

Orange Book

Directly Search Federal Courts

Search Trademark ...

This document is available on Docket Alarm but you must sign up to view it.

Accessing this document will incur an additional charge of $.

Still Working On It

A few More Minutes ... Still Working

This document could not be displayed.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

One Moment Please

Your document is on its way!

Sealed Document

We are redirecting youto a mobile optimized page.

Document Unreadable or Corrupt

We are unable to display this document.

STEP 2 of 2

Choose your membership type

Flat-Fee

Pay-As-You-Go

Add your payment information

Login or Join

Enter your corporate Email

Thousands of your peers are saving time and gaining a competitive advantage with Docket Alarm.

Join Docket Alarm to perform smarter legal research.

Download this document and millions of others instantly with a Docket Alarm membership.

Join Docket Alarm and start performing smarter legal research.

Start tracking this docket instantly with a Docket Alarm membership.

Join thousands of your peers and start performing smarter legal research.

STEP 1 of 2

Millions of Documents | 15 Seconds to Signup

Hi !

Welcome to Docket Alarm

Welcome to Docket Alarm!

Explore Litigation Insights andManage Your Cases

Reset Password

What is PACER?

Why do I need it?

What will I be charged?

Do other courts have fees?

Basic Free Access

Welcome

Thank you

Check Firm Account

We are redirecting you
to a mobile optimized page.

Explore Litigation Insights and
Manage Your Cases