throbber
AUDIO / VIDEO CODING
`
`Windows Media 9Series
`
`— a platform to deliver compressed audio and video for
`Internet and broadcast applications
`
`Jordi Ribas-Corbera
`Microsoft Windows Digital Media Division
`
`Microsoft® Windows Media® 9 Series is a set of technologies that enables rich digital
`media experiences across all types of networks and devices. These technologies
`include an encoder to author the multimedia content, a server to distribute the
`content, a Digital Rights Management (DRM) system to let content owners set usage
`policies, and a variety of players to decode and render the content on personal
`computers and other consumer electronic devices. These components are built on top
`of a programmable and extensible platform that enables partners to build tailored
`applications and services.
`This article provides a high-level overview of the technologies in Windows Media 9
`Series, with a particular focus on the different audio and video codecs available.
`Applications and services for broadcast (e.g., IP datacasting via DVB) are also
`discussed.
`
`Introduction
`Windows Media 9 Series is the latest generation of digital media technologies developed by Microsoft [1].
`Although the origins of Windows Media focused on streaming compressed audio and video over the Internet
`to personal computers, the vision moving forward is to enable effective delivery of digital media through any
`network to any device.
`
`A wide range of
`applications
`Fig. 1 illustrates a variety of exam-
`ples of how Windows Media tech-
`nology is being used today. In
`addition to Internet-based applica-
`tions (subscription services, video-
`on-demand, Web broadcast, and so
`on), content compressed with Win-
`dows Media codecs is being con-
`sumed by a wide range of wired
`and wireless consumer electronic
`devices (such as mobile phones,
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`Content
`
`Web-based
`- streaming
`- local playback
`
`Devices
`- wired
`- wireless
`
`Future delivery
`- DVB-S,
`- DVB-T
`- digital cinema
`
`Physical format
`- SD memory
`- secure CDs
`- next-gen DVDs
`
`Figure 1
`Examples of current Windows Media technology applications
`
`1 / 12
`
`EX1053
`Roku V. Media Chain
`U.S. Patent No. 10,515,191
`
`

`

`AUDIO / VIDEO CODING
`DVD players, portable music players, and car stereos) [2]. Windows Media content can also be delivered to
`consumers in physical formats(cid:151)for instance, using the Secure Digital (SD) memory card [3], or on CD or
`DVD using the emerging HighMAT(cid:153) format [4].
`In the terrestrial and satellite broadcast space, a recent project at the International Broadcasting Convention
`(IBC) demonstrated how to deliver Windows Media 9 Series content via DBV-T and DVB-S [5]. As another
`example, Windows Media technology is also used to compress movies in high definition and multi-channel
`audio, and these films are being projected in public digital theatres in the USA.
`
`End-to-end delivery
`All of the applications mentioned so far require a set of building blocks or components that permit the deploy-
`ment of complete end-to-end solutions. The fundamental components of Windows Media 9 Series are illus-
`trated in Fig. 2 and can be classified in three steps: authoring, distribution and playback.
`
`Authoring
`Authoring is the process of creating and encoding digital media. The basic encoding software provided by
`Microsoft is called Windows Media Encoder 9 Series. It is a flexible encoder that can compress audio and
`video sources for live or on-demand streaming by using the Windows Media codecs.
`At the same time, there are alternative encoding solutions provided by third parties that are built on top of the
`Windows Media porting kits (e.g., hardware encoders from companies such as Optibase, Tandberg Television,
`Texas Instruments) or the Windows Media software development kits (SDKs) (e.g., software encoders from
`companies like Accom, Adobe, Avid, Discreet and Sonic Foundry).
`
`Distribution
`The distribution over the Internet of content, compressed with Windows Media codecs, is generally done by a Win-
`dows Media Services server. Windows Media Services version 4.1 is an optional component in Windows 2000
`Server, and Windows Media Services 9 Series is expected to be an optional component in Windows Server 2003.
`
`Licence
`server
`
`Download & Play
`streaming
`
`UNICAST
`MULTICAST
`
`Windows Media
`encoder
`
`Windows Media
`Services server
`(or Web Server)
`
`Windows Media Player
`(PC, hand-held device,
`set-top box ...)
`
`Live feed
`
`Stored
`content
`
`Authoring
`
`Distribution
`
`Playback
`
`Figure 2
`End-to-end delivery of Windows Media content: authoring, distribution and playback.
`The DRM system protects content, based on policies set by the owner.
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`2 / 12
`
`

`

`AUDIO / VIDEO CODING
`The new server supports more features for advertising and corporate scenarios and is twice as scalable (cid:150) it
`doubles the number of customers that can receive a media clip at the same time.
`A server can either stream the clip (transmit it with as little delay as possible) or download it (transmit and
`store it) into a user’s playback device. The transmission of the clip can be performed live (for news, sports,
`concerts or similar events) or on demand (for music videos, movies-on-demand and so on). When streaming a
`media clip, the server adapts its throughput and re-transmits lost packets intelligently, using feedback from the
`network quality metrics. For on-demand streaming, the latest server takes advantage of the additional band-
`width available (above the average bit-rate of the clip) to reduce the start-up delay. In addition, such a server
`reduces the likelihood of losing the connection (which manifests as playback (cid:147)glitches(cid:148) and re-buffering to the
`viewer) by sending more data to the playback device, so that when there is network congestion the device can
`continue playing.
`A robust and scalable server is essential for Internet delivery, but obviously having a solid network connection
`is also critical. The latter is addressed by content delivery networks (CDNs) such as Akamai, Digital Fountain
`and SMC. The combination of robust servers and networks result in TV-like experiences that are remarkably
`superior to those of Internet streaming in the recent past.
`As mentioned earlier, one can also deliver Windows Media content via physical formats such as CD or DVD,
`or through other networks such as DVB. We will discuss transmission of Windows Media over DVB in more
`detail later in this article.
`
`Playback
`The final step of the end-to-end delivery process is playback, which consists of decoding and rendering the
`compressed data in the user’s device. On a personal computer, Windows Media Player and a variety of other
`players built by third parties (such as MusicMatch Jukebox or RealOne Player) can decode and play Windows
`Media streams and files. As illustrated in Fig. 2, a wide variety of consumer electronic devices can also play
`Windows Media content [2]. Like in the encoder case, third parties can build such hardware playback devices
`on any platform by using Windows Media porting kits.
`
`Digital Rights Management
`A critical component of the end-to-end delivery is Digital Rights Management (DRM), which we represent by
`the licence server in Fig. 2. DRM works across the three steps of the media delivery system, so we will dis-
`cuss it separately in this subsection.
`The DRM technology used by Windows Media lets owners encrypt their media products and services and
`specify the usage rules and policies. For example, the owner may decide that the user can play the digital
`media until a certain date, or can play it a given number of times. Or the owner can let the user copy the digital
`media to a certain number (and type) of devices.
`In the typical Internet scenario, the content owner encrypts the (compressed) digital media stream using DRM.
`When a viewer selects the stream, the playback device connects to the licence server, which offers a licence for
`the content. The viewer then decides whether to accept the terms and price of the licence and, if so, the licence
`is downloaded into the viewer’s device. Then the viewer is able to decrypt and play the content according to
`the terms of the licence.
`Designing a complete DRM service is a challenging project. The system needs to be secure (with the capabil-
`ity of upgrading quickly), flexible (it must accommodate the desires of the content owner and device manufac-
`turers), and user-friendly. Windows Media DRM fulfils the requirements of many scenarios and is one of the
`leading solutions in the market.
`
`Platform Components
`
`Windows Media provides a development platform in addition to a set of components for authoring, distribut-
`ing and playing digital media. Windows Media Encoder, Windows Media Services and Windows Media
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`3 / 12
`
`

`

`Windows
`Media
`Server
`SDK
`
`Windows
`Media
`Player
`SDK
`
`Windows
`Media
`Encoder
`SDK
`
`Windows
`Media
`Rights
`Manager
`SDK
`
`Windows
`Media
`Format
`SDK
`(Audio &
`Video)
`
`AUDIO / VIDEO CODING
`Player meet the requirements of a fair number of applications, but they are essentially examples that showcase
`the capabilities of the platform.
`These components are built on top
`of SDKs that can also be used by
`third parties to develop their own
`encoders, servers and players, tai-
`lored to specific applications. By
`providing a platform with state-of-
`the-art compression and delivery
`technology, Windows Media ena-
`bles third parties to create their
`own leading-edge customized solu-
`tions [6].
`Fig. 3 shows the Windows Media 9
`Series SDKs. Software such as
`Windows Media Player and the
`MusicMatch Jukebox are built using the Windows Media Player SDK. A large number of companies have
`downloaded and licensed one or more of these SDKs to build their own digital media systems.
`In addition, companies can build Windows Media-based software and hardware solutions by using the porting
`kits described later in this article.
`
`Figure 3
`The Windows Media 9 Series SDKs
`
`Audio and video codecs
`Windows Media audio and video codecs are key components for the authoring and playback of digital media.
`The table below lists the audio and video codecs that ship in Windows Media 9 Series. Each codec uses differ-
`ent technology and bit-stream syntax and, consequently, is not compatible with the others; e.g., the bit-streams
`of Windows Media Audio 9 Professional cannot be decoded by a Windows Media Audio 9 decoder, and vice-
`versa. Older codecs (such as Windows Media Video 8, ISO MPEG-4 Video and others) are also supported for
`backward compatibility, but they are not listed here as the focus of this article is on the new 9 Series codecs.
`
`Windows Media Audio 9 Codecs
`
`Windows Media Video 9 Codecs
`
`Windows Media Audio 9
`Windows Media Audio 9 Professional
`Windows Media Audio 9 Lossless
`Windows Media Audio 9 Voice
`
`Windows Media Video 9
`Windows Media Video 9 Screen
`Windows Media Video 9 Image
`
`Windows Media Audio (WMA) 9
`The Windows Media Audio 9 codec is the most popular audio codec in Windows Media and is commonly
`known as WMA. The decoder (or bitstream syntax) was frozen more than four years ago, and only the
`encoder has been improved since then. WMA 9 is the third iteration of backward-compatible improvements.
`Maintaining backward compatibility has been critical to support the consumer electronics manufacturers who
`choose to build devices that play WMA.
`The new WMA encoder enhances the one-pass constant-bit-rate (CBR) encoding mode (the only mode sup-
`ported by WMA in previous versions), using improved rate control and masking algorithms. It adds new two-
`pass and variable-bit-rate (VBR) modes that further enhance quality over the one-pass mode.
`For any of the codecs, one-pass CBR encoding is required for live encoding and transmission, while two-pass
`CBR is appropriate when encoding off-line for on-demand streaming. VBR modes are recommended when
`the compressed clips will be downloaded into the user(cid:146)s device (for download-and-play applications). Even
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`4 / 12
`
`

`

`AUDIO / VIDEO CODING
`though VBR-encoded clips can also be streamed (with the new server), the bit-rate fluctuations within the
`clips are usually high and their transmission requires long buffering delays. There is also a peak-constrained
`VBR encoding mode to create bitstreams meant to play back on devices with a constrained reading speed. The
`WMA 9 codec supports all of these encoding modes.
`
`WMA 9 also supports a large list of encoding settings for mono and stereo audio, with bit-rates ranging from
`5 kbit/s to 320 kbit/s and sampling rates ranging from 8 kHz to 48 kHz. At the typical CD sampling rate
`(44.1 kHz), we found that most users select rates between 48 kbit/s to 128 kbit/s to achieve CD-like quality,
`depending on their sensitivity to compression artefacts and bandwidth availability. A small percentage of crit-
`ical listeners may require higher quality to be satisfied, which is why higher encoding rates are also provided.
`
`There has been a lot of discussion about which audio codec technology produces the best quality. There are
`quite a few good audio codecs available, and many opinions. We find that experts still differ on what is the
`best compressed sound. Some prefer codecs that preserve more bandwidth, and hence produce a richer sound,
`while tolerating some distortion at high frequency. Others prefer codecs that generate a more muffled sound
`with minimum high-frequency distortion.
`
`The audio quality of the latest WMA 9 has not been evaluated by independent studies yet. Prior versions have
`been extensively evaluated and have been ranked at the top of some studies. For example, WMA 8 was
`recently chosen over MP3 and RealAudio 8 in a study by the (cid:147)golden ears(cid:148) of Sound & Vision [7]. Other stud-
`ies have reached different conclusions for a variety of reasons 1, including differences in content and testing
`conditions, or sometimes simply because the listeners had other subjective preferences. As a result, audio
`experts in organizations usually base their decisions on their own subjective tests.
`
`A new objective for Windows Media 9 Series has been to develop compression technology that goes beyond
`CD quality. The first major step in that direction is the introduction of the Windows Media Audio 9 Profes-
`sional codec.
`
`Windows Media Audio (WMA) 9 Professional
`
`The WMA 9 Professional codec is the first Windows Media codec for audio that supports high-resolution (up to
`24 bits per audio sample, and sampling rates of up to 96 kHz), and multiple channels (up to eight discrete chan-
`nels) for typical 5.1 or 7.1 speaker configurations in high-end home systems or commercial digital theatres.
`
`An important application for this codec is the encoding of multi-channel music and movie sound tracks at
`Internet broadband rates, where there is currently no suitable, widely deployed codec in the industry. For
`example, the minimum bit-rate of Dolby(cid:146)s AC-3 codec is 384 kbit/s (for 5.1 channels), which provides very
`little bit-rate for video on DSL/cable connections. WMA 9 Professional can encode 5.1 channels at as low as
`128 kbit/s, and 192 kbit/s appears to be the (cid:147)sweet spot(cid:148) for the technology. As a result, there is enough band-
`width left for encoding video for broadband delivery.
`
`As with the WMA 9 codec, there are also five encoding modes for WMA 9 Professional: one-pass CBR, two-
`pass CBR, one-pass VBR, two-pass VBR, and peak-constrained VBR. In addition, WMA Professional 9
`allows near-lossless compression at its highest-quality VBR setting.
`
`Windows Media Audio (WMA) 9 Voice
`
`Another new codec in Windows Media 9 Series is WMA 9 Voice. This codec compresses mono-only audio at
`very low bit-rates, which is useful for transmitting digital media through low-rate modem or ISDN connec-
`tions. The supported bit-rates range from 4 to 20 kbit/s and the sampling rates range from 8 to 22 kHz. At this
`point in time, WMA 9 Voice only supports one-pass CBR encoding.
`
`1. One of the most comprehensive audio codec studies (which included WMA 8) was recently performed by several members of the EBU
`[8]. Unfortunately, WMA 8 was compared to other codecs at different settings – in some cases the bandwidth or sampling rate (in Hz)
`for WMA 8 was set to more than twice that of others. We hope to work more closely with these EBU members in the future to avoid
`these drawbacks. We believe that future tests will likely include the latest WMA 9 codec.
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`5 / 12
`
`

`

`AUDIO / VIDEO CODING
`When compressing audio at very low bit-rates, typical transform-based codecs will generally produce better
`quality on music, while CELP-based (or harmonic) codecs will produce better quality on speech. WMA 9
`Voice is a unique hybrid codec that uses an automatic classifier to detect voice and music, and applies the
`appropriate coding mode for each segment. When the content contains voice and music, the mode selected
`depends on the type of audio that dominates. The encoder also provides a manual mode so that the user can
`select the desired mode for any given segment.
`The voice encoding mode is based on a new advanced algorithm. The music mode uses essentially WMA
`transform technology. Consequently, this codec will provide state-of-the-art quality for both types of audio
`content, while previous codecs will generally only do well with one type.
`
`Windows Media Audio (WMA) 9 Lossless
`The final audio codec in Windows Media 9 Series is a mathematically lossless codec. This codec is competi-
`tive with other state-of-the-art lossless audio codecs such as Monkey Audio, and can compress a wide variety
`of audio sources, from CD resolution and sampling rates up to 24-bit, 96 kHz, 7.1 channel audio.
`WMA 9 Lossless is integrated into Windows Media Player 9 Series (in the CD copy function) and can achieve
`compression ratios of about 2:1 for stereo content. Multi-channel, high-resolution audio clips can often be
`compressed losslessly with higher ratios.
`
`Selecting appropriate audio codecs
`For typical stereo broadcast and Internet broadband applications, the standard WMA 9 codec is recommended.
`If the audio or film contains a high-resolution or multi-channel track, then one should consider WMA 9 Pro-
`fessional.
`The other audio codecs are useful for somewhat more limited scenarios. WMA 9 Voice targets very low rate
`audio applications (such as modem or ISDN delivery), and WMA 9 Lossless is useful for audio archival.
`
`Windows Media Video (WMV) 9
`The WMV 9 codec is the most popular video codec in Windows Media 9 Series and is based on technology
`that can achieve state-of-the-art compressed video quality from very low bit-rates (such as 160x120 at 10 kbit/
`s for modem applications) to very high bit-rates (1920x1080 at 6 to 20 Mbit/s for high-definition video). The
`codec supports all five CBR and VBR encoding modes.
`WMV 9 achieves 15 to 50 percent compression improvements over version 8, and the improvements tend to
`be greater at higher bit-rates. For instance, Fig. 4 shows a peak signal-to-noise ratio (PSNR) quality plot ver-
`sus bit-rate for WMV 9, WMV 8 and Microsoft(cid:146)s ISO MPEG-4 video codec (simple profile). The source con-
`sisted of 13 typical MPEG clips (i.e., (cid:147)Stefan(cid:148), (cid:147)Akiyo(cid:148), (cid:147)Coastguard(cid:148), (cid:147)News(cid:148), (cid:147)Mobile & Calendar(cid:148), etc).
`
`Abbreviations
`
`ANSI
`American National Standards Institute
`ASF
`(Microsoft) Advanced Streaming Format
`CBR
`Constant Bit-Rate
`CDN
`Content Delivery Network
`CPU
`Central Processing Unit
`DSL
`Digital Subscriber Line
`DRM Digital Rights Management
`DVB
`Digital Video Broadcasting
`DVB-S DVB - Satellite
`DVB-T DVB - Terrestrial
`IBC
`International Broadcasting Convention
`ISDN
`Integrated Services Digital Network
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`ISMA
`Internet Streaming Media Alliance
`ISO
`International Organization for Standardization
`ITU
`International Telecommunication Union
`ITU-T
`ITU - Telecommunication Standardization Sector
`MMX
`(Intel) Pentium-based chip technology
`PSNR
`Peak Signal-to-Noise Ratio
`QCIF
`Quarter Common Intermediate Format
`SD
`Secure Digital
`SDK
`Software Development Kit
`VBR
`Variable Bit-Rate
`WMA (Microsoft) Windows Media Audio
`WMV
`(Microsoft) Windows Media Video
`
`6 / 12
`
`

`

`AUDIO / VIDEO CODING
`
`40%
`
`100%
`
`46
`
`44
`
`42
`
`40
`
`38
`
`36
`
`PSNR (quality)
`
`We set a fixed quantization step
`size for all codecs and used the
`same mode selection strategy, as it
`is usually done in MPEG and ITU
`standard tests. Even though PSNR
`is by no means an exact measure of
`video quality, the plot conveys that
`the visual compression gains also
`translate to PSNR gains.
`
`With the compression efficiency of
`WMV 9, one can achieve broad-
`cast-quality BT.601 video at about
`2 Mbit/s, and high-quality, high-
`definition video (e.g., 720p) at
`high-end broadcast or DVD rates
`(e.g., 4 to 6 Mbit/s). All broadcast
`formats are supported, including
`the high-definition 720p and 1080i
`variants.
` The codec
`includes
`native interlace compression tools.
`In addition to 4:2:0, it also supports
`the 4:1:1 sampling structure to maintain the odd and even field chroma separately in interlaced video (4:2:0
`mixes the chroma values between both fields).
`
`Figure 4
`PSNR versus bit-rate for WMV 9, WMV 8 and Microsoft’s ISO
`MPEG-4 video codec, on MPEG clips
`
`Since there are different applications that require different levels of complexity for WMV, we have defined
`several profiles and levels for interoperability. For example, the (cid:147)simple profile and low level(cid:148) supports up to
`QCIF resolution, 96 kbit/s, and 15 frames/sec (fps), and targets low-end hand-set devices. The (cid:147)main profile
`and main level(cid:148) targets the standard-definition broadcast scenario (the functional equivalent of MPEG-2(cid:146)s
`MP@ML). The (cid:147)main profile and high level(cid:148) is appropriate for high-definition applications (equivalent to
`MPEG-2(cid:146)s MP@HL). WMV 9 bitstreams targeted to higher-end applications (e.g., standard-definition and
`high-definition broadcast) are referred to as WMV 9 Professional bitstreams.
`
`The video quality of the latest WMV 9 codec has not yet been evaluated by independent studies. Nevertheless,
`prior versions of the codec have been shown to provide competitive compression efficiency by such studies. For
`example, WMV 8 was selected as the video codec that produced the best quality in the latest video codec review
`of DV magazine [9]. In addition, the preliminary conclusions of the recent EBU tests [10] also gave an edge to
`WMV 8. Like in the audio case, there are plenty of opinions and studies on which codec produces the best video
`quality, so once again it is recommended that experts do their own tests and reach their own conclusions.
`
`WMV 9 and video compression standards
`A typical question that arises is whether the compression efficiency provided by WMV 9 is superior to that of
`prior standards like MPEG-2 and MPEG-4, or even of the emerging H.264. This question is difficult to
`answer because standards only define the bitstream syntax and decoder semantics. Therefore, different imple-
`mentations can produce different quality results. The same could be said about WMV 9, since we anticipate
`that future (backward-compatible) versions of the encoder built by hardware vendors will likely enhance the
`compression efficiency over our current version.
`Nevertheless, in order to provide some indication of quality comparisons, we conducted internal tests using the
`well-known Minerva C250 MPEG-2 hardware video codec and the recently-released QuickTime 6 MPEG-4
`codec (with the more advanced ISMA profile 1 compatibility level). In those tests, using the same encoding
`settings, WMV 9 achieved similar quality to MPEG-2 and MPEG-4 with only 1/3 and 1/2 of the bit-rate,
`respectively 2. Although there may be better implementations of MPEG-2 and MPEG-4, these large compres-
`
`2.
`
`To obtain a DVD data disk with the uncompressed source, the bitstream results, and instructions for duplicating these tests independ-
`ently, please send an e-mail to stevsk@microsoft.com.
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`7 / 12
`
`34
`
`32
`
`30
`
`0
`
`250
`
`550
`
`750
`
`1000
`
`WMV v9
`WMV v8
`MPEG-4
`
`1250
`1550
`(kbit/s)
`
`1750
`
`

`

`AUDIO / VIDEO CODING
`sion gains suggest that WMV 9 provides a significant quality advantage (or bandwidth savings) over codecs
`that are compliant with these standards. Indeed, recent independent studies have also concluded that the com-
`pression efficiency of an early (pre-beta) version of WMV 9 [11] and even of WMV 8 [10] is superior to that
`obtained by solutions based on MPEG-2 or MPEG-4.
`
`H.264 [12] is a joint ITU-ISO video compression standard that is expected to be completed by May 2003.
`Since the interoperability process usually continues for months after the standard is completed, it will take
`some time before competitive standard-compliant products appear on the market. Therefore, it is still too early
`to reach any solid conclusions on the quality differences between H.264 and WMV 9. Nevertheless, since the
`rate-distortion-optimized reference codec (cid:150) developed by the Joint Video Team of ITU-ISO [13] (cid:150) is believed
`to provide very high quality, some companies have already made their own initial tests. For example, a fairly
`comprehensive study, performed several months ago, concluded that the video quality achieved by H.264 and
`WMV 9 is similar [11], although both codecs have been undergoing enhancements since then.
`
`A concern with H.264 is the high computational complexity required for encoding and decoding. For exam-
`ple, some initial studies show that in order to achieve the compression benefits, the encoding computational
`complexity needs to increase by an order of magnitude over MPEG-4 simple profile [14]. In addition, they
`suggest that the decoder complexity of H.264 (main profile) will be about three times higher than MPEG-4
`(simple profile) [14].
`
`On the other hand, the decoding complexity of the main profile of WMV 9 is relatively close to that of our
`(MMX-optimized) MPEG-4 simple profile codec. To be more concrete, decoding with WMV 9 is only about
`1.4 times slower, which can be verified easily by using both codecs in the Windows Media Player (or using
`other MPEG-4 simple profile decoders). Even though one should not draw parallels on such complexity anal-
`yses, this information suggests that H.264 decoding complexity could be twice that of the WMV 9 codec, or at
`least that there is a significant decoding computational benefit of WMV 9 over H.264.
`
`Video smoothing
`
`A new feature of WMV
`is the ability to interpo-
`late missing frames after
`decoding. This feature
`is called (cid:147)video smooth-
`ing(cid:148) in Windows Media
`9 Series, and it is com-
`monly referred
`to as
`frame
`interpolation
`in
`the literature [15]. The
`video smoothing algo-
`rithm uses an advanced
`optical flow estimation
`technique (on a per-pixel
`basis), along with warp-
`ing, to synthesize new
`frames. Fig. 5 provides
`an
`illustration of
`the
`process. The feature is CPU-intensive and is only triggered when there are enough CPU cycles available. For
`example, end users typically need a CPU operating at 733 MHz or higher to interpolate a video clip at
`320x240 pixels resolution from 10 to 30 fps.
`
`Figure 5
`Video smoothing interpolates missing frames at the decoder side using
`optical flow and warping techniques
`
`This feature is particularly useful at very low bit-rates, where the full-frame rate is difficult to achieve during
`encoding, and where the low-rate compression artefacts will mask occasional interpolation errors. Video
`smoothing will remove the jerkiness associated with such low-rate video, and hence it will improve the video
`quality. Alternatively, a content provider can encode a video clip at a lower frame rate (e.g., 12.5 fps) and a
`lower bit-rate on purpose, and then let video smoothing up-sample at the decoder side (e.g., up to 25 fps).
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`8 / 12
`
`

`

`AUDIO / VIDEO CODING
`
`Windows Media Video (WMV) 9 Screen
`
`The WMV 9 Screen codec is the next version of a highly efficient engine for palletized computer-screen video
`compression. This codec is used for capturing applications from a computer desktop for the purposes of creat-
`ing a demonstration. The entire desktop can be coded and transmitted at rates as low as 28 kbit/s, although
`when there are natural images embedded in the desktop application, the required bit-rate is usually around
`100 kbit/s.
`This new version of the codec improves both picture quality and CPU usage when there is motion and natural
`images, with respect to the earlier version. It supports one-pass VBR and one-pass CBR encoding. A future
`version will incorporate motion compensation to handle embedded videos on the desktop.
`
`Windows Media Video (WMV) 9 Image
`The last new codec in Windows Media 9 Series is WMV 9 Image. This codec enables a user to combine a set
`of still images to create a video clip using fading, panning, zooming and other effects. One can understand this
`compression technique as a video codec where the I-frames in the bit stream are followed by a set of motion
`and transition instructions for each frame (rather than P-frame data).
`
`Selecting appropriate video codecs
`In the vast majority of cases, including broadcast applications, the WMV 9 codec is the correct choice.
`The WMV 9 Screen and WMV 9 Image codecs are used in more limited (albeit interesting) scenarios, such as
`when the user wants to compress the video of the computer screen or create a video out of a set of still images.
`
`Windows Media and consumer-electronic devices
`Windows Media made inroads in the consumer electronic space when the syntax for WMA was frozen more
`than four years ago. With a fixed bit stream, a large number of companies started implementing WMA in their
`products. There are currently more than 170 types of devices (DVD players, CD players, personal digital
`assistants such as PocketPC, portable music players, car stereos, and so on) that support WMA [2].
`Video codec technology has improved significantly during the last few years. Even though we envision new
`algorithms that can provide incremental benefits, their computational cost appears to be very high. Therefore,
`we believe that these advances will not be commercially viable for quite some time, perhaps five years or
`longer. Consequently, the WMV bit stream syntax will be frozen for the first time in version 9, and future ver-
`sions of WMV (e.g., versions 10, 11, etc.) will be backward compatible.
`The process for freezing the WMV 9 bit stream was done carefully with much feedback from chip manufactur-
`ers. The objective was to achieve the highest quality with the minimum computation requirement. Algorithms
`that provided a very minor quality improvement, but required much computation, were either re-designed or
`occasionally eliminated. As a result, the quality-to-computation balance of WMV 9 is quite compelling. Even
`in software, one can decode and render some 1080p content in real-time using WMV 9 on a high-end compu-
`ter (e.g., 2.8 GHz processor) without any hardware acceleration. Nevertheless, we are actively working with
`ATI and NVIDIA to incorporate WMV decoding in their graphic cards through Microsoft DirectXfi Video
`Acceleration, which should guarantee robust playback for all content at all bit-rates in even lower-end
`machines.
`A common misconception for customers not familiar with Windows Media-based devices is that Windows
`Media technologies can only be implemented on computers running Microsoft Windowsfi operating systems.
`In reality, Windows Media is a generic digital media format that is operating system (OS) and platform inde-
`pendent. The different Windows Media components are broadly licensed and can be ported using Windows
`Media Porting Kits (which provide specifications and ANSI C source code) to non-OS hardware devices,
`graphic cards, Linux, Mac, etc. Further, the specification and licence for the Windows Media file container,
`called Advanced System Format (ASF), is open and publicly available [16], and organizations such as the
`
`EBU TECHNICAL REVIEW – January 2003
`J. Ribas-Corbera
`
`9 / 12
`
`

`

`AUDIO / VIDEO CODING
`Secure Digital (SD) consortium have adopted Windows Media components (in this case, ASF and WMA) in
`their specifications [3].
`Consequently, many partners and licensees compete among themselves to provide better and cheaper chips
`and OEM solutions that support Windows Media on a variety of platfor

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket