`
`Softswitch
`Architecture for VoIP
`
`San Juan Seoul Singapore Sydney Toronto
`
`McGraw-Hill
`
`NewYork Chicago San Francisco Lisbon
`London Madrid Mexico City Milan New Delhi
`
`AT&T v. VoIP, IPR 2017-01383, Page 1
`
`AT&T Exhibit 1018
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 1
`
`
`
`
`
` The MCGFHW'Hm Companies
`
`Cataloging-in-Publication Data is on file with the Library of Congress.
`
`Copyright © 2003 by The McGraW-Hill Companies, Inc. All rights reserved.
`Printed in the United States ofAmerica. Except as permitted under the United
`States Copyright Act of 1976. no part of this publication may be reproduced or
`distributed in any form or by any means, or stored in a data base or retrieval
`system, Without the prior written permission of the publisher.
`234567890 DOCfDOC 0987654
`
`ISBN 0-07-140977-7
`
`The sponsoring editor for this book 35:13 Marjorie Spencer and the production supervisor
`was Pamela A. Pelton. It was set in . 13;; Century Schoolbook by MacAllister Publishing
`Services, LLC.
`
`Printed and bound by RR Donnelfqt
`
`MCGTaW‘iHfll books are available a: atrial quantity discounts to use as premiums and
`
`sales promotions, or for use in come-E}. mung programs. For more information,
`please write to the Director of Expert 5 Saies, Professional Publishing, McGraW-Hill,
`
`Two Penn Plaza, New York, NY 1'}21-3235. Or contact your local bookstore.
`
`
`
`Information contained in this 731-: :as been obtained by The McGravv-Hill
`Companies, inc. (“MoGraw-Hii‘ 2:: sources believed to be reliable. However,
`
`neither McGraerill nor its 2:511 marantee the accuracy or completeness of
`any information published here: iii neither McGraw-Hill nor its authors
`
`shall be responsible for any €737:
`:r—‘ssions, or damages arising out of use of
`this information. This work is gs' '
`' Pi With the understanding that McGraW—
`
`Hill and its authors are 5 p3;
`nation but are not attempting to render
` HI]:
`engineering or other pror
`res. If such services are required, the
`5:.
`assistance of an appropriate ztri-EEIEJ should be sought.
`
`
`
`
`I! This book is printed on ?h'jFit’Z; iii-fee paper containing a minimum of
`‘3 50 percent recycled de—irl's-i Exes:
`
`AT&T v. VoIP, IPR 2017-01383, Page 2
`
`
`AT&T Exhibit 1018
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 2
`
`
`
`
`
`The Public
`
`Switched
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 3
`
`
`
`
`
`.10
`
`Chapter 2
`
`An understanding of the workings of the Public Switched Telephone Net-
`work (PSTN) is best grasped by understanding its three major components:
`access, switching, and transport (see Figure 2-1). Each element has evolved
`over the IOU—plus year history of the PSTN. Access pertains to how a user
`accesses the network. Switching refers to how a call is “switched” or routed
`through the network, and transport describes how a call travels or is “trans—
`ported” over the network.
`
`
`
`Access
`
`Access refers to how the user accesses the telephone network. For most
`users, access is gained to the network via a telephone handset. Transmis—
`sion and reception is via diaphragms where the mouthpiece converts the air
`pressure of voice into an analog electromagnetic wave for transmission to
`the switch. The earpiece performs this process in reverse. The most
`sophisticated aspect ofthe handset is its Dual-Tone Multifi‘equency (DTMF)
`function, which signals the switch by tones. The handset is usually con-
`nected to the central office (where the switch is located) via copper wire
`known as twisted pair because, in most cases, it consists of a twisted pair of
`copper wire. The stretch of copper wire connects the telephone handset to
`the central office. Everything that runs between the subscriber and the cen—
`tral office is known as outside plant. Telephone equipment at the subscriber
`end is called customer premise equipment (OPE).
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`_ m
`Figure 2-1
`We three
`
`
`
`
`
`
`
`
`AT&T v. VoIP, IPR 2017-01383, Page 4
`
`AT&T Exhibit 101%
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 4
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`Switching
`
`m m ..
`
`Figure 2-2
`The traditional
`
`relationship of
`Class 4, Class 5, and
`data networks
`
`Legacy Networks
`
`Web Sites
`
`The PSTN is a star network; that is, every subscriber is connected to
`another via at least one if not many hubs known as offices. In those offices
`are switches.Very simply, local offices are used for local service connections
`and tandem offices for long—distance service. Local offices, better known
`as central offices, use Class 5 switches, and tandem offices use Class
`4 switches. Figure 2-2 details the relationship between Class 4 and 5
`switches. A large city might have several central offices. Denver (population
`2 million), for example, is estimated to have almost 40 central offices. Cen-
`tral offices in a large city often take up much of a city block and are recog-
`nizable as large brick buildings with no windows.
`The first telephone switches were human. Taking a telephone handset off
`hook alerted a telephone operator of the caller’s intention to place a call.
`The caller informed the operator of their intended called party and the
`operator set up the call by manually connecting the two parties.
`Mechanical switching is credited to Almon Stowger, an undertaker in
`Kansas City, Missouri, who realized he was losing business when families
`of the deceased picked up their telephone handset and simply asked the
`operator to connect them with “the undertaker.” The sole operator in this
`
`IP Circuits
`
`Class 4 Switch
`
`TDM Circuits
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 5
`
`
`
`
`
`town was engaged to an undertaker competing with Stowger. This compet-
`ing undertaker had promised to marry the operator once he had the finan-
`cial means to do so. The operator, in turn, was more than willing to help him
`achieve that goal.
`Stowger, realizing he was losing business to his competitor due to the
`intercession of the telephone operator, proceeded to invent an electro—
`mechanical telephone handset and switch that enabled the caller, by virtue
`of dialing the called party’s number, to complete the connection without
`human intervention. Telephone companies realized the enormous savings
`in manpower (or womanpower as the majority oftelephone operators at the
`time were women) by automating the call setup and takedown process.
`Stowger switches (also known as crossbar switches) can still be found in the
`central offices of rural America and lesser developed countries.
`Stowger’s design remained the predominant telephone switching tech—
`nology until the mid-1970s. Beginning in the ‘70s, switching technology
`evolved to mainframe computers; that is, no moving parts were used and
`the computer telephony applications made such features as conferencing
`and call forwarding possible. In 1976, AT&T installed its first #4 Electronic
`Switching System MESS) tandem switch. This was followed shortly there—
`after with the 5ESS as a central office switch. ESS central office switches
`did not require a physical connection between incoming and outgoing cir-
`cuits. Paths between the circuits consisted of temporary memory locations
`that enabled the temporary storage of traific. For an ESS system, a com-
`puter controls the assignment, storage, and retrieval of memory locations so
`that a portion of an incoming line (time slot) could be stored in temporary
`memory and retrieved for insertion to an outgoing line. This is called a time
`slot interchange (TSI) memory matrix. The switch control system maps spe-
`cific time slots on an incoming «communication line (such as a D83) to
`specific time slots on an outgoing communication line.1
`
`
`
`1Harte, Lawrence. Telecom Made Simple. Fuquay-Varina, NC: APDG Publishing, 2002.
`
`AT&T v. VoIP, IPR 2017-01383, Page 6
`
`AT&T Exhibit 1018
`
`Chapter 2
`
`' Class 4 and 5 Switching
`
`Class 4 and 5 switches are the “brains” of the PSTN. Figure 2-3 illustrates
`the flow of a call from a handset to a Class 5 switch, which in turn hands
`the call off to a Class 4 switch for routing over a long—distance network.
`That call may be routed through other Class 4 switches before terminating
`at the Class 5 switch at the destination end of the call before being passed
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 6
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`:
`
`1 3
`
`
`
`
`
`“em =ZS<Z1%*§§ W
`
`
`Figure 2-3
`Relationship of Class
`4 and 5 switching
`
`Class Network and Relationship to Class 5
`Switching
`
`Class 4 Switch Denver
`
`Ciass 4 Switch St. Louis
`
`Class 4 Switch Chicago
`
` Class 5 Switch Chicago
`
`on to the terminating handset. Class 5 switches handle local calling and
`Class 4 switches handle long-distance calls. The performance metrics for
`the Class 4 and 5 have been reliability, scalability, quality of service (QoS),
`
`signaling, and features.
`
`Class 4 and 5 Architecture One reason for the reputation of Class 4
`and 5 switches being reliable is that they have been tested by time in the
`legacy market. Incremental improvements to the 4ESS included new inter-
`faces, hardware, software, and databases to improve Operations, Adminis-
`tration, Maintenance, and Provisioning (OAM&P). The inclusion of the 1A
`processor improved memory in the 4 and 515188 mainframe, allowing for
`translation databases. Ultimately, those databases were interfaced with the
`Centralized Automatic Reporting on Trunks (CAROT). Later, integrated cir—
`cuit chips replaced the magnetic core stores and improved memory and
`boosted the Busy Hour Call Attempt (BHCA) capacity to 700,000 BHCAs.2
`
`Class 4 and 5 Components The architecture of the Class 4 and 5 switch
`is the product of 25-plus years of design evolution. For the purposes of this
`discussion, the Nortel DMS—250, one of the most prevalent products in the
`North American Class 4 market, is used as a real-world example. The other
`
`
`
`
`
`zChapuis, Robert, and Amos Joel. “In the United States,AT&T’s Digital Switch Entry No. 4 ESS,
`First Generation Time Division Digital Switch.” Electronics, Computers, and {Ilelepkone Systems.
`New York: North Holland Publishing, 1990, p. 337—338.
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 7
`
`
`
`
`
`leading product in this market is the 4ESS from Lucent Technologies. For
`local offices or Class 5, the most prevalent product is the SESS from Lucent.
`DMS—250 hardware, for example, is redundant for reliability and decreased
`downtime during upgrades. It has a reliability rating of 99.999 percent (the
`five 95), which meets the industry metric for reliability. The modular design
`of the hardware enables the system to scale from 480 to over 100,000 DSOs
`(individual phone lines). The density, or number of phone lines the switch
`can handle, is one metric of scalability. The DMS—25O is rated at 800,000
`BHCAS. Tracking BHCAs on a switch is a measure of call-processing capa—
`bility and is another metric for scalability.
`Key hardware components of the DMS—250 system include the DMS
`core, switch matrix, and trunk interface. The DMS core is the central pro-
`cessing unit (CPU) and memory of the system, handling high-level call pro-
`cessing, system control functions, system maintenance, and the installation
`of new switch software.
`The EMS-250 switching matrix switches calls to their destinations. Its
`nonblocking architecture enables the switch to communicate with periph-
`erals through fiber optic connections. The trunk interfaces are peripheral
`modules that form a bridge between the DMS-250 switching matrix and the
`trunks it serves. They handle voice and data traffic to and from customers
`and other switching systems. UNIS—250 trunk interfaces terminate DS—l,
`Integrated Services Digital Network (ISDN) Primary Rate Interface (PR1),
`X.75/X.75 packet networking, and analog trunks. They also accommodate
`test and service circuits used in office and facility maintenance. It is impor—
`tant to note that the Class 4 switching matrix is a part of the centralized
`architecture of the Class 4. Unlike the media gateways in a softswitch solu—
`tion, it must be collocated with the other components of the Class 4.
`DMS—250 billing requires the maintenance of real-time, transaction-
`based billing records for many thousands of customers and scores of vari—
`ants in service pricing. The DMS—250 system automatically provides
`detailed data, formats the data into call detail records, and constructs bills.3
`
`
`3Nortel Networks, “Product Service Information-DMS300/250 System Advantage.” www.
`nortel.com, 2001.
`
`AT&T v. VoIP, IPR 2017-01383, Page 8
`
`AT&T Exhibit 1018
`
`Chapter 2
`
`Private Branch Exchange (PBX)
`
`As the name would imply, a private branch exchange (PBX) is a switch
`owned and maintained by a business with many (20 or more) users. A key
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 8
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`
`
`system is used by smaller offices. PBXs and key systems today are com—
`puter based and enable soft changes to be made through an administration
`terminal or PC. Unless the business has a need for technical telecommuni-
`
`cations personnel on staff for other reasons, the business will normally con-
`tract with their vendor for routine adds, moves, and changes of telephone
`equipment.
`PBX systems are often equipped with key assemblies and systems,
`including voice mail, call accounting, a local maintenance terminal, and a
`dial-in modem. The voice mail system is controlled by the PBX and only
`receives calls when the PBX software determines a message can be left or
`retrieved. The call accounting system receives system message details on
`
`all call activities that occur within the PBX. The local terminal provides
`onsite access to the PBX for maintenance activities. The dial-in capability
`also provides access to the PBX for maintenance activities.‘1
`
`Centrex
`
`After PBXs caught on in the industry, local exchange carriers began to lose
`some of their more lucrative business margins. The response to the PBX
`was Centrex. Centrex is a service offered by a local telephone service
`provider (primarily to businesses) that enables the customer to have fea-
`tures that are typically associated with a PBX. These features include
`three- or four-digit dialing, intercom features, distinctive line ringing for
`inside and outside lines, voice mail, call-waiting indication, and others. Cen—
`trex services flourished and still have a place for many large, dispersed enti—
`ties such as large universities and major medical centers.
`
`One of the major selling points for Centrex is the lack of capital expen-
`diture up front. That, coupled with the reliability associated with Centrex
`due to its location in the telephone company central office, has kept Centrex
`as the primary telephone system in many of the businesses referenced pre-
`viously. PBXs, however, have cut into What was once a lucrative market for
`the telephone companies and are now the rule rather than the exception for
`business telephone service. This has come about because of inventive ways
`of funding the initial capital outlay and the significantly lower operating
`cost of a PBX versus a comparable Centrex offering.
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`‘Harte, Lawrence. Tblecom Made Simple. Fuquay-Varina, NC: APDG Publishing. 2002.
`
`
`
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 9
`
`
`
`
`
`Multiplexing
`
`Chapter 2
`
`
`
`The earliest approach to getting multiple conversations over one circuit was
`frequency division multiplexing (FDM). FDM was made possible by the vac—
`uum tube where the range of frequencies was divided into parcels that were
`distributed among subscribers. In the first FDM architectures, the overall
`system bandwidth was 96 kHz. This 96 kHz could be divided among a num-
`ber of subscribers into, for example, 5 kHz per subscriber, meaning almost
`20 subscribers could use this circuit.
`FDM is an analog technology and suffers from a number of shortcom-
`ings. It is susceptible to picking up noise along the transmission path. This
`FDM signal loses its power over the length of the transmission path. FDM
`requires amplifiers to strengthen the signal ever that path. However, the
`amplifiers cannot separate the noise from the signal and the end result is
`an amplified noisy signal.
`The improvement over FDM was time division multiplexing (TDM).
`TDM was made possible by the transistor that arrived in the market in the
`1950s and 1960s. As the name would imply, TDM divides the time rather
`than the frequency of a signal over a given circuit. Although FDM was typ—
`ified by “some of the frequency all of the time,” TDM is “all of the frequency
`some of the time.” TDM is a digital transmission scheme that uSes a small
`number of discrete signal states. Digital carrier systems have only three
`valid signal values: one positive, one negative, and zero. Everything else is
`registered as noise. A repeater, known as a regenerator, can receive a weak
`and noisy digital signal, remove the noise, reconstruct the original signal,
`and amplify it before transmitting the signal onto the next segment of the
`transmission facility. Digitization brings with it the advantages of better
`maintenance and troubleshooting capability, resulting in better reliability.
`Also, a digital system enables improved configuration flexibility
`TDM has made the multiplexer, also known as the channel bank, possi—
`ble. In the United States, the multiplexer or “mux” enables 24 channels per
`single four-wire facility. This is called a TI, DS 1, or T—Carrier. Outside
`North America and Japan, it is 32. channels per facility and known as E1.
`These systems came on the market in the early 19605 as a means to trans—
`port multiple channels ofvoice over expensive transmission facilities.
`
`Voice Digitization via Pulse Code Modulation
`
`One of the first processes in the transmission of a telephone call is the con—
`version of an analog signal into a digital one. This process is called pulse
`
`AT&T v. VoIP, IPR 2017-01383, Page 10
`
`AT&T Exhibit 1018
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 10
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`
`
`code modulation (PCM). This is a four-step process consisting of pulse
`amplitude modulation (PAM) sampling, companding, quantization, and
`encoding.
`
`Pulse Amplitude Modulation (PAM) The first stage in PCM is known
`as PAM. In order for an analog signal to be represented as a digitally
`encoded bitstream, the analog signal must be sampled at a rate that is
`equal to twice the bandwidth of the channel over which the signal is to be
`transmitted. As each analog voice channel is allocated 4 kHz of bandwidth,
`each voice signal is sampled at twice that rate, or 8,000 samples per sec-
`ond. In a T—Carrier, the standard in North America and Japan, each chan—
`nel is sampled every one eight—thousandth of a second in rotation, resulting
`in the generation of 8,000 pulse amplitude samples from each channel
`every second. If the sampling rate is too high, too much information is
`transmitted and bandwidth is wasted. If the sampling rate is too low, alias-
`ing may result. Aliasing is the interpretation of the sample points as a false
`waveform due to the lack of samples.
`
`Companding The second process of PCM is companding. Companding is
`the process of compressing the values of the PAM samples to fit the non-
`linear quantizing scale that results in bandwidth savings of more than 30
`percent. It is called companding as the sample is compressed for transmis-
`sion and expanded for reception.5
`
`Quantization The third stage in PCM is quantization. In quantization,
`values are assigned to each sample within a constrained range. In using a
`limited number of bits to represent each sample, the signal is quantized.
`The difference between the actual level of the input analog signal and the
`digitized representation is known as quantization noise. Noise is a detrac—
`tion to voice quality and it is necessary to minimize noise. The way to do
`this is to use more bits, thus providing better granularity In this case, an
`inevitable trade-ofi'takes place bewteen bandwidth and quality. More band—
`width usually improves signal quality, but bandwidth costs money. Service
`providers, whether using TDM or Voice over IP (VoIP) for voice transmis-
`sion will always have to choose between quality and bandwidth. A process
`known as nonuniform quantization involves the usage of smaller
`
`
`
`5Shepard, Steven. SONET/SDH Demystified. New York: McGraw-Hill, 2001. p. 15—21.
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 11
`
`
`
`Chapter 2
`
`quantization steps at smaller signal levels and larger quantization steps for
`larger signal levels. This gives the signal greater granularity or quality at
`low signal levels and less granularity (quality) at high signal levels. The
`result is to spread the signal-to-noise ratio more evenly across the range of
`different signals and to enable fewer bits to be used compared to uniform
`quantization. This process results in less bandwidth being consumed than
`for uniform quantization.6
`
`
`
`Encoding The fourth and final process in PCM is encoding the signal.
`This is performed by a codec (coder/decoder). Three types of codecs exist:
`waveform codecs, source codecs (also known as vocoders), and hybrid
`codecs. Waveform codecs sample and code an incoming analog signal
`without regard to how the signal was generated. Quantized values of the
`samples are then transmitted to the destination where the original signal
`is reconstructed, at least to a certain approximation of the original. Wave-
`form codecs are known for simplicity with high-quality output. The disad—
`vantage of waveform codecs is that they consume considerably more
`bandwidth than the other codecs. When waveform codecs are used at low
`bandwidth, speech quality degrades markedly.
`Source codecs match an incoming signal to a mathematical model of how
`speech is produced. They use the linear predictive filter model of the vocal
`tract, with a voiced/unvoiced flag to represent the excitation that is applied
`to the filter. The filter represents the vocal tract and the voice/unvoiced flag
`represents whether a voiced or unvoiced input is received from the vocal
`chords. The information transmitted is a set of model parameters as
`opposed to the signal itself. The receiver, using the same modeling tech—
`nique in reverse, reconstructs the values received into an analog signal.
`Source codecs also operate at low bit rates and reproduce a synthetically
`sounding voice. Using higher bit rates does not result in improved voice
`quality. Voooders (source codecs) are most widely used in private and mili-
`tary applications.
`Hybrid codecs are deployed in an attempt to derive the benefits from
`both technologies. They perform some degree of waveform matching while
`mimicking the architecture of human speech. Hybrid codecs provide better
`voice quality at low bandwidth than waveform codecs. Table 2—1 provides an
`outline of the different ITU codec standards and Table 2-2 lists the para-
`meters of the voice codecs.
`
`
`
`“Collins, Daniel. Carrier Grade Voice Over IP. New York: McGraw—Hill, 2001. p. 95—96.
`
`AT&T v. VoIP, IPR 2017-01383 Pace 12
`
`AT&T Exhibit 1018
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 12
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`Table 2-1
`
`Descriptions of
`voice codecs (ITU)
`
`RBOU
`
`G114
`
`G165
`
`G168
`
`G711
`
`G722
`
`G.723.1
`
`G729
`
`G.729A
`
`H.323
`
`P.861
`
`fihescfipfion -
`
`<
`
`A subjective rating system to determine the Mean Opinion Score
`(MOS) or the quality of telephone connections
`
`A maximum one-way delay end to end for 3 Vol? call (150 ms)
`
`Echo cancellers
`
`Digital network echo cancellers
`
`PCM of voice frequencies
`
`7 kHz audio coding within 64 Kbps
`
`A dual-rate speech coder for multimedia communications transmitting
`at 5.3 and 6.3 Kbps
`
`Coding for speech at 8 Kbps using conjugate-structure algebraic code-
`excited linear-prediction (CS-ACELP)
`
`Annex A reduced complexity 8 Kbps CS—ACELP speech codec
`
`A packet—based multimedia communications system
`
`Specifies a model to map actual audio signals to their representations
`inside the human head
`
`Q.931
`
`Digital subscriber signaling system number 1 ISBN user-network
`interface layer 3 specification for basic call control
`_______.____.__———-————-—
`
`Table 2—2
`
`Parameters of voice
`codecs
`
`(3.711
`
`64
`
`(1721, G723, G726
`(3.728
`
`16,24,32,40
`15
`
`G729
`
`G.723.1
`
`8
`
`0.125
`
`0.125
`2.5
`
`10
`30
`
`Waveform
`
`4.8
`
`4.2
`4.2
`
`4.2
`3.5, 3.98
`
`____________._1_———————
`
`nel. There is an inevitable trade off in compression for voice quality in the
`
`Popular Speech Codecs Codecs are best known for the sophisticated
`compression algorithms they introduce into a conversation. Bandwidth
`costs service providers money. The challenge for many service providers is
`to squeeze as much traffic as possible into one “pipe,” that is one channel.
`Most codecs allow multiple conversations to be carried on one 64 kbps chan—
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 13
`
`
`
`
`
`Chapter 2
`
`conversation. The challenge for service providers is to balance the econom—
`ics of compression with savings in bandwidth costs.
`
`6.71 I G.711 is the best-known coding technique in use today. It is a wave-
`form codec and is the coding technique used in circuit-switched telephone
`networks all over the world. G711 has a sampling rate of 8,000 Hz. If
`uniform quantization were to be used, the signal levels commonly found in
`speech would be such that at least 12 bits per sample would be needed, giv—
`ing it a bit rate of 96 Kbps. Nonuniform quantization is used with eight bits
`used to represent each sample. This quantization leads to the well—known
`64 Kbps DSO rate. G.711 is often referred to as PCM. G.711 has two vari-
`ants: A—law and mu-law. Mu—law is used in North America and Japan where
`T-Carrier systems prevail. A—law is used everywhere else in the world. The
`difference between the two is the way nonuniform quantization is per—
`formed. Both are symmetrical at approximately zero. Both A—law and mu-
`law offer good voice quality with a MOS of 4.3, with 5 being the best and 1
`being the worst. Despite being the predominant codec in the industry, G711
`suffers one significant drawback; it consumes 64 Kbps in bandwidth. Car-
`riers seek to deliver voice quality using little bandwidth, thus saving on
`
`operating costs.
`
`6.728 LD—CELP Code-Excited Linear Predictor (LD—CELP) codecs imple-
`ment a filter and contain a codebook of acoustic vectors. Each vector
`contains a set of elements in which the elements represent various charac-
`teristics of the excitation signal. CELP coders transmit to the receiving end
`a set of information determining filter coefficients, gain, and a pointer to the
`chosen excitation vector. The receiving end contains the same code book and
`filter capabilities so that it reconstructs the original signal.
`(1728 is a
`backward-adaptive coder as it uses previous speech samples to determine
`the applicable filter coefficients. G728 operates on five samples at one time.
`That is, 5 samples at 8,000 Hz are needed to determine a codebook vector
`and filter coefficients based upon previous and current samples. Given a
`coder operating on five samples at a time, a delay of less than 1 millisecond
`is the result. Low delay equals better voice quality.
`The G728 codebook contains 1,024 vectors, which requires a 10—bit index
`value for transmission. It also uses 5 samples at a time taken at a rate of
`8,000 per second. For each of those 5 samples, G728 results in a transmit-
`ted bit rate of 16 Kbps. Hence, G728 has a transmitted bit rate of 16 Kbps.
`Another advantage here is that this coder introduces a delay of 0.625 mil-
`liseconds with an MOS of 3.9. The difference from G.711’s MOS of 4.3 is
`imperceptible to the human ear. The bandwidth savings between G.728’s 16
`Kbps per conversation and G.711’s 64 Kbps per conversation make G728
`very attractive to carriers given the savings in bandwidth.
`
`AT&T v. VoIP, IPR 2017-01383, Page 1‘
`
`AT&T Exhibit 101
`
`
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 14
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`E
`
`G.723.1 ACELP G.723.1 ACELP can operate at either 6.3 Kbps or 5.3 Kbps
`with the 6.3 Kbps providing higher voice quality. Bit rates are contained in
`the coder and decoder, and the transition between the two can be made dur-
`
`ing a conversation. The coder takes a bank-limited input speech signal that
`is sampled a 8,000 Hz and undergoes uniform POM quantization, resulting
`in a 16-bit PCM signal. The encoder then operates on blocks or frames of
`
`240 samples at a time. Each frame corresponds to 30 milliseconds of speech,
`which means that the coder causes a delay of 30 milliseconds. With a look-
`ahead delay of 75 milliseconds, the total algorithmic delay is 37.5 millisec—
`
`onds. G.723.1 gives an MOS of 3.8, which is highly advantageous in regards
`to the bandwidth used. The delay of 37.5 milliseconds one way does present
`an impediment to good quality, but the round-trip delay over varying
`aspects of a network determines the final delay and not necessarily the
`codec used.
`
`6.729 G729 is a speech coder that operates at 8 Kbps. This coder uses
`input frames of 10 milliseconds, corresponding to 80 samples at a sampling
`rate of 8,000 Hz. This coder includes a 5-millisecond look—ahead, resulting
`in an algorithmic delay of 15 milliseconds (considerably better than
`G.723.1). G729 uses an 80-bit frame. The transmitted bit rate is 8 Kbps.
`Given that it turns in an MOS of 4.0, G729 is perhaps the best tradepfi‘ in
`
`bandwidth for voice quality The previous paragraphs previde an overview
`of the multiple means of maximizing the efficiency of transport via the
`PSTN. We find today that TDM is almost synonymous with circuit switch-
`ing. Telecommunications engineers use the term TDM to describe a circuit—
`switched solution. A 64 Kbps G711 codec is the standard in use on the
`
`PSTN. The codecs described in the previous pages apply to VoIP as well.
`VoIP engineers seeking to squeeze more conversations over valuable band—
`
`width have found these codecs very valuable in compressing VoIP conver—
`sations over an IP circuit.7
`
`Signaling
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`
`Signaling describes the process of how calls are set up and torn down. Gen-
`erally speaking, there are three main functions of signaling: supervision,
`alerting, and addressing. Supervision refers to monitoring the status of a
`line or circuit to determine if there is traffic on the line. Alerting deals with
`the ringing of a phone indicating the arrival of an incoming call. Address-
`
`ing is the routing of a call over a network. As telephone networks matured,
`
`
`
`
`leid.
`
`
`
`
`
`
`
`
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 15
`
`
`
`Signaling System 7 (SS7) For much of the history of circuit-switched
`networks, signaling followed the same path as the conversation. This is
`called Channel-Associated Signaling (CAS) and is still in wide use today.
`R1 Multifiequency (MF) used in North American markets and R2 Multi-
`Fhequency Compelled (RFC) used elsewhere in the world are the best exam—
`ples of this. Another name for this is in—channel signaling. The newer
`technology for signaling is called Common Channel Signaling (COS), also
`known as out-of-‘oand signaling. 008 uses a separate transmission path for
`call signaling and not the bearer path for the call. This separation enables
`the signaling to be handled in a different manner to the call. This enables
`signaling to be managed by a network independent of the transport net—
`work. Figure 2—4 details the difference between CAS and CCS.
`
`
`
`E‘Stallings, William. ISBN and Broadband ISDN with Flume Relay and ATM New York: Pren-
`tice Hall, 1995. p.292.
`
`AT&T v. VoIP, IPR 2017-01383, Page 1 .
`
`AT&T Exhibit 101 :<
`
`Chapter 2
`
`individual nations developed their proprietary signaling systems. Ulti-
`mately, there become a signaling protocol for every national phone service
`in the world. Frankly, it is a miracle that international calls are ever com-
`
`pleted given the complexity of interfacing national signaling protocols.
`
`Figure 2'4
`CAS and CCS
`
`Speech and Signaling
`
`Channel Associated Signaling
`
`Signaling
`
`Common Channel Signaling
`
`AT&T Exhibit 1018
`AT&T v. VoIP, IPR 2017-01383, Page 16
`
`
`
`The Public Switched Telephone Network (PSTN)
`
`23
`
`Signaling System 7 (SS7) is the standard for CCS with many national
`variants throughout the world (such as Mexico’s NOM—112). It routes con-
`trol messages through the network to perform call management (setup,
`maintenance, and termination) and network management functions.
`Although the network being controlled is circuit switched, the control sig—
`naling is implemented using packet-switching technology. In effe