`Ostrover et al.
`
`lllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllllll
`US005469370A
`5,469,370
`[11] Patent Number:
`[45] Date of Patent:
`Nov. 21, 1995
`
`[54] SYSTEM AND METHOD FOR
`CONTROLLING PLAY OF MULTIPLE
`AUDIO TRACKS OF A SOFTWARE
`CARRIER
`
`[75]
`
`ln'lentors: Lewis S. Ostrover; Christopher J.
`Cookson; Warren N. Lieberfarb, all
`of Los Angeles, Calif.
`
`[73] Assignee: Time Warner Entertainment Co.,
`L.P., Burbank, Calif.
`
`[21] Appl. No.: 145,326
`
`Oct. 29, 1993
`
`[22] Filed:
`Int. Cl.6
`.....•....•.....•................••••..........•... GllB 27/028
`[51]
`[52] U.S. Cl . ....................... 364/514 R; 369/60; 3951156;
`84/610; 84/634
`[58] Field of Search ............................. 364/514; 395/100,
`3951144, 153-156; 369/49, 60, 99, 106,
`124; 84/477 R, 478, 609, 610, 634, DIG. 29
`
`[56]
`
`References Cited
`
`U.S. PATENT DOCUMENTS
`
`4,445,151
`4,523,236
`4,858,033
`4,995,026
`5,023,707
`5,092,216
`
`4/1984 Kinoshita et al ....................... 358/341
`6/1985 Hayashi et al .......................... 358/341
`8/1989 Chippendale .......................... 360114.2
`2/1991 Makabe et al. ........................... 369nO
`611991 Briggs ..................................... 358/343
`3/1992 Wadhams .......................... 84/DIG. 29
`
`5,130,815
`5,130,816
`5,146,833
`5,262,964
`5,280,572
`5,289,288
`5,340,939
`
`711992 Silverman et al ...................... 358/335
`711992 Yoshio .................................... 358/341
`9/1992 Lui ............................................ 84/462
`11/1993 Bonsall et al ........................... 364/514
`1/1994 Case et al ............................... 3951144
`2/1994 Silverman et al. . .................... 358/335
`8/1994 Kumagai ................................... 84/609
`
`OTHER PUBLICATIONS
`
`Connie Guglielmo, ''Teamwork Binds Multimedia Kids'
`Titles", MacWeek, vol. 6, No.2, p. 42(2), Jan. 13, 1992
`
`Primary Examiner-Emanuel T. Voeltz
`Assistant Examiner-Edward Pipala
`Attorney, Agent, or Firm-Gottlieb, Rackman & Reisman
`
`[57]
`
`ABSTRACT
`
`A system for selectively combining numerous audio tracks
`recorded on an optical disk, including subtracting, by phase
`inversion, particular tracks from a full mix. Each track might
`contain a recording of an individual instrument, or a group
`of instruments. When the disk is played, the player provides
`the user with menu choices-which tracks should be played
`and mixed together. In this way, the user can create his/her
`own orchestral mix, perhaps omitting an instrument in a
`"play along" application, or playing one instrument alone to
`be mimicked for practice purposes. The disk further includes
`track definitions in multiple languages so that the same disk
`can be distributed throughout the world, with menu choices
`being given in a language selected by the user.
`
`34 Claims, 11 Drawing Sheets
`
`GTL 1008
`IPR of U.S. Pat. No. 8,135,115
`
`0001
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 1 of 11
`
`5,469,370
`
`FIG. 1
`PRIOR ART
`VHS PLAYER
`
`7
`
`5
`
`9
`
`AID
`CONVERTER
`
`11
`
`VIDEO
`FRAME
`STORE
`
`13
`
`SYNC
`GENERATOR,
`STANDARDS
`CONVERTER,
`AND D/A
`CONVERTER
`
`VIDEO
`OUTPUT
`
`FIG. 8
`PRIOR ART
`
`B
`
`B
`
`B
`
`p
`
`B
`
`B
`
`B
`
`p
`
`BIDIRECTIONALLY
`INDEPENDENT PREDICTED
`INTRAFRAME
`FRAMES
`REFERENCE
`I FRAME
`
`Ill
`
`Ill
`Ill
`
`/
`
`I
`
`Ill ~ Ill
`Ill -
`
`BIDIRECTIONALLY
`PREDICTED
`FRAMES
`
`Ill
`Ill
`Ill
`Ill
`
`'
`
`11LETTER BOX 11
`ORIGINAL ASPECT RATIO
`
`I
`I
`I
`
`~~
`ASPECT RATIO ~ .. ~ -------lo~--.~....- --
`
`:
`
`'
`
`•
`
`' .
`
`• •
`
`: •
`
`11WIDE SCREEN 11
`
`11PAN SCAN 11
`REDUCED
`ASPECT RATIO ,_ _____ __.
`
`FIG. 9
`
`0002
`
`
`
`--...l =
`
`~
`...
`\C
`="
`~
`...
`01
`
`~
`~
`
`sa,
`N
`m.
`~
`
`!.It
`\C
`\C
`~
`
`N r-
`~
`z
`
`•
`rJ'J.
`0 •
`
`~ = ;-a
`~ •
`
`I
`
`\
`I ••• {t
`1.
`
`••
`
`I AMPLIFIER
`
`_ .,.
`a
`
`-1-
`
`67
`
`_
`
`DECODER
`
`VIDEO
`
`71
`
`-
`
`.. ,_ ___ __.
`
`DECODER
`PROCESSOR
`
`AUDIO
`
`93--VIDEO
`DIGITAL
`691C,__....,. __ _.
`_
`
`Eo
`
`95
`
`_.,
`
`(71
`
`BUFFER
`
`r--
`
`PAN SCAN
`
`.. "\
`57
`
`BUFFER
`VIDEO
`
`55-.
`
`r--
`_.,
`
`89
`
`r-
`
`87
`
`r-
`
`.--f...___----L-----,.
`
`v:~lli~L
`PROCESSOR/
`PAN SCAN
`
`_ f
`
`-
`
`P~~~8~N
`
`FRAME STORE,
`
`VIDEO
`
`~ INTERLACE
`
`73'G~~~~~R
`
`~
`
`'
`
`_r
`
`..
`
`BUFFERS
`AUDIO
`
`L.,.
`
`/53
`
`VI
`
`CONVERTER
`
`AND D/A
`
`~ •. ·:··-·<. --~
`
`----T
`~-._, .. , ·· .. ~
`
`.. ::::::::.--'
`
`BUFFER
`"-. BU?ER FULL ~ SUBTITLE
`97
`
`,
`
`_.X
`
`"-.
`85
`
`f
`54-~ 59/
`
`'1
`
`VIDEO 99
`
`CLOCK
`MASTER
`
`.
`
`79
`
`USER }-
`
`INTERFACE
`
`/51
`
`I
`
`DISPL~X_v~; ['"]~6/KEYER -GE~~~~:~9~:2
`
`.. COMMAND/DATA
`
`65
`
`_
`
`I
`
`(
`
`81
`
`'
`
`~ MICRO-
`
`I'
`CO~~~gtLER
`PROCESSOR
`
`'---l~ 41
`77 ~ CONTROL
`LOCK
`PARENTAL
`
`GATE -+ DEMULTIPLEXER
`
`.T.
`
`..
`
`_( 25
`
`.
`"'-,~~ DATA OUT
`
`..
`
`~
`
`ANALYZER
`POINTER ~ ~3
`
`----
`75 v
`
`~
`
`76---
`
`49 I
`
`I
`
`BLOCK NUMBER/ I ~
`.--4:...:...7""'-+--__I_ y __ --.. ~.--~
`
`29/ CONTROL
`3 1/ CONTROL POSITION
`I ~RATE
`2
`~);'}I . .
`Ul/\/:j.. . . ..
`f),
`
`/45
`
`'-27
`
`_
`
`43
`
`/
`
`~
`CONTROLLER
`
`I
`
`DISK DRIVE
`PROCESSOR
`
`MICRO-
`
`,..---JL..-----<---,.
`
`' HEAD
`
`0003
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 3 of 11
`
`5,469,370
`
`FIG. 3
`
`BITS
`MAX
`
`MIN
`
`COMMENTS
`
`1
`2
`40
`3
`1
`4
`0
`5
`12
`6 w
`7
`0
`8
`0
`9
`6
`10
`0
`11
`1CO
`12
`1
`13
`0
`14
`0
`15 w
`16
`4
`17
`18
`19
`20
`21
`22
`23
`24
`25
`26
`27
`
`20
`0
`0
`4
`10
`0
`
`0
`
`28
`0
`29 w
`
`30
`31
`32
`
`1
`0
`
`N "1"s, MAX 16
`Nx3
`Nx4
`M = OT063
`Mx4
`P "1"s, MAX 99
`
`R "1"s, MAX 99
`
`LEAD-IN SYNC BITS
`40 AUTHORIZED TERRITORIES
`1 SPECIAL SOFTWARE FLAG
`UNDET SPECIAL SOF1WARE W/ENDING SYNC WORD
`12 AUTHORIZED STANDARDS
`100 AVAILABLE AUDIO LANGUAGES, M&E PLUS 99
`48 TRACK TYPES
`64 TRACK CODING
`6 NUMBER OF "OTHER" AUDIO TRACKS
`252 CODING FOR "OTHER" AUDIO TRACKS
`100 AVAILABLE DISPLAY LANGUAGES
`1 SPECIAL MIXING/DELETION SOFTWARE FLAG
`UNDET SPECIAL MIXING/DELETION SOFTWARE W/ENDING SYNC WORD
`UNDET P x M STRINGS EACH ENDING WITH ESC CHARACTER
`100 AVAILABLE SUBTITLE LANGUAGES
`4 MULTIPLE VERSION CODE
`1 SPECIAL VERSION SOF1WARE FLAG
`UNDET SPECIAL VERSION SOF1WARE W/ENDING SYNC WORD
`VIDEO AVAILABILITY FLAG
`BASE ASPECT RATIO
`PAN SCAN AVAILABILITY
`20 TOTAL NUMBER OF DATA BLOCKS
`20 NUMBER OF DATA BLOCKS IN VERSION A
`20 NUMBER OF DATA BLOCKS IN VERSION B
`4 ORIGINAL FRAME RATE
`10 BLOCK TIME FACTOR
`UNDET TABLE OF CONTENTS FOR FIRST VERSION
`FOR EACH CHAPTER:
`8-BIT CHAPTER NUMBER
`20-BIT STARTING BLOCK SERIAL BLOCK NUMBER
`20-BIT BLOCK DURATION OF CHAPTER
`AVAILABLE CHAPTER DISPLAY LANGUAGES (100 BITS)
`LANGUAGE STRINGS IDENTIFYING CHAPTERS,
`EACH ENDING WITH ESC CHARACTER
`UNDET TABLE OF CONTENTS FOR SECOND VERSION
`1200 ENCRYPTED AUTHORIZATION CODE FOR EACH STANDARD
`W/ENDING SYNC WORD
`1 DATA BLOCK COMMAND/DATA FLAG
`1 SUPPLEMENTAL SOFTWARE FLAG
`UNDET SUPPLEMENTAL SOFTWARE W/ENDING SYNC WORD
`
`0004
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 4 of 11
`
`5,469,370
`
`FIG. 4
`
`BITS
`MIN MAX
`32
`32 SYNC WORD
`20
`20 SERIAL BLOCK NUMBER
`2
`2 VERSION (A,B OR COMMON)
`0
`2 2-BIT POINTER FLAG
`0
`20 POINTER
`0
`1 VIDEO PRESENT FLAG
`0
`UNDET VIDEO BLOCK W/ENDING SYNC WORD
`0
`16 AUDIO TRACKS PRESENT
`0 UNDET X AUDIO LANGUAGE BLOCKS,
`EACH ENDING W/ESC CHARACTER
`63 "OTHER" AUDIO TRACKS PRESENT
`UNDET Y "OTHER" AUDIO TRACK BLOCKS,
`EACH ENDING W/ESC CHARACTER
`99 SUBTITLE UPDATES PRESENT
`UNDET Z SUBTITLE UPDATE BLOCKS,
`EACH ENDING W/ESC CHARACTER
`1 PAN SCAN UPDATE FLAG
`9 PAN SCAN UPDATE
`COMMAND/DATA PRESENT FLAG
`UNDET COMMAND/DATA BLOCK ENDING W/ESCAPE CHARACTER
`
`2
`3
`4
`5
`6
`7
`8
`9
`
`10
`11
`
`12
`13
`
`14
`15
`16
`17
`
`0
`0
`
`0
`0
`
`0
`0
`0
`0
`
`COMMENTS
`
`X "1"s, MAX=16
`
`Y "1"s, MAX=63
`
`Z '1"s, MAX=99
`
`0005
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 5 of 11
`
`5,469,370
`
`FIG. SA
`
`READ DEFAULT
`SETIINGS:
`1. STANDARD
`2. LANGUAGE
`A. AUDIO
`B. SUBTITLES
`C. MENUS
`3. ASPECT RATIO
`4. PARENTAL LOCK
`STATUS
`
`NO
`
`DISPLAY MENU:
`• CHANGE DEFAULTS
`• TABLE OF CONTENTS
`• RETURN
`
`RETURN
`
`CHANGE
`DEFAULTS
`.----''----.
`CHANGE
`DEFAULTS
`
`TABLE
`OF
`CONTENTS
`
`SETTOC
`FLAG=1
`
`READ
`AUTHORIZED
`TERRITORIES
`
`READ SPECIAL
`SOFTWARE FLAG
`
`NO
`
`IS
`SPECIAL
`SOFTWARE
`PRESENT
`?
`
`READ SPECIAL
`SOFTWARE
`
`EXECUTE SPECIAL
`SOFTWARE
`
`READ DEFAULT
`STANDARD
`
`READ AUTHORIZED
`STANDARDS
`
`YES
`
`IS
`DEFAULT
`STANDARD
`AUTHORIZED
`?
`
`DISPLAY AUTHORIZED
`STANDARDS
`
`SELECT AN
`AUTHORIZED
`STANDARD
`
`READ AVAILABLE
`AUDIO LANGUAGES
`& TRACK TYPES
`
`IS
`DEFAULT
`AUDIO LANGUAGE
`AVAILABLE
`?
`
`0006
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 6 of 11
`
`5,469,370
`
`FIG. 58
`
`DISPLAY AVAILABLE
`AUDIO LANGUAGES
`
`SELECT AN AVAILABLE
`AUDIO LANGUAGE
`c l-----!11111
`
`READ CODING FOR
`SELECTED AUDIO
`LANGUAGE
`
`READ NUMBER
`OF"OTHER"
`AUDIO TRACKS
`
`IS
`SPECIAL
`MIXING/DELETION
`SOFTWARE
`AVAILABLE
`?
`
`NO
`
`READ SPECIAL MIXING/
`DELETION SOFTWARE
`
`READ "OTHER"
`AUDIO TRACKS MENU
`
`DISPLAY "OTHER"
`AUDIO TRACKS MENU
`
`0
`
`SELECT "OTHER"
`AUDIO TRACKS
`PLAY MODE
`
`READ CODING FOR
`AVAILABLE "OTHER"
`AUDIO TRACKS
`
`READ AVAILABLE
`MENU LANGUAGES
`
`NO
`
`SUBTITLES
`DESIRED IN
`DEFAULT
`?
`
`YES
`
`READ AVAILABLE
`SUBTITLE LANGUAGES
`
`YES
`
`IS
`DEFAULT
`MENU LANGUAGE
`AVAILABLE
`?
`
`YES
`
`DISPLAY AVAILABLE
`MENU LANGUAGES
`
`DISPLAY AVAILABLE
`SUBTITLE LANGUAGES
`
`SELECT AN AVAILABLE
`MENU LANGUAGE
`
`SELECT AN AVAILABLE
`SUBTITLE LANGUAGE
`
`0007
`
`
`
`U.S. Patent
`
`FIG. 5C
`
`Nov. 21, 1995
`
`Sheet 7 of 11
`
`5,469,370
`
`READ 4-BIT MULTIPLE
`VERSION CODE
`
`READ SPECIAL
`VERSION SOFTWARE
`
`DETERMINE IF
`MULTIPLE VERSIONS
`ARE AVAILABLE
`
`EXECUTE SPECIAL
`VERSION SOFTWARE
`
`YES
`
`DISPLAY CHOICES:
`• AUTHORIZED
`VERSIONS
`• EXIT
`
`SELECT VERSION
`OR EXIT
`
`READ VIDEO
`AVAILABILITY
`BIT
`
`IS
`PARENTAL
`LOCK ON
`?
`
`IS
`VERSION A
`RATED "R"
`?
`
`ABORT
`
`NO
`
`NO
`
`READ BASE
`ASPECT RATIO
`
`NO
`
`AUTHORIZE
`VERSION A
`
`IS
`VERSION B
`RATED "R"
`?
`
`YES
`
`F
`
`0008
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 8 of 11
`
`5,469,370
`
`IS
`DEFAULT
`PAN SCAN
`4:3
`?
`
`READ PAN SCAN
`AVAILABILITY BIT
`
`IS
`PAN SCAN
`AVAILABLE
`?
`
`DISPLAY CHOICES:
`• CENTER CUT
`• LETTER BOX
`
`SELECT DISPLAY
`MODE
`
`READ NUMBER
`OF BLOCKS
`(20 OR 60 BITS)
`
`READ ORIGINAL
`FRAME RATE
`
`READ BLOCK
`TIME FACTOR
`
`READ TABLE OF
`CONTENTS FOR
`SELECTED VERSION
`
`IS
`DEFAULT
`MENU LANGUAGE
`AVAILABLE
`?
`
`YES
`
`DISPLAY AVAILABLE
`CHAPTER DISPLAY
`LANGUAGES
`
`SELECT CHAPTER
`DISPLAY LANGUAGE
`
`CALCULATE TABLE
`OF CONTENTS TIME
`DURATIONS
`
`0
`
`SET START AT
`BEGINNING
`
`DISPLAY TABLE
`OF CONTENTS
`
`SELECT
`START POINT
`
`READ ENCRYPTED
`AUTHORIZATION
`CODE FOR
`SELECTED STANDARD
`
`READ
`PREDETERMINED
`DISK DATA
`FOR SELECTED
`STANDARD
`
`0009
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 9 of 11
`
`5,469,370
`
`FIG. 5E
`
`COMPUTE
`AUTHORIZATION
`CODE FOR SELECTED
`STANDARD FROM
`PREDETERMINED
`DISK DATA
`
`DECRYPT
`AUTHORIZATION
`CODE FOR
`SELECTED STANDARD
`
`READ DATA BLOCK
`COMMAND/DATA FLAG
`
`READ SUPPLEMENTAL
`SOFlWARE BIT
`
`IS
`SUPPLEMENTAL
`SOFlWARE
`PRESENT
`?
`
`NO
`
`READ SUPPLEMENTAL
`SOFlWARE AND
`INTEGRATE WITH
`OPERATING SYSTEM
`
`YES
`
`YES
`
`PROCESS
`INTERRUPT
`
`GOTO START
`POINT
`
`IS ANY
`BUFFER FULL
`?
`
`CONTROLLER
`INTERRUPT
`?
`
`NO
`
`READ BLOCK
`NUMBER
`
`IS
`THIS THE
`NEXT REQUIRED
`BLOCK
`?
`
`READ POl NTER
`DATA
`
`PROCESS POINTER
`DATA TO
`IDENTIFY NEXT
`REQUIRED BLOCK
`
`READ REMAINDER
`OF DATA BLOCK
`AND LOAD INTO
`BUFFERS
`
`0010
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 10 of 11
`
`5,469,370
`
`FIG. 6
`
`YES
`
`LOAD MARKER
`IN SUBTITLE
`BUFFER
`
`LOAD MARKER
`IN VIDEO
`BUFFER
`
`LOAD SUBTITLE
`BUFFER
`
`LOAD VIDEO
`BUFFER
`
`IS 1ST
`SELECTED
`AUDIO
`PRESENT
`?
`
`NO
`
`LOAD MARKER
`IN AUDIO
`BUFFER
`
`LOAD AUDIO
`BUFFER
`
`NO
`
`YES
`
`LOAD MARKER
`IN AUDIO
`BUFFER
`
`READ PAN SCAN
`UPDATE FLAG
`
`IS
`PAN SCAN
`UPDATE
`PRESENT
`?
`
`LOAD PAN
`SCAN BUFFER
`
`READ
`COMMAND/DATA
`FLAG
`
`IS
`COMMAND/DATA
`PRESENT
`?
`
`LOAD MARKER
`IN PAN SCAN
`BUFFER
`
`LOAD MARKER
`IN RAM
`
`LOAD AUDIO
`BUFFER
`
`LOAD COMMAND/
`DATA IN RAM
`
`0011
`
`
`
`U.S. Patent
`
`Nov. 21, 1995
`
`Sheet 11 of 11
`
`5,469,370
`
`JUMP OVER B
`(TO ANOTHER A)
`
`A
`
`NEXT
`00
`
`FIG. 7A
`
`jUMP OVER A
`10 (P)
`
`LEGEND
`
`11 (P)
`
`10 (P)}
`10 [P]}
`l
`11 [P
`
`FIG. 78
`
`.
`
`Use Pomter P
`
`Ignore Pointer P
`
`CODE
`00 = Continue to next block
`01 = jump to same version or common, one pointer
`1 0 = Branch from common:
`Next block is an A, pointer is to a 8 or a C
`(if version A is being played, continue to next block)
`(if version 8 is being played, jump to block identified by pointer)
`11 = Branch from common:
`Next block is a 8, pointer is to an A or a C
`(if version A is being played, jump to block identified by pointer)
`(if version 8 is being played, continue to next block)
`
`0012
`
`
`
`5,469,370
`
`1
`SYSTEM AND METHOD FOR
`CONTROLLING PLAY OF MULTIPLE
`AUDIO TRACKS OF A SOFTWARE
`CARRIER
`
`This invention relates to the generation of audio signals
`during play of a software carrier, and more particularly to a
`technique by which ·numerous audio signals may be
`recorded on separate audio tracks of the same carrier in a
`manner which permits the selective processing and combin- 10
`ing of such signals.
`
`5
`
`2
`the full mix recording so that track addition effects signal
`deletion.)
`It is an object of this invention to provide a system and
`method for a software (e.g., music) publisher to record on a
`software carrier, such as an optical disk, numerous audio
`tracks in a way which allows them, during play, to be
`processed and combined in accordance with menu choices
`made by the user. In a sense, the invention constitutes a new
`form of consumer entertainment.
`
`SUMMARY OF THE INVENTION
`
`The invention can be best appreciated by considering an
`BACKGROUND OF THE INVENTION
`application which it is believed will be among the most
`The most widespread medium for distributing motion 15
`important uses of the invention. An optical disk has recorded
`on it dozens of audio tracks, each of which represents play
`pictures is the videocassette. The conventional practice is to
`of an individual musical instrument, play of a group of
`provide only one language soundtrack on each videocas-
`musical instruments, or a vocal rendition. The user may
`sette. This means that different versions of the same motion
`select which of the audio signals should be combined to
`picture must be prepared for distribution in different coun-
`tries. Rather than to dedicate a different version of the same 20 generate a composite "result." The level (gain) of each track
`in the composite output signal, developed in an audio
`motion picture to each of several different languages, it
`would be far more advantageous to provide all desired sound
`processor decoder, can be adjusted by the user so that
`tracks, containing different dialog languages, on the same
`particular instruments can be emphasized or eliminated.
`carrier; this would require the production of far fewer
`Thus a user who wants to practice as a soloist with a full
`versions of the same motion picture. Because of the large 25 orchestra could, for example, eliminate the "piano" track
`storage requirements, however, this has not proven to be ·
`and play along with the rest of the orchestral mix. (An
`practical. In fact, the only practical consumer use of multiple
`alternative approach is to provide a track with a full arches-
`tral mix, and to use a negative gain for the piano track-in
`sound tracks on the same carrier is the provision of anno-
`tated and non-annotated soundtracks in some laserdisc
`effect, the piano play would be deleted from the full arches-
`releases. (It is possible, for example, to store different 30
`tral recording.) Or the user might single out the piano track
`soundtracks in the digital and analog audio channels of a
`by itself, and play along with it.
`laserdisc.)
`Certainly, the idea of providing "music minus one" pre-
`Despite the fact that it has occurred to others in the prior
`recorded media, in which one instrument is left out of the
`art to provide multiple soundtracks on the same software 35 mix, is not new. But providing a recording with one instru-
`ment left out does not approach the flexibility contemplated
`carrier, certainly the provision of perhaps a dozen different
`soundtracks, in different dialog languages, all on the same
`by the present invention, in which every one of dozens of
`consumer software carrier, is not to be found anywhere. Not
`audio tracks can be controlled (mixed or deleted) by the user.
`It is contemplated that use of the present invention could
`only are there no consumer players capable of selecting one
`from among so many different soundtracks, but software 40 extend the karaoke form of entertainment significantly. For
`publishers have just not found it practical to store so much
`example, a newly arrived player of an instrument could join
`a group of musicians already rehearsing simply by deleting
`audio information on a single carrier. The traditional
`approach is to publish different versions of the same motion
`another track from an overall orchestral recording.
`picture for distribution in different territories where different
`The invention is described in the context of an optical disk
`45 and the digital storage of audio signals. It is to be under-
`languages are spoken.
`The idea of providing numerous different soundtracks, in
`stood, however, that the invention is not limited to a par-
`ticular medium, and it is applicable to ta,pe carriers and all
`different dialog languages, all on the same consumer soft-
`ware carrier, would not only be attractive to motion picture
`digital storage media, not just the optical disks of the
`companies but could give rise to totally different applica-
`illustrative embodiment of the invention. Nor is the term
`tions of the same underlying technology. These applications 50 "audio signal" restricted to music. For example, the inven-
`tion is applicable to a carrier containing different types of
`would make use of audio tracks, representing audio.signals,
`in ways which may be totally unrelated to the play of motion
`sound effects recorded in respective tracks, with the user
`pictures. In fact, the present invention contemplates the
`thus being able to mix them selectively depending on the
`storage of dozens of different audio signals on a software
`particular application at hand. Another example would be
`carrier which does not even contain visual information or 55
`the provision of alternate PG-rated soundtrack segments to
`conventional dialog soundtracks.
`accompany a motion picture which is R-rated only because
`of explicit language in its soundtrack; by switching between
`The storage is ditigal. A particular advantage of the
`the soundtracks, a PG-rated version of the motion picture
`present invention is that it allows the "music minus one"
`may be played.
`concept to be expanded and perfected. For example, several
`digital audio tracks, each devoted to a particular instrument 60
`In the illustrative embodiment of the invention, there may
`or group of instruments, can be digitally subtracted from a
`be up to 63 tracks with different audio signals that can be
`track representing a full orchestral mix. Not only does this
`mixed or combined with one another. (The word "com(cid:173)
`allow unheard of flexibility, but the digital subtraction
`bined" is used in a broad sense-it includes subtracting one
`signal from another, in effect deleting one signal from
`(phase inversion) allows near perfect results when compared
`with analog techniques. (Instead of the phase inversion 65
`another more inclusive signal.) A design question that must
`being accomplished in subtraction hardware, specific tracks
`be confronted is how the user is told which signal is recorded
`could be recorded 180 degrees out of phase compared with
`in each track. Another issue is how the user informs the
`
`0013
`
`
`
`5,469,370
`
`5
`
`15
`
`4
`Provision of multiple-language audio tracks and multiple(cid:173)
`language subtitle tracks on a single disk, with the user
`specifying the language of choice.
`Provision of multiple "other" audio tracks, e.g., each
`containing some component of orchestral music, with
`the user choosing the desired mix.
`Variable rate encoding of data blocks, and efficient use of
`bit capacity with track switching and/or mixing, to
`allow all of the above capabilities on a single carrier.
`Further objects, features and advantages of the invention
`will become apparent upon consideration of the following
`detailed description in conjunction with the drawing, in
`which:
`FIG. 1 depicts a prior art system and typifies the lack of
`flexibility in, and the poor performance of, presently avail(cid:173)
`able media players;
`FIG. 2 depicts the illustrative embodiment of the inven(cid:173)
`tion;
`FIG. 3 is a chart which lists the fields in the lead-in portion
`20 of the digital data track of an optical disk that can be played
`in the system of FIG. 2;
`FIG. 4 is a similar chart which lists the fields in each of
`the data blocks which follow the lead-in track section of
`FIG. 3;
`FIGS. SA-SE comprise a flowchart that illustrates the
`processing by the system of FIG. 2 of the data contained in
`the lead-in track section of an optical disk being played;
`FIG. 6 is a flowchart that illustrates the processing of the
`data blocks, in the format depicted in FIG. 4, that follow the
`lead-in section of the track;
`FIG. 7 A is a state diagram and legend that characterize the
`manner in which the player of the invention reads only those
`data blocks on a disk track that are required for the play of
`a selected version of a motion picture or other video pre(cid:173)
`sentation, and FIG. 7B depicts the way in which one of two
`alternate versions can be played by following the rules
`illustrated by the state diagram of FIG. 7A;
`FIG. 8 depicts symbolically a prior art technique used in
`compressing the digital representation of a video signal; and
`FIG. 9 illustrates the relationships among three different
`image aspect ratios.
`
`3
`player of which tracks should be combined. And however
`the user is informed of what is on the disk, since music is in
`a sense universal and not limited to a particular spoken
`language, it would be very convenient to communicate with
`the user in a language which he understands without requir-
`ing the production of different disks for different languages
`when language may be irrelevant to the music itself.
`In accordance with the invention, the lead-in section of
`the software carrier includes textual data which defines the
`audio contents of the individual tracks. The textual data is in 10
`multiple languages. There is a code recorded for each
`available language, and each player represents a default
`language- for players sold in France, for example, French
`would be the default language. The player software com(cid:173)
`pares the player default language with those available on the
`disk. If there is a match, then only the French textual data is
`used to form the menu displays. On the other hand, if the
`default language is not represented on the disk, then the user
`is provided with a list of the represented languages so that
`he may select a different default language.
`The player software includes provision for displaying the
`track information, and allowing the user to select which
`tracks should be played or not, and their respective levels
`(gains). The menu displays and the mechanisms for making
`selections, controlling levels, etc. can be of standard types. 25
`Menu displays and the inputting of selections are well
`known to users of personal computers and consumer elec(cid:173)
`tronic products. (For example, conventional videocassette
`recorders are controlled by menu displays and user selec(cid:173)
`tions.)
`The use of predetermined software for controlling menu
`displays and user inputs limits flexibility-the only variabil-
`ity is in the track definitions which, of course, depend on the
`audio content of the particular disk being played. But it is
`possible that in some cases non-standard menu choices will 35
`be desired. For example, the standard software which is
`contemplated processes the audio signals in a synchronized
`fashion. But suppose that a software publisher, for a par(cid:173)
`ticular disk, desires to allow the user to alter the phase
`relationships among the various tracks, e.g., by delaying 40
`tracks relative to each other in order to achieve non-standard
`reverberation effects. This would require not only different
`types of menu choices, but it would require changes in the
`basic processing software (e.g., delaying the processing of
`data in one track relative to another). Even this, however, is 45
`accommodated for in the invention. It is commonplace today
`to distribute software on optical disks. All that is necessary
`is to include on the optical disk of the invention software
`which is loaded into the player when the disk is first read,
`which software modifies or replaces the standard software 50
`for processing the audio track data. In this way, total
`flexibility can be achieved.
`The invention is disclosed in the context of an overall
`system which offers numerous advantageous features. The
`entire system is described although the appended claims are
`directed to specific features. The overall list of features
`which are of particular interest in the description below
`include:
`Video standard and territorial lock out.
`Play in multiple aspect ratios.
`Play of multiple versions, e.g., PG-rated and R-rated, of
`the same motion picture from the same disk, with
`selective automatic parental disablement of R-rated
`play.
`Encrypted authorization codes that prevent unauthorized
`publishers from producing playable disks.
`
`30
`
`THE PRIOR ART
`
`The limitations of the prior art are exemplified by the
`system of FIG. 1. Such a system is presently available for
`playing a single source of program material, usually a VHS
`videocassette, to generate a video signal conforming to a
`selected one of multiple standards. A system of this type is
`referred to as a multi-standard VCR, although stand-alone
`components are shown in the drawing. Typically, a VHS
`tape 7 has recorded on it an NTSC (analog) video signal, and
`the tape is played in a VHS player 5. The analog signal is
`converted to digital form in AID converter 9, and the digital
`55 representations of successive frames are written into video
`frame store 11. Circuit 13 then deletes excess frames, or
`estimates and adds additional frames, necessary to conform
`to the selected standard, e.g., PAL. To convert from one
`standard to another, it is generally necessary to change the
`60 number of horizontal lines in a field or frame (image
`scaling). This is usually accomplished by dropping some
`lines, and/or repeating some or averaging successive lines to
`derive a new line to be inserted between them. The main
`function of circuit 13, of course, is to convert a digital frame
`representation to analog form as the video output.
`Systems of the type shown in FIG.1 generally degrade the
`video output. Conventional videocassettes deliver reduced
`
`65
`
`0014
`
`
`
`5,469,370
`
`5
`quality video when they support more than one video
`standard. One reason is that there is a double conversion
`from analog to digital, and then back again. Another is that
`the image scaling is usually performed in a crude manner
`(deleting lines, repeating lines and averaging lines). There
`are known ways, however, to perform image scaling in the
`digital domain without degrading the picture. While not
`generally used, the technique is in the prior art and will
`therefore -be described briefly as it is also used in the
`illustrative embodiment of the invention.
`To give a concrete example, the PAL standard has 625
`lines per frame, while the NTSC standard has 525 lines per
`frame. Because no part of the image is formed during the
`vertical retrace, not all of the horizontal line scans in either
`system are usable for representing image information. In the
`PAL standard there are nominally 576 lines per frame with
`image information, and in an NTSC frame there are nomi(cid:173)
`nally 483 lines with image information.
`To convert from one standard to another, successive fields
`are first de-interlaced. Then 576lines are converted to 483,
`or vice versa, and re-interlaced. How this is done is easy to
`visualize conceptually. Consider, for example, a very thin
`vertical slice through a PAL frame. The slice is broken down
`into its three color components. Image scaling for convert(cid:173)
`ing from PAL to NTSC, from a conceptual standpoint, is
`nothing more than drawing a curve based on 576 PAL pieces
`of color data and then dividing the curve into 483 parts to
`derive a piece of data for each horizontal line of the desired
`NTSC signal. In actuality, this is accomplished by a process
`of interpolation, and it is done digitally. (Image scaling, in
`general, may also involve a change in the aspect ratio, for
`example, in going from HDTV to NTSC, and may require
`clipping off information at both ends of every horizontal
`line.)
`While prior art systems thus do provide for standards
`conversion, that is about the extent of their flexibility. The
`system of FIG. 2, on the other hand, offers unprecedented
`flexibility in ways not even contemplated in the prior art.
`The lllustrative System of the Invention
`The system of FIG. 2 includes a disk drive 21 for playing
`an optical disk 23. Digital data stored on the disk appears on
`the DATA OUT conductor 25. The disk drive operation is
`governed by microprocessor disk drive controller 27. The
`read head is positioned by commands issued over HEAD
`POSffiON CONTROL lead 29, and the speed of the disk
`rotation is governed by commands issued over RATE CON(cid:173)
`TROL conductor 31. Optical disks are usually driven at
`either constant linear velocity or constant angular velocity.
`(Another possibility involves the use of a discrete number of
`constant angular velocities.) Disks of the invention may be
`driven at constant linear velocity so that the linear length of
`track taken by each bit is the same whether a bit is recorded
`in an inner or outer portion of the track. This allows for the
`storage of the most data. A constant linear velocity requires
`that the rate of rotation of the disk decrease when outer
`tracks are being read. This type of optical disk control is
`conventional. For example, the CD audio standard also
`requires disks which are rotated at a constant linear rate.
`Microprocessor 41 is the master controller of the system.
`As such, it issues commands to the disk drive controller over
`conductor 43 and it determines the status of the disk drive
`controller over conductor 45. The disk drive controller is
`provided with two other inputs. Block number/pointer ana(cid:173)
`lyzer 47 issues commands to the disk drive controller over
`conductor 49, and BUFFER FULL conductor 51 extends a
`control signal from OR gate 54 to the disk drive controller.
`These two inputs will be described below. (In general,
`
`10
`
`55
`
`20
`
`6
`although reference is made to individual conductors, it is to
`be understood that in context some of these conductors are
`in reality cables for extending bits in parallel. For example,
`while the output of OR gate 54 can be extended to the disk
`5 drive controller over a single conductor 51, block number/
`pointer analyzer 47 could be connected to the disk drive
`controller over a cable 49 so that multi-bit data can be sent
`in parallel rather than serially.)
`An important feature of the system of FIG. 2 is that bit
`information is stored on the disk at a rate which varies
`according to the complexity of the encoded material. By this
`is meant not that the number of bits per second which
`actually appear on the DATA OUT conductor 25 varies, but
`rather that the number of bits which are used per second
`varies. Video information is stored in compressed digital
`15 form. FIG. 8 shows the marmer in which video frames are
`coded according to the MPEG 1 and MPEG2 standards. An
`independent 1-frame is coded in its entirety. Predicted or
`P-frames are frames which are predicted based upon pre(cid:173)
`ceding independent frames, and the digital information that
`is actually required for a P frame simply represents the
`difference between the actual frame and its prediction.
`Bidirectionally predicted B-frames are frames which are
`predicted from I and/or P frames, with the information
`required for such a frame once again representing the
`25 difference between the actual and predicted forms. (As can
`be appreciated, fast forward and fast reverse functions, if
`desired, are best implemented using !-frames.) The number
`of bits required to represent any frame depends not only on
`its type, but also on the actual visual information which is to
`30 be represented. Obviously, it requires far fewer bits to
`represent a blue sky than it does