throbber
('12)
`
`United States Patent
`Svensson et a1.
`
`US 7,356,680 B2
`(10) Patent No.:
`
` (45) Date of Patent: Apr. 8, 2008
`
`USOO73566SOB2'
`
`(54)
`
`METHOD OF LOADING INFORMATION
`INTO A SLAVE PROCESSOR IN A
`
`MULTI—PROCESSOR SYSTEM USING AN
`OPERATING—SYSTEM—FRIENDLY BOOT
`LOADER
`
`(75)
`
`Inventors: Mats Svensson, Lund (SE); Peter
`Aulin, Malmo (SE); Nlclas Bauer,
`Malmo (SE); Michael Rosenberg,
`Sodra Sandby (SE)
`
`(73)
`
`Assignee:
`
`'I‘elefonaktiebolaget L M Ericsson
`(publ), Stockholm (SB)
`
`(*1
`
`Notice:
`
`Subject to any disclaimer, the term of this
`patent is extended or adjusted under 35
`USC. 154(b) by 482 days.
`
`(21)
`
`App]. No,: 11/040,798
`
`(22)
`
`(65)
`
`(51)
`
`(52)
`
`(58)
`
`(55)
`
`Filed:
`
`Jan. 22, 2005
`
`Prior Publication Data
`
`US 2006/0168435 A1
`
`Jul. 27, 2006
`
`Int. Cl.
`(2006.01)
`G06F 9/00
`U.S. Cl.
`.............................. 713/1; 713/2; 713/100;
`709/208; 709/212; 709/213
`Field of Classification Search
`713/1,
`713/2,100; 709/208, 212, 213
`See application file for complete search history.
`References Cited
`
`U.S. PATENT DOCUMENTS
`
`7/1990 Kopp et 111.
`4,943,911 A
`11/1991 Bruckert et a1.
`5,068,780 A
`5,155,833 A "‘ 10/1992 Cullison et al.
`..... 713/2
`
`5,347,514 A *
`9/1994 Davis et a1.
`.......
`.. 370/429
`5,652,886 A
`7/1997 Tulpule et 211.
`5,754,863 A ”'
`5/1998 Router ........................ 717/173
`5,799,186 A
`8/1998 Compton
`5,835,784 A
`11/1998 Gillespie et :11.
`5,944,820 A
`8/1999 Beclitz
`
`l
`
`................... 713/2
`
`6,012,142 A *
`6,058,474 A
`6,438,683 131
`6,490,722 B1
`6,601,167 131
`6,684,397 131
`6,691,216 132
`
`1/2000 Dokic eta].
`5/2000 1321112 61 :11.
`8/2002 Endsley
`12/2002 Barton et :11.
`7/2003 Gibson et al.
`1/2004 Byeretal.
`2/2004 Kelly e1 'dl,
`
`(Continued)
`FOREIGN PATENT DOCUMENTS
`
`EP
`
`1460539 A2
`
`9/2004
`
`(Continued)
`OTHER PUBLICATIONS
`
`PCT International Search Report, mailed Aug. 8, 2006, in connec-
`tion with International Application No, PCT/EP2006/000351.
`
`(Continued)
`
`Primary Examiner——»~Thomas Lee
`Assistant Examiner~~~~~~~~~~~~Malcolm D Cribbs
`(74) Attorney, Agent, or Finn~~~~~~Potomac Patent Group
`PLLC
`
`(57)
`
`ABSTRACT
`
`A conventional bootloadcr can conflict with the operating
`system (OS) of a multi-processor system. An OS-friendly
`bootloader and methods are described that integrate an OS
`with a bootloader in any system in which a host processor
`and a client processor have a communication mechanism
`that requires the OS for the mechanism to work and the
`client has two memory systems: one visible to both host and
`client and one visible only to the client.
`
`23 Claims, 2 Drawing Sheets
`
`102
`
`Non-Volatile
`
`Memory
`
`
`
`ARM
`________________ J. _. .. _ .. .. ..
`
`DSP
`
`
`
`:
`int.
`DSP
`Store;
`SARAM
`
`Area '& DARAM
`
`
`
`
`
`
`AINTEL1O1O
`
`INTEL 1010
`
`

`

`US 7,356,680 .82
`Page 2
`
`U.S. PATENT DOCUMENTS
`
`OTHER PUBLJCATIONS
`
`6,760,785 Bl
`6,810,478 B1
`2002/0059560 Al"‘
`2002/0138156 A1
`2003/0126424 A1
`2004/0059906 A1
`2004/0088697 A1
`2004/0215952 A1
`
`7/2004
`10/2004
`5/2002
`9/2002
`47/2003
`3/2004
`5/2004
`10/2004
`
`Powderly et a1.
`Anand et a].
`Phillips ..................
`Wong et a1.
`Horan'ly at al.
`Park e1 :11.
`Schwartz et al.
`Oguma
`
`FOREIGN PATENT DOCUMENTS
`0 1/277 53 A2
`4/2001
`
`W0
`
`717/124
`
`in connection with
`PCT Written Opinion, mailed Aug. 8, 2006,
`Intemation‘al Application No. PCT/EP2006/00351.
`Hyde,
`J., “How to Make Pentium Pros Cooperate", BYTE,
`McGraw-Hill, Inc. St. Peterborough, US, vol. 21, No. 4, Apr. 1996,
`pp. 1770178, XP000586039.
`L
`Winderweedle, B. et al., ‘TMSSZOVC5470/5471 Bootloader Appli-
`cation Report”, Texas Instrunients, Dallas, TX, SPRA376, Jun.
`2002.
`“OMAP5910 Dual-Core Processor DSP Subsystems Reference
`Guide", Texas Instwments, Dallas, TX, SPRU672, Oct. 2003.
`
`"‘ cited by examiner
`
`

`

`U.S. Patent
`
`Apr. 8, 2008
`
`Sheet 1 of 2
`
`US 7,356,680 B2
`
`u‘
`
`
`
`03?
`
`.
`
`'
`Non—Volatile
`:
`}
`ARM
`________________ .1.--__......._._...__.....__.___._..._...
`
`103
`
`100
`

`
`
`
`
`DSP
`I
`int.
`SARAM
`Store {
`Area ' & DARAM
`
`
`
`
`103
`
`USP CPU
`
`1 O4
`
`DSP
`XRAM
`
`1 1
`
`FIG. 1
`
`
`
`Length
`Best. Addr.
`
`
`
`
`
`Header
`
`.
`
`HS. 3
`
`-mder ~~~~~~~~~~~~
`Data
`
`TransferBlock =_ntermediate
`
`Storage
`Area
`
`

`

`US. Patent
`
`Apr. 8, 2008
`
`Sheet 2 of 2
`
`US 7,356,680 B2
`
`Reset and Hold
`Slave Processor
`
`Push Info to
`Slave Processor
`
`202
`
`204
`
`
`
`Push
`Complete?
`
`
`Yes
`
`Boot Slave
`Processor
`
`Start 08 in
`Slave Processor
`
`20
`.
`
`208
`
`’
`
`21
`
`Reserve lntermed.
`Storage Area
`'
`
`212 _
`
`FIG. 2
`
`Send Message to
`r P
`33
`Host
`rose or
`
`Push lnfo to ISA
`
`Send Message to
`Slave Processor
`
`Copy ISA to
`"Invisible" Memory
`
`Send Message to
`_ Host Processor
`
`214
`
`216
`
`218
`
`220
`
`222
`
`
` .
`
`
`
`
`Release Blocks;
`Load Complete
`
`226
`
`Complete?
`
`224
`
`Yes
`
`

`

`US 7,356,680 BZ
`
`1
`METHOD OF LOADING INFORMATION
`INT0 A SLAVE PROCESSOR IN A
`MULTl-PRMIESSOR SYSTEM USING AN
`OPERATING-SYSTEM-FRIENDLY BOOT
`LOADER
`
`BACKGROUND
`
`This invention relates to initialization of electronic sys-
`tems having multiple programmable processors.
`The process of starting, or booting up, an electronic
`system having a programmable processor connected to one
`or more memory devices for storing program instructions, or
`code, and data is not as simple as it might seem at first
`glance. An important part of the reason for this is the need
`for the processor to begin operation in a well-defined state.
`The traditional ways of loading program code and data to
`a bare system are either by “pushing” the code and data into
`the system’s random-access memory (RAM) directly or by
`using a bootloader. The bootloader, which is sometimes
`called a boot
`loader or a bootstrap loader,
`is a set of
`instructions (i.e., program code, sometimes called “boot
`code”) that can be either “pushed” into the system's RAM
`or loaded into the RAM from a non-volatile memory, such
`as read-only memory (ROM). In its execution by the pro-
`cessor, the bootloader then “drags” in the rest of the code
`and data and starts the system.
`Examples of prior mechanisms for starting processor
`systems, including bootloaders, are LLS. Pat. No. 5,652,886
`to Tulpule et al. and US. Pat. No. 6,490,722 to Barton et al.
`and U.8. Patent Application Publication No. US 2002/
`0138156 Al
`to Wong et a1. Barton et al., for example,
`describes a two-stage bootloader in which the second stage
`finds, verifies, and loads the operating system. In Wong et
`al., a multiprocessor system uses a master processor coupled
`to a ROM to transfer boot code to slave processors, with
`memory controllers in the slave processors denying memory
`access requests until the boot code has been transferred to
`their RAMs.
`,
`As indicated by Barton et a1. and Wong et al., for example,
`starting up a multi-processor system, which can be generally
`considered as having a master or host processor, i.e., the
`system that orders the boot, and one or more slave or client
`processors,
`i.e.,
`the system to be booted,
`is even more
`complicated than starting up a single-processor system.
`Advantages of the “push” method are that it requires no
`code to execute in the slave during boot and that the only
`synchronization required is to hold the slave in a reset state
`and release it when loading is finished. Nevertheless, the
`“push” method works only when the memory or memories
`of the slave are visible to the host. This visibility can be
`implemented in several ways. For example, a memory may
`be visible on the address and data busses ofboth the host and
`the slave processors or direct memory access (DMA) trans-
`fers may be allowed from the host’s memory or memories to
`the slave’s memory or memories.
`'
`When the slave’s memory to be loaded is invisible to the
`host, the “push” method cannot be used. Inthat situation,
`some form of bootloading must be used. As noted above, the
`bootloader technique requires either that bo‘ot code can be
`pushed onto the slave (which in this case is not possible) or
`that the slave can load code from a non—volatile memory.
`The bootloader then initiates a transfer of code from the host
`to the slave and finishes loading the memory.
`Multi-processor systems'in which some or all of a slave‘s
`memory is not visible to a host are possible. In such systems,
`it can be advantageous to take advantage of well-established
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`60
`
`65
`
`2
`software frameworks for loading and inter-processor com-
`munication, which render traditional bootloaders undesir-
`able. Moreover, a bootloader can conflict with the operating
`system, which can be said to want to have control over the
`entire system and all of the memory.
`Among the problems faced when integrating a bootloader
`with an operating system (OS) are ensuring that code that is
`not yet loaded is not executed, efficiently loading code to a
`memory or memories invisible to the host, and synchroniz-
`ing with the host the loading and booting of the slavc(s).
`Moreover, it is necessary to determine which portions of the
`system must be loaded to memories visible to both host and
`slave processors and how the binary image to be loaded
`should be arranged for the bootloader to work together with
`the OS. Another issue that can be important is the integration
`of the bootloader and the OS, as an already established
`framework for communication between host and slave then
`can be used during loading. Such a framework typically
`would include one or more primitives for communication
`that rely on OS-features.
`'
`
`SUMMARY
`
`in one aspect, a method of
`This invention provides,
`loading program code into a slave processor in a multi-
`processor system that includes a master processor and the
`slave processor. The method includes the steps of resetting
`the slave processor and holding the slave processor in a reset
`state; pushing information into a first memory that is acces-
`sible by the master and slave processors; booting the slave
`processor; starting an operating system in the slave proces-
`sor,
`including blocking scheduling of processes having
`program code located in a second memory that is accessible
`by the slave processor and inaccessible by the master
`processor; reserving an intermediate storage area in the first
`memory; sending to the master processor information about
`a location and size of the intermediate storage area reserved;
`based on the sent
`information,
`loading the intermediate
`storage area with information to be loaded into the second
`memory; sending a first message to the slave processor that
`indicates the intermediate storage area has been loaded and
`whether loading is finished or more information is to be
`loaded; copying information in the intermediate storage area
`to the second memory; and sending a second message to the
`master processor that indicates that information in the inter-
`mediate storage area has been copied.
`In another aspect of the invention, a multi—processor
`system includes a host processor, at least one client proces—
`sor, a first random-access memory accessible by the host and
`client processors, a second random~aeoess memory acces-
`sible by the client processor and not accessible by the host
`processor, and a bootloader. The first memory includes an
`intermediate storage area, and the bootloader includes a host
`part and a client part. The host part is loadable into the first
`random-access memory and has a first stage and a second
`stage. The first stage resets and holds the client processor in
`a reset state and pushes information into the first random-
`access memory. The second stage is initiated by the client
`part, loads the intermediate storage area with information to
`be loaded to the second random—access memory, and sends
`to the client part a first message indicating the intermediate
`storage area is loaded. The client part is loadable into the
`first random-access memory, starts an operating system
`including an idle process and initially blocking scheduling
`of all processes having program code located in the second
`random-access memory, copies information loaded into the
`intermediate storage area to the second random-access
`
`

`

`US 7,356,680 82
`
`3
`memory, and sends to the host part a second message
`indicating information has been copied.
`In another aspect of the invention, a computer-readable
`medium contains a computer program for loading informa-
`tion into a slave processor in a multi-processor system that
`includes a master processor and the slave processor. The
`computer program performs the steps of resetting the slave
`processor and holding the slave processor in a reset state;
`pushing information into a first memory that is accessible by
`the master and slave processors; booting the slave processor;
`starting an operating system in the slave processor, including
`blocking scheduling of processes having program code
`located in a second memory that is accessible by the slave
`processor and inaccessible by the master processor; reserv—
`ing an intermediate storage area in the first memory; sending
`to the master processor information about a location and size
`of the intermediate storage area reserved; based on the sent
`information,
`loading the intermediate storage area with
`information to be loaded into the second memory; sending
`a first message to the slave processor that indicates the
`intermediate storage area has been loaded and whether
`loading is finished or more information is to be loaded;
`copying information in the intermediate storage area to the
`second memory; and sending a second message to the
`master processor that indicates that infonnation in the inter-
`mediate storage area has been copied.
`
`BRlHF DHSCRII’l'lON ()F 'l'HH DRAWINGS
`
`The various features, objects, and advantages of this
`invention will be understood by reading this description in
`conjtmction with the drawings, in which:
`FIG. 1 depicts a multi-processor system;
`FIG. 2 is a flowchart of an OS-friendly bootloader; and
`FIG. 3 depicts an example of an organization of an
`intermediate storage area.
`
`DETAILED DESCRIPTION
`
`As noted above, a conventional bootloader can conflict
`with the operating system of a multi—processor system. This
`application describes an OS-friendly bootloader and meth-
`ods that meet the challenge of integrating an OS With a
`bootloader in systems in which the host and a client have a
`communication mechanism that requires the OS for the
`mechanism to work and the client has two memory systems:
`one visible to both host and client and one visible only to the
`client.
`
`1 depicts such a multi—processor system 100 that
`FIG.
`includes a host processor 102 and a client processor 104. It
`will be appreciated that although FIG. 1 shows one client
`processor 104, more can be provided..]t will also be appre-
`ciated that
`the host and client processors may be any
`programmable
`electronic processors.
`In the
`example
`depicted in FIG. 1, the processor 102 is shown as the central
`processing unit
`(CPU) of an advanced RISC machine
`(ARM), and the processor 104 is shown as the CPU of a
`digital signal processor (DSP) device. The dashed line in
`FIG. 1 depicts the hardware boundary between the host and
`slave devices, in this example, the ARM and the DSP, and
`also a non-volatile memory 106. The memory 106 may be a
`ROM, a flash memory, or other type ofnon-volatile memory
`device.
`
`Most commercially available DSP devices include on-
`chip memories, and as indicated in FIG. 1, the DSP includes
`“internal” single—access RAM (SARAM) and dual-access
`RAM (DARAM) 108, as well as an “external” RAM
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`4o
`
`45
`
`50
`
`55
`
`60
`
`65
`
`4
`(XRAM) 110. An intermediate storage area, indicated by the
`dashed line, is defined within the memory 108 as described
`in more detail below. The arrows in FIG. 1 indicate access
`paths, e.g., busses and DMA paths, between the CPUs and
`the memories. The ARM host CPU 102 can access the
`non-volatile memory 106 and the SARAM and DARAM
`108 of the DSP, but not the DSP’S XRAM 110, and the DSP
`slave CPU 104 can access all of the RAMs 108, 110.
`The SARAM and DARAM 108 can be loaded from the
`non-volatile memory 106 by the trivial “push” method.
`When code needs to be loaded to the XRAM 110 during
`boot, however, a bootloader solution is required because the
`XRAM 110 is invisible to, i.e., not accessible by, the CPU
`102 and so boot code cannot be pushed to the XRAM 110.
`As described in more detail below and in connection with
`the flow chart oi" FIG. 2, an (')S-‘friendly hootloader advan-
`tageously has a host part and a client part that is loaded into -
`a memory or memories visible to both the master and the
`slave (e.g., SARAM and DARAM 108).
`The host part of the OS-friendly bootlo‘ader may be
`considered as including two stages or modes of operation.
`The first stage resets and holds the slave 104 in the reset state
`(Step 202) and pushes information (program instructions
`and/or data) (Step 204) in the usual way from the non-
`volatile memory 106 into the commonly visible memories
`108. The information pushed into these memories is mainly
`the bobtloader, the OS, and any necessary start~up code for
`the OS.
`It should be appreciated that an application or
`applications or parts thereof may also be pushed into these
`memories at start—up and may start executing during the
`loading of the “external" memory 110. When this “pus " is
`finished (Step 206), the slave 104 is allowed to boot (Step '
`208) and to start up the OS (e.g., it is released from the reset
`state) and its normal communication mechanisms (Step
`210). The host part then awaits a message from the slave,
`which initiates operation of its second stage as described in
`more detail below.
`The slave part of the OS—fri endly bootloader that is loaded
`(“pushed” by the host part‘s first stage) into the commonly
`visible memories 108 starts the operating system, carrying
`out the following operations (Step 210). First,
`interrupt
`handlers are created. The code for the interrupt handlers
`must be located in the memory that
`is already loaded
`because an interrupt may occur at any time. Second, data
`structures (e.g., process control blocks and stacks) of corn-
`mon processes, i.c., processes that run in both the host and
`the slave, are created. It should be understood that since
`these common processes have not yet executed, their code
`may be loaded at a later time and may very well be located
`in “external” memory visible only to the slave, e.g., XRAM
`110. Third, the system idle process is created. The code for
`the idle process must be located in the memory that
`is
`already loaded because the idle process is the process
`selected to run by the OS if there is nothing useful to do.
`Fourth, the scheduling of at least all processes residing in,
`i.e., having program code or data located in, the “externa ”
`memory 110 is blocked. Execution of processes residing in
`the “internal” memory can thus advantageously start or
`continue in parallel with the loading of the “external”
`memory as noted above. It is also possible to stop scheduling
`all processes except the idle process, but this is not neces~ .
`sary. Making this blocking the last thing done before the OS
`scheduler switches on ensures that the code in these pro-
`cesses will not run when the scheduler releases. Finally, the
`OS scheduler is released, which allows the OS to start
`executing code and scheduling processes. It will be under~
`stood that since at least all external-memory-process sched-
`
`

`

`US 7,356,680 B2
`
`5
`
`uling was blocked, all that the OS can now do is schedule
`interrupts and the idle process.
`At. this point, the slave 104 is partly up and running. The
`slave part of the OS-friendly bootloader has been loaded,
`and the slavc’s idle process is executing. The slavc’s OS can
`schedule and execute code in response to interrupts and can
`schedule the idle process and any unblocked processes
`having code residing in internal memory. OS mechanisms
`for which all code and data accesses are in memory that has
`already been loaded (SARAM and DARAM 108,
`in this
`example) are available, including the usual communication
`mechanisms. These OS communication mechanisms, being
`high-level abstractions ofDMA, shared memory, and struc-
`tured registers, are more capable than simple semaphores
`and enable the host processor to communicate efficiently
`with a processor (the slave) that has not completely started,
`which is to say a processor that is executing mainly only the
`OS, interrupt services, and processes residing in “intern ”
`RAM.
`
`10
`
`15
`
`20
`
`6
`slave then sends a message to the host (Step 222) that
`indicates that the slave has copied the contents of the
`intermediate storage area.
`If there is more code and/or data to load (Step 224), this
`cycle of copying and messaging (Steps 216-224) can be
`repeated as many times as required. When the loading is
`finished, i.e., when no more information needs to be copied
`to the slave, the slave releases the blocking of processes that
`were blocked earlier, thereby allowing scheduling of code in
`its slave-private memory (Step 226). Loading is now com—
`plete.
`As described above, the host fills the intermediate storage
`area in the memory 108 with code and data that the slave
`further copies to end destinations in the slave—private
`memory 110. Perhaps the simplest way of doing this is to
`precede all code and data in the intermediate storage area
`with a tag that contains the destination address and length of
`the block to be loaded. FIG. 3 depicts one example of such
`an organization of the intermediate storage area. A block of
`code and/or data to be transferred into the intermediate
`storage area includes a header that indicates the length of the
`block and where it is to be loaded in the slave memory, i.e.,
`the destination address. As indicated by the dashed lines in
`FIG. 3, several such blocks may be concatenated in the
`intermediate storage area.
`The information (code and data) to be loaded can be
`arranged in many ways in the intermediate storage area and
`memories. Often the information is arranged as blocks of
`consecutive information that are to be loaded to different
`addresses, and thus an arbitrarily chosen size of the inter-
`mediate storage area may not match the sizes of all such
`blocks. Still, it should be understood that
`the system will
`operate more efiiciently when the intermediate storage area
`is always filled. This means that if the blocks to be loaded
`are smaller than this area, a transfer of several (smaller)
`blocks should be done at the same time. This also means that
`a block should be split if it is larger than the remaining part
`of the intermediate storage area, and one part transferred to
`the intermediate storage area with the remaining part trans—
`fcrrcd in the next block, Moreover, if a block is several times
`larger than the intermediate storage area, it may have to be
`split more than once. All of this splitting and concatenation
`is done in the host part of the OS-fi’iendly bootloader in ways
`that are well known to computer scientists. From the point
`of View of data communications engineers, the host part of
`the OS—friendly bootloader is thus a kind of “transport
`layer”.
`The artisan will understand the benefit of this splitting and
`concatenation of information into transfer blocks. Some
`kind of communication mechanism is required to perform
`the actual transfers of information between memories, and
`whatever the mechanism used, fewer large transfers are
`typically preferable to more small
`transfers. A kephfull
`intermediate storage area can make the most efiicient use of
`the available bandwidth by advantageously minimizing
`overhead on the communications channel. Each message
`requires some amount of administration and administrative
`information, and so fewer messages means less overhead.
`A good example of the benefit of block splitting and
`concatenation eife‘ct is DMA as the communication mecha-
`nism. DMA typically requires some setup overhead (i .e., it
`takes some time to set up), but then DMA is very efficient
`once it has been started because transfers can be carried out
`in minimal CPU cycles. In order to gain the greatest benefit
`from the use of DMA, the largest DMA transfer pennitted by
`the hardware should be done every time. Thus, it is currently
`
`30
`
`35
`
`4O
`
`45
`
`The idle process reserves a block of memory in the slave’s
`heap of memory that is located in the memory visible to the
`host,
`such as “internal” memory 108 (Step 212). As
`described in more detail below,
`this reserved block of
`memory is used for intermediate storage of information 25
`(code and/or data) to be transferred to the slave-private
`memory, i.e., the memory that is invisible to the host, such
`as “external” XRAM 110. The slave’s idle process advan~
`tageously uses the established communication mechanisms
`to send to the host (Step 2314) information about the address
`and size or length of the intermediate storage area reserved
`in the previous step. After sending the information, which
`may be contained in one or more suitable messages, the
`slave blocks, awaiting a message from the host. While
`“blocked”, the slave does not conduct any further loading
`activities ulllll it receives the host’s response.
`.
`It will be understood that whether the slave’s OS acts on
`an interrupt at this stage depends on the nature of the
`interrupt. Since many OS mechanisms (like those used to
`communicate with the host, for example) rely on interrupts,
`and it cannot be known in advance when an interrupt will
`occur, all interrupt code must have been loaded into “inter-
`nal” memory. In that respect, interrupts are served during the
`second stage of the bootloading. Nevertheless, if an interrupt
`is to trigger a chain of events such as processes starting to
`do some data processing and the code or data for those
`processes are located or will be located in “external”
`memory, the interrupt is blocked and the interrupt service
`puts the request in the “in-queue" of that process so that the
`request will be served after booting has finished and that
`process can execute.
`On receipt of the slave’s information, the second stage of
`the host bootloader fills the intermediate storage area with
`information (code and/or data) to be loaded into the slave’s
`invisible memory (Step 216). Code and data is pushed to the
`intermediate storage area in the usual way because this area
`is memory that both processors can access, but the push is
`activated through the OS communication mechanisms.
`The host now sends a message to the slave (Step 218) that
`indicates the intermediate storage area has been loaded and
`whether loading is finished or more code and/or data is
`available. This is the message the slave is waiting fer. The
`host in turn new blocks, awaiting a message from the slave.
`'lhe slave copies the contents of the intermediate storage
`area to appropriate locations in its slave-private memory
`(Step 220), thereby implementing its actual loading. The
`
`50
`
`60
`
`65
`
`

`

`US 7,356,680 BZ
`
`7
`
`believed to be advantageous to set the size of the interme-
`diate storage area to the maximum DMA block size.
`The host part of the OS-friendly bootloader should
`“know” when to leave its first stage (loading information by
`pushing it
`into memory) and to enter its second stage
`(loading information through one or more communication
`- mechanisms). After all, the host cannot push information
`into memory that is invisible to it. Although the slave sends
`a message to the host part when it has reached the idle
`process, this may not be enough for the best part to tell the
`slave to start executing. This transition from pushing to
`bootloading will be seen as a change from the paradigm of
`passive loading (i.e., no code executing in the slave) to the
`paradigm of active loading (i.e., a partly alive, executing
`slave).
`One way for the host part to know when to change stages
`is to tag the code and data to be loaded with information on
`what memory it shall be loaded to. For example. information
`intended for the invisible memory could include a tag or tags
`that indicate the information is to be loaded to the invisible
`memory. The absence of such a tag could indicate that the
`information is to be loaded to the visible memory, although
`it will be appreciated that a tag explicitly indicating that the
`information is to be loaded to the visible memory could also
`be used. This enables the host to do two passes over the
`information and load only the information required in each
`pass. In the first pass, things that go into the internal memory
`would be found and loaded, and in the second pass, things
`that go into the external memory would be found and loaded.
`Another way, which currently appears to be simpler, is to
`arrange the slave-private memory such that all of it resides
`above (or below) a predetermined addreSS. The information
`to be transferred is then sorted accordingly, with all sections
`of code and data to be loaded to the slave-private memory
`put at the end (or the beginning) of the sorted image. Then,
`all the host part of the OS—friendly bootloader has to do is to
`enter its second stage when an address larger (or smaller)
`than the predetermined (boundary) address is encountered.
`In order to save memory or increase code integrity and
`platform security on the host side, information to be loaded
`to the slave can also be pre-processed in several different
`ways. For example, the information may be compressed
`according to a suitable algoritlmr, thereby reducing the size
`of the memory needed for it on the host side. For another
`example,
`the information may be encrypted,
`thereby
`strengthening platform security, as a potential hacker will
`not be able to disassemble the information easily.
`It
`is
`currently believed that encryption is valuable if the infor-
`mation to be loaded to the slave is stored in the internal file
`system of the host, where the information is available (at
`least in theory) to anyone.
`it will be understood that OS
`From this description,
`mechanisms are available to the slave part of the 'OS-
`friendly bootloader that is executed by the slave processor
`and that the slave can reuse existing OS-dependent code
`required for communication. Moreover,
`the OS-friendly
`bootloader uses loading resources (e.g., DMA) efiiciently,
`with the host part automatically deciding when to switch
`from a first stage, or push mode,
`to a second stage, or
`bootloader mode.
`It is expected that this invention can be implemented in a
`wide variety of environments, including for example mobile
`communication devices. Newer ones of such devices can
`employ the OS—friendly bootloader described here to boot
`their DSPs, which may be provided to handle multimedia
`tasks,
`in cooperation with their main-processor software
`systems.
`
`10
`
`15
`
`20
`
`25
`
`30
`
`35
`
`40
`
`45
`
`50
`
`55
`
`60
`
`65
`
`8
`The OS—friendiy bootloader described here takes the
`operating system into account and actually executes on an
`operating system. The host is fully running when the slave
`is booted or re—rebooted. This bootloader does not require
`the host processor to be in a certain state in order to start up
`the slave processor. Indeed, the startup of the slave processor
`can be carried out any time during the execution of the host
`processor software. The OS-friendly bootloader does not
`need a special executable file that
`is run in the slave
`processor while information is being loaded to it from the
`host processor and the host-inaccessible RAM. One execut-
`able is linked to all of the slave processor’s memories. The
`slave is booted before all code is loaded, but code that is
`linked to host-inaccessible memory is not run until it
`is
`loaded with the help of code that is linked to the slave
`processors host-accessible memory.
`It will therefore be understood that the OS—friendly boot~
`loader described here also makes it possible to change
`software executing in the slave processor and to start slave
`execution of an application software before it is completely
`loaded. One or more application processes can be chosen for
`“pushing” with the bootloader into the slave processor's
`host-accessible memory, and those processes will start
`executing at the same point in time as the slave processor‘s
`host—inaccessible memory begins to be loaded.
`This capability can be important in many devices and
`many use cases. in a mobile telephone, for example, such
`use cases include making a call, receiving a call, compress—
`ing/decompressing speech, playing music files, etc. With the
`OS-friendly bootloader described here, one can load and
`execute new software in the slave processor virtually any-
`time the host processor is running.
`'
`It will be appreciated that procedures described above are
`carried out repetitively as necessary. To facilitate under—
`standing, many aspects of the invention are described in
`terms of sequences of actions that can be performed by, for
`example, elements of a programmable computer system. It
`will be recognized that various actions could be performed
`by specialized circuits (e.g., discrete logic gates intercon-
`nected to perform a specialized function or application-
`specific
`integrated circuits), by program instructions
`executed by one or more processors, or by a combination of
`both.
`Moreover, the invention described he

This document is available on Docket Alarm but you must sign up to view it.


Or .

Accessing this document will incur an additional charge of $.

After purchase, you can access this document again without charge.

Accept $ Charge
throbber

Still Working On It

This document is taking longer than usual to download. This can happen if we need to contact the court directly to obtain the document and their servers are running slowly.

Give it another minute or two to complete, and then try the refresh button.

throbber

A few More Minutes ... Still Working

It can take up to 5 minutes for us to download a document if the court servers are running slowly.

Thank you for your continued patience.

This document could not be displayed.

We could not find this document within its docket. Please go back to the docket page and check the link. If that does not work, go back to the docket and refresh it to pull the newest information.

Your account does not support viewing this document.

You need a Paid Account to view this document. Click here to change your account type.

Your account does not support viewing this document.

Set your membership status to view this document.

With a Docket Alarm membership, you'll get a whole lot more, including:

  • Up-to-date information for this case.
  • Email alerts whenever there is an update.
  • Full text search for other cases.
  • Get email alerts whenever a new case matches your search.

Become a Member

One Moment Please

The filing “” is large (MB) and is being downloaded.

Please refresh this page in a few minutes to see if the filing has been downloaded. The filing will also be emailed to you when the download completes.

Your document is on its way!

If you do not receive the document in five minutes, contact support at support@docketalarm.com.

Sealed Document

We are unable to display this document, it may be under a court ordered seal.

If you have proper credentials to access the file, you may proceed directly to the court's system using your government issued username and password.


Access Government Site

We are redirecting you
to a mobile optimized page.





Document Unreadable or Corrupt

Refresh this Document
Go to the Docket

We are unable to display this document.

Refresh this Document
Go to the Docket