`Wilkinson et al.
`
`[54] PARALLEL PROCESSING SYSTEM HAVING
`ASYNCHRONOUS SIMD PROCESSING AND
`DATA PARALLEL CODING
`
`[75]
`
`Inventors: Paul Amba Wilkinson. Apalachin;
`James Warren Dieft'enderfer. Owego;
`Peter Michael Kogge, Endicott;
`Nicholas Jerome Schoonover. Tioga
`Center, all of N.Y.
`
`[73] Assignee: International Business Machines
`Corporation. Armonk. N.Y.
`
`[21] Appl. No.: 487,363
`Jun. 7, 1995
`
`[22] Filed:
`
`Related U.S. Application Data
`
`[62] Division of Ser. No. 233,210, Apr. 26, 1994, which is a
`continuation of Ser. No. 888,680, May 22, 1992, abandoned,
`which is a continuation-in-part of Ser. No. 611,594, Nov. 13,
`1990, abandoned, and Ser. No. 798,788, Nov. 27, 1991,
`abandoned.
`Int. CL 6
`[51]
`...................................................... G06F 15/80
`[52] U.S. Cl. ......................................... 395/800.2; 395/379
`[58] Field of Search ..................................... 395/375. 800.
`395/379. 800.2. 800.1. 800.11, 800.12.
`800.13. 800.14, 800.15. 800.16, 800.21,
`800.22
`
`[56]
`
`References Cited
`
`U.S. PATENT DOCUMENfS
`3,537,074 10/1970 Stokes et al ..
`3,544,973 12/1970 Borek et al ..
`3,970,993
`7/1976 Finnila.
`
`(List continued on next page.)
`
`FOREIGN PATENT DOCUMENTS
`0132926
`2/1985 European Pat. Off ..
`0208497
`6/1986 European Pat. Off ..
`208457A2
`6/1986 European Pat. Off ..
`
`(List continued on next page.)
`
`I IIII IIII II II IIII I llll llll I1I111111I11I111
`
`US005761523A
`[111 Patent Number:
`[45J Date of Patent:
`
`5,761,523
`Jun. 2, 1998
`
`OTHER PUBLICATIONS
`
`T.A. Kriz and M.J. Marple. "Multi-Port Bus Structure With
`Fast Shared Memory". IBM Technical Disclosure Bulletin.
`vol. 27. No. lOA. pp. 5579-5580. Mar. 1985.
`
`(List continued on next page.)
`
`Primary Examiner-Richard L. Ellis
`Attorney, Agent, or Finn-Lynn L. Augspurger; David V.
`Rossi
`
`[57]
`
`ABSTRACT
`
`A parallel array processor for massively parallel applications
`is formed with low power CMOS with DRAM processing
`while incorporating processing elements on a single chip.
`Eight processors on a single chip have their own associated
`processing element. significant memory. and I/0 and are
`interconnected with a hypercube based. but modified. topol(cid:173)
`ogy. These nodes are then interconnected. either by a
`hypercube, modified hypercube, or ring. or ring within ring
`network topology. Conventional microprocessor MMPs
`consume pins and time going to memory. The new archi(cid:173)
`tecture merges processor and memory with multiple PMEs
`( eight 16 bit processors with 32K and I/0) in DRAM and has
`no memory access delays and uses all the pins for network(cid:173)
`ing. Each chip will have eight 16 bit processors. each
`processor providing 5 MIPs performance. I/0 has three
`internal ports and one external port shared by the plural
`processors on the chip. The scalable chip PME has internal
`and external connections for broadcast and asynchronous
`SThfD. MIMD and SIMlMD (SIMD/MThfD) with dynamic
`switching of modes. The chip can be used in systems which
`employ 32. 64 or 128.000 processors. and can be used for
`lower, intermediate and higher ranges. Local and global
`memory functions can all be provided by the chips
`themselves, and the system can connect to and support other
`global memories and DASD. The chip can be used as a
`microprocessor accelerator. in personal computer
`applications. as a vision or avionics computer system. or as
`workstation or supercomputer.
`
`4 Claims, 24 Drawing Sheets
`
`INTEL - 1007
`
`
`
`5,761,523
`Page 2
`
`U.S. P.IITENT DOCUMENTS
`7/1978 Stokes et al ..
`8/1978 Gilbreath et al ..
`5/1981 Reddaway .
`2/1982 Batcher ................................... 364nl6
`7/1982 Palmer et al ........................... 364/748
`4/1983 Fung .
`7/1983 Kohl .
`10/1983 Barnes et al ..
`3/1984 Lorie et al. .
`8/1984 Hunt .
`8/1984 Carrison .
`2/1985 Bolton et al ..
`6/1985 Adams, ill et al ..
`7 /1986 Hillis ....... .................................. 370/60
`8/1986 Widen et al ..
`11/1986 Wagner et al ..
`11/1986 Kulisch ................................... 364/748
`11/1987 Hamstra et al ..
`1/1988 Dolecek .
`4/1988 Jennings et al ..
`4/1988 Holsztynski .
`4/1988 Fiduccia .
`5/1988 Chiarulli .
`8/1988 Calvignac et al. ........................ 370/94
`10/1988 Mattheyses ............................... 370/94
`11/1988 Li et al ..
`11/1988 Morton ...................................... 371/11
`2/1989 Thiel et al ..
`2/1989 Sowa .
`2/1989 Sfarti et al ..
`2/1989 Nash et al ................................. 382/49
`3/1989 Peterson et al ..
`4/1989 Ohkami et al ..
`5/1989 Morton.
`5/1989 Morton .
`6/1989 Mitchell et al ..
`7/1989 Morrison et al ..
`7/1989 Aoyama et al ..
`7/1989 Morton .
`8/1989 Carleton et al ..
`8/1989 Miyata .
`8/1989 Stolfo et al ..
`10/1989 Leeland ................................... 364/748
`10/1989 Gifford .
`1/1990 Gifford .
`1/1990 Fiduccia et al ..
`2/1990 Ewert .
`2/1990 Boettle et al. ............................ 370/60
`2/1990 Takahashi et al ..
`3/1990 Morton .
`3/1990 Mattheyses et al ..
`4/1990 Schwarz .................................. 364n48
`4/1990 Morton .
`4/1990 Ranade.
`5/1990 Davis et al ..
`5/1990 Neches et al ..
`6/1990 Humphrey et al ..
`6/1990 Grinberg et al ........................ 364n48
`7/1990 Hyatt.
`7/1990 Cok.
`7/1990 Aoyma et al ..
`9/1990 Neches.
`9/1990 Anderson et al ..
`10/1990 Smith .
`10/1990 Dawes.
`12/1990 Xu et al ..
`1/1991 Grondalski .
`2/1991 Janke et al .............................. 364/134
`2/1991 Taylor.
`4/1991 Ruetz .
`
`4,101,9(i()
`4,107,773
`4,270,170
`4,314,349
`4,338,675
`4,380,046
`4,394,726
`4,412,303
`4,435,758
`4,467,422
`4,468,727
`4,498,133
`4,523,273
`4,598,400
`4,604,695
`4,621,339
`4,622,650
`4.706,191
`4,720,780
`4,736,291
`4,739,474
`4,739,476
`4,748,585
`4,763,321
`4,780,873
`4,783,738
`4,783,782
`4,805,091
`4,809,159
`4,809,169
`4,809,347
`4,814,980
`4,825,359
`4,831,519
`4,835,729
`4,841,476
`4,847,755
`4,849,882
`4,852,048
`4,855,903
`4,858,110
`4,8Ci0,201
`4,872,133
`4,873,626
`4,891,787
`4,896,265
`4,901,224
`4,903,2Ci0
`4,905,143
`4,907,148
`4,910,665
`4,916,652
`4,916,657
`4,920,484
`4,922,408
`4,925,311
`4,933,846
`4,933,895
`4,942,516
`4,942,517
`4,943,912
`4,956,772
`4,958,273
`4,964,032
`4,967,340
`4,975,834
`4,985,832
`4,992,926
`4,992,933
`5,005,120
`
`5,006,978
`5,008,815
`5,008,882
`5,010,477
`5,016,163
`5,020,059
`5,021,945
`5,038,282
`5,038,386
`5,041,189
`5,041,971
`5,045,995
`5,047,917
`5,049,982
`5,056,000
`5,072,217
`5,113,523
`5,121,498
`5,136,582
`5,142,540
`5,146,608
`5,165,023
`5,170,482
`5,170,484
`5,173,947
`5,175,762
`5,175,865
`5,181,017
`5,187,801
`5,189,665
`5,197,130
`5,212,773
`5,212,777
`5,218,679
`5,218,709
`5,230,079
`5,239,629
`5,239,654
`5,251,097
`5,253,359
`5,265,124
`5,280,474
`5,297,260
`5,355,508
`5,367,636
`
`4/1991 Neches .
`4/1991 Hillis .
`4/1991 Peterson et al. ....................... 370/94.3
`4/1991 Omoda et al ..
`5/1991 Jesshope et al ..
`5/1991 Gorin et al ............................ 371/11.3
`6/1991 Morrison et al ..
`8/1991 Gilbert et al ..
`8/1991 Li ........... .......... ....................... 382/438
`8/1991 Tamitani .
`8/1991 Carvey et al ..
`9/1991 Levinthal et al ..
`9/1991 Athas et al ..
`9/1991 Lee et al ................................... 357/81
`10/1991 Chang.
`12/1991 Georgiou et al ................... 340/825.79
`5/1992 Colley et al. ........................... 395/800
`6/1992 Gilbert et al.
`.......................... 395noo
`8/1992 Firoozdmand ......................... 370/85.1
`8/1992 Glasser ................................... 371/40.l
`9/1992 Hillis ....................................... 395/800
`11/1992 Gifford .................................... 395/325
`12/1992 Shu et al. ................................ 395/800
`12/1992 Gorodalski .............................. 395/800
`12/1992 Chande et al ............................. 382/41
`12/1992 Phelps et al. ........................... 395/800
`12/1992 Hillis ....................................... 395/800
`1/1993 Frey, Jr., et al.
`.................. 340/825.02
`2/1993 Zenios et al. ........................... 395/800
`2/1993 Niehaus et al. ...................... 370/458.1
`3/1993 Chen et al. ............................. 395/325
`5/1993 Hillis ...... ... ....................... ....... 395/200
`5/1993 Gove et al .............................. 395/375
`6/1993 Ben-Ayed et al ....................... 395/200
`6/1993 Fijany et al. ............................ 395/800
`7/1993 Grondalski .............................. 395/800
`8/1993 Miller et al. ............................ 395/325
`8/1993 Ing-Simmons et al ................. 395/800
`10/1993 Simmons et al ........................ 361/687
`10/1993 Spix et al. .............................. 395/575
`11/1993 Staab et al. ................................. 375/3
`1/1994 Nickolls et al. .. ........................ 370/60
`3/1994 Kametani ................................ 395/325
`10/1994 Kan ......................................... 395/800
`11/1994 Colley et al ............................ 395/200
`FOREIGN PATENT DOCUMENTS
`340668A2
`4/1989
`European Pat. Off ..
`428327Al
`11/1990
`European Pat. Off ..
`429733A2
`6/1991
`European Pat Off ..
`4Ci0599A3
`12/1991
`European Pat Off ..
`485690A2
`5/1992
`European Pat Off ..
`493876A2
`7/1992
`European Pat. Off. .
`2223867
`4/1990
`United Kingdom .
`89/09967
`4/1988
`WIPO.
`92/06436
`4/1992
`WIPO.
`<ITHER PUBLICATIONS
`H.P. Bakoglu, "Second-Level Shared Cache Implementa(cid:173)
`tion For Multiprocessor Computers With A Common Inter(cid:173)
`face For The Second-Level Shared Cache And The Sec(cid:173)
`ond-Level Private Cache", IBM Technical Disclosure
`Bulletin, vol. 33, No. 11. pp. 362-365, Apr. 1991.
`Mansingh et al., "System Level Air Flow Analysis for a
`Computer Processing Unit", Hewlett-Packard Jouma~ vol.
`41 No. 5. Oct. 1990, pp. 82-87.
`Tewsbury et al .• "Communication Network Issues and High(cid:173)
`-Density Interconnects in Large-Scale Distributed Comput(cid:173)
`ing Systems". IEEE Journal on Selected Areas in Commu(cid:173)
`nication, vol. 6 No. 3. Apr. 1988. pp. 587-o07.
`
`INTEL - 1007
`
`
`
`5,761,523
`Page 3
`
`Boubekeur et al., "Configuring A Wafer-Scale Two-Dimen(cid:173)
`sional Array of Single-Bit Processors". Computer. vol. 2.
`Issue 4. Apr. 1992. pp. 29-39.
`Korpiharju et al., "flITCA Configurable Logic Cell Array
`Architecture" IEEE. Sep. 1991. pp. 3-3.1-3-3.4.
`C.K. Baru and S.Y.W. Su. 'The Architecture of SM3: A
`Dynamically Partitionable Multicomputer System", IEEE
`Transactions on Computers. vol. C-35, No. 9. pp. 790-802,
`Sep. 1986.
`S.P. Booth et al., "An Evaluation of the Meiko Computing
`Surface for HEP Fortran Farming*", Computer Physics
`Communications 57. pp. 48~91. 1989.
`S.P. Booth et al.. "Large Scale Applications of Transputers
`in HEP: the Edinburgh Concurrent Supercomputer Project",
`Computer Physics Communications 57. pp. 101-107, 1989.
`P. Christy, "Software to Support Massively Parallel Com(cid:173)
`puting on the MasPar MP-1", 1990 IEEE, pp. 29-33.
`S.R. Colley, ''Parallel Solutions to Parallel Problems",
`Research & Development, pp. 42-45, Nov. 21. 1989.
`J.R Nickolls, 'The Design of the MasPar MP-1: A Cost
`Effective Massively Parallel Computer", 1990 IEEE, pp.
`25-28.
`J.P. Prins and J.A. Smith. "Parallel Sorting of Large Arrays
`on the MasPar MP-1 *. The 3rd Symposium on the Frontiers
`of Massively Parallel Computation", pp. 59-64, Oct. 1990.
`J.B. Rosenberg and J.D. Becher, "Mapping Massive S™D
`Parralelism onto Vector Architectures for Simulation", Soft(cid:173)
`ware-Practice and Experience, vol. 19(8). pp. 739-756.
`Aug. 1989.
`J.C. Tilton. "Porting an Interative Parallel Region Growing
`Algorithm from the MPP to the MasPar MP-1". The 3rd
`Symposium on the Frontiers of Massively Parallel Compu(cid:173)
`tation, pp. 170-173. Oct. 1990.
`"Sequent Computer Systems Balance and Symmetry
`Series". Faulkner Technical Reports. Inc., pp. 1-6, Jan.,
`1988.
`"Symmetry 2000/400 and 2000/700 with the DYNIK/ptx
`Operation System", Sequent Computer Systems Inc. date
`unknown.
`"Symmetry 2000 Systems-Foundation for Information
`Advantage", Sequent Computer Systems Inc, date unknown.
`"Our Customers Have Something That Gives Them an
`Unfair Advantage", The nCUBE Parallel Software Environ(cid:173)
`ment. nCUBE Corporation date unknown.
`Y.M. Leung. "Parallel Technology Mapping With Identifi(cid:173)
`cation of Cells for Dynamic Cell Generation". Dissertation.
`Syracuse University. May 1992.
`'The Connection Machine CM-5 Technical Summary",
`Thinking Machines Corporation. Oct. 1991.
`Fineberg et al .. "Experimental Analysis of a Mixed-Mode
`Parallel Architecture Using Bitonic Sequence Sorting".
`Journal of Parallel And Distributed Computing, Mar. 1991.
`pp. 239-251.
`T. Bridges, 'The GPA Machine: A Generally Partitionable
`MSIMD Architecture". The 3rd Symposium on the Frontiers
`of Massively Parallel Computation. Oct. 1990, pp. 196-203.
`Abreu et al., 'The APx Accelerator", The 2nd Symposium
`on the Frontiers of Massively Parallel Computation, Oct.
`1988. pp. 413-417.
`D.A. Nicole, "Esprit Project 1085 Reconfigurable Trans(cid:173)
`puter Architecture". CONPAR 88 Additional Papers. Sep.
`1988. pp. 12-39.
`E. DeBenedictis and J.M. del Rosario. "nCUBE Parallel YO
`Software". IPCCC '92 IEEE. pp. 0117-0124.
`
`T.H. Dunigan. Hypercube Clock Synchronization: Concur(cid:173)
`rency: Practice and Experience. vol. 4(3 ). pp. 257-268. May
`1992.
`T.H. Dunigan. ''Petformance of the Intel iPSC/860 and
`Ncube 6400 hypercubes*". Parallel Computing 17. pp.
`1285-1302. 1991.
`D.D. Gajski and J.K. Peir, "Essential Issues in Multiproces(cid:173)
`sor Systems", 1985 IEEE, pp. 9-27. Jun. 1985.
`A. Holman, 'The Meiko Computing Surface: A Parallel &
`Scalable Open Systems Platform for Oracle", A Study of a
`Parallel Database Machine and its Petformance-The NCR/
`Teradata DBC/1012. pp. 96-114 date unknown.
`Baba et al .. "A Parallel Object-Oriented Total Architecture:
`A-NET". Proceedings Supercomputing. Nov. 1990. pp.
`276-285.
`Mitchell et al .. "Architectural Description of a New. Easily
`Expandable Self-Routing Computer Network Topology",
`IEEE INFOCOM. Apr. 1989. pp. 981-988.
`K. Padmanabhan, "Hierarchical Communication in Cube(cid:173)
`-Connected Multiprocessors", The 10th International Con(cid:173)
`ference on Distributed Computing Systems. May 1990, pp.
`270-277.
`Fineberg er al .• "Experimental Analysis of Communication/
`Data-Conditional Aspects of a Mixed-Mode Parallel Archi(cid:173)
`tecture via Synthetic Computations". Proceeding Supercom(cid:173)
`puting '90, Nov. 1990, pp. 647-646.
`Kan et al., "Parallel Processing on the CAP: Cellular Array
`Processor", COMPCON 84, 16 Sep. 1984. pp. 239-244.
`Ezzedine et al .• "A 16-bit Specialized Proccor Design".
`Integration The VLSI Journal, vol. 6 No. 1, May 1988. pp.
`101-110.
`A. Mudrow, "High Speed Scientific Arithemetic Using a
`High Petformance Sequencer", ELECTRO, vol. 6. No. 11.
`1986, pp. 1-5.
`Alleyne et al., "A Bit-Parallel, Word-Parallel. Massively
`Parallel Accociative Processor for Scientific Computing",
`Third Symposium on the Frontiers of Massive Parallel
`Computation, Oct. 8-10. 1990; pp. 176-185.
`Jesshoppe et al., "Design of S™D Microprocessor Array".
`IEEE Proceedings, vol. 136., May 1989, pp. 197-204.
`DeGroot et al .. "Image Processing Using the Sprint Multi(cid:173)
`processon", IEEE, 1989, pp. 173-176.
`Nudd et al., "An Heterogeneous M-SIMD Architecture for
`Kalman Ftlter Controlled Processing of Image Sequences".
`IEEE 1992, pp. 842-845.
`Li et al., "Polmorphic-Torus Network". IEEE Transactions
`on Computers. vol. 38, No. 9, Sep. 1989 pp. 1345-1351.
`Li et al .• "Sparse Matrix Vector Multiplication of Polymor(cid:173)
`phic-Torus", IBM Technical Disclosure Bulletin, vol. 32.
`No.3A, Aug. 1989. pp. 233-238.
`Li et al., "Parallel Local Operator Engine and Fast P300",
`IBM Tech. Disc. Bulletin, vol. 32. No. 8B, Jan. 1990, pp.
`295-300.
`R. Duncan, "A Survey of Parallel Computer Architectures".
`IEEE.Feb. 90' pp.5-16.
`C.R. Jesshope et al., "Design of SIMD Microprocessor
`Array", UMI Article Clearing house. No. 88'.
`Senger Ilgen & Issac Sebers. ''Parallel Processing on VLSI
`Associative Memory". NSF Award #ECS-8404627. pp.
`50-53 date unknown.
`H. Stone. "Introduction to Computer Architecture", Science
`Research Associates, 1975. Ch. 8. pp. 318-374.
`R.M. Lea, "WASP: A WSI Associative String Processor"
`Journal of VLSI Signal Processing. May 1991. No. 4. pp.
`271-285.
`
`INTEL - 1007
`
`
`
`5,761,523
`Page 4
`
`Lea, R.M .. "ASP Modules: Cost-Effective Building-Blocks
`for Real-Time DSP Systems". Journal of VLSI Signal
`Processing. vol. 1. No. 1. Aug. 1989, pp. 69-84.
`Isaac D. Scherson. et al.. "Bit Parallel Arithmetic in a
`Massively-Parallel Associative Processor", IEEE. vol. 41.
`No. 10. Oct. 1992.
`Supreet Singh and Jia-Yuan Han. "Systolic arrays", IEEE,
`Feb. 1991.
`H. Richter and G. Raupp. "Control of a Tokamak Fusion
`Experiment by a Set of Multi top Parallel Computers", IEEE
`vol. 39. 1992, pp. 192-197.
`Higuchi et al., "IXM2: A Parallel Associative Processor for
`Semantic Net Processing-Preliminary Evaluation-",
`IEEE, Jun. 1990, pp. 667-673.
`Frison et al., "Designing Specific Systolic Arrays with the
`AP115C Chip". IEEE 1990, xii+808pp .• pp. 505-517.
`Berg et al., "Instruction Execution Trade-Offs for SIMD vs.
`MIMD vs. mixed Parallelism", IEEE Feb. 1991, pp.
`301-308.
`Raghaven et al., "Fine Grain Parallel Processors and Real(cid:173)
`-Time Applications: MIMD Controller.SIMD Array", IEEE,
`May 1990, pp. 324-331.
`
`G.J. Lipovski. "SIMD and MIMD Processing in the Texas
`Reconfigurable Array Computer". Feb. 1988, pp. 268-271.
`R.M. Lea, "ASP: A Cost-effective Parallel Microcomputer".
`IEEE Oct. 1988, pp. 1~29.
`Mark A. Nichols. "Data Management and Control-Flow
`Constructs in a SIMD/SPMD Parallel Language/Compiler".
`IEEE. Feb. 1990, pp. 397-406.
`Will R. Moore, "VLSI For Artificial Intelligence", Kluwer
`Academic Publishers, Ch. 4.1 date unknown.
`Mosher et al .• "A Software Architecture for Image Process(cid:173)
`ing on a Medium-Grain Parallel Machine". SPIE vol. 1659
`Image Processing and Interchange. 1992/279.
`Patent Abstracts of Japan, vol. 8, No. 105. 17 May 1984. p.
`274. App. No. JP-820 125 341 (fokyo Shibaura Denki KK)
`27 Jan. 1984.
`W.D. Hillis. "The Connection Macmne", The MIT Press.
`Chapters 1, 3, and 4 date unknown.
`"Joho-syori", vol. 26(3), 1985-3, pp. 213-225 (Japanese).
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun. 2, 1998
`
`Sheet 1 of 24
`
`5,761,523
`
`FIG.1 A
`Prior Art
`
`FPU
`
`SHIFT
`
`NORMALISE
`
`Z BUS
`
`I
`
`I MANTISSA
`I
`I ALU
`\
`{Y
`{xt
`.
`NORMALISE
`I I
`I
`ROUNDING
`I I
`I
`A REG
`
`~
`,,
`
`i,
`
`B REG
`C REG
`
`DATA BUS
`INTERFACE
`
`...
`~DIN BUS
`.
`"f T
`DOUT eus)
`,,,
`Ir)
`
`IEXPONENTII I Z BUS
`I ALU
`\
`{x)-
`{Y(
`S REG
`I I
`I I
`CONSTANTS
`I I
`I I
`A REG
`
`DIN BUS /
`,,,
`(oouT BUS
`"f
`
`1.,A--
`"-
`-.
`
`B REG
`C REG
`
`{'). FPopcode
`
`I
`
`INSTRUCTION
`STREAMER
`
`INSTRUCTION ~
`
`PTR
`OPERAND REG
`I I
`I I
`SCHEDULER
`WORKSPACE ~
`f'--r
`PTR
`
`...
`/
`'-,,
`
`.
`)
`,,,
`
`4 KBYTE
`RAM
`
`vu-
`l'r A
`D
`D
`R
`E
`s
`s
`
`8
`u
`s
`
`D
`A
`T
`A
`
`B
`u
`s
`
`FIG.1A
`
`FlG.18
`
`FIG.1
`
`~"
`,,/1----"...,
`
`DBUS
`
`-"-._
`
`'-r-,..' INTERLACE - ✓
`/'\
`
`D
`I
`N
`
`B
`u Lr-...__
`s r:/
`
`z
`k:= 8
`u
`s
`
`vt-
`t'v
`
`DATA REG
`
`IXI
`IYI
`A REG
`B REG
`C REG
`I I
`I I
`11'.,
`~ DATA IN REG
`DATA OUT REG
`DOUT BUS
`~ CHANNEL
`! I I I
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 2 of 24
`
`5,761,523
`
`u
`
`8
`u
`s
`
`I
`TIMERS
`{x}
`\
`
`ALU
`I
`
`ZY>
`
`I
`
`~
`B
`
`CON FIGURA TlON
`REGISTER &
`
`/
`TIMING CONTROL ' ...
`
`...
`' ,,,,
`,, -
`
`EXTERNAL
`MEMORY
`INTERFACE
`
`/1---
`"r-
`
`,I
`
`/ ' - "
`
`PTR REG
`
`COUNT REG
`u V w z
`PTR REG
`
`COUNT REG
`
`INPUT
`
`LOGIC
`
`OUTPUT
`
`LOGIC
`
`DATA REG - LINK
`DATA REG - LINK
`I LINKS I
`~vnwnz°
`- UNK3
`- - --
`
`LINK 1
`LINK 2
`
`I
`
`I
`
`LINKS
`
`ADDRESS
`REGISTERS
`
`-
`,/v'v'
`V W Z
`INSTRUCTION FETCH ADDRESS
`CHANNEL ADDRESS
`DATA ADDRESS
`
`FIG.18
`Prior Art
`
`INTEL - 1007
`
`
`
`100
`
`APPLICATION
`PROCESSOR 0
`
`110
`
`APPLICA llON
`PROCESSOR 1
`
`120
`
`APPLICATION
`PROCESSOR 2
`
`•
`•
`•
`130---... •
`
`APPLICATION
`PROCESSOR N
`
`140
`
`ADVANCED
`PARALLEL
`ARRAY
`PROCESSOR
`
`150
`160
`170
`180
`
`FIG.4
`
`I 111 I 111 I 111 I 111 I,,, I,, .I,, 1 I 11 ii
`
`CMOS GA TE ARRAY
`
`SCALABLE
`PARALLEL
`PROCESSOR
`CHIP
`
`32K X
`DRAM
`MACRO
`
`~
`
`I
`
`Ill
`
`FIG.2
`
`~
`00 •
`
`~ ...... a
`
`~
`
`~
`
`J"' -!
`
`00000000 0000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`ooooooo<t" o'oooooo-<
`00000000 00000000
`00000000 00000000
`-- ----P
`0 0 0 000 00 000 0~0 0 Orr- - -~
`00000000 0000 ooo ooo
`00000000 00000000 nnnn~n nnhon
`00000000 00000000
`00000000 0000000
`
`950
`
`00000000 0000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`
`FIG.16
`
`975
`
`00001000 00000000
`00000000 00000000
`00000000 00000000
`00000000 00000000
`0000 000 00000000
`00000000 00000000
`00000000 00000000
`,ooo~ooo,,, 00000000
`900
`
`ga
`i
`w
`~
`~
`
`01
`,_.
`-..,l
`~
`~ ,_.
`01
`N
`
`""'
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2,1998
`
`Sheet 4 of 24
`
`5,761,523
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun. 2, 1998
`
`Sheet 5 of 24
`
`5,761,523
`
`200
`
`APPUCATION
`PROCESSOR 0
`
`210
`
`APPLICATION
`PROCESSOR 1
`
`220
`
`APPLICATION
`PROCESSOR 2
`
`•
`•
`•
`
`230
`
`APPLICATION
`PROCESSOR N
`
`.
`•
`
`ARRAY N
`
`ARRAY 02
`
`250~
`r -
`ARRAY01REcToR- - 7
`I
`I
`12so
`ARRAY 01
`I.---_,____ .---........_-I,.....__ __
`I
`270
`I
`I APPLICATION
`ARRA y
`ARRAY 00
`PROCESSOR
`CONTROLLER &
`SYNCHRONIZER I
`I
`INTERFACE
`1--......... ----------11----..J-
`
`280
`
`." 310
`
`.
`
`• 300
`•
`
`290
`
`I
`_ __ _ __J
`
`240
`
`TEST/DEBUG
`DEVICE
`
`FIG.5
`
`SER u
`ER-
`INT
`FA
`CE
`
`-
`
`-
`
`HOST APPLICATION PROCESSOR
`
`usER PROGRAMsl-H
`
`PERFORMANCE & DEMO
`PROGRAMS
`
`COMPILE I EXECUTE
`I
`APPLICATION
`DEV LIB
`
`--
`
`RISC/6000 TEST & DEBUG MONITOR
`DEBUGGER
`PERFORMANCE MONITOR
`& ANALYSIS
`DIAGNOSTICS
`SIMULATOR
`ASSEMBLER /LINKER
`& LOADER
`
`TEST &
`i--, INTERFACE
`MONITOR
`
`-
`
`FIG.19
`
`ARRAY CTRLR
`
`f4-
`I~ HOST !'FACE
`,.... M'TOR l'F
`
`ARRAY l'F
`
`i-.-....
`
`PME
`ARRAY
`
`-
`
`-
`
`- RS/6000
`MPP
`SIMULATOR
`
`I
`
`I
`
`INTEL - 1007
`
`
`
`~
`
`8/4
`
`AWACS
`AEW
`RTALS
`
`µ
`CHANNEL
`CARD
`
`CLUSTER
`••••••••
`••••••••
`••••••••
`••••••••
`••••••••
`••••••••
`••••••••
`••••••••
`
`512 PROCESSORS
`
`340
`SMALL FINE
`GRAIN PARALLEL
`PROCESSOR
`
`RS/6000
`
`313
`
`314
`
`' ' ' ' '
`8 PROCESSORS
`
`EIGHT - 1 BYTE BIDI PORTS i
`
`z
`0 v;
`z w
`:::::;
`0
`N
`
`WOXY
`
`W1XY
`
`W2XY
`
`W3XY
`
`W4XY
`
`W5XY
`
`W6XY
`
`W7XY
`
`/
`
`~
`
`LARGE
`FINE
`GRAIN
`PARALLEL
`PROCESSOR
`
`/
`
`/
`
`370
`
`SINGLE
`PROCESSOR UNIT
`f
`I
`
`32k
`HALFWORD
`MEMORY
`
`16 BIT
`PROCESSOR
`
`300
`
`301
`
`320
`
`30
`
`~ETWORK NODE /
`
`31 O
`
`1
`PROCESSOR
`
`~gglM 1 ... 1 --> 64 NODES
`
`-
`W DIMENSION
`OZXY
`2ZXY
`4ZXY
`6ZXY
`3ZXY
`SZXY
`1ZXY
`7ZXY
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• ••• ,.---36(
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• •• • ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`••• ••• ••• ••• ••• ••• ••• •••
`32,768 PROCESSING ELEMENT
`FIG.6
`
`Cj
`• rJJ. •
`~ = """" g
`
`""""
`
`~
`~ ....
`i
`
`00 =(cid:173)~
`
`~
`0"I
`~
`N
`.a:,.
`
`01
`"' ....;:a
`="',
`~
`"' 01
`N w
`
`INTEL - 1007
`
`
`
`400
`
`INTERNAL
`INTERFACE
`PORTS(3)
`
`425
`
`MULTIPLEXOR
`. - - - - - - 430
`PC
`REGISTER
`
`AR
`REGISTER
`
`420
`
`MEMORY
`
`32K X 16
`
`435
`
`440
`
`·450
`
`M
`REGISTER
`
`OP
`REGISTER
`
`410
`EXTERNAL
`INTERFACE
`PORT
`
`406
`
`405
`
`407
`MULTIPLEXOR
`
`408
`
`OUT
`REGISTER
`
`EXTERNAL
`INTERFACE
`PORTS
`490
`
`INTERNAL
`OUTPUT
`PORTS (3)
`480
`
`FIG.7
`
`465
`
`OP9-11
`
`OP5-7
`
`MULTIPLEXOR
`
`445
`
`.-----L~--L- 460
`
`ALU
`
`DELTA 13-15
`
`455
`
`CSTAT
`
`M
`u
`X
`
`SCALE
`
`470--.....i WR REGISTER
`
`~ • 00
`• ;p
`.....
`B
`.....
`
`~
`
`~
`N
`" .....
`\0
`~
`
`00 =-
`ii
`~
`~
`~
`
`...
`'-II
`~ "" ~ ...
`'-II
`~
`
`INTEL - 1007
`
`
`
`Ml
`
`DB
`
`~~~ I I
`~ I GR
`I
`RAL
`REGISTER
`16 X 16
`i
`I RA
`01,~ + I I I I
`
`LI
`
`_J I I
`Rb
`Ra
`ooox ..
`
`I
`/ 16 BCI
`
`~ 2WR
`
`COUT--1
`
`ALU
`
`16
`-f-
`1--CIN
`
`ch
`
`Ml
`
`MAIN STORE
`32K X 18
`
`~h-1-i
`I
`
`/ 18
`
`I
`
`p +
`
`~ • rJ1 •
`~ =
`~ a
`
`~
`
`~
`
`~
`
`~N
`"'""'
`\0
`\0
`00
`
`OUT
`
`rJ"J =-
`....
`
`('I)
`('I)
`
`00
`0
`~
`N
`~
`
`I
`
`XMIT
`
`L1 R2
`
`I
`
`L1 R21
`
`16
`I
`IN BUS
`
`DB
`
`I
`
`r-DEL
`
`Ml
`16
`
`16
`
`FIG.8
`
`FF
`(SOURCE)
`s
`(CIRCUIT SWITCH)
`TT
`(DESTINATION)
`
`L R V X
`
`L R V X
`
`~
`
`tit
`-...J
`0"-,
`io-
`~ tit
`N w
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 9 of 24
`
`5,761,523
`
`500
`
`510
`
`501
`
`515
`
`521,.
`•
`
`•
`
`520
`
`+W
`
`511~.
`.
`•
`
`535
`
`530
`
`531
`
`592
`
`591
`
`590
`
`593
`
`565
`
`-x
`
`550
`
`541
`
`-Y
`
`540
`
`561\ •
`•
`
`•
`
`571
`
`-w
`
`560
`
`551\.
`•
`
`•
`
`575
`
`-z
`
`570
`
`FIG.9
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 10 of 24
`
`5,761,523
`
`+X
`
`+w PE•------
`
`-W
`
`-x
`
`-Y
`
`+z
`
`FIG.10
`
`+y
`
`j
`
`'
`PE
`
`( +y)
`
`INTER
`PE
`BUS
`L_
`
`81ST
`BUS
`_J
`,
`
`-
`
`PE
`
`(-y)
`
`~
`
`-y
`
`+x
`
`PE
`
`( +x)
`
`'
`
`I
`
`PE
`
`{-x)
`
`-x
`FIG.11
`
`-z
`
`+z
`
`j
`
`'
`
`PE
`
`( +z)
`
`I-<•
`
`--
`
`1 1
`
`PE
`
`(-z)
`
`--
`
`-z
`
`EXTERNAL
`PORTS
`
`--
`
`+w
`
`1
`
`PE
`
`(+w)
`I
`
`I
`
`-
`CMD/INST
`
`::
`
`B'CAST
`I' I
`FACE
`I -
`tJ
`
`SERREQ
`.;;
`
`;;
`
`~
`
`SERIAL LOOPS
`
`-
`
`EXTERNAL
`PORTS
`
`• •
`.
`•
`
`I
`PE
`
`-
`
`(-w)
`
`I
`
`'
`-w
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 11 of 24
`
`5,761,523
`
`p-o~---------7
`I
`I
`ARRA y DIRECTOR
`I
`I sJo
`I
`APPUCA TION
`I
`
`640
`
`CLUSTER
`
`PROCESSOR
`
`FAST 1/0 (ZIPPMER)
`620
`
`620
`
`600
`
`APPLICATION
`
`PROCESSORS
`
`CLUSTER
`
`660
`
`SYNCHRONIZER
`
`CONTROLLER
`INTERFACE
`------ ____ .,....., I
`650
`1sc1
`I
`I
`I
`I
`I
`I
`_______ J
`
`MCA
`
`605
`
`ARRAY CLUSTERS
`
`605
`
`0 1
`
`0 2
`
`•
`•
`•
`•
`•
`•
`~90
`
`ARRAY CLUSTERS
`
`695
`
`MCA BACKPLANE
`
`FIG.1.2
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 12 of 24
`
`5,761,523
`
`CLKS · · · · · · · · · · · · · ·
`
`CONT'R
`
`Bl-DIRECTIONAL
`TRISTATE DRIVERS
`(CONTROLLER
`ENABLE)
`
`8 X 8 NODE
`CLUSTER
`
`WlTH x AND y
`PATHS DOTTED
`WlTH
`SYSTEM BUS
`AT EDGE
`
`. . .
`
`•
`
`•
`
`FIG.13A
`PE xO PE x1 PE x2 PE x3
`
`PE x12 PE x13 PE x14 PE x15
`
`SYSTEM
`BUS
`
`•
`
`•
`
`DRIVER
`
`FIG.138
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 13 of 24
`
`5,761,523
`
`700
`ZIPPER BUS
`
`710
`
`WORD
`
`770
`
`wd i
`
`word i+1
`
`word i+2
`
`word i+n
`
`720
`
`751
`
`730
`
`740
`
`753
`
`750
`
`754
`
`NODE
`1, 1,x,y
`
`NODE
`1,2,x,y
`
`NODE
`1,3,x,y
`
`....
`
`NODE
`1,n,x,y
`
`780
`
`755
`
`NODE
`2,2,x,y
`
`....
`
`790
`
`• •
`
`• .
`• •
`
`NODE
`n,1,x,y
`
`NODE
`n,3,x,y
`
`••••
`
`NODE
`n,n,x,y
`
`FIG.14
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun. 2, 1998
`
`Sheet 14 of 24
`
`5,761,523
`
`Q
`O'.)
`
`0
`@
`
`(/) z
`0
`WI=
`Ou
`~w z
`I- z -o
`CDc_,
`z a:::
`v~
`<Dz
`
`(/) z
`0
`(/)
`z w
`
`~
`0
`(/)
`
`(/) w
`(/) a::
`0:: Cl
`~ Cl
`(/) <
`:::,
`...J
`d ~
`w u
`Cl 0
`0
`N
`z
`v Cl
`
`>(cid:173)
`x
`
`tO a
`
`0
`N
`
`....J
`
`<(
`
`I ~
`v
`tO
`
`(I)
`
`bit('c('c(' -.-: ,,,t:('c('d
`(cid:143)(cid:143)(cid:143)(cid:143)
`B (cid:143)(cid:143)
`(cid:143)(cid:143)(cid:143)(cid:143)
`(cid:143)~
`(cid:143)
`(cid:143)
`0
`___ oo ODDO ODO~
`D D El El
`0001-
`El (cid:143) (cid:143) (cid:143)
`(cid:143) (cid:143) (cid:143)~
`(cid:143)
`(cid:143)
`(cid:143)
`DO (cid:143)
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun. 2, 1998
`
`Sheet 15 of 24
`
`5,761,523
`
`HOW WOULD A 16 ELEMENT SORT REPEAT THE PATTERN?
`STAG
`2 3
`E 1
`4 5 6
`7
`8 9 10
`0
`+
`•
`+
`+
`1
`2
`•
`3
`4
`+
`5
`6
`•
`7
`8
`9
`10
`11
`12
`13
`14
`15
`
`+
`
`•
`
`•
`
`+
`
`•
`
`' •
`
`•
`, t
`
`1 •
`I
`
`•
`
`I
`
`+
`
`+
`
`•
`
`•
`
`+
`
`I•
`
`•
`
`•
`
`'
`'
`
`I
`
`I
`
`,1
`+ +
`
`' I
`
`+ +
`
`1000
`
`+
`
`fl
`+ •
`I •
`H
`
`1 ' '
`
`1100
`
`do J == 0 to I
`
`-
`
`-
`
`J
`
`else TARGET = PE#-2'- J
`
`FOR SORTING n DATA ELEMENTS (n E {2':i EN,2's # OF PE'Sl)
`do I = 0 to (log
`n) - 1
`2
`if (PE# /2 1
`J ) %2 = 0
`then TARGET = PE#+2 1
`send DATA to TARGET
`receive data store in TEMP (If data is not available - wait)
`.
`PE#
`~
`rf (2(( 21 + 1) %2) + (( 2,+ 1) %2) +1) %2 = 0
`then if TEMP < DATA then DATA = TEMP else NOP
`then if TEMP> DATA then DATA= TEMP else NOP
`end bothe do's
`
`FIG.17
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun. 2, 1998
`
`Sheet 16 of 24
`
`5,761,523
`
`HOST
`PROCESSOR ..:_
`
`J
`
`HOST
`MEMORY
`
`DATA AND
`COMMANDS
`
`APPLICATION PROCESSOR
`INTERFACE
`API
`
`ARRAY
`CONTROLLER
`
`CLUSTER
`CONTROLLER 0
`cc
`{PORT TYPE)
`
`CLUSTER
`CONTROLLER 1
`cc
`(NON-PORTED)
`
`.....
`
`CLUSTER
`CONTROLLER N
`cc
`
`/64+P DATA
`PORT
`
`I
`NORMAL
`/16+
`BROADCAST B'CAST
`& STA rus
`
`'
`
`'
`STATUS MONITOR
`SERIAL LOOP
`
`CLUSTER 0
`
`64 NODES
`
`(512 PME's)
`
`CLUSTER 1
`
`CLUSTER N
`
`PME
`ARRAY
`
`FlG.18
`
`COMMAND
`DATA
`
`/n DATA
`(u-CHANNEL)
`
`CLUSTER
`SYNCHRONIZER
`cs
`........ -........ .-------------1,---------,
`,
`
`.
`.
`(OPTIONAL U
`CHANNEL
`DEVICES, DASO.
`DISPLAYS,
`GATEWAYS,
`ETC.)
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 17 of 24
`
`5,761,523
`
`VECTORIZED
`APPLICATION
`DEVELOPER ...... HIGH ORDER
`LANG SOURCE
`
`•
`
`....
`
`I
`HOST
`COMPILER
`
`APPLICATION
`!
`DEVELOPMENT
`LIBRARY
`I
`'
`HOST
`: EX. BLAS, ESSL, EXECUTION
`PARALLEL
`...
`!
`FUNCTlONS,
`APPLICATION ····AND
`OPTIMIZER
`ETC.
`
`APPLICATION
`PROCESSOR
`INTERFACE
`
`--------------7
`
`I
`I
`
`,.
`
`· ... VECTORIZED
`- EMULATOR
`
`FUNCTIONAL
`
`(APJ CODE)
`
`CUSTOMIZER
`OR
`PROD. DEV'R
`
`... ·A LL
`
`~
`
`I
`I
`
`,-
`
`I
`I
`I
`I
`I
`
`PARALLEL
`·• OPERATION
`CONTROL CODE
`
`,____
`
`(PME CODE)
`
`FIG. 20
`
`I
`1
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`
`I
`
`CLUSTER
`SYNCHRONIZER
`
`•
`• ******
`*
`*
`*
`*
`•
`*
`*
`•
`•
`CLUSTER
`CONTROLLERs *
`(n)
`* ,.
`•
`~
`PARALLEL l/0 l'F ACE
`
`,~
`
`B'CST
`INTER
`FACE
`
`PROCES~NG ELEMENT
`ARRAY
`
`(n CLUSTERS OF 512 PMEs)
`
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`
`I
`
`PARALLEL PROCESSOR
`L-------------------------~
`
`INTEL - 1007
`
`
`
`U.S. Patent
`
`Jun.2, 1998
`
`Sheet 18 of 24
`
`5,761,523
`
`APPLICATION
`DEVELOPER
`
`·OR
`
`SEQUENTIAL
`FORTRAN
`SOURCE
`
`VECTORIZED
`HIGH ORDER a------
`1
`LANG SOURCE
`•
`
`HOST
`COMPILER
`
`APPUCA TION
`DEVELOPMENT i - - - - -
`LIBRARY
`____..____,~____,
`HOST
`EXECUTION
`
`l
`
`PARALLEL
`FORTRAN
`COMPILER
`SYSTEM
`
`APPLICATION_ ... AND
`OPTIMIZER
`.
`
`',. -
`
`CUSTOMIZER
`OR
`PROD. DEVR
`
`... ·A
`LL
`
`...
`
`FIG.21
`
`CLUSTER
`SYNCHRONIZER
`
`(API CODE)
`
`I
`I
`I
`I
`I
`
`PARALLEL
`OPERATION
`CONTROL CODE
`
`,___
`
`(PE CODE)
`
`~
`: EX. BLAS, ESSL.
`l
`PARALLEL
`FUNCTIONS,
`ETC·......-------J"-----
`APPLICATION
`~-----7
`PROCESSOR
`I
`INTERFACE
`I
`I
`~------,
`i . -_ _.., ___ .,__ .... 1~
`I
`I
`I
`I
`I
`!***** I
`I
`VECTORIZED -
`*
`I
`* I
`FUNCTIONAL
`I
`EMULATOR
`*
`I
`*
`I
`I
`*
`I
`*
`I
`I
`*
`I
`,~
`*
`I
`- - ' - - - - - -1
`CLUSTER
`*
`I
`/ i
`CONTROLLERs *
`:
`(n)
`*
`I 1
`I
`'----------il.__ __ _,I I
`•
`L.: _____ ..J
`:
`*
`
`r7
`
`I
`I
`I
`I
`I
`I
`
`I
`I
`I
`I
`I
`I
`
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I
`I I
`I
`I
`
`'
`PARALLEL 1/0 !'FACE
`
`'
`
`B'CST
`INTER
`FACE
`
`PROCESSING ELEMENT
`ARRAY
`
`I
`I
`I
`I
`.___...i-_______ _.,;....Jf
`(n CLUSTERS OF 512 PEs)
`I
`
`:
`PARALLEL PROCESSOR
`:
`L-------------------------~
`
`INTEL - 1007
`
`
`
`SUBSYSTEM
`CONTROLLER
`
`WORKSTATION
`HOST
`
`ls=To-. s_us__Jj1NTERFACE I
`
`BUS
`
`ARRAY CF MIMD/SIMO CHIPS
`-,
`1/0
`INTERFACE. LOGIC
`(MCA. VME ___ __1
`. . . )
`.
`
`HISPEED
`BUS 1/F
`1/0 ,__ ______ __.
`
`• •
`
`POTENTIAL
`HISPEED
`BUS
`
`~ •
`rJ1
`•
`~
`"""'
`("D = """'
`
`~
`
`~N
`
`~
`....
`i 00
`
`g:
`~ a ....
`
`\0
`~
`t
`
`tll
`....
`.....:a
`~
`~ ....
`tit
`~
`
`CROSS FLOW, 115 VOLT/60 Mx
`BLOWER/GRILL ASSEMBLY
`ebm EQRG-38-2-I15
`DUAL 1/2 CLUSTER
`AIR FLOW
`ASSEMBLY
`EXHAUST
`LOUVERS
`
`CIRCUIT
`BREAKER
`AC POWER
`FDWER DIST.
`
`A COPIER POWER
`SUPPLY 80 AMPS/+ 5 VOLT
`
`FIG.22
`
`DUAL 1/2
`CLUSTER ASSEMB-.Y
`
`AC POWER CORD
`LOCATION
`
`~Jj]
`
`...._,____~"----
`
`INTEL - 1007
`
`
`
`OBSV
`SET
`1
`
`INITIAL
`08S'TION i---
`GATING
`
`~
`
`OBSER'N
`KALMAN
`TO ~ FILTER __.,
`TRACK
`MATCHING
`
`LOCAL
`TRACK -
`FILE
`
`t
`
`I
`
`OBSV
`SET
`2
`
`OBSV
`SET
`n
`
`-
`
`INITIAL
`
`GATING
`
`OBS'TION -
`t
`08S'TION -
`t
`
`INITIAL
`
`GATING
`
`OBSER'N
`KALMAN
`LOCAL
`TO
`TRACK ~ FILTER ~
`TRACK
`FILE
`MATCHING
`
`,..__., MERGE
`LOCAL
`-
`TRACK
`~ FILES
`
`-
`
`CENTER'ZD
`TRACK
`FILE
`
`-
`
`I
`
`TO - FILTER ~
`
`OBSER'N
`
`KALMAN
`
`TRACK
`MATCHING
`
`LOCAL
`TRACK -
`FILE
`
`[ OR 1
`
`FIG.23
`
`~ • rJ).
`•
`
`~ = I
`
`;...
`
`~
`~
`~
`
`~
`00
`
`00 r
`~