Tài liệu PARALLEL COMPUTER ARCHITECTURES-8 doc

43 294 0
Tài liệu PARALLEL COMPUTER ARCHITECTURES-8 doc

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

Thông tin tài liệu

8 PARALLEL COMPUTER ARCHITECTURES 1 CPU (a) P P P P PPPP P P P P P P P P (b) P P P P PPPP P P P P P P P P Shared memory Figure 8-1. (a) A multiprocessor with 16 CPUs sharing a com- mon memory. (b) An image partitioned into 16 sections, each being analyzed by a different CPU. CPU (a) P P P P M M M M PPPP MMMM P P P P M M M M P P P P M M M M Message- passing interconnection network CPU (b) P P P P PPPP P P P P P P P P Message- passing interconnection network Private memory Figure 8-2. (a) A multicomputer with 16 CPUs, each with each own private memory. (b) The bit-map image of Fig. 8-1 split up among the 16 memories. (a) Machine 1 Machine 2 Language run-time system Operating system Shared memory Application Hardware Language run-time system Operating system Application Hardware (b) Machine 1 Machine 2 Language run-time system Operating system Shared memory Application Hardware Language run-time system Operating system Application Hardware (c) Machine 1 Machine 2 Language run-time system Operating system Shared memory Application Hardware Language run-time system Operating system Application Hardware Figure 8-3. Various layers where shared memory can be im- plemented. (a) The hardware. (b) The operating system. (c) The language runtime system. (a) (c) (e) (g) (b) (d) (f) (h) Figure 8-4. Various topologies. The heavy dots represent switches. The CPUs and memories are not shown. (a) A star. (b) A complete interconnect. (c) A tree. (d) A ring. (e) A grid. (f) A double torus. (g) A cube. (h) A 4D hypercube. CPU 1 End of packet Middle of packet A Input port Output port Front of packet Four-port switch B CD CPU 2 Figure 8-5. An interconnection network in the form of a four- switch square grid. Only two of the CPUs are shown. CPU 1 Input port (a) Output port Entire packet Entire packet Four-port switch C A CPU 2 Entire packet D B (b) C A D B (c) C A D B Figure 8-6. Store-and-forward packet switching. CPU 1 CPU 2 CPU 3 A C B D Input port Output buffer Four-port switch CPU 4   ,  , Figure 8-7. Deadlock in a circuit-switched interconnection network. 6050403020100 60 50 40 30 20 10 0 Speedup Linear speedup N-body problem Awari Skyline matrix inversion Number of CPUs Figure 8-8. Real programs achieve less than the perfect speed- up indicated by the dotted line. (a) n CPUs active 1 CPU active 1 – ff T Inherently sequential part (b) 1 – ff Potentially parallelizable part … fT (1 – f)T/n Figure 8-9. (a) A program has a sequential part and a parallel- izable part. (b) Effect of running part of the program in paral- lel. [...]... Vector supercomputer, array processor Arguably none Multiprocessor, multicomputer Figure 8-13 Flynn’s taxonomy of parallel computers Parallel computer architectures SISD SIMD MISD (Von Neumann) MIMD ? Vector processor Array processor UMA Bus Multiprocessors COMA Switched Multicomputers NUMA CC-NUMA Shared memory NC-NUMA MPP Grid COW Hypercube Message passing Figure 8-14 A taxonomy of parallel computers... P9 (a) (b) (c) (d) Figure 8-11 Computational paradigms (a) Pipeline (b) Phased computation (c) Divide and conquer (d) Replicated worker P3 Physical (hardware) Multiprocessor Multiprocessor Multicomputer Multicomputer Logical (software) Shared variables Message passing Shared variables Message passing Examples Image processing as in Fig 8-1 Message passing simulated with buffers in memory DSM, Linda,... line is shown cached at three nodes CPU Node Memory … … Local interconnect Disk and I/O … Local interconnect Communication processor High-performance interconnection network Figure 8-34 A generic multicomputer Disk and I/O Network Disk Tape GigaRing Alpha Shell Node Mem Alpha Mem Control + E registers Control + E registers Commun processor Alpha … Commun processor Full-duplex 3D torus Figure 8-35 The . of parallel computers. SISD (Von Neumann) SIMD Parallel computer architectures MISD ? MIMD Vector processor Array processor Multi- processors Multi- computers. sequential part (b) 1 – ff Potentially parallelizable part … fT (1 – f)T/n Figure 8-9. (a) A program has a sequential part and a parallel- izable part. (b) Effect

Ngày đăng: 12/12/2013, 09:15

Từ khóa liên quan

Tài liệu cùng người dùng

Tài liệu liên quan