Uploaded by

a.shyam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

120 views

Appendix H: Authors: John Hennessy & David Patterson

Computer Architecture Appendix H

Uploaded by

a.shyam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Appendix H

Authors: John Hennessy & David Patterson

Copyright 2011, Elsevier Inc. All rights Reserved. 1

Figure H.1 A software-pipelined loop chooses instructions from different loop iterations, thus separating
the dependent instructions within one iteration of the original loop. The start-up and finish-up code will
correspond to the portions above and below the software-pipelined iteration.

Copyright 2011, Elsevier Inc. All rights Reserved. 2

Figure H.2 The execution pattern for (a) a software-pipelined loop and (b) an unrolled loop. The shaded areas are the times
when the loop is not running with maximum overlap or parallelism among instructions. This occurs once at the beginning and once
at the end for the software-pipelined loop. For the unrolled loop it occurs m/n times if the loop has a total of m iterations and is
unrolled n times. Each block represents an unroll of n iterations. Increasing the number of unrollings will reduce the start-up and
clean-up overhead. The overhead of one iteration overlaps with the overhead of the next, thereby reducing the impact. The total
area under the polygonal region in each case will be the same, since the total number of operations is just the execution rate
multiplied by the time.

Figure H.3 A code fragment and the common path shaded with gray. Moving the assignments to B or C requires a more
complex analysis than for straight-line code. In this section we focus on scheduling this code segment efficiently without hardware
assistance. Predication or conditional instructions, which we discuss in the next section, provide another way to schedule this code.

Figure H.4 This trace is obtained by assuming that the program fragment in Figure H.3 is the inner loop and unwinding it
four times, treating the shaded portion in Figure H.3 as the likely path. The trace exits correspond to jumps off the frequent
path, and the trace entrances correspond to returns to the trace.

Figure H.5 This superblock results from unrolling the code in Figure H.3 four times and creating a superblock.

Figure H.11 The performance of four multiple-issue processors for five SPECfp and SPECint benchmarks. The clock rates
of the four processors are Itanium 2 at 1.5 GHz, Pentium 4 Extreme Edition at 3.8 GHz, AMD Athlon 64 at 2.8 GHz, and the IBM
Power5 at 1.9 GHz.

Advanced Computer Architectures: 17CS72 (As Per CBCS Scheme)
No ratings yet
Advanced Computer Architectures: 17CS72 (As Per CBCS Scheme)
31 pages
Lec-10 Software Pipelining
No ratings yet
Lec-10 Software Pipelining
24 pages
Computer Science 146 Computer Architecture
No ratings yet
Computer Science 146 Computer Architecture
13 pages
Lec18-Static BRANCH PREDICTION VLIW
No ratings yet
Lec18-Static BRANCH PREDICTION VLIW
40 pages
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
0% (1)
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
40 pages
9 Loop Unrolling
No ratings yet
9 Loop Unrolling
21 pages
Exploiting Instruction-Level Parallelism With Software Approaches
No ratings yet
Exploiting Instruction-Level Parallelism With Software Approaches
108 pages
Instruction Level Pipelining
100% (1)
Instruction Level Pipelining
113 pages
43-Instruction Scheduling and Software Pipelining-19!11!2024
No ratings yet
43-Instruction Scheduling and Software Pipelining-19!11!2024
25 pages
Onur Digitaldesign - Comparch 2021 Lecture13 Pipelining Afterlecture
No ratings yet
Onur Digitaldesign - Comparch 2021 Lecture13 Pipelining Afterlecture
138 pages
ILP-Solution For CO5
No ratings yet
ILP-Solution For CO5
27 pages
Multiple Issue
No ratings yet
Multiple Issue
10 pages
4.1 Basic Compiler Techniques For Exposing ILP Instruction-Level Parallelism
No ratings yet
4.1 Basic Compiler Techniques For Exposing ILP Instruction-Level Parallelism
11 pages
Pipeline
No ratings yet
Pipeline
36 pages
Unit II
No ratings yet
Unit II
84 pages
GE42 DSP C6000 SW Pipeline 23
No ratings yet
GE42 DSP C6000 SW Pipeline 23
74 pages
Vliw/Epic:: Statically Scheduled ILP
No ratings yet
Vliw/Epic:: Statically Scheduled ILP
34 pages
Lecture: Static ILP: Topics: Predication, Speculation (Sections C.5, 3.2)
No ratings yet
Lecture: Static ILP: Topics: Predication, Speculation (Sections C.5, 3.2)
26 pages
CS3350B Computer Architecture: Lecture 6.3: Instructional Level Parallelism: Advanced Techniques
No ratings yet
CS3350B Computer Architecture: Lecture 6.3: Instructional Level Parallelism: Advanced Techniques
24 pages
CompanionAsset 9780128119051 Chapter03 (3)
No ratings yet
CompanionAsset 9780128119051 Chapter03 (3)
67 pages
App C
No ratings yet
App C
50 pages
L1.3b_OOOpipelines
No ratings yet
L1.3b_OOOpipelines
72 pages
CO Assignment 4 Solution
100% (1)
CO Assignment 4 Solution
10 pages
Code Optimization Word-Wide Optimization Mixing C and Assembly
No ratings yet
Code Optimization Word-Wide Optimization Mixing C and Assembly
13 pages
Lecture 10
No ratings yet
Lecture 10
38 pages
CAunitiii
No ratings yet
CAunitiii
36 pages
Lecture 5
No ratings yet
Lecture 5
76 pages
HW 2 Is Out! Due 9/25!
No ratings yet
HW 2 Is Out! Due 9/25!
21 pages
Superpipelining
No ratings yet
Superpipelining
7 pages
Lec02 Superscalar SW VLIW 22 23
No ratings yet
Lec02 Superscalar SW VLIW 22 23
34 pages
ACA Unit 3
No ratings yet
ACA Unit 3
50 pages
Instruction-Level Parallelism and Its Exploitation: Prof. Dr. Nizamettin AYDIN
No ratings yet
Instruction-Level Parallelism and Its Exploitation: Prof. Dr. Nizamettin AYDIN
170 pages
Zareen 14
No ratings yet
Zareen 14
9 pages
Appendix C
No ratings yet
Appendix C
26 pages
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
Portuguese 240 Proposal
No ratings yet
Portuguese 240 Proposal
4 pages
Computer_Architecture_ILP_-_techniques_for_increasing
No ratings yet
Computer_Architecture_ILP_-_techniques_for_increasing
11 pages
Instruction-Level Parallelism (ILP), Since The
100% (1)
Instruction-Level Parallelism (ILP), Since The
57 pages
Appendix A: Authors: John Hennessy & David Patterson
No ratings yet
Appendix A: Authors: John Hennessy & David Patterson
15 pages
Onur Ddca 2025 Lecture15b Branch Prediction Beforelecture
No ratings yet
Onur Ddca 2025 Lecture15b Branch Prediction Beforelecture
188 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
2.advanced Compiler Support For ILP
100% (1)
2.advanced Compiler Support For ILP
16 pages
CS6461 - Computer Architecture Fall 2016 Adapted From Professor Stephen Kaisler's Slides
No ratings yet
CS6461 - Computer Architecture Fall 2016 Adapted From Professor Stephen Kaisler's Slides
71 pages
Solution Assignment No 2
No ratings yet
Solution Assignment No 2
8 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
EE457Unit9a_OoO
No ratings yet
EE457Unit9a_OoO
77 pages
Chapter 2 ILP
No ratings yet
Chapter 2 ILP
89 pages
Advanced Computer Architecture Pipeline and Branch Prediction
No ratings yet
Advanced Computer Architecture Pipeline and Branch Prediction
8 pages
M3.5 Instruction Level Parallesim
No ratings yet
M3.5 Instruction Level Parallesim
13 pages
Embedded Systems Design: Pipelining and Instruction Scheduling
No ratings yet
Embedded Systems Design: Pipelining and Instruction Scheduling
48 pages
CH16-WS ILP and Superscalar-v2
No ratings yet
CH16-WS ILP and Superscalar-v2
42 pages
Intro To Static Pipelining: CS252 Graduate Computer Architecture
No ratings yet
Intro To Static Pipelining: CS252 Graduate Computer Architecture
52 pages
Arch3 Pipelining Afterlecture
No ratings yet
Arch3 Pipelining Afterlecture
180 pages
Lecture 13: Trace Scheduling, Conditional Execution, Speculation, Limits of ILP
No ratings yet
Lecture 13: Trace Scheduling, Conditional Execution, Speculation, Limits of ILP
21 pages
dac14_cgpa
No ratings yet
dac14_cgpa
8 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
49 pages
Generating A Periodic Pattern For VLIW
No ratings yet
Generating A Periodic Pattern For VLIW
18 pages
Pipeline History
No ratings yet
Pipeline History
30 pages
Sp11-Quiz1 Soln
No ratings yet
Sp11-Quiz1 Soln
20 pages
5.Advanced-1
No ratings yet
5.Advanced-1
60 pages
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Appendix F: Authors: John Hennessy & David Patterson
No ratings yet
Appendix F: Authors: John Hennessy & David Patterson
33 pages
Appendix G: Authors: John Hennessy & David Patterson
No ratings yet
Appendix G: Authors: John Hennessy & David Patterson
6 pages
Verilog - PPT 1
No ratings yet
Verilog - PPT 1
41 pages
BillDesk Payment Gateway
No ratings yet
BillDesk Payment Gateway
1 page
Paypal
No ratings yet
Paypal
2 pages
DSP Design - Lecture 6: Unfolding
No ratings yet
DSP Design - Lecture 6: Unfolding
44 pages
Unit 5 CD
No ratings yet
Unit 5 CD
13 pages
ACA Notes
No ratings yet
ACA Notes
60 pages
Code Optimization-II
No ratings yet
Code Optimization-II
16 pages
144. Base Advanced_optimizing [Codebase 64 Wiki]
No ratings yet
144. Base Advanced_optimizing [Codebase 64 Wiki]
22 pages
C Optimization Techniques
No ratings yet
C Optimization Techniques
79 pages
Power Optimization Techniques Adopted at Various Abstraction Levels in System On Chip Design - A Survey
No ratings yet
Power Optimization Techniques Adopted at Various Abstraction Levels in System On Chip Design - A Survey
11 pages
Unit 5 Bard
No ratings yet
Unit 5 Bard
8 pages
Compilation Techniques
No ratings yet
Compilation Techniques
15 pages
Appendix H: Authors: John Hennessy & David Patterson
No ratings yet
Appendix H: Authors: John Hennessy & David Patterson
7 pages
Xapp1209 Designing Protocol Processing Systems Hls
No ratings yet
Xapp1209 Designing Protocol Processing Systems Hls
24 pages
Circular Buffering On TMS320C6000: Dipa Rao DSP West Applications
No ratings yet
Circular Buffering On TMS320C6000: Dipa Rao DSP West Applications
17 pages
Portuguese 240 Test
No ratings yet
Portuguese 240 Test
4 pages
Question Bank BCS402 (2)
No ratings yet
Question Bank BCS402 (2)
3 pages
Chapter 3: Code Optimization and Code Version Control
No ratings yet
Chapter 3: Code Optimization and Code Version Control
30 pages
Program Level Energy and Power Analysis
No ratings yet
Program Level Energy and Power Analysis
4 pages
Reversing PDF
100% (1)
Reversing PDF
136 pages
Compiler Techniques For Exposing ILP
No ratings yet
Compiler Techniques For Exposing ILP
4 pages
Unfolding
No ratings yet
Unfolding
11 pages
Very Large Instruction Word (VLIW) : - VLIW - Architectures and Scheduling Techniques (Ch. 3.5)
No ratings yet
Very Large Instruction Word (VLIW) : - VLIW - Architectures and Scheduling Techniques (Ch. 3.5)
35 pages
COMPILER DESIGN _MODULE 5
No ratings yet
COMPILER DESIGN _MODULE 5
37 pages
MN Loop Unrolling
No ratings yet
MN Loop Unrolling
5 pages
A Reusable Duff Device - DR Dobb's
No ratings yet
A Reusable Duff Device - DR Dobb's
3 pages
Aca 3
No ratings yet
Aca 3
113 pages
Embedded C Interview Questions
75% (4)
Embedded C Interview Questions
3 pages
BCS402_MC_M3_Notes SJCIT
No ratings yet
BCS402_MC_M3_Notes SJCIT
18 pages

Uploaded by

Uploaded by

Appendix H

Authors: John Hennessy & David Patterson

Copyright 2011, Elsevier Inc. All rights Reserved. 1

Copyright 2011, Elsevier Inc. All rights Reserved. 2

Copyright 2011, Elsevier Inc. All rights Reserved. 3

Copyright 2011, Elsevier Inc. All rights Reserved. 4

Copyright 2011, Elsevier Inc. All rights Reserved. 5

Copyright 2011, Elsevier Inc. All rights Reserved. 6

Copyright 2011, Elsevier Inc. All rights Reserved. 7

You might also like