Académique Documents
Professionnel Documents
Culture Documents
Processor Performance
Understanding Performance
Algorithm
Determines number of operations executed
1.4 Performance
Defining Performance
Which airplane has the best performance?
Boeing 777 Boeing 747 BAC/Sud Concorde Douglas DC-8-50 0 100 200 300 400 500 Boeing 777 Boeing 747 BAC/Sud Concorde Douglas DC8-50 0 2000 4000 6000 8000 10000
Passenger Capacity
Boeing 777 Boeing 747 BAC/Sud Concorde Douglas DC-8-50 0 500 1000 1500
Boeing 777 Boeing 747 BAC/Sud Concorde Douglas DC8-50 0 100000 200000 300000 400000 Passengers x mph
Throughput
Total work done per unit time
e.g., tasks/transactions/ per hour
Relative Performance
Define Performance = 1/Execution Time X is n time faster than Y
Performanc e X Performanc e Y = Execution time Y Execution time X = n
CPU time
Time spent processing a given job
Discounts I/O time, other jobs shares
Comprises user CPU time and system CPU time Different programs are affected differently by CPU and system performance
Chapter 1 Computer Abstractions and Technology 6
CPU Clocking
Operation of digital hardware governed by a constant-rate clock
Clock period Clock (cycles) Data transfer and computation Update state
CPU Time
CPU Time = CPU Clock Cycles Clock Cycle Time CPU Clock Cycles = Clock Rate
Performance improved by
Reducing number of clock cycles Increasing clock rate Hardware designer must often trade off clock rate against cycle count
CPI Example
Computer A: Cycle Time = 250ps, CPI = 2.0 Computer B: Cycle Time = 500ps, CPI = 1.2 Same ISA Which is faster, and by how much?
CPU Time A B = Instruction Count CPI Cycle Time A A = I 2.0 250ps = I 500ps A is faster = Instruction Count CPI Cycle Time B B = I 1.2 500ps = I 600ps
by this much
Relative frequency
Chapter 1 Computer Abstractions and Technology 12
CPI Example
Alternative compiled code sequences using instructions in classes A, B, C
Class CPI for class IC in sequence 1 IC in sequence 2 A 1 2 4 B 2 1 1 C 3 2 1
Sequence 1: IC = 5
Clock Cycles = 21 + 12 + 23 = 10 Avg. CPI = 10/5 = 2.0
Sequence 2: IC = 6
Clock Cycles = 41 + 12 + 13 =9 Avg. CPI = 9/6 = 1.5
Performance Summary
The BIG Picture
Instructions Clock cycles Seconds CPU Time = Program Instruction Clock cycle
Performance depends on
Algorithm: affects IC, possibly CPI Programming language: affects IC, CPI Compiler: affects IC, CPI Instruction set architecture: affects IC, CPI, Tc
Chapter 1 Computer Abstractions and Technology 14
Power Trends
In CMOS IC technology
Power = Capacitive load Voltage 2 Frequency
30 5V 1V 1000
Reducing Power
Suppose a new CPU has
85% of capacitive load of old CPU 15% voltage and 15% frequency reduction
Pnew Cold 0.85 (Vold 0.85) 2 Fold 0.85 = = 0.85 4 = 0.52 2 Pold Cold Vold Fold
Uniprocessor Performance
Multiprocessors
Multicore microprocessors
More than one processor per chip
Hard to do
Programming for performance Load balancing Optimizing communication and synchronization
Chapter 1 Computer Abstractions and Technology 18