| MAIK.NO    |  |
|------------|--|
| SURNAME    |  |
| FIRST NAME |  |

(2 instructions per cycle) processor using Tomasulo's algorithm to perform the dynamic scheduling of instructions on the pipeline shown in the following figure. This pipeline is executing the following program, which performs a search within a vector (initially, R1=0).

```
etic: LW R2, 0(R1) ; read Xi

MULI R2, R2, 3 ; multiplies Xi by 3

SW R2, 0(R1) ; write Xi

ADDI R1, R1, 4 ; update R1

BNE R2, R0, etic ; continue to loop if false
```



## Working hypothesis:

- the loop executes speculatively in terms of direction (always taken) and regarding the branch condition; high-performance fetch breaks after fetching a branch
- the issue stage (I) calculates the address of the actual read/write and push it into load/store queues; only 1 instruction is issued per cycle
- reads require 2 clock cycles; writes take 1 cycle (this means that stage M is lasting just 1 cycle)
- when accessing memory (M), writes have precedence over reads and must be executed in-order
- there is a single CDB
- dispatch stage (P) and complete stage (W) require 1 clock cycle
- ASSUME that the reservation stations could be freed right before the start of issue phase (therefore extending the duration of P stage)
- only 1 instruction is committed (C stage) per cycle
- there are separated integer units: one for the calculation of the actual address, one for arithmetic and logical operations, one of the integer multiplication and one for the evaluation of the branch condition, as illustrated in this table:

| Type of Functional Unit       | No. of Functional Units | Cycles for stage I+X | No. of reservation stations |
|-------------------------------|-------------------------|----------------------|-----------------------------|
| LS: Integer (effective addr.) | 1                       | 1                    | 2                           |
| A: Integer (op. A-L)          | 1                       | 1                    | 2                           |
| B: Integer (branch calc.)     | 1                       | 1                    | 2                           |
| M: Integer Multiplication     | 1                       | 2                    | 2                           |

- the functional units TAKE advantage of pipelining techniques internally
- the load queue has 3 slots; the store queue has 3 slots (writes wait for the operand in the store queue, i.e., in the execution stage)

Complete the following chart until the end of the FOURTH iteration of the above code fragment in the case of dynamic scheduling with speculation. Also add the instruction that occupies a certain reservation station (one of the 8) as indicated below:

| Instr.<br>No | Instruction name | ALU<br>RS1 | ALU<br>RS2 | LS<br>RS1  | LS<br>RS2 | BU<br>RS1 | BU<br>RS2 | MU MU<br>RS1 RS2 | P: disPatch<br>(clock) | I+X:Issue+Exec<br>(start-stop) | M: MEM.ACCESS<br>(start-stop) | W: CDB-write<br>(clock) | C: Commit<br>(clock) | Comments |
|--------------|------------------|------------|------------|------------|-----------|-----------|-----------|------------------|------------------------|--------------------------------|-------------------------------|-------------------------|----------------------|----------|
| 101          | LW R2,0(R1)      |            |            | I01<br>1-1 |           |           |           |                  | 1                      | 2-2                            | 3-4                           | 5                       | 6                    |          |
|              |                  |            |            |            |           |           |           |                  |                        |                                |                               |                         |                      |          |
|              | •••              |            |            |            |           |           |           |                  |                        |                                |                               |                         |                      |          |

- 2) (POINTS 3/30) On a Linux system, write the SINGLE command line to perform at the BASH shell prompt the following operation (please note that no intermediate files should be used):
  - Print on the standard output the name and size of the files (including "." and "..") of the directory "/home/mario".

## **EXERCIZE 1**

| Instr. Inst<br>No nam | ruction<br>ne | ALU<br>RS1<br>(start- | ALU<br>RS2<br>(start- | LS<br>RS1<br>(start- | LS<br>RS2<br>(start- | BU<br>RS1<br>(start- | BU<br>RS2<br>(start- | MU<br>RS<br>(start- | MU<br>RS2<br>(start- | P: Dispatch<br>(clock) | I+X: Issue<br>(start-stop) | MEM. ACC. (start-stop) | W: CDB-<br>write (clock) |    | it Comments                                                |
|-----------------------|---------------|-----------------------|-----------------------|----------------------|----------------------|----------------------|----------------------|---------------------|----------------------|------------------------|----------------------------|------------------------|--------------------------|----|------------------------------------------------------------|
|                       |               | stop)                 | stop)                 | stop)                | stop)                | stop)                | stop)                | stop)l              | stop)                |                        |                            |                        |                          |    |                                                            |
| 101 LW                | R2,0(R1)      |                       |                       | I01<br>1-1           |                      |                      |                      |                     |                      | 1                      | 2-2                        | 3-4                    | 5_\                      | 6  |                                                            |
| 102 MULI              | R2,R2,3       |                       |                       |                      |                      |                      |                      | I02<br>1-5          |                      | 1                      | 6-7                        | ر                      | 8                        | 9  | I waits R2 from 1/LW                                       |
| 103 SW                | R2,0(R1)      |                       |                       |                      | 103<br>2-2           |                      |                      |                     |                      | 2                      | 3-3                        | 10                     | 🗸                        | 11 | M waits R2 from 1/MULI; M waits mem                        |
| I04 ADDI              | R1,R1,4       | 104<br>2-4            |                       |                      | 1                    |                      |                      |                     |                      | 2                      | 4-4                        | \                      | 6                        | 12 | I waits issue logic; CDB collision                         |
| I05 BNE               | R2,R0,etic    |                       |                       |                      |                      | 105<br>3-8           |                      |                     |                      | 3                      | 9                          | -                      |                          | 13 | I waits R2 from 1/MULI                                     |
| 106 LW                | R2,0(R1)      |                       |                       | 106<br>4-6           |                      | ١                    |                      |                     |                      | 4                      | 7                          | 8,9/                   | 10 ,                     | 14 | I waits R1 from 1/ADDI                                     |
| 107 MULI              | R2,R2,3       | 1                     |                       |                      |                      |                      |                      |                     | 107<br>4-10          | 4                      | 11-12                      | 7- /                   | 13                       | 15 | I waits R2 from 2/LW                                       |
| 108 SW                | R2,0(R1)      |                       |                       |                      | I08<br>5-7           |                      |                      |                     | 1                    | 5                      | 8                          | 15                     |                          | 16 | I waits R1, M waits R2; I waits issue logic; M waits mem   |
| I09 ADDI              | R1,R1,4       | IO9<br>5-9            |                       |                      | ı                    |                      |                      | <b>+</b>            |                      | 5                      | 10                         | - /                    | 11                       | 17 | I waits R1 from 1/ADDI; I waits issue logic; CDB collision |
| I10 BNE               | R2,R0,etic    | 1                     |                       | 1                    |                      |                      | I10<br>6-13          |                     |                      | 6                      | 14                         | -\\/                   |                          | 18 | I waits R2 from 2/MULI;                                    |
| I11 LW                | R2,0(R1)      |                       |                       | I11<br>7-11          |                      |                      | ı                    |                     |                      | 7                      | 12                         | 13-14                  | 15                       | 19 | I waits R1 from 2/ADDI;                                    |
| I12 MULI              | R2,R2,3       |                       |                       | 1                    | 1                    |                      |                      | I12<br>7-15         |                      | 7 /(/                  | 16-17                      | 7                      | 18                       | 20 | I waits R2 from 3/LW                                       |
| 113 SW                | R2,0(R1)      |                       |                       |                      | I13<br>8-12          |                      |                      |                     |                      | 8                      | 13 🖊                       | 20                     |                          | 21 | I waits R; M waits R2; I waits issue logic; M waits mem    |
| I14 ADDI              | R1,R1,4       |                       | I14<br>8-14           |                      |                      | 1                    |                      |                     |                      | 8                      | 15                         | 1/                     | 16                       | 22 | I waits R1 from 2/ADDI; I waits issue logic                |
| I15 BNE               | R2,R0,etic    | +                     | 1                     | 1                    |                      | I15<br>9-17          |                      |                     | 1                    | 9                      | 19                         | _//                    |                          | 23 | I waits R2 from 3/MULI I waits issue logic;                |
| 116 LW                | R2,0(R1)      | •                     |                       | I16<br>12-16         |                      | 1                    |                      |                     |                      | 12 /                   | 17                         | 18-19                  | 20                       | 24 | P waits LS-RS; I waits R1; I waits issue logic;            |
| I17 MULI              | R2,R2,3       |                       |                       |                      | 1                    |                      |                      |                     | I17<br>12-20         | 12 (                   | 21-22/                     | 7                      | 23                       | 25 | I waits R2 from 4/LW                                       |
| 118 SW                | R2,0(R1)      |                       |                       |                      | 118<br>13-17         |                      |                      |                     | 1                    | 13                     | 18 /                       | 24                     |                          | 26 | I waits R1 from 3/ADDI; M waits R2; I waits issue logic;   |
| I19 ADDI              | R1,R1,4       | I19<br>13-19          |                       |                      | 1                    |                      | 1                    |                     |                      | 13                     | 20                         | /_                     | 21                       | 27 | I waits R1 from 3/ADDI; I waits issue logic;               |
| I20 BNE               | R2,R0,etic    | 1                     |                       |                      | 1                    |                      | 120<br>14-23         | T                   |                      | 14                     | 24                         |                        |                          | 28 | I waits R2 from 4/MULI                                     |

## **EXERCIZE 2**

The requested command line is:

ls -al /home/mario | awk '{print \$9 " " \$5} '