All quizzes for Spring 2017

Question 1: Suppose a program has three tasks to perform:

Task A, which takes 10 seconds;
Task B, which takes 5 seconds; and
Task C, which takes 3 seconds

Initially, my program performs each of these tasks one at a time. Which of the following change will result in the fastest runtime? (Assume doing two tasks in parallel does not make either of the parallel tasks slower.)

perform task A, B, and C in parallel;
speedup B and C by a factor of 5;
speedup A and B by a factor of 4;
speedup A by a factor of 2 and run it in parallel with B and C;

Question 2: If the bytes 0x12, followed by 0x34, followed by 0x56, followed by 0x78 are interpreted as a 4-byte little endian integer, what value will they have?

Question 3 (0 points): If a program compiles as C with a C compiler and is strictly standards-conformant C, it will ___ compile as C++ with a standards-conformant C++ compiler.

always
never
sometimes

Skim chapter 1, and review figures 3.2, 3.3, and 3.28 and answer the following questions.

Question 1: When we compile a C program with multiple source files, we can produce object (.o on Linux) files, which we combine into the final program. What is the process of combining these files into an executable called?

For these questions, look at figure 1.13 and consider the following program:

static int y = 42;
int main(void) {
    int x;
    scanf("%d", &x);
    printf("Input was %d\n", x);
    printf("Input + 42 is  %d\n", x + y);
}

Question 2: (see above) If the variable x is placed in memory, what region of memory would it most likely be in?

Question 3: (see above) If the variable y is placed in memory, what region of memory would it most likely be in?

Question 4: On 64-bit x86, if a function void foo(int w, int x, int y, int z) is called using foo(1, 2, 3, 4), then where is the value 4 placed?

Question 5: On 64-bit x86, if %rax contains 0x1000 and %rdx contains 0x1, then, in AT&T syntax, what does 0x10(%rax,%rdx,4) represent?

the value in memory at address 0x1014
the value in memory at address 0x1011
the value in memory at address 0x4011
the value in memory at address 0x4014
the value 0x1015
the value 0x4011

Quiz on second week's material.

Consider the C program main.c

#include <stdio.h>
void sayHello(void) {
    printf("Hello, World!\n");
}
int main(void) {
    sayHello();
}

Suppose it is turned into an executable main using the following steps:

gcc -O -S main.c -o main.s
gcc -c main.s -o main.o
gcc main.o -o main.exe

Question 1: (see above) Which of the following files could contain the memory address of the function sayHello?
Select all that apply

main.c
main.s
main.o
main.exe

Question 2: (see above) Which of the following files contains the string "Hello, World!"?
Select all that apply

main.c
main.s
main.o
main.exe

Question 3: Given the following code, which of the following are valid variable declarations?

typedef struct bar {
    int x;
} foo;

Select all that apply

Question 4: What does the following code output?

int x = 0;
int y = 1;

foo:
while (y < 5) {
    y += 2;
    x += 1;
    if (y == 6)
        goto bar;
}
y -= 5;
goto foo;

bar:
printf("%d\n", x);

Consider the following variable declarations and initializations:

int foo[32];
int* bar = foo;

Question 5: (see above) Which of the following pairs of statements are equivalent?
Select all that apply

Skim for 2150 topics you do not remember in chapter 2. Skim sections 3.6.7, 3.7, and figures 3.1, 3.2, and 3.3.

The textbook presents the following two implementations of factorial fact_do and fact_while in Figures 3.19 and 3.20:

fact_do:                    fact_while:
    movl $1, %eax               movl $1, %eax
L2:                             jmp L5
    imulq %rdi, %rax        L6:
    subq $1, %rdi               imulq %rdi, %rax
    cmpq $1, %rdi               subq $1, %rdi
    jg  L2                  L5:
    rep; ret                    cmpq $1, %rdi
                                jg L6
                                rep; ret

(rep; ret is an alternate form of ret, which is equivalent for the purposes of this course.)

Question 1: (see above) If the function fact_do above is invoked with the argument 4, how many times is the cmpq instruction executed?

Question 2: (see above) If the function fact_while above is invoked with the argument 4, how many times is the cmpq instruction executed?

Question 3: Figure 3.35 in the text presents a recursive implementation of factorial:

rfact:
    pushq %rbx
    movq %rdi, %rbx
    movl $1, %eax
    cmpq $1, %rdi
    jle .L35
    leaq -1(%rdi), %rdi
    call rfact
    imulq %rbx, %rax
.L35:
    popq %rbx
    ret

Calling this function with an argument of 0 requires 16 bytes of stack space, including the space for the return address used by the final ret. How much stack space does calling it with an argument of 3 require?

Question 4: Which bit sequence is the 6-bit 2's complement representation of the decimal number -16?

Quiz on three week's material.

Question 1: Consider the following code:

unsigned int v; // input
int f;         // output
f = v && !(v & (v - 1));

What does the output f tell us about the input v?

f is true if v is a negative number
f is true if v is a power of two
f is the sign extended version of v
f is true if any bit in v is set

Question 2: Consider the following code:

int x;  // input 1
int y;  // input 2
int r;  // output 
r = y ^ ((x ^ y) & -(x < y));

What does r represent here? Hint: -1 is represented in binary by all 1's, and a < b returns 0 or 1.

minimum of x and y
maximum of x and y
conditionally setting bits in x
summation of x and y

Question 3: What is the value of the address computed by (%rdx,%rcx,4), if %rdx is 0xf000 and %rcx is 0x0100?

0xf400
0xf100
0xf500
0x4f00

Question 4: Code segment 1:

push   %ebx            //store ebx value to stack
subq   $10, %ebx
pop    %ebx            //restore ebx value from stack

Code segment 2:

push   %ebx            //store ebx value to stack
cmpq   $10, %ebx
pop    %ebx            //restore ebx value from stack

What is the difference between code segment 1 and 2?

1 may change ZF, but 2 will not
1 may change SF, but 2 will not
2 may change ZF, but 1 will not
2 may change SF, but 1 will not
none of the above

Read sections 4.1 through 4.3

Consider the instruction pushq %rax on Y86-64.

Question 1: (see above) How many program registers (e.g. %rax, %rbx, etc.) are written by this instruction?

Question 2: (see above) How many program registers (e.g. %rax, %rbx, etc.) are read by this instruction?

Question 3: In order for a Y86-64 processor to determine the length of the currently executing instruction in memory, what information might it need?
Select all that apply

the first byte of the instruction
the immediate or displacement value in that instruction
the register numbers rA, rB in that instruction
the current value of the condition codes
the current value of the status code Stat

Question 4: Which of the following operations that can be done in one instruction on X86-64 can NOT be done in one instruction on Y86-64?
Select all that apply

accessing a memory location computed by adding a register value and a constant offset
accessing a memory location computed by adding a register value and another register value
adding a constant value to a register
multiplying a register value by another register value
storing an 8-byte value in memory

Question 5: Consider a register (of the kind our textbook describes) which takes as input value I and a clock signal, and which outputs a value O. Which of the following statements is true?

Each time the input I changes, the output O changes.
As long as the clock signal is high, each time the input I changes, the output O will change.
As long as the clock signal is low, each time the input I changes, the output O will change.
If I changes and, later, the clock signal changes from low to high, then the output O will change.
If C changes from low to high and, later, I changes, then the output O will change.

Quiz on three four's material.

Question 1: Each of the following statements about Y86-64 are true. Which of them is an attribute that makes Y86-64 more RISC-like?
Select all that apply

the OPq instructions operate only on registers
there are few ways to specify instruction operands in Y86-64
simpler instructions can take up less space than more complicated ones
Y86-64 has only 18 instructions (counting conditional moves and conditional jumps each as one instruction)
Y86-64 has instructions like pop which modify two registers, allowing for shorter programs

Question 2: What Y86-64 assembly is equivalent to cmovne %rax, %rbx?
Select all that apply

rrmovq %rax, %rbx

    je after
    rrmovq %rax, %rbx
after:

    jne after
    rrmovq %rax, %rbx
after:

cmove %rbx, %rax

    cmovl %rax, %rbx
    cmovg %rax, %rbx

Question 3: What determines the length of one cycle in the single cycle microarchitecture?
Select all that apply

the slowest stage (e.g., fetch, decode)
the slowest instruction (e.g., multiplication)
the number of registers
the number of available instructions in the ISA

Question 4: After the decode stage what do we not know about the instruction?
Select all that apply

Read sections 4.3.2 and 4.3.4. Review sections 4.2.2-4 and read the HCL2D document, sections 2 and 3.

Question 1: Consider the instruction subq rA, rB. This instruction performs R[rB] <— R[rB] sub R[rA]. The instruction passes through the five stages (Fetch, Decode, Execute, Memory, Writeback). Which (if any) stages are not involved in handling this instruction?

Question 2: Consider the instruction rmmovq D(rB), rA. This instruction performs EA <— D + R[rB]; MEM[EA] <— R[rA]. The instruction passes through the five stages (Fetch, Decode, Execute, Memory, Writeback). In which stage does the hardware calculate the effective address EA (ValE in the book)?

Question 3: Consider the instruction mrmovq D(rB), rA. This instruction performs EA <— D + R[rB]; R[rA] <— MEM[EA]. The instruction passes through the five stages (Fetch, Decode, Execute, Memory, Writeback). In which stage does the hardware retrieve the value stored in register rB?

Question 4 (0 points): What is the result of the following HCL (or HCL2D) expression if x is 10 and y is 5?

[
    y in {4,6} : x + y,
    y <= 5 : x - y,
    1 : 0,
]

Quiz on week five's material.

Question 1: In the single-cycle processor design we discussed in class, one option was for the memory address input to the data memory to come directly from the single ALU. For which of the following memory-accessing instructions was this not done because the ALU was used for something else?
Select all that apply

mrmovq
ret
pushq
rmmovq

Question 2: In the single-cycle processor design we discussed in class, some instructions do not use the memory stage. What is true about these instructions?
Select all that apply

these instructions change the PC more quickly than other instructions;
when these instructions execute, the address input (mem_addr) to the data memory is 0;
when these instructions execute, in an HCL2D implementation, the mem_readbit control signal will be 0;
these instructions all write a value to the register file;

Question 3: In the single-cycle processor design we discussed in class, from which sources could the input of the program counter register come?
Select all that apply

a register file output
the data memory output
the output of an adder which takes the current PC as input
the instruction memory output
the output of the ALU that is also used for data memory address computations

Question 4: Which of the following are illegal in HCL2D?
Select all that apply

No quizzes for week 6; good luck on the exam!

Read sections 4.4 through 4.5.4

Question 1: Adding pipelining to a previously unpipelined processor ______ the latency of instructions.

increases
decreases
can increase or decrease
does not change

Question 2: Adding pipelining to a previously unpipelined processor ______ the throughput of instructions

increases
decreases
can increase or decrease
does not change

Question 3: In the pipelined version of the Y86-64 processor, which values are passed in pipeline registers between the fetch and decode stage?
Select all that apply

the rA and rB fields from the instruction (if the instruction has them)
the immediate or displacement value from the instruction (if the instruction has one)
the result of the instruction's ALU operation (if the instruction has one)

Quiz on week seven's material.

Question 1: Consider the pipelined Y86-64 implementation we discussed in class. Suppose the register delay is 10 ps and the length of the critical path through memories and combinatorial logic in each stage is:

100 ps for fetch;
75 ps for decode;
80 ps for execute
100 ps for memory
60 ps for writeback

What is the minimum clock cycle time this processor could have and still operate correctly?

Question 2: Consider the following Y86-64 assembly snippet:

mrmovq 4(%r11), %r10
addq %rax, %rbx
subq %rcx, %rdx
andq %rsi, %rdi 
xorq %rsp, %rbp
irmovq $10, %r9

In our pipelined implementation, when the subq instruction is in its decode stage, what stage is the mrmovq instruction in?

Question 3: For the pipelined Y86-64 design we discussed in class, for which of the following instruction sequences is there a data hazard?
Select all that apply

addq %rax, %rbx; addq %rbx, %rcx
addq %rax, %rbx; addq %rax, %rcx
addq %rax, %rbx; addq %rax, %rbx

Question 4: For the pipelined Y86-64 design we discussed in class, for which of the following instruction sequences is there a control hazard?
Select all that apply

irmovq $0, %rax; mrmovq 0(%r10), %r11; ret
irmovq $0, %rax; addq %rbx, %rbx; jle foo
irmovq $0, %rax; addq %rbx, %rbx; call foo

Read sections 4.5.5, 4.5.8, 4.5.10, 5.7-5.7.2

Question 1: Using our textbook's notion of "stall" and "bubble" signals, if a pipeline register's input is receiving a stall signal of 1 and a bubble signal of 0, then its output after the next rising edge of the clock will be:

the same as the current input of the register
the same as the current output of the register
the nop value

Question 2: Our textbook talks about both data dependencies and data hazards. Which are differences between these?
Select all that apply

what is a data hazard depends on the hardware, but what is a data dependency depends only on the ISA
two instructions can have a data dependency without creating a data hazard

Question 3: We can perform forwarding of %rax to implement

irmovq $15, %rax
addq %rbx, %rax

without any stalling. Suppose the addq instruction receives the forwarded value for %rax from irmovq in its decode stage. Where can it retrieve (i.e. forward) this value from?

the pipeline registers between fetch and decode
the pipeline registers between decode and execute
the pipeline registers between execute and memory
the pipeline regsiters between memory and writeback

Question 4: Figure 5.12 indicates that the integer multiplication functional units on the Intel "Haswell" microarchitecture have an issue time of 1 cycle and a latency of 3 cycles. How long does it take one of these functional units to perform two integer multiplications from when the first multiplcation is started until the last finishes?

For each of the following sequences of instructions, indicate how many cycles of pipeline stalls occur in the pipelined Y86-64 implementation we discussed in class.

Question 1: (see above) :

popq %rax
addq %rax, %rbx
popq %rax
rmmovq %rax, 8(%rbx)

Question 2: (see above) : (accepted both 0 and 3 --- depending on whether you counted stalls "during" ret)

     xorq %rax, %rax
     je foo // always taken
foo: subq %rbx, %rax
     ret

Question 3: Consider a pipelined Y86-64 processor with six stages, resulting from splitting the memory stage into two parts. That is, the processor has the following stages:

Fetch
Decode
Execute
Memory Part 1
Memory Part 2
Writeback

Suppose the results of any data memory load or store is not available until near the end of the second memory stage. On this processor, even after implementing all possible forwarding, which of the following instruction sequences would require a pipeline stall to resolve hazards:
Select all that apply

addq %rcx, %rax; rmmovq %rbx, 8(%rax); addq %rcx, %rax
mrmovq 8(%rax), %rbx; subq %rcx, %rax; addq %rbx, %rax
pushq %rax; addq %rsp, %rbx; popq %rbx

Question 4: IBM 360/91 had out-of-order instruction execution and completion but no support for precise exceptions. Suppose you were writing a floating point program in this machine. Which statement is true?
Select all that apply

Each instruction in your program executed in one cycle
Debugging was hard
Instructions got executed in-order

skim section 6.1.1 and read 6.2-6.3

Question 1: Which component in the memory hierarchy is the fastest?

Register
Cache
Memory
Storage

Question 2: Which component in the memory hierarchy is non-volatile (can retain data without power)?

SRAM
DRAM
Disk
None of the above

Question 3: Suppose the memory access pattern of your program looks like this: a[1], a[2], a[3], a[7], a[8], a[9], a[13], a[14], a[15]. What kind of locality correctly describes this access pattern?

Spatial locality
Temporal locality
No locality
None of the above

Question 4: Suppose the memory access pattern of your program looks like this: a[1], b[1], c[1], a[1], b[1], c[1]. What kind of locality correctly describes this access pattern?

Spatial locality
Temporal locality
No locality
None of the above

Question 1: Which of the following are true about programs in the data-flow model?
Select all that apply

Operations in the program appear to execute in the order in which they were written.
Independent operations can execute as soon as their inputs are available.
Multiple operations with the same input can execute in parallel.

Question 2: On an out-of-order processor, which of the following can happen while the processor is completing a very long load (mrmovq) instruction?
Select all that apply

Independent instructions from later in the program can be fetched.
Dependent instructions from later in the program can be fetched.
Some instructions from later in the program can be executed.
Dependent instructions from later in the program can be executed.
Some instructions from later in the program can be committed.

Question 3: Suppose a level-1 cache has an access time of 2 nanoseconds and a 95% hit rate, and the corresponding level-2 cache has a preceived access time of 10 nanoseconds. What is the preceived access time of the level 1 cache?

Question 4: Assume that you are designing a cache for low-power sensor nodes. It measures the environment temperature every minute and sends the average temperature from the last 24 hours to the base station. This sensor requires the caches to be extremely power-efficient. Which design decision would you make?

Serial look up for tag store and data store in the cache
A huge L1 cache
No cache, directly access memory

Section 6.5

Suppose your program sums up all elements of an array. Each element of the array takes 4 bytes of space. Your cache block size is 64B. Initially your cache is empty and your first access v[0] misses the cache.

int sum = 0;
for(int i = 0; i < N; i += 1)
    sum += v[i];

Question 1: (see above) Which of the following statements are true?
Select all that apply

v[1] will hit the cache
v[1] to v[15] will hit the cache
v[16] will miss the cache
v[0], v[16], v[32] will miss the cache

Question 2: (see above) What kind of locality this access pattern is exploiting in the cache?

Spatial locality
Temporal locality
No locality
None of the above

Question 3: (see above) If N is set to be 100, how many misses this code would see while accessing the array v?

0
6
7
10

Question 4: (see above) If we operate on every even element in the array, the code becomes:

int sum = 0;
for(int i = 0; i < N; i += 2)
    sum += v[i];

The cache parameters remain the same. Which of the following statements are true?
Select all that apply

v[0] will hit the cache
v[2] will hit the cache
v[16] will miss the cache
v[0], v[16], v[32] will miss the cache

Consider a 1.5MB 3-way set-associative cache with 64 byte cache blocks which uses a true LRU (least recently used) replacement policy.

Question 1: (see above) How many sets does this cache have?

Question 2: (see above) The byte at address 0x654321 will be stored in the same cache block as the byte at
Select all that apply

0x654300
0x654350
0x600021
0x054320

Question 3: (see above) The byte at address 0x654321 will be stored in the same cache set as the byte at
Select all that apply

0x654300
0x654350
0x600021
0x054320

Question 4: Which of the following techniques are likely to reduce the number of conflict cache misses programs experience, assuming the cache size remains fixed?
Select all that apply

increased cache associativity
better choices of cache replacement policy
increased cache block size

Question 5: Switching a cache from a write-through to write-back policy is likely to have which of the following effects on the cache?
Select all that apply

the cache will take better of advantage of locality in writes
the cache will write to memory more often
cache replacement will be simpler

Section 5.1-2, 5.4-6, 5.8-11, skim 5.14

Question 1: Consider the following two functions:

void sumArray1(int *pSum, int *array, int n) {
    for (int i = 0; i < n; ++i)
        *pSum += array[i];
}

void sumArray2(int *pSum, int *array, int n) {
    int temp = 0;
    for (int i = 0; i < n; ++i)
        temp += array[i];
    *pSum += temp;
}

Suppose an optimizing compiler generates much slower code for sumArray1 than sumArray2. What is a likely cause of this?
Select all that apply

pSum might alias array
pSum might alias n
sumArray1 is too large to be inlined
sumArray1 has side-effects
the loop in sumArray1 can't be unrolled because of *pSum

Question 2: Which of the following optimizations the textbook discusses are likely to substantially increase machine code size?
Select all that apply

loop unrolling
eliminating loop inefficiencies by moving computations outside of loops
function inlining
using multiple accumulators
eliminating unneeded memory references

Question 3 (0 points): (Question dropped.) The textbook discusses how a loop like:

for (int i = 0; i < N; i += 2) {
    acc = acc * (a[i] * b[i]);
    acc = acc * (a[i+1] * b[i+1]);
}

is usually faster than one like:

for (int i = 0; i < N; i += 2) {
    acc = (acc * a[i]) * b[i];
    acc = (acc * a[i+1]) * b[i+1];
}

(Assume the compiler obeys the explicitly specified order of operations.) Which of the following are different about how a modern out-of-order processor will execute the two loops?

The first answer was meant to say acc * (a[i] * b[i]) and acc * (a[i+1] + b[i+1]) to make the question unambiguous. This is why the question was dropped. However, it's still the case the processor can execute a[i+1] * b[i+1] from the next iteration of the loop in parallel with acc * (a[i+1] * b[i+1]) from the previous.
Select all that apply

the processor can execute a[i+1] * b[i+1] and acc * (a[i+1] * b[i+1]) in parallel in the first loop, but not the second
the processor can load a[i] earlier in the first loop than the second loop
the processor can more easily predict branches in the second loop
the processor can load a[i] and b[i] in parallel with the first loop, but not the second

Question 4: Which of the following transformation our textbook discusses likely to decrease the number of instructions a program executes? (This should have been a select-all, but since it wasn't, either of the correct answers gets full credit.)

cache blocking
loop unrolling
function inlining
using multiple accumulators

Question 1: Some of the optimizations we talked about compilers do not automatically perform aggressively because they can make code much slower when done excessively. Which are examples of these?
Select all that apply

loop unrolling
removing redundant operations from loops
function inlining
replacing multiplication in a loop with addition

Question 2: Which of these are a way to eliminate aliasing so a compiler can perform more optimizations?

place values accessed through pointers temporarily in a local variable
iterate through arrays with pointer arithmetic instead of using array subscripts
place array indices in a local variable rather than computing it each time
transform your loops to use cache blocking

Question 3: Using mulitple accumulators improves performance by

avoiding redundant computations or memory accesses
performing more operations in parallel
eliminating loop overheads
simplifying the leftover work that must be done after an unrolled loop
none of the above

Question 4: In the example in lecture, trying to use too many accumulators resulted in lower performance than using fewer (with the same amount of loop performance). This was most likely because of

register spilling
more instruction cache misses
poorer spatial locality
all of the above

Section 8.1-8.3

Question 1: Suppose that your program reads a file from the disk using the fread function. The library you are using implements this fread function using a system call referred to as read. How would you characterize the exception caused by read?

Interrupt
Trap
Fault
Abort

Question 2: Which of the following events will initiate an interrupt?
Select all that apply

A mouse click
Typing on the key board
A divide by zero operation
A completed data transfer from the disk to memory

Question 3 (0 points): Which of the following events will initiate an abort?
Select all that apply

A mouse click
Data transfer complete from the disk to memory
A divide by zero operation
A completed data transfer from the disk to memory

Question 4: Suppose that your program writes to a read-only page. Which of the following exceptions will occur in this case?

Interrupt
Trap
Fault
Abort

Question 1: Each of the following is either synchronous or asynchronous. Which are synchronous?
Select all that apply

traps
faults
interrupts
signals

Question 2: We have seen this code in our lecture.

void fork5()
{
        printf("L0\t”);
        if (fork() == 0) {
            printf("L1\t”);
            if (fork() == 0) {
                printf("L2\t”);
        }
        }
        printf("Bye\t”);
}

Which of the following are infeasible?
Select all that apply

L0 Bye L1 L2 Bye Bye
L0 Bye L1 Bye Bye L2
L0 L1 Bye Bye L2 Bye
L0 Bye Bye L1 L2 Bye

Question 3: Which of the following are true about signal handlers?
Select all that apply

they run in user mode
can access the global variables
they can be executed in response to an external event
they can be interrupted by other handlers

Question 4: Which of the following are true about signals?
Select all that apply

sent from kernel to a user process
it is possible to queue multiple pending signals of the same type
can be used to communicate between two user processes via kernel
received by the user process as soon as they are send

Section 9.3-9.6, skim 9.7

Question 1: Which of the following statements are false?
Select all that apply

a process can always access data from the address space of some other process
a process can never access data from its own address space
all processes share the same address space
all processes can access kernel address space

Question 2: Virtual memory can be thought of as a mechanism that caches virtual pages in the physical memory. A virtual page can be stored at any physical page frame. Which of the following statement is true?

virtual memory is direct mapped
virtual memory is set associative
virtual memory is fully associative

Question 3: If we need 32 bits to represent a virtual address and our page size is 4KB, how many page table entries (PTE) do we need?

2^10
2^12
2^20
2^22

Question 4: A page table is a map of <vpn, ppn>, where vpn is a virtual page number and ppn is a physical page number. The process of generating the appropriate ppn for a vpn is referred to as:

address protection
address translation
address swapping
address counting

Question 1: The PTBR contains the base address of the page table. Which of the following statements are false?
Select all that apply

It contains a virtual address
It contains the same address for each process
It is updated on a contest switch
It is the CR3 register for x86 machines

Question 2: A single-level page table has only one table, where a multi-level page table has multiple tables organized in a hierarchical manner. Which of the following statements are true?
Select all that apply

For a single-level page table, the amount of space it requires varies based on how many pages are in use
For a multi-level page table, the amount of space it requires varies based on how many pages are in use
If your program uses 100% of the virtual address space (even the parts normally not used), having a multi-level page table requires less space than a single-level page table
All of the above

Question 3: Which of the following can have a virtually-indexed-physically-tagged L1 cache?
Select all that apply

L1 32 KB, 8-way set associative, page size 4K
L1 64 KB, 8-way set associative, page size 4K
L1 64 KB, 2-way set associative, page size 4K
L1 64 KB, 2-way set associative, page size 8K

Question 4: Which of the following statements are true about the translation lookaside buffer (TLB)?
Select all that apply

Small set-associative hardware cache in MMU
Maps virtual page numbers to physical page numbers
Contains complete page table entries for some pages
None of the above

All quizzes for Spring 2017

Quiz 00

Quiz 01

Quiz 02

Quiz 03

Quiz 04

Quiz 05

Quiz 06

Quiz 07

Quiz 08

Quiz 09

Quiz 10

Quiz 11

Quiz 12

Quiz 13

Quiz 14

Quiz 15

Quiz 16

Quiz 17

Quiz 18

Quiz 19

Quiz 20

Quiz 21

Quiz 22

Quiz 23

Quiz 24