Changelogcr4bd/3330/F2017/notes/...2017/09/12 · constructinginstructions typedef unsigned char...

Changelog

Changes made in this version not seen in first lecture:12 September 2017: slide 28, 33: quote solution that uses z correctly

last time

Y86 — choices, encoding and decoding

shift operatorsshr assembly, >> in Cright shift = towards least significant bitright shift = dividing by power of two

on the quizzes in general

yes, I know quizzes are hard

intention: quiz questions from slides/etc. + some serious thought

(and sometimes I miss the mark)

main purpose: review material other than before exams, warningsign for me

why graded? because otherwise…

on the quiz (1)

RISC versus CISC: about simplifying hardware

variable-length encoding is less simple for HW

instructions chosen more based on what’s simple for HW(e.g. push/pop not simple for HW)

more registers — simpler than adding more instructionscompensates for seperate memory instructions

on the quiz (2)

instruction set — what the instructions do

size of memory address — operands to rmmovq, etc. mean what?

floating point support — do such instructions exist?

constructing instructions

typedef unsigned char byte;byte make_simple_opcode(byte icode) {

// function code is fixed as 0 for nowreturn opcode * 16;// 16 = 1 0000 in binary

constructing instructions in hardware

0 0 0 0

opcode

reversed shift right?

((((((((((((hhhhhhhhhhhhshr $-4, %reg

(((((((((((((hhhhhhhhhhhhhopcode >> (-4)

shift left

x86 instruction: shl — shift leftC: value << amount

shl $amount, %reg (or variable: shr %cl, %reg)%reg (initial value)

%reg (final value)

1 0 1 1 0

shift left

x86 instruction: shl — shift leftC: value << amount

shl $amount, %reg (or variable: shr %cl, %reg)%reg (initial value)

%reg (final value)

1 0 1 1 0

left shift in math1 << 0 == 1 0000 00011 << 1 == 2 0000 00101 << 2 == 4 0000 0100

10 << 0 == 10 0000 101010 << 1 == 20 0001 010010 << 2 == 40 0010 1000

x << y = x × 2y

left shift in math1 << 0 == 1 0000 00011 << 1 == 2 0000 00101 << 2 == 4 0000 0100

10 << 0 == 10 0000 101010 << 1 == 20 0001 010010 << 2 == 40 0010 1000

x << y = x × 2y

extracting icode from more

11111000 0 0 1 0 0 0 0 0icode ifunrBrA

// % -- remainderunsigned extract_opcode1(unsigned value) {

return (value / 16) % 16;}

unsigned extract_opcode2(unsigned value) {return (value % 256) / 16;

extracting icode from more

11111000 0 0 1 0 0 0 0 0icode ifunrBrA

// % -- remainderunsigned extract_opcode1(unsigned value) {

return (value / 16) % 16;}

unsigned extract_opcode2(unsigned value) {return (value % 256) / 16;

manipulating bits?

easy to manipulate individual bits in HW

how do we expose that to software?

interlude: a truth table

AND 0 10 0 01 0 1

AND with 1: keep a bit the same

AND with 0: clear a bit

method: construct “mask” of what to keep/remove

AND 0 10 0 01 0 1

bitwise AND — &

Treat value as array of bits

1 & 1 == 1

1 & 0 == 0

0 & 0 == 0

2 & 4 == 0

10 & 7 == 2

… 0 0 1 0& … 0 1 0 0

… 0 0 0 0

… 1 0 1 0& … 0 1 1 1

… 0 0 1 0

bitwise AND — &

1 & 1 == 1

1 & 0 == 0

0 & 0 == 0

2 & 4 == 0

10 & 7 == 2

… 0 0 1 0& … 0 1 0 0

… 0 0 0 0

… 1 0 1 0& … 0 1 1 1

… 0 0 1 0

bitwise AND — &

1 & 1 == 1

1 & 0 == 0

0 & 0 == 0

2 & 4 == 0

10 & 7 == 2

… 0 0 1 0& … 0 1 0 0

… 0 0 0 0

… 1 0 1 0& … 0 1 1 1

… 0 0 1 0

bitwise AND — C/assembly

x86: and %reg, %reg

C: foo & bar

bitwise hardware (10 & 7 == 2)

0101 1110

extract opcode from larger

unsigned extract_opcode1_bitwise(unsigned value) {return (value >> 4) & 0xF; // 0xF: 00001111// like (value / 16) % 16

unsigned extract_opcode2_bitwise(unsigned value) {return (value & 0xF0) >> 4; // 0xF0: 11110000// like (value % 256) / 16;

extract opcode from larger

extract_opcode1_bitwise:movl %edi, %eaxshrl $4, %eaxandl $0xF, %eaxret

extract_opcode2_bitwise:movl %edi, %eaxandl $0xF0, %eaxshrl $4, %eaxret

more truth tables

AND 0 10 0 01 0 1

OR 0 10 0 11 1 1

XOR 0 10 0 11 1 0

conditionally clear bitconditionally keep bit

conditionally set bit

conditionally flip bit

bitwise OR — |

1 | 1 == 1

1 | 0 == 1

0 | 0 == 0

2 | 4 == 6

10 | 7 == 15

… 1 0 1 0| … 0 1 1 1

… 1 1 1 1

bitwise xor — ̂

1 ^ 1 == 0

1 ^ 0 == 1

0 ^ 0 == 0

2 ^ 4 == 6

10 ^ 7 == 13

… 1 0 1 0^ … 0 1 1 1

… 1 1 0 1

negation / not — ~

~ (‘complement’) is bitwise version of !:

!0 == 1

!notZero == 0

~0 == (int) 0xFFFFFFFF (aka −1)

~2 == (int) 0xFFFFFFFD (aka −3)

~((unsigned) 2) == 0xFFFFFFFD

~ 0 0 … 0 0 0 01 1 … 1 1 1 1

32 bits

!0 == 1

!notZero == 0

~ 0 0 … 0 0 0 01 1 … 1 1 1 1

32 bits

!0 == 1

!notZero == 0

~ 0 0 … 0 0 0 01 1 … 1 1 1 1

32 bits

note: ternary operator

w = (x ? y : z)

if (x) { w = y; } else { w = z; }

one-bit ternary

(x ? y : z)

constraint: everything is 0 or 1

now: reimplement in C without if/else/||/etc.(assembly: no jumps probably)

divide-and-conquer:(x ? y : 0)(x ? 0 : z)

one-bit ternary

(x ? y : z)

now: reimplement in C without if/else/||/etc.(assembly: no jumps probably)

divide-and-conquer:(x ? y : 0)(x ? 0 : z)

one-bit ternary parts (1)

(x ? y : 0)

y=0 y=1x=0 0 0x=1 0 1

→ (x & y)

(x ? y : 0)

y=0 y=1x=0 0 0x=1 0 1

→ (x & y)

(x ? y : 0) = (x & y)

(x ? 0 : z)

opposite x: ~x

((~x) & z)

(x ? y : 0) = (x & y)

(x ? 0 : z)

opposite x: ~x

((~x) & z)

one-bit ternary

constraint: everything is 0 or 1 — but y, z is any integer

(x ? y : z)

(x ? y : 0) | (x ? 0 : z)

(x & y) | ((~x) & z)

multibit ternary

constraint: x is 0 or 1

old solution ((x & y) | (~x) & 1) only gets least sig. bit

(x ? y : z)

(x ? y : 0) | (x ? 0 : z)

((−x) & y) | ((−(x ^ 1)) & z)

multibit ternary

(x ? y : z)

(x ? y : 0) | (x ? 0 : z)

((−x) & y) | ((−(x ^ 1)) & z)

constructing masks

(x ? y : 0)

if x = 1: want 1111111111…1 (keep y)

if x = 0: want 0000000000…0 (want 0)

one idea: x | (x << 1) | (x << 2) | ...

a trick: −x

((-x) & y)

constructing masks

(x ? y : 0)

if x = 1: want 1111111111…1 (keep y)

if x = 0: want 0000000000…0 (want 0)

one idea: x | (x << 1) | (x << 2) | ...

a trick: −x

((-x) & y)

two’s complement refresher

1−231

… 1+22

−1 =

231 − 1−231

−231 + 10111 1111… 1111

1000 0000… 0000

1111 1111… 1111

1−231

… 1+22

−1 =

231 − 1−231

−231 + 1

0111 1111… 1111

1000 0000… 0000

1111 1111… 1111

1−231

… 1+22

−1 =

231 − 1−231

−231 + 10111 1111… 1111

1000 0000… 0000

1111 1111… 1111

constructing masks

(x ? y : 0)

if x = 1: want 1111111111…1 (keep y)

if x = 0: want 0000000000…0 (want 0)

one idea: x | (x << 1) | (x << 2) | ...

a trick: −x

((-x) & y)31

constructing other masks

(x ? 0 : z)

if x = ��SS1 0: want 1111111111…1

if x = ��AA0 1: want 0000000000…0

mask: ��HH-x

−(x^1)

constructing other masks

(x ? 0 : z)

if x = ��SS1 0: want 1111111111…1

if x = ��AA0 1: want 0000000000…0

mask: ��HH-x −(x^1)

multibit ternary

(x ? y : z)

(x ? y : 0) | (x ? 0 : z)

((−x) & y) | ((−(x ^ 1)) & z)

fully multibit

((((((((((((((hhhhhhhhhhhhhhconstraint: x is 0 or 1

(x ? y : z)

easy C way: !x = 0 or 1, !!x = 0 or 1x86 assembly: testq %rax, %rax then sete/setne(copy from ZF)

(x ? y : 0) | (x ? 0 : z)

((−!!x) & y) | ((−!x) & z)

fully multibit

(x ? y : z)

(x ? y : 0) | (x ? 0 : z)

((−!!x) & y) | ((−!x) & z)

fully multibit

(x ? y : z)

(x ? y : 0) | (x ? 0 : z)

((−!!x) & y) | ((−!x) & z)

simple operation performance

typical modern desktop processor:bitwise and/or/xor, shift, add, subtract, compare — ∼ 1 cycleinteger multiply — ∼ 1-3 cyclesinteger divide — ∼ 10-150 cycles

(smaller/simpler/lower-power processors are different)

add/subtract/compare are more complicated in hardware!

but much more important for typical applications

simple operation performance

typical modern desktop processor:bitwise and/or/xor, shift, add, subtract, compare — ∼ 1 cycleinteger multiply — ∼ 1-3 cyclesinteger divide — ∼ 10-150 cycles

(smaller/simpler/lower-power processors are different)

add/subtract/compare are more complicated in hardware!

but much more important for typical applications

problem: any-bit

is any bit of x set?

goal: turn 0 into 0, not zero into 1

easy C solution: !(!(x))another easy solution if you have − or + (lab exercise)

what if we don’t have ! or − or +

how do we solve is x is two bits? four bits?