Notes on Homework 1

Notes on Homework 1What is Peak ?

Summary of SSE intrinsics

#include <emmintrin.h>Vector data type:• __m128dLoad and store operations:• _mm_load_pd• _mm_store_pd• _mm_loadu_pd• _mm_storeu_pdLoad and broadcast across vector• _mm_load1_pdArithmetic:• _mm_add_pd• _mm_mul_pd

2x2 Matrix MultiplyC00 += A00B10 + A01B00

C10 += A10B10 + A11B00

C01 += A00B11 + A01B01

C11 += A10B11 + A11B01

Rewrite as SIMD algebra

C00_C10 += A00_A10 * B10_B10C01_C11 += A00_A10 * B11_B11C00_C10 += A01_A11 * B00_B00C01_C11 += A01_A11 * B01_B01

02/11/2009CS267 Lecture 7

Example: multiplying 2x2 matrices#include <emmintrin.h>c1 = _mm_loadu_pd( C+0*lda ); //load unaligned block in Cc2 = _mm_loadu_pd( C+1*lda );for( int i = 0; i < 2; i++ ){ a = _mm_load_pd( A+i*lda );//load aligned i-th column of A b1 = _mm_load1_pd( B+i+0*lda ); //load i-th row of B b2 = _mm_load1_pd( B+i+1*lda ); c1=_mm_add_pd( c1, _mm_mul_pd( a, b1 ) ); //rank-1 update c2=_mm_add_pd( c2, _mm_mul_pd( a, b2 ) );}_mm_storeu_pd( C+0*lda, c1 ); //store unaligned block in C_mm_storeu_pd( C+1*lda, c2 );

Other Issues

Checking efficiency of the compiler helps• Use -S option to see the generated assembly code• Inner loop should consist mostly of ADDPD and MULPD ops,

ADDSD and MULSD imply scalar computations

• Consider using another compiler• Options are PGI, PathScale and GNU• Cray C compiler doesn’t seem to have any way to use SSE

• It is capable of auto-vectorizing loops it recognizes

• Look through Goto and van de Geijn’s paper• Don’t ignore the other optimizations

• more compiler flags• loop unrolling• inlining• keywords (ask your classmates!)

Notes on Homework 1

Documents

Transcript of Notes on Homework 1

Structural Formulas AGENDA: Review Check homework Notes on Structural Formulas Homework: Drawing Formluas.

Combined Gas Law Agenda Demos & Notes on gas laws Homework: Gas Laws.

Daily Agenda Daily Trivia Agenda Check Homework Notes on Genetic Crosses Groupwork on HW worksheets Homework.

Class02 ChemistryG12 Notes and Homework

Mixtures Agenda Review Yesterday’s homework Notes on Mixtures Homework.

10.19.15 EDU physical and chemcial Agenda: notes Homework: due wednesday Agenda: notes Homework: due wednesday (5) Matter and energy. The student knows.

Agenda 11/16/12 Warm-up on American Revolution Review Homework on American Revolution Role Play – Estates General Notes on French Revolution Homework-

IS 325 Notes for Wednesday September 18, 2013. Homework Grades/Feedback I’m behind on homework.. I will catch-up this weekend.

Algebra 2 Schedule: Class Announcements Wednesday Warm-Up Homework Check/Questions 2.2 Notes Begin Homework.

Homework problem - LSU · Written Homework Each week there will be two problems assigned as an open book, open notes, work together homework assignment. They will be available on

Class05 ChemistryG12 Notes and Homework

Class01 ChemistryG12 Notes and Homework

First Year Engineering Class Notes and Homework Workbook.

Stat 112 Notes 15 Today: –Outliers and influential points. Homework 4 due on Thursday.

Renaissance Notes Unit 3 – Ms. Doyle. Page 2: Renaissance Notes Agenda: Notes The Prince – GIST Finish brochures, if time Homework: Quiz tomorrow on renaissance!

Class07 ChemistryG12 Notes and Homework

Notes Plus Homework

Chapter 1 Notes and Homework Packet - Weebly

Algebra 1(A/B) Fall 2012. Class Announcements Homework Check Warm-Up 4.3 Notes Begin Homework.

Today in Pre-Calculus Go over homework Notes: Symmetry –Need a calculator Homework.