Unit 4.1.2 Outline Week 4
ΒΆ-
4.1 Opening Remarks
4.1.1 Launch
4.1.2 Outline Week 4
4.1.3 What you will learn
-
4.2 OpenMP
4.2.1 Of cores and threads
4.2.2 Basics
4.2.3 Hello World!
-
4.3 Multithreading Matrix Multiplication
4.3.1 Lots of loops to parallelize
4.3.2 Parallelizing the first loop around the micro-kernel
4.3.3 Parallelizing the second loop around the micro-kernel
4.3.4 Parallelizing the third loop around the micro-kernel
4.3.5 Parallelizing the fourth loop around the micro-kernel
4.3.6 Parallelizing the fifth loop around the micro-kernel
4.3.7 Discussion
-
4.4 Parallelizing More
4.4.1 Speedup, efficiency, etc.
4.4.2 Ahmdahl's law
4.4.3 Parallelizing the packing
4.4.4 Parallelizing multiple loops
-
4.5 Enrichments
4.5.1 Casting computation in terms of matrix-matrix multiplication
4.5.2 Family values
4.5.3 Matrix-matrix multiplication for Machine Learning
4.5.4 Matrix-matrix multiplication on GPUs
4.5.5 Matrix-matrix multiplication on distributed memory architectures
-
4.6 Wrap Up
4.6.1 Additional exercises
4.6.2 Summary