Skip to main content
LAFF-On Programming for High Performance:
ulaff.net
Robert van de Geijn, Margaret Myers, Devangi Parikh
Contents
Index
Prev
Up
Next
Contents
Prev
Up
Next
Front Matter
Colophon
Dedication
Acknowledgements
Preface
0
Getting Started
Opening Remarks
Navigating the Course
Setting Up
Experimental Setup
Enrichments
Wrap Up
1
Loops and More Loops
Opening Remarks
Loop Orderings
Layering Matrix-Matrix Multiplication
Layering Matrix-Matrix Multiplication: Alternatives
Enrichments
Wrap Up
2
Start Your Engines
Opening Remarks
Blocked Matrix-Matrix Multiplication
Blocking for Registers
Optimizing the Micro-kernel
Enrichments
Wrap Up
3
Pushing the Limits
Opening Remarks
Leveraging the Caches
Packing
Further Tricks of the Trade
Enrichments
Wrap Up
4
Multithreaded Parallelism
Opening Remarks
OpenMP
Multithreading Matrix Multiplication
Parallelizing More
Enrichments
Wrap Up
Back Matter
A
B
GNU Free Documentation License
References
Index
Colophon
Authored in PreTeXt
permalink
LAFF-On Programming for High Performance:
ulaff.net
Robert van de Geijn
Department of Computer Science
and
Oden Institute for Computational Engineering and Sciences
The University of Texas at Austin
rvdg@cs.utexas.edu
Margaret Myers
Department of Statistics and Data Sciences
The University of Texas at Austin
myers@cs.utexas.edu
Devangi Parikh
Oden Institute for Computational Engineering and Sciences
The University of Texas at Austin
dnp@cs.utexas.edu
November 25, 2021
Colophon
Dedication
Acknowledgements
Preface
login