Processing math: 100%
Skip to main content
LAFF-On Programming for High Performance:
ulaff.net
Robert van de Geijn, Margaret Myers, Devangi Parikh
Contents
Index
Prev
Up
Next
Contents
Prev
Up
Next
Front Matter
Colophon
Dedication
Acknowledgements
Preface
0
Getting Started
Opening Remarks
Navigating the Course
Setting Up
Experimental Setup
Enrichments
Wrap Up
1
Loops and More Loops
Opening Remarks
Loop Orderings
Layering Matrix-Matrix Multiplication
Layering Matrix-Matrix Multiplication: Alternatives
Enrichments
Wrap Up
2
Start Your Engines
Opening Remarks
Blocked Matrix-Matrix Multiplication
Blocking for Registers
Optimizing the Micro-kernel
Enrichments
Wrap Up
3
Pushing the Limits
Opening Remarks
Leveraging the Caches
Packing
Further Tricks of the Trade
Enrichments
Wrap Up
4
Multithreaded Parallelism
Opening Remarks
OpenMP
Multithreading Matrix Multiplication
Parallelizing More
Enrichments
Wrap Up
Back Matter
A
B
GNU Free Documentation License
References
Index
Colophon
Authored in PreTeXt
permalink
Section
1.2
Loop Orderings
ΒΆ
1.2.1
Mapping matrices to memory
1.2.2
The leading dimension
1.2.3
A convention regarding the letters used for the loop index
1.2.4
Ordering the loops
login