Skip to main content
BLIS Retreat 2024:
Sept. 26-27, UT Austin
Contents
Search Book
close
Search Results:
No results.
Prev
Up
Next
\usepackage
\setlength
\oddsidemargin
\setlength
\evensidemargin
\setlength
\textheight
\setlength
\textwidth
\setlength
\topmargin
\usepackage
a
r
r
a
y
\setlength
\oddsidemargin
−
0.0
i
n
\setlength
\evensidemargin
−
0.0
i
n
\setlength
\textheight
8.75
i
n
\setlength
\textwidth
6.5
i
n
\setlength
\topmargin
−
0.25
i
n
1
About
1.1
Acknowledgements
2
Announcements
3
Format
4
Logistics
4.1
Zoom/Discord
4.1.1
Zoom
4.1.1.1
Before the meeting
4.1.1.2
To join the meetings
4.1.1.3
Zoom basics
4.1.2
Discord (discussion)
4.2
For speakers
4.3
For those who attend in person
4.3.1
Masks
4.3.2
Parking
4.3.3
Getting to the meeting room
4.3.4
Before the meeting (Thursday and Friday)
4.3.5
Lunch on Thursday
4.3.6
Dinner on Thursday
4.3.7
Friday afternoon
5
Program
5.1
Program at a glance
5.2
Talks
5.2.1
Nikoli Dryden, "A Distributed Multilinear Algebra Library for Deep Learning"
5.2.2
Carl Kwan, "The Cholesky Factorization Theorem in ACL2"
5.2.3
Joe Dobson, "Strategy Selection in the Arm Performance Libraries"
5.2.4
Elliott Binder, "FAST Attention for Small Tensors"
5.2.5
Upasana Sridhar, "Layer fusion with composable abstraction"
5.2.6
Cem Bassoy, "Fast and layout-oblivious tensor-matrix multiplication with BLAS"
5.2.7
Jim Demmel, "How to grade the accuracy of an implementation of the BLAS; Short update on Exception Handling"
5.2.8
Grace Dinh, "Cost Estimation and Bounds for Sparse Kernels "
5.2.9
Evarist Fomenko, "NVPL BLAS Architecture and Implementation Overview"
5.2.10
Thijs Steel, "Communication efficient application of sequences of rotations to a matrix"
5.2.11
Bhaskar, Nallani, "LPGEMM Enhancements in AOCL BLAS"
5.2.12
Arnav Sharma, "BLAS Extension APIs"
5.2.13
Eleni Vlachopoulou, "CMake Build System in AOCL BLAS"
5.2.14
Sridhar Govindaswamy, "Close coupling of AOCL BLAS in AOCL LAPACK"
5.2.15
Stepan Nassyr, "Simulating Parameterized Kernels on Parameterized Architectures"
5.2.16
Devin Matthews, "The state of BLIS 1.0 and 2.0"
5.2.17
Devin Matthews and Robert van de Geijn, "Vertical integration of the linear and multilinear software stack"
5.2.18
Devangi Parikh and Greg Henry, "Accuracy study of Cascading GEMM"
5.2.19
Chao Yin, "LTLT Decomposition of a Skew-Symmetric Matrix - High Performance Implementation"
5.2.20
Ishna Satyarth, "LTLT Decomposition of a Skew-Symmetric Matrix - Derivation"
6
BLIS Retreat Participants
6.1
Academia
6.2
Industry
6.3
National Labs
7
Bulletin Board
7.1
Instructions
8
Contacts
🔗
5
Program
5.1
Program at a glance
5.2
Talks