Number of times this page has been accessed since July 21, 1995:
A High Performance Parallel Strassen Implementation
- Brian Grayson
- Department of Electrical and Computer Engineering
- University of Texas
- Austin, TX 78712
- bgrayson@pine.ece.utexas.edu
- Robert A. van de Geijn
- Department of Computer Sciences
- University of Texas
- Austin, TX 78712
- rvdg@cs.utexas.edu
Abstract
In this paper, we give what we believe to be the
first practical high performance parallel implementation of
Strassen's algorithm for matrix multiplication.
We show how under restricted conditions,
this algorithm can be implemented plug compatible
with standard parallel matrix multiplication algorithms.
Results obtained on a large Intel Paragon system
show a 10-20% reduction in execution time compared to
what we believe to be the fastest standard parallel
matrix multiplication implementation available at this
time.
Brian Grayson and Robert van de Geijn
"A High Performance Parallel Strassen Implementation,"
Parallel Processing Letters, Vol 6, No. 1 (1996) 3-12.