Performance comparison of TXBLAS vs. ATLAS 2.0
Operation: C = A B + C
C – m x n, A – m x k, B – k x n
Platform: Pentium II 233 MHz Dell Inspiron 3200 laptop
We report performance for m = 64:64:512, n = 64:64:512, k = 64:64:512
Figure 1: Mflop/sec attained for indicated matrix dimensions by current version of TXBLAS matrix-matrix multiply.
Figure 2: Mflop/sec attained for indicated matrix dimensions by ATLAS Release 2.0 matrix-matrix multiply.
Figure 3: Difference in Mflop/sec attained: TXBLAS – ATLAS reporting only those cases where TXBLAS outperforms ATLAS.
Figure 4: Difference in Mflop/sec attained: ATLAS - TXBLAS reporting only those cases where ATLAS outperforms TXBLAS.
Figure 5: Percent of cases measured where implementation outperforms the Mflop/sec rate indicated on the x-axis.
Figure 6: Percent of cases measured where implementation outperforms the indicated percent of peak, where peak is taken to equal 233 Mflop/sec.