Unit 2.6.2 Summary
¶Subsubsection 2.6.2.1 The week in pictures
¶Subsubsection 2.6.2.2 Useful intrinsic functions
¶From Intel's Intrinsics Reference Guide
-
__m256d _mm256_loadu_pd (double const * mem_addr)
Description
Load 256-bits (composed of 4 packed double-precision (64-bit) floating-point elements) from memory into
dst
(output).mem_addr
does not need to be aligned on any particular boundary. -
__m256d _mm256_broadcast_sd (double const * mem_addr)
Description
Broadcast a double-precision (64-bit) floating-point element from memory to all elements of
dst
(output). -
__m256d _mm256_fmadd_pd (__m256d a, __m256d b, __m256d c)
Description
Multiply packed double-precision (64-bit) floating-point elements in
a
andb
, add the intermediate result to packed elements inc
, and store the results indst
(output).