b4e04bbd8a
Add logic to handle different architectural capabilities
Detect the compiler flags necessary to build specialized
versions of the MPI_OP. Once the different flavors (AVX512,
AVX2, AVX) are built, detect at runtime which is the best
match with the current processor capabilities.
Add validation checks for loadu 256 and 512 bits.
Add validation tests for MPI_Op.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
Signed-off-by: dongzhong <zhongdong0321@hotmail.com>
Signed-off-by: George Bosilca <bosilca@icl.utk.edu>
(cherry picked from commit
|
||
---|---|---|
.. | ||
asm | ||
carto | ||
class | ||
datatype | ||
dss | ||
event | ||
memchecker | ||
monitoring | ||
mpi | ||
mpool | ||
runtime | ||
spc | ||
support | ||
threads | ||
util | ||
Makefile.am |