1
1
openmpi/test
Rainer Keller 4e6a6fc146 - Check, whether the compiler supports __builtin_clz (count leading
zeroes);
   if so, use it for bit-operations like opal_cube_dim and opal_hibit.
   Implement two versions of power-of-two.
   In case of opal_next_poweroftwo, this reduces the average execution
   time from 83 cycles to 4 cycles (Intel Nehalem, icc, -O2, inlining,
   measured rdtsc, with loop over 2^27 values).
   Numbers for other functions are similar (but of course heavily depend
   on the usage, e.g. opal_hibit() with a start of 4 does not save
   much).  The bsr instruction on AMD Opteron is also not as fast.

 - Replace various places where the next power-of-two is computed.
   
   Tested on Intel Nehalem Cluster with openib, compilers GNU-4.6.1 and
   Intel-12.0.4 using mpi_testsuite -t "Collective" with 128 processes.

This commit was SVN r25270.
2011-10-11 22:49:01 +00:00
..
asm s/ompi/opal/gi in all of these files because they're really OPAL 2010-07-28 14:20:58 +00:00
carto Fix visibility issues with opal_graph functions. 2009-06-09 15:02:57 +00:00
class Add opal_tree class. This will be used in the future by sysinfo to store hw maps to be used by rmaps for the new affinity code. 2011-03-30 08:05:28 +00:00
datatype Fix some fairly-important typos (!) 2011-02-01 13:18:01 +00:00
dss - Adapt for changed opal_init() arguments -- takes argc&argv 2010-02-10 18:29:01 +00:00
event Clean up a few initialization issues - don't think these are impacting the shared memory situation as it didn't fix the problem. 2010-10-26 02:41:42 +00:00
memchecker merging the memchecker into trunk. 2008-02-12 08:46:27 +00:00
mpi Convert the bad dos line endings to unix style for all windows related files. 2010-12-02 12:08:08 +00:00
runtime Remove stale code associated with OPAL_THREADS_HAVE_DIFFERENT_PIDS. In the past, we have supported the case of really, really old Linux kernels where threads have different pids. However, when we updated the event library, we didn't also update that support code. In addition, when we dropped progress thread support, we didn't remove areas of the code that could no longer be compiled (i.e., were protected by "if progress thread && if have different pids). 2011-03-15 21:05:03 +00:00
support s/ompi/opal/gi in all of these files because they're really OPAL 2010-07-28 14:20:58 +00:00
threads Continue clean up of thread options started in r22841, 22842, and 22849. 2011-03-18 21:36:35 +00:00
util - Check, whether the compiler supports __builtin_clz (count leading 2011-10-11 22:49:01 +00:00
CMakeLists.txt Convert the bad dos line endings to unix style for all windows related files. 2010-12-02 12:08:08 +00:00
Makefile.am Move the predefined gap test to ompi/debuggers (we already have the 2010-07-28 16:22:10 +00:00