1
1
Граф коммитов

17946 Коммитов

Автор SHA1 Сообщение Дата
Ralph Castain
cc29f8ff95 Attempt to fix the stupid Cray PMI problem
This commit was SVN r27742.
2013-01-04 02:53:42 +00:00
Nathan Hjelm
6a9ab9b221 Change orte_startup_timeout to be in seconds and remove the 10 second maximum
This commit was SVN r27741.
2013-01-03 23:56:34 +00:00
Nathan Hjelm
84e34ee0d7 Fix a bug in the uGNI btl that could cause certain descriptor callbacks to be called twice.
There was a race condition in the eager get protocol where the RDMA complete message could be received before the local completion of the SMSG message that started the eager get protocol.

cmr:v1.7

This commit was SVN r27740.
2013-01-03 23:11:13 +00:00
Samuel Gutierrez
a159bfaf25 sm BTL initialization via modex, as discussed at last year's meeting.
This commit was SVN r27739.
2013-01-03 21:52:20 +00:00
Ralph Castain
81a8e21939 Need to have the event thread running during init/finalize, but we still have a problem with cleanup - so comment out the event_base_free for now.
This commit was SVN r27738.
2013-01-03 02:16:57 +00:00
Ralph Castain
c65de32218 Cleanup the PMI subsystems to support Sam's "rml-less" shared memory wireup. Only retrieve keys that are specifically requested, and only when they are requested. Let string values be segmented across multiple keys, but don't do it for anything else.
This commit was SVN r27737.
2013-01-03 02:16:10 +00:00
Ralph Castain
ec2f6abb69 Cleanup minor memory leak
This commit was SVN r27736.
2013-01-03 02:04:54 +00:00
George Bosilca
a1754dfe31 Don't leave the cid registered in any case.
This commit was SVN r27735.
2013-01-02 18:32:46 +00:00
Matthias Jurenz
1de078c888 Changes to OTF:
- configure: fixed warnings from automake 1.12.x

Changes to VT:
	- general:
		- incremented version number to 5.14.2
	- configure:
		- fixed warnings from automake 1.12.x
	- VT libs / MPI wrappers:
		- do initialize VT and enter the dummy main function ("user") if MPI_Initialized is the very first event to be recorded (fixed assertion error)
		- leave the dummy main function on the same thread where it is entered (fixes potential stack underflow)

This commit was SVN r27734.
2013-01-02 13:42:17 +00:00
Mike Dubman
889d46e966 support for FCA v3.0 and up
This commit was SVN r27731.
2012-12-31 05:49:22 +00:00
Ralph Castain
c1690f403e Remove non-existent file
This commit was SVN r27730.
2012-12-29 02:21:50 +00:00
Ralph Castain
68329b516c Cleanup stale test codes
This commit was SVN r27729.
2012-12-28 16:52:51 +00:00
Ralph Castain
07194c6564 Add platform files
This commit was SVN r27728.
2012-12-28 16:52:21 +00:00
Ralph Castain
2353803a0f Bend cluster now has slurm
This commit was SVN r27727.
2012-12-28 03:05:51 +00:00
Ralph Castain
d482c6acc1 Improve the error message a bit
This commit was SVN r27726.
2012-12-28 03:02:07 +00:00
Ralph Castain
fb5079a310 Update the pmi detection to report a more meaningful error message when with-pmi is specified but the pmi.h file cannot be found
This commit was SVN r27725.
2012-12-27 04:57:30 +00:00
Mike Dubman
b6d50a5733 Performance optimizations by alexm:
* btl sendi(): if message can be send inline try to avoid signal
* signal is requested one per 64 or when
    there are no send wqes 
    when message can not be send inline 
    any other btl method then sendi()

This commit was SVN r27724.
2012-12-26 10:19:12 +00:00
Ralph Castain
d1163ebbf2 Ensure we cleanup DFS worker threads during finalize to avoid segfaulting in MCA param cleanup
This commit was SVN r27723.
2012-12-25 21:17:35 +00:00
Ralph Castain
64da742d5f Remove the orte_finalize_event variable - no longer needed
This commit was SVN r27722.
2012-12-25 19:33:20 +00:00
Ralph Castain
cada035f38 Fix the segfault problem in the orteds - turns out it only occurred with progress threads enabled. Ensure the thread gets started at the right time (at the end of init), although the event base gets created earlier. Remove the finalize event as we can instead use the loopbreak call to exit the event loop.
This commit was SVN r27721.
2012-12-25 19:30:18 +00:00
Ralph Castain
c8e34813b6 THIS IS A TEMPORARY FIX - do not finalize opal as the parameter system has been broken and will segfault when finalized.
THIS PATCH MUST BE REMOVED WHEN THE PARAMETER SYSTEM HAS BEEN FIXED.

This commit was SVN r27720.
2012-12-24 18:42:19 +00:00
Ralph Castain
b1925d35c7 Silence asprintf warnings in ompi_info, add libs to build under Ubuntu
This commit was SVN r27719.
2012-12-23 19:54:44 +00:00
George Bosilca
ed77868984 No need for event.h in the SM BTL.
This commit was SVN r27718.
2012-12-23 19:33:53 +00:00
Ralph Castain
72bea688f1 Fix typo
This commit was SVN r27717.
2012-12-23 18:13:39 +00:00
Ralph Castain
852a709c0e Add libopen-pal to the libraries as all these tools directly reference OPAL functions, and the list of OS's that don't support indirect linking grows (Mac and Ubuntu, for now).
This commit was SVN r27716.
2012-12-23 15:54:05 +00:00
Brian Barrett
b2efc9ffa9 On OpenBSD, both sys/mount.h and sys/sysctl.h require sys/param.h in order
to compile.  With recent Autoconf versions, AC_CHECK_HEADERS only succeeds
if the header compiles, so need to check these two with sys/param.h
explicitly included in the search list...

This commit was SVN r27715.
2012-12-22 16:32:26 +00:00
Brian Barrett
8231cb62a2 On OpenBSD (and probably others), expr operates on a 32 bit signed integer,
which is the same size as a Fortran or C integer.  This resulted in configure
coming up with Fortran's MAX_INT as -2^31, which obviously isn't a positive
number.  Since we found the MAX_INT using the same broken loop in a couple
places and doing it right is complicated, added a new macro that is much
more careful about sign roll-over.

During the Fortran rework between v1.6 and v1.7, the variable which
indicates whether or not Fortran is being compiled changed, so on platforms
without Fortran compilers, we were trying to determine the max value for
Fortran INTEGERS where we previously didn't.  I believe this is why
bug #3374 appeared as a regression.

Finally, since the OMPI code doesn't cope with OMPI_FORTRAN_HANDLE_MAX
being negative (which was the root cause of the segfault in $3374),
add a check at the end of the OMPI_FORTRAN_GET_HANDLE_MAX macro to
ensure that OMPI_FORTRAN_HANDLE_MAX is always non-negative.

This commit was SVN r27714.
2012-12-22 16:24:36 +00:00
Matthias Jurenz
295b04266b Changes to VT:
- configure: pass the MPI configure options (e.g. --with-mpi-lib, --with-mpi-inc-dir) to the OTF configure, even if MPI compiler wrappers were found

This commit was SVN r27710.
2012-12-19 14:46:16 +00:00
George Bosilca
4a581d276d Fix the issues with the MPI_Op and the Fortran90 types.
This commit was SVN r27707.
2012-12-19 11:08:18 +00:00
Ralph Castain
6046812952 Add float and struct timeval fields to the opal_value_t object, and provide dss support for those data types
This commit was SVN r27705.
2012-12-19 00:14:19 +00:00
Jeff Squyres
b29b852281 Consolidate all the opal/orte/ompi .m4 files back to the top-level
config/ directory.  We split them apart a while ago in the hopes that
it would simplify things, but it didn't really (e.g., because there
were still some ompi/opal .m4 files in the top-level config/
directory, resulting in developer confusion where any given m4 macro
was defined).

So this commit consolidates them back into the top-level directory for
simplicity.  

There's still (at least) two changes that would be nice to make:

 1. Split any generated .m4 file (e.g., autogen-generated .m4 files)
    into a separate directory somewhere so that a top-level -Iconfig/
    will only get our explicitly defined macros, not the autogen stuff
    (e.g., with libevent2019 needing to get the visibility macro, but
    NOT all the autogen-generated inclusion of component configure.m4
    files).
 1. Change configure to be of the form:
{{{
# ...a small amount of preamble/setup...
OPAL_SETUP
m4_ifdef([project_orte], [ORTE_SETUP])
m4_ifdef([project_ompi], [OMPI_SETUP])
# ...a small amount of finishing stuff...
}}}

I doubt we'll ever get anything as clean as that, but that would be
the goal to shoot for.

This commit was SVN r27704.
2012-12-19 00:00:36 +00:00
Jeff Squyres
cdddbadc52 Minor improvements to configure.ac:
* We always build OPAL, so always do the filesystem case-sensitivity
   check 
 * Protect the extra ORTE/OMPI macros with "if we're building this
   project..." tests

This commit was SVN r27699.
2012-12-18 23:51:38 +00:00
Brad Benton
b4a79f8983 Adding power6 & power7-specific platform files
This commit was SVN r27690.
2012-12-18 21:37:14 +00:00
Brad Benton
388f34193b Updates for ompi-trunk flag changes
Updates for generic ppc flags

This commit was SVN r27689.
2012-12-18 21:35:54 +00:00
Ralph Castain
ab73d11368 Oops - push missing definitions
This commit was SVN r27688.
2012-12-18 16:43:03 +00:00
Matthias Jurenz
3509deecc3 Changes to VT:
- compiler wrappers: removed invocation of pdbcomment; it removes essential information from the PDB file for instrumenting functions

This commit was SVN r27687.
2012-12-18 13:41:21 +00:00
Ralph Castain
c5ba59ba67 Remove stale component
This commit was SVN r27684.
2012-12-18 04:01:16 +00:00
Ralph Castain
0427a478b2 Remove stale component
This commit was SVN r27683.
2012-12-18 04:00:51 +00:00
Ralph Castain
82f1ba0ea8 Fix static port usage, ensure that both ipv4 and ipv6 are given if ipv6 was enabled
This commit was SVN r27682.
2012-12-18 03:59:49 +00:00
Ralph Castain
885fc8432d Fix the printing and handling of sample times in stats objects
This commit was SVN r27681.
2012-12-18 03:45:09 +00:00
Nathan Hjelm
ef49fcea25 Remove debug printfs.
cmr:v1.7

This commit was SVN r27680.
2012-12-17 16:34:07 +00:00
Matthias Jurenz
9f9fded4de Changes to OTF:
- otfprofile: fixed build error when using the IBM XL C++ compiler

Changes to VT:
	- configure:
		- use AC_CHECK_TYPES instead of AC_CHECK_DECLS to check for PAPI's long_long type
		- use AC_C_INLINE to check whether C 'inline' is present
	- VT libs:
		- set CUPTI tracing as default, if CUDA runtime wrapper has not been built
		- added a common interface for all NVIDIA CUPTI interfaces (events, activity, callbacks)
		- added support for concurrent kernel tracing (since CUDA 5.0)
		- removed almost unused VT_DEBUG env. variable - replaced calls to vt_debug_msg(DBG_LEVEL,...) by vt_cntl_msg (DBG_LEVEL+10,...)
		- added some more ifdefs to new CUDA 5 features
		- added several guards for internal malloc() and free() calls in CUDA related source files
		- revised memory allocation tracing:
			- intercept memory (de)alloaction functions by library wrapping (replaces deprecated hook technique from the GNU C library)
			- added support for multi-threaded applications
			- added wrapper functions for memalign, posix_memalign, and valloc
		- revised exec,system,fork tracing
			- retitled to "Child Process Execution Tracing"
			- introduced env. variable VT_EXECTRACE (marked VT_LIBCTRACE as deprecated)
			- added wrapper functions for execvpe, fexecve, waitid, wait3, and wait4
		- changes default function group name for
			- memory (de)allocation functions: "MEM" -> "LIBC-MALLOC"
			- I/O functions: "I/O" -> "LIBC-I/O"
			- child process execution: "LIBC" -> "LIBC-EXEC"
	- plugin counter interface:
		- added check for initialized vt_plugin_cntr_info.info
	- vtdyn:
		- added missing header includes for Dyninst 8
	- vtrun:
		- do not preload the Dyninst Runtime library; it is loaded by Dyninst itself

This commit was SVN r27679.
2012-12-17 14:40:07 +00:00
Jeff Squyres
8e304f0fb5 Remove the comment about OSCAR being long gone; it's apparently alive
and well.

This commit was SVN r27674.
2012-12-15 13:08:04 +00:00
Ralph Castain
2fdd367aa9 Refs trac:3429
Fix bug reported by FreyGuy19713: in cases where HNP node has multiple entries in a hostfile or other allocation, we need to track the total slots allocated to that node.

This commit was SVN r27673.

The following Trac tickets were found above:
  Ticket 3429 --> https://svn.open-mpi.org/trac/ompi/ticket/3429
2012-12-14 17:00:44 +00:00
Jeff Squyres
c5b0bcd9f7 Refs trac:3422
* Add some comments in the *-wrapper-data-txt.in files just so that
   someone doesn't forget in the future why we link in what we do in
   the MPI and ORTE wrapper compilers.
 * Update ompi_wrapper_script.in to match the new behavior.
 * Update orte_wrapper_script.in to support --openmpi:linkall (which
   is a no-op in this case)

This commit was SVN r27672.

The following Trac tickets were found above:
  Ticket 3422 --> https://svn.open-mpi.org/trac/ompi/ticket/3422
2012-12-14 16:34:20 +00:00
Nathan Hjelm
ba5b2b0540 btl/vader: fix bug in single copy code that could cause ob1 sends to not get marked complete.
cmr:v1.7

This commit was SVN r27671.
2012-12-13 23:18:53 +00:00
Jeff Squyres
f779b1ded9 Put back the static-library-detection stuff from r27668, with some
additional functionality.  Rationale (refs trac:3422):

 * Normal MPI applications only ever use the MPI API. Hence, -lmpi is
   sufficient (they'll never directly call ORTE or OPAL
   functions). This is arguably the most common case.
 * That being said, we do have some test programs (e.g., those in
   orte/test/mpi) that call MPI functions but also call ORTE/OPAL
   functions. I've also written the occasional MPI test program that
   calls opal_output, for example (there even might be a few tests in
   the IBM test suite that directly call ORTE/OPAL functions).
   * Even though this is not a common case, these applications should
     also compile/link with mpicc.
   * So we should add a --openmpi:linkall option that will also link
     in whatever is necessary to call ORTE/OPAL functions
   * Yes, we could hard-code "-lopen-rte -lopen-pal" in Makefiles, but
     we do reserve the right to change those library names and/or add
     others someday, so it's better to abstract out the names and let
     the wrapper supply whatever is necessary.
 * ORTE programs, however, are different. They almost always call OPAL
   functions (e.g., if they want to send a message, they must use the
   OPAL DSS). As such, it seems like the ORTE programs should always
   link in OPAL.

Therefore:

 * Add undocumented --openmpi:linkall flag to the wrapper compilers.
   See the comment in opal_wrapper.c for an explanation of what it
   does.  This flag is only intended for Open MPI developers -- not
   end users.  That's why it's undocumented.
 * Update orte/test/mpi/Makefile.am to add --openmpi:linkall
 * Make ortecc/ortec++'s wrapper data text files always explicitly
   link in libopen-pal

This commit was SVN r27670.

The following SVN revision numbers were found above:
  r27668 --> open-mpi/ompi@cf845897aa

The following Trac tickets were found above:
  Ticket 3422 --> https://svn.open-mpi.org/trac/ompi/ticket/3422
2012-12-13 22:31:37 +00:00
Jeff Squyres
18e0c75857 Ensure to set Fortran ierr properly at the end of
MPI_Get_processor_name.  Thanks to LANL for spotting the issue.

This commit was SVN r27669.
2012-12-13 15:03:48 +00:00
Jeff Squyres
cf845897aa Temporarily revert r27662 and r27667 because something wonky is
happening on OS X.  Grumble...

This commit was SVN r27668.

The following SVN revision numbers were found above:
  r27662 --> open-mpi/ompi@97cc916007
  r27667 --> open-mpi/ompi@529f6244ca
2012-12-11 23:08:14 +00:00
Jeff Squyres
529f6244ca If a user supplies both (some form of --static) and (some form of
--dynamic), use the one that was farthest to the right on the command
line.

This commit was SVN r27667.
2012-12-11 21:25:00 +00:00