1
1
Граф коммитов

6207 Коммитов

Автор SHA1 Сообщение Дата
Brian Barrett
14f4aa1198 Fix memory leak in nbc init
This commit was SVN r27884.
2013-01-21 22:45:59 +00:00
Brian Barrett
407714a85a Fix a memory leak in the RDMA one-sided component. Thanks to Victor Vysotskiy
for letting us know about this one.

This commit was SVN r27883.
2013-01-21 22:45:37 +00:00
George Bosilca
42753b4690 Make the TCP BTL really fail-safe. It now trigger the error callback on
all pending fragments when the destination goes down. This allows the PML
to recalibrate its behavior, either find an alternate route or just give up.

This commit was SVN r27881.
2013-01-21 11:41:08 +00:00
George Bosilca
6bdd2e063a My understanding is that an MPI_WAIT() on an inactive request should
return the empty status (MPI 3.0 page 52 line 46).

This commit was SVN r27880.
2013-01-21 11:35:42 +00:00
George Bosilca
d2281cc672 Remove the CMA related warnings.
This commit was SVN r27872.
2013-01-19 14:26:43 +00:00
Ralph Castain
e4d7d87ac7 Get make to respect V=0 - thanks Jeff
This commit was SVN r27871.
2013-01-19 14:26:37 +00:00
Ralph Castain
252ea297ff Silence warnings
This commit was SVN r27870.
2013-01-19 09:11:17 +00:00
Matthias Jurenz
fd00910616 Changes to VT:
- library wrapping: Prevent calling dlerror, if the memory allocation wrappers are enabled. dlerror calls realloc which would ends up in an infinite recursion.

This commit was SVN r27869.
2013-01-18 16:29:06 +00:00
Rolf vandeVaart
f63c88701f Improve CUDA GPU transfers over openib BTL. Use aynchronous copies.
This is RFC that was submitted in July and December of 2012.

This commit was SVN r27862.
2013-01-17 22:34:43 +00:00
Brian Barrett
fc3df11e08 Remove the (only two) fortran constants from OPAL. The only places that
actually care if opal_pointer_array is limited to handle_max already passes
that in as the max_size during init, so don't need it there.  The arch
constant was a bit more difficult, so pass that in during MPI init and
leave empty otherwise.

This is to help with the effort to allow building ompi against an external
opal or orte.

This commit was SVN r27817.
2013-01-15 01:27:36 +00:00
Jeff Squyres
8b5276901a Didn't mean to commit this change as part of r27806.
This commit was SVN r27807.

The following SVN revision numbers were found above:
  r27806 --> open-mpi/ompi@f877248d5c
2013-01-14 16:36:20 +00:00
Jeff Squyres
f877248d5c Remove some generated files
This commit was SVN r27806.
2013-01-14 16:32:57 +00:00
Rolf vandeVaart
a07a4bb3f7 Update smcuda to match recent changes in sm BTL.
This commit was SVN r27803.
2013-01-14 14:42:19 +00:00
Rolf vandeVaart
34d1f0a585 Add some comments to the #ifdefs for clarity. No functional changes.
This commit was SVN r27802.
2013-01-13 16:08:48 +00:00
Alex Mikheev
344d407ed4 fixed compilation warning
always send signalled when BTL_OPENIB_FAILOVER is defined

This commit was SVN r27801.
2013-01-13 10:11:03 +00:00
Jeff Squyres
b2d5d1e348 Along with the Automake 1.13.x changes in r27790, rename these third
party configure.in scripts to be configure.ac so that Automake stops
complaining about them.

This commit was SVN r27791.

The following SVN revision numbers were found above:
  r27790 --> open-mpi/ompi@675a2f5c48
2013-01-11 20:26:19 +00:00
Jeff Squyres
675a2f5c48 Updates for Automake 1.13.x. Without these changes, Automake 1.13.x
will error out, due to use of the
previously-deprecated-and-now-removed AM_CONFIG_HEADER macro.

This commit was SVN r27790.
2013-01-11 20:20:02 +00:00
Samuel Gutierrez
4c28c8cbd0 New sm BTL initialization take two. This approach is pretty simple. Instead of
using the modex or RML to share sm initialization information, have node rank 0
create a file containing initialization information in a well-known place. Then
during add_procs, the rest of the node processes requiring sm BTL initialization
will just read from that file to complete their initialization.

This commit was SVN r27789.
2013-01-11 16:24:56 +00:00
Jeff Squyres
344ac08d22 Fix from Jed Brown for fairly obvious typos.
This commit was SVN r27787.
2013-01-10 19:02:08 +00:00
Brian Barrett
b817166072 Use a process name instead of a name list in bcol_basesmuma
This commit was SVN r27779.
2013-01-09 16:43:49 +00:00
Joshua Ladd
77df51c516 Fixes the definition of the first fragment and does not assume that first frag has offset_into_user_buff equal to zero. This fix should be added to cmr:v1.7.1:reviewer=pasha
This commit was SVN r27775.
2013-01-08 20:24:58 +00:00
Matthias Jurenz
9e47ef47ac Changes to VT:
- fixed build error reported in http://www.open-mpi.org/community/lists/users/2013/01/21081.php:
		- removed typo
		- added configure test for declaration of the global 'environ' variable in unistd.h

This commit was SVN r27770.
2013-01-08 14:30:13 +00:00
Alex Mikheev
fe672f255f request signal when sending over SRQ and number of SRQ sd_credits is 0
This commit was SVN r27767.
2013-01-08 14:00:29 +00:00
Samuel Gutierrez
c4acd20eb9 Backout r27739.
This commit was SVN r27745.

The following SVN revision numbers were found above:
  r27739 --> open-mpi/ompi@a159bfaf25
2013-01-05 01:54:23 +00:00
George Bosilca
791f31d688 Remove the unnecessary ompi_modex_recv_pointer function.
This commit was SVN r27744.
2013-01-04 20:50:25 +00:00
Nathan Hjelm
84e34ee0d7 Fix a bug in the uGNI btl that could cause certain descriptor callbacks to be called twice.
There was a race condition in the eager get protocol where the RDMA complete message could be received before the local completion of the SMSG message that started the eager get protocol.

cmr:v1.7

This commit was SVN r27740.
2013-01-03 23:11:13 +00:00
Samuel Gutierrez
a159bfaf25 sm BTL initialization via modex, as discussed at last year's meeting.
This commit was SVN r27739.
2013-01-03 21:52:20 +00:00
Ralph Castain
ec2f6abb69 Cleanup minor memory leak
This commit was SVN r27736.
2013-01-03 02:04:54 +00:00
George Bosilca
a1754dfe31 Don't leave the cid registered in any case.
This commit was SVN r27735.
2013-01-02 18:32:46 +00:00
Matthias Jurenz
1de078c888 Changes to OTF:
- configure: fixed warnings from automake 1.12.x

Changes to VT:
	- general:
		- incremented version number to 5.14.2
	- configure:
		- fixed warnings from automake 1.12.x
	- VT libs / MPI wrappers:
		- do initialize VT and enter the dummy main function ("user") if MPI_Initialized is the very first event to be recorded (fixed assertion error)
		- leave the dummy main function on the same thread where it is entered (fixes potential stack underflow)

This commit was SVN r27734.
2013-01-02 13:42:17 +00:00
Mike Dubman
889d46e966 support for FCA v3.0 and up
This commit was SVN r27731.
2012-12-31 05:49:22 +00:00
Mike Dubman
b6d50a5733 Performance optimizations by alexm:
* btl sendi(): if message can be send inline try to avoid signal
* signal is requested one per 64 or when
    there are no send wqes 
    when message can not be send inline 
    any other btl method then sendi()

This commit was SVN r27724.
2012-12-26 10:19:12 +00:00
Ralph Castain
b1925d35c7 Silence asprintf warnings in ompi_info, add libs to build under Ubuntu
This commit was SVN r27719.
2012-12-23 19:54:44 +00:00
George Bosilca
ed77868984 No need for event.h in the SM BTL.
This commit was SVN r27718.
2012-12-23 19:33:53 +00:00
Matthias Jurenz
295b04266b Changes to VT:
- configure: pass the MPI configure options (e.g. --with-mpi-lib, --with-mpi-inc-dir) to the OTF configure, even if MPI compiler wrappers were found

This commit was SVN r27710.
2012-12-19 14:46:16 +00:00
George Bosilca
4a581d276d Fix the issues with the MPI_Op and the Fortran90 types.
This commit was SVN r27707.
2012-12-19 11:08:18 +00:00
Jeff Squyres
b29b852281 Consolidate all the opal/orte/ompi .m4 files back to the top-level
config/ directory.  We split them apart a while ago in the hopes that
it would simplify things, but it didn't really (e.g., because there
were still some ompi/opal .m4 files in the top-level config/
directory, resulting in developer confusion where any given m4 macro
was defined).

So this commit consolidates them back into the top-level directory for
simplicity.  

There's still (at least) two changes that would be nice to make:

 1. Split any generated .m4 file (e.g., autogen-generated .m4 files)
    into a separate directory somewhere so that a top-level -Iconfig/
    will only get our explicitly defined macros, not the autogen stuff
    (e.g., with libevent2019 needing to get the visibility macro, but
    NOT all the autogen-generated inclusion of component configure.m4
    files).
 1. Change configure to be of the form:
{{{
# ...a small amount of preamble/setup...
OPAL_SETUP
m4_ifdef([project_orte], [ORTE_SETUP])
m4_ifdef([project_ompi], [OMPI_SETUP])
# ...a small amount of finishing stuff...
}}}

I doubt we'll ever get anything as clean as that, but that would be
the goal to shoot for.

This commit was SVN r27704.
2012-12-19 00:00:36 +00:00
Matthias Jurenz
3509deecc3 Changes to VT:
- compiler wrappers: removed invocation of pdbcomment; it removes essential information from the PDB file for instrumenting functions

This commit was SVN r27687.
2012-12-18 13:41:21 +00:00
Nathan Hjelm
ef49fcea25 Remove debug printfs.
cmr:v1.7

This commit was SVN r27680.
2012-12-17 16:34:07 +00:00
Matthias Jurenz
9f9fded4de Changes to OTF:
- otfprofile: fixed build error when using the IBM XL C++ compiler

Changes to VT:
	- configure:
		- use AC_CHECK_TYPES instead of AC_CHECK_DECLS to check for PAPI's long_long type
		- use AC_C_INLINE to check whether C 'inline' is present
	- VT libs:
		- set CUPTI tracing as default, if CUDA runtime wrapper has not been built
		- added a common interface for all NVIDIA CUPTI interfaces (events, activity, callbacks)
		- added support for concurrent kernel tracing (since CUDA 5.0)
		- removed almost unused VT_DEBUG env. variable - replaced calls to vt_debug_msg(DBG_LEVEL,...) by vt_cntl_msg (DBG_LEVEL+10,...)
		- added some more ifdefs to new CUDA 5 features
		- added several guards for internal malloc() and free() calls in CUDA related source files
		- revised memory allocation tracing:
			- intercept memory (de)alloaction functions by library wrapping (replaces deprecated hook technique from the GNU C library)
			- added support for multi-threaded applications
			- added wrapper functions for memalign, posix_memalign, and valloc
		- revised exec,system,fork tracing
			- retitled to "Child Process Execution Tracing"
			- introduced env. variable VT_EXECTRACE (marked VT_LIBCTRACE as deprecated)
			- added wrapper functions for execvpe, fexecve, waitid, wait3, and wait4
		- changes default function group name for
			- memory (de)allocation functions: "MEM" -> "LIBC-MALLOC"
			- I/O functions: "I/O" -> "LIBC-I/O"
			- child process execution: "LIBC" -> "LIBC-EXEC"
	- plugin counter interface:
		- added check for initialized vt_plugin_cntr_info.info
	- vtdyn:
		- added missing header includes for Dyninst 8
	- vtrun:
		- do not preload the Dyninst Runtime library; it is loaded by Dyninst itself

This commit was SVN r27679.
2012-12-17 14:40:07 +00:00
Jeff Squyres
c5b0bcd9f7 Refs trac:3422
* Add some comments in the *-wrapper-data-txt.in files just so that
   someone doesn't forget in the future why we link in what we do in
   the MPI and ORTE wrapper compilers.
 * Update ompi_wrapper_script.in to match the new behavior.
 * Update orte_wrapper_script.in to support --openmpi:linkall (which
   is a no-op in this case)

This commit was SVN r27672.

The following Trac tickets were found above:
  Ticket 3422 --> https://svn.open-mpi.org/trac/ompi/ticket/3422
2012-12-14 16:34:20 +00:00
Nathan Hjelm
ba5b2b0540 btl/vader: fix bug in single copy code that could cause ob1 sends to not get marked complete.
cmr:v1.7

This commit was SVN r27671.
2012-12-13 23:18:53 +00:00
Jeff Squyres
f779b1ded9 Put back the static-library-detection stuff from r27668, with some
additional functionality.  Rationale (refs trac:3422):

 * Normal MPI applications only ever use the MPI API. Hence, -lmpi is
   sufficient (they'll never directly call ORTE or OPAL
   functions). This is arguably the most common case.
 * That being said, we do have some test programs (e.g., those in
   orte/test/mpi) that call MPI functions but also call ORTE/OPAL
   functions. I've also written the occasional MPI test program that
   calls opal_output, for example (there even might be a few tests in
   the IBM test suite that directly call ORTE/OPAL functions).
   * Even though this is not a common case, these applications should
     also compile/link with mpicc.
   * So we should add a --openmpi:linkall option that will also link
     in whatever is necessary to call ORTE/OPAL functions
   * Yes, we could hard-code "-lopen-rte -lopen-pal" in Makefiles, but
     we do reserve the right to change those library names and/or add
     others someday, so it's better to abstract out the names and let
     the wrapper supply whatever is necessary.
 * ORTE programs, however, are different. They almost always call OPAL
   functions (e.g., if they want to send a message, they must use the
   OPAL DSS). As such, it seems like the ORTE programs should always
   link in OPAL.

Therefore:

 * Add undocumented --openmpi:linkall flag to the wrapper compilers.
   See the comment in opal_wrapper.c for an explanation of what it
   does.  This flag is only intended for Open MPI developers -- not
   end users.  That's why it's undocumented.
 * Update orte/test/mpi/Makefile.am to add --openmpi:linkall
 * Make ortecc/ortec++'s wrapper data text files always explicitly
   link in libopen-pal

This commit was SVN r27670.

The following SVN revision numbers were found above:
  r27668 --> open-mpi/ompi@cf845897aa

The following Trac tickets were found above:
  Ticket 3422 --> https://svn.open-mpi.org/trac/ompi/ticket/3422
2012-12-13 22:31:37 +00:00
Jeff Squyres
18e0c75857 Ensure to set Fortran ierr properly at the end of
MPI_Get_processor_name.  Thanks to LANL for spotting the issue.

This commit was SVN r27669.
2012-12-13 15:03:48 +00:00
Jeff Squyres
cf845897aa Temporarily revert r27662 and r27667 because something wonky is
happening on OS X.  Grumble...

This commit was SVN r27668.

The following SVN revision numbers were found above:
  r27662 --> open-mpi/ompi@97cc916007
  r27667 --> open-mpi/ompi@529f6244ca
2012-12-11 23:08:14 +00:00
Jeff Squyres
97cc916007 Per discussion at the Open MPI developer meeting last week:
1. Restore libopen-pal.la, libopen-rte.la, and libmpi.la to be
    separate entities (i.e., don't have libopen-rte.la include
    libopen-pal.la, and don't have libmpi.la include libopen-pal.la).
    Yay!
 1. Consequently, make the wrapper compilers look for flags indicating
    that the user wants to compile statically (currently: -static,
    !--static, -Bstatic, and "-Wl," in front of all of those).  If it
    is, follow a 6-way matrix for determinining which libraries to
    list on the underlying command line.
 1. To support that, add the name of a token static and dynamic
    library to look for in each of the wrapper compiler data files.
 1. Fix a long-standing typo in the opalcc wrapper data file.

This commit was SVN r27662.
2012-12-11 01:46:59 +00:00
Mike Dubman
a454341e2b add support for mxm 2.0
This commit was SVN r27661.
2012-12-09 22:58:37 +00:00
Nathan Hjelm
3e1b13b13a Re-add support for old flex (2.5.4a and earlier) while still cleaning up properly in new flex.
This commit was SVN r27657.
2012-12-07 00:12:43 +00:00
Brian Barrett
702451111b Remove Portals 3.3 support
This commit was SVN r27656.
2012-12-06 20:11:27 +00:00
Jeff Squyres
c00e6a7abf Remove the OFUD BTL. It doesn't work, and isn't included in 1.7.
An upcoming BTL from Cisco used ofud as a starting point, and should
probably be used as a starting point for any future UD-based BTL.

And this OFUD BTL is obviously still in history if anyone ever wants
to resurrect it.

This commit was SVN r27655.
2012-12-06 17:43:28 +00:00