Ralph Castain
61b09a132b
Fix bynode mapping of multiple app-contexts
...
This commit was SVN r26949.
2012-08-03 21:45:40 +00:00
Ralph Castain
96f6f94c24
Ensure we don't get trapped in an infinite loop when ranking bynode if something isn't right
...
This commit was SVN r26948.
2012-08-03 21:45:10 +00:00
Ralph Castain
0d878937fe
If a callback is set in the state machine, and the state doesn't yet exist, create it
...
This commit was SVN r26947.
2012-08-03 21:43:36 +00:00
Ralph Castain
431d5361ed
For those who really preferred our prior mode of operation that mapped procs and only launched daemons on the nodes that had procs on them, introduce the "novm" state machine component. This recreates the old mode of operation by re-ordering the launch sequence so that we allocate, then map, and then launch daemons only on the reqd nodes (instead of across the entire allocation).
...
This commit was SVN r26946.
2012-08-03 16:30:05 +00:00
Ralph Castain
dc22ea5cde
A little cleaner on the message about repeated ctrl-c, and re-enable the event so we can abort if we see multiple ctrl-c's that don't meet the time requirement
...
This commit was SVN r26945.
2012-08-03 01:26:18 +00:00
Ralph Castain
89f3754559
Bring in the rest of MPI-3 Ticket #125 changes, including changes to mpi.h, function param lists, and associated documentation.
...
This commit was SVN r26944.
2012-08-03 01:09:59 +00:00
Ralph Castain
e6c72bfd53
Ensure we can forcibly exit even when we are stuck inside of an event by replacing the libevent signal handler with a POSIX one that (a) attempts to trip a libevent termination event and (b) if anothe ctrl-c hits within 5 seconds, just calls exit.
...
This commit was SVN r26943.
2012-08-02 21:15:35 +00:00
Ralph Castain
bf701f6c55
Make PMPI match MPI defs
...
This commit was SVN r26942.
2012-08-02 16:58:04 +00:00
Ralph Castain
8b595ddf84
Complete changes specified in MPI-3 Ticket #125 . Still holding on proposed change to MPI_Init as it remains under discussion (and is just plain uuuugly). No action taken on changing string arrays as that part of #125 was not included in the final pdf text.
...
This commit was SVN r26939.
2012-08-02 16:31:02 +00:00
Jeff Squyres
91ccba9643
Minor enhancements to the hwloc base:
...
* NULL's out the hwloc_obj_t->userdata in
hwloc_base_util.c:free_object() and
hwloc_base_util.c:opal_hwloc_base_free_topology() after it has been
OBJ_RELEASE'd.
* Adds a userdata field to opal_hwloc_topo_data_t. This field will
be used in an upcoming rmaps component ("lama") to cache some
associated data during hardware tree traversals.
This commit was SVN r26938.
2012-08-02 16:29:44 +00:00
Ralph Castain
d818c9d407
Includes a patch from Jeff and Josh: update the simulator module to allow specification of multiple slot and max_slot counts for each node group (but don't require it). Remove the requirement that each node group provide its own topology. Adjust verbosities to allow showing some light debug output to see what nodes have been added without getting a bunch of other stuff.
...
This commit was SVN r26936.
2012-08-02 04:57:13 +00:00
Jeff Squyres
62c2ff7ee7
It's actually ''not'' an error to exit if all routes and children are
...
gone. So exit with 0, not ORTE_ERROR_DEFAULT_EXIT_CODE (which is 1).
This fixes a race condition in the rsh launcher upon termination,
where ORTE would sometimes think that a daemon failed to launch.
This commit was SVN r26935.
2012-08-01 19:49:19 +00:00
Ralph Castain
05845214b8
Add missing include file
...
This commit was SVN r26934.
2012-08-01 04:08:47 +00:00
Nathan Hjelm
4557e15c18
oob/ud fix compile error
...
This commit was SVN r26933.
2012-07-31 21:50:34 +00:00
Ralph Castain
6ee35e4977
Add num_local_peers to orte_process_info so we don't keep re-computing it, ensure it is available for direct launch via pmi as well
...
This commit was SVN r26931.
2012-07-31 21:21:50 +00:00
Jeff Squyres
88cbe9c780
.ompi_ignore this component until it can be fixed.
...
This commit was SVN r26930.
2012-07-31 21:02:06 +00:00
Nathan Hjelm
980692804d
oob/ud: don't start listening for ud requests unless we have one usable port
...
This commit was SVN r26929.
2012-07-31 19:00:18 +00:00
Jeff Squyres
92adbc5d72
Fix some problems with the Fortran wrapper compiler flags,
...
particularly with respect to threading flags.
Before this change, the following scenario would fail (e.g., on Linux
with pthreads):
{{{
$ ./configure --disable-shared --enable-static ...
$ make clean install
$ cd examples
$ make clean all
}}}
Linking the Fortran examples would fail with missing pthread symbols.
This commit was SVN r26927.
2012-07-31 18:12:24 +00:00
Matthias Jurenz
79a065290c
Changes to OTF:
...
- general:
- incremented version number to 1.11.2openmpi
- lib OTFAUX:
- speed-up messages matching, if no snapshots should be generated
Changes to VT:
- general:
- incremented version number to 5.13.1openmpi
- compiler wrappers:
- vtnvcc:
- add path to cuda.h to the PDT parser command
- exclude *.cu source files from instrumenting with PDT/TAU; the PDT parser is not (yet) able to handle CUDA statements and kernels
- vtunify:
- fixed timestamp boundary check for merging asynchronous plugin counters (i.e. async. timestamp must >= process' start timestamp)
- fixed timestamp conversion from local to global
- print percentage of message matching bumps (unmatched or reversed messages)
- inlined key-value list "record handler"
- minor optimizations in hook for merging async. events:
- search async. source manager only once per stream
- don't call writeRecHook_Event() if no async. source key is defined
This commit was SVN r26925.
2012-07-31 08:45:37 +00:00
Vasily Filipov
c386847d9a
MTL MXM: Adding MXM version protect for Mprobe, Mrecv resources.
...
This commit was SVN r26922.
2012-07-31 07:57:25 +00:00
Vishwanath Venkatesan
dccfd18481
1. Removing two-phase support functions
...
2. Moving nbc headers to a seperate header file and modifying
io_ompio_nbc.c accordingly.
This commit was SVN r26921.
2012-07-31 04:39:13 +00:00
Vishwanath Venkatesan
539571171b
Moving support functions of two-phase to the two_phase component.
...
This commit was SVN r26920.
2012-07-31 04:37:04 +00:00
Edgar Gabriel
fb64322dc3
this code section was supposed to be commented out...
...
This commit was SVN r26918.
2012-07-30 20:46:07 +00:00
Edgar Gabriel
1078f13ad2
set status->_ucount correctly for collective read and write operations in the module
...
This commit was SVN r26916.
2012-07-30 20:14:36 +00:00
Edgar Gabriel
91c8577d9d
fix in the offset calculation for explicit offset operations.
...
This commit was SVN r26915.
2012-07-30 20:08:00 +00:00
Ralph Castain
23c2a315a9
Add missing line to set flag indicating at least one port found
...
This commit was SVN r26914.
2012-07-30 17:54:38 +00:00
Edgar Gabriel
81a050add9
simplify the individual fcoll module by just calling the
...
mca_io_ompio_file_read/write functions directly. Avoid replicating the code in
both places.
This commit was SVN r26909.
2012-07-30 15:44:22 +00:00
Edgar Gabriel
66c5a80dfd
- get rid of a warning about an unused variable
...
- return MPI_ERR_OTHER instead of MPI_SUCCESS for the functions that are not
yet implemented
- add another field to the mca_io_ompio_file_t structure to point back to the
ompi_file_t structure.
This commit was SVN r26908.
2012-07-30 15:29:59 +00:00
Ralph Castain
6285f7d8c0
Per request of Shiqing, restore the ccp components
...
This commit was SVN r26904.
2012-07-29 23:49:59 +00:00
Ralph Castain
c7f9a0fa34
Check for recursive use of mpirun - issue error message and abort if detected
...
This commit was SVN r26903.
2012-07-28 21:50:56 +00:00
Ralph Castain
94d11e04fd
Add an intermediate state when the VM is ready so that third party tools can take action prior to mapping/launching apps
...
This commit was SVN r26902.
2012-07-28 15:33:09 +00:00
Jeff Squyres
3273e9c658
Added 1.6.1 bullets.
...
This commit was SVN r26901.
2012-07-28 02:21:21 +00:00
Ralph Castain
b2aca614a8
fix typo
...
This commit was SVN r26899.
2012-07-28 01:23:14 +00:00
Jeff Squyres
9f8265eccb
The files for automake to generate are specified via AC_CONFIG_FILES
...
in the */configure.m4 files. configure.params files are obsolete.
This commit was SVN r26897.
2012-07-27 14:33:17 +00:00
Shiqing Fan
660188307c
fix an export declaration name
...
This commit was SVN r26895.
2012-07-27 13:26:24 +00:00
Shiqing Fan
f27251ca13
Include the newly added example VS project file into the tarball.
...
This commit was SVN r26894.
2012-07-27 11:53:15 +00:00
Shiqing Fan
42dfbc7d2f
Another CMake scripts update for:
...
correctly generate hwloc library
automatically define OMPI/OPAL/ORTE_OMPORTS for user applications
update the f77 bindings
This commit was SVN r26893.
2012-07-27 11:49:09 +00:00
Jeff Squyres
46591b0b1a
Clarify a configure warning: we're ''not'' adding to DYLD_LIBRARY_PATH.
...
This commit was SVN r26880.
2012-07-26 21:47:00 +00:00
Jeff Squyres
ce85596bc9
Also show the memcpy framework (if there are any components, which
...
there probably won't be, but...).
This commit was SVN r26879.
2012-07-26 21:28:41 +00:00
Yevgeny Kliteynik
a6458da4ba
Using 8K as a minimal CQ length
...
- For now we'll use 8192 as a base value
- We leave the adjust_cq() as is
- For the long term we can work on an appropriate setting to expose through the INI file.
8K CQEs are 512K per process, which is 8MB for ppn=16
This commit was SVN r26877.
2012-07-26 21:06:18 +00:00
Nathan Hjelm
8736953c7f
btl/openib/connect improve the help message printed when a queue pair can not be created
...
This commit was SVN r26876.
2012-07-26 20:36:46 +00:00
Ralph Castain
8bc6694a62
Ensure the daemons don't incorrectly declare a failed launch
...
This commit was SVN r26875.
2012-07-26 19:05:06 +00:00
Jeff Squyres
a8a5f26bc2
Fix typo in comment.
...
This commit was SVN r26874.
2012-07-26 18:09:33 +00:00
Nathan Hjelm
3c6e2a2a48
Per RFC enable the use of the source directive (relative path) in platform files. The priority of configure options vs platform file options remains the same.
...
This commit was SVN r26873.
2012-07-26 16:19:23 +00:00
Shiqing Fan
204fbfe4b1
update the wv btl component.
...
This commit was SVN r26872.
2012-07-26 15:35:01 +00:00
George Bosilca
3a8478827b
Fix the MPI_Cancel issue identified by Fujitsu. And a typo.
...
This commit was SVN r26871.
2012-07-26 14:06:24 +00:00
Vasily Filipov
4e66ff030b
MTL MXM Mrecv: adding missed return message to a free list.
...
This commit was SVN r26870.
2012-07-26 11:22:22 +00:00
Nathan Hjelm
fe0e7f81df
btl/openib: as discussed remove the per-peer queue pair from the default configuration
...
This commit was SVN r26869.
2012-07-25 22:53:58 +00:00
Ralph Castain
07846f12ae
Reconnect the rsh/ssh error reporting code for remote spawns to report failure to launch. Ensure the HNP correctly reports non-zero exit status when ssh encounters a problem.
...
Thanks to Terry for spotting it!
This commit was SVN r26868.
2012-07-25 21:46:45 +00:00
Jeff Squyres
5ec6a65a72
After I spent a while looking in libibverbs for
...
ibv_get_device_list_compat() and not finding it, I finally realized
that it was a function in OMPI. So let's name it with a proper ompi_
prefix, not an ibv_ prefix.
This commit was SVN r26867.
2012-07-25 16:32:51 +00:00