Split the finalize process into two parts: one that finalizes the orte subsystems, and another that finalizes (what will become) the opal subsystems. Needed to properly restart the workstation process once remote launch accomplished.
This commit was SVN r5758.
launching new processes so that they get all the default unix
behaviors (i.e., become killable, and don't accidentally have some
signals blocked -- left over from the event library).
This commit was SVN r5757.
Remote launch of daemon now working. Bunch of forced diagnostic messages in it, though, which I'll leave until I release this for actual use.
This commit was SVN r5750.
Orted has been modified to take a new parameter - a file descriptor used as a pipe to pass the daemon's contact info back to the probe when the daemon is remotely launched.
This commit was SVN r5748.
You will now receive a message indicating that an existing universe was detected, but connection to it was refused. The system will tell you the name it created for the new universe it will now be using.
This commit was SVN r5747.
(mostly UB/LB related and doing questionable things). Otherwise a lot of changes:
- cleaner code + more comments
- stronger bound checker (DDT error messages if we exceed the buffer capacity)
- differentiate the 3 internal types: loop, basic element and end_loop (using union)
- more MACROS to solve the repetitive problems
- more output in debug mode (and if requested by the user).
- correct some mismaching between usage of true_extent and extent
- improve the special cases (contiguous data, contiguous with gaps, no conversion ...)
- in order to allow gdb to break in the pack/unpack function they became real function (defined
in dt_pack.c and dt_unpack.c) if OMPI_ENABLE_DEBUG is defined. Otherwise they are just macros.
- a new approach for the computation of the lower bound and upper bound. In same time the true_lb and
true_ub has been modified to match the new algorithm.
- handle specific cases in the datatype optimization. In some cases the datatype definition increase,
but the complexity decrease.
This commit was SVN r5729.
is int (mostly 4 bytes) and strdup normally return a char* (usually 8 bytes). The result: stack corrupted
and other weird things ...
This commit was SVN r5725.
Need to do some refining of the component, but it meets basic requirements right now. Nobody else should notice any change - system basically ignores it unless you tell it to do something.
This commit was SVN r5723.
Added a special case under the win_makefile for the gpr/replica directory
since it contains multiple dependant layers of directories.
Added a couple of OMPI_DECLSPECs. Change a conflicting variable name in
gpr_replica_dict_tl.c from 'new' to 'new_dict'.
This commit was SVN r5712.
Fixes for orterun in handling different MCA params for different
processes (reviewed by Brian):
- By design, if you run the following:
mpirun --mca foo aaa --mca foo bbb a.out
a.out will get a single MCA param for foo with value "aaa,bbb".
- However, if you specify multiple apps with different values for the
same MCA param, you should expect to get the different values for
each app. For example:
mpirun --mca foo aaa a.out : --mca foo bbb b.out
Should yield a.out with a "foo" param with value "aaa" and b.out
with a "foo" param with a value "bbb".
- This did not work -- both a.out and b.out would get a "foo" with
"aaa,bbb".
- This commit fixes this behavior -- now a.out will get aaa and b.out
will get bbb.
- Additionally, if you mix --mca and and app file, you can have
"global" params and per-line-in-the-appfile params. For example:
mpirun --mca foo zzzz --app appfile
where "appfile" contains:
-np 1 --mca bar aaa a.out
-np 1 --mca bar bbb b.out
In this case, a.out will get foo=zzzz and bar=aaa, and b.out will
get foo=zzzz and bar=bbb.
Spiffy.
Ok, fortran build is done... back to Fortran... sigh...
This commit was SVN r5710.
* start refactoring duplicate code into inline functions (probably will
have to become macros, but not until debugging is done)
* general code cleanup
This commit was SVN r5706.