4af7a0827f
The fact that application proc called Abort (read failed) doesn't mean that ORTE subsystem has failed - vice versa it does it's work to gracefuly exit the whole application. orted exiting with non-zero status creates a problem for at least plm/slurm environments where orteds are launched via `srun` with "--kill-on-bad-exit" flag. If one of orteds has exited with non- zero status slurm will immediately kill all other orteds. As the result we see a lot of leftover in the `/tmp` directory. Signed-off-by: Artem Polyakov <artpol84@gmail.com> |
||
---|---|---|
.. | ||
Makefile.am | ||
pmix_server_dyn.c | ||
pmix_server_fence.c | ||
pmix_server_gen.c | ||
pmix_server_internal.h | ||
pmix_server_pub.c | ||
pmix_server_register_fns.c | ||
pmix_server.c | ||
pmix_server.h |