..
help-orte-runtime.txt
Complete modifications for failed-to-start of applications. Modifications for failed-to-start of orteds coming next.
2007-04-24 20:53:54 +00:00
Makefile.am
Add a new function to wake orterun up - used in failed-to-start scenarios, but can be used anytime a lower level needs to ensure orterun wakes up
2007-04-23 12:49:25 +00:00
orte_abort.c
Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD).
2007-03-16 23:11:45 +00:00
orte_cr.c
OS X apparently really doesn't like shared libraries with unresolvable
2007-06-05 03:03:59 +00:00
orte_cr.h
Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD).
2007-03-16 23:11:45 +00:00
orte_finalize.c
* Before this commit, if we called ompi_mpi_abort() before MPI_INIT
2007-01-29 22:01:28 +00:00
orte_init_stage1.c
Bring in an updated launch system for the orteds. This commit restores the ability to execute singletons and singleton comm_spawn, both in single node and multi-node environments.
2007-07-12 19:53:18 +00:00
orte_init_stage2.c
Bring in an updated launch system for the orteds. This commit restores the ability to execute singletons and singleton comm_spawn, both in single node and multi-node environments.
2007-07-12 19:53:18 +00:00
orte_init.c
per conversation with Ralph and Jeff take out the opal_init_only logic.
2007-05-24 21:54:58 +00:00
orte_monitor.c
Bring in the code for routing xcast stage gate messages via the local orteds. This code is inactive unless you specifically request it via an mca param oob_xcast_mode (can be set to "linear" or "direct"). Direct mode is the old standard method where we send messages directly to each MPI process. Linear mode sends the xcast message via the orteds, with the HNP sending the message to each orted directly.
2007-04-23 18:41:04 +00:00
orte_params.c
Fix a problem observed by Brian where processes launched local to mpirun lost their environment except for MCA params.
2007-07-13 15:47:57 +00:00
orte_restart.c
Bring over the update to terminate orteds that are generated by a dynamic spawn such as comm_spawn. This introduces the concept of a job "family" - i.e., jobs that have a parent/child relationship. Comm_spawn'ed jobs have a parent (the one that spawned them). We track that relationship throughout the lineage - i.e., if a comm_spawned job in turn calls comm_spawn, then it has a parent (the one that spawned it) and a "root" job (the original job that started things).
2006-11-14 19:34:59 +00:00
orte_setup_hnp.c
Fix a problem introduced by r15390 which was causing strange failures in numerous areas.
2007-07-13 17:52:16 +00:00
orte_setup_hnp.h
And ORTE is ready for prime-time. All Windows tricks are in:
2006-08-23 03:32:36 +00:00
orte_system_finalize.c
per conversation with Ralph and Jeff take out the opal_init_only logic.
2007-05-24 21:54:58 +00:00
orte_system_init.c
Bring in an updated launch system for the orteds. This commit restores the ability to execute singletons and singleton comm_spawn, both in single node and multi-node environments.
2007-07-12 19:53:18 +00:00
orte_universe_exists.c
Bring in the generalized xcast communication system along with the correspondingly revised orted launch. I will send a message out to developers explaining the basic changes. In brief:
2007-06-12 13:28:54 +00:00
orte_wait.c
Update the Windows wait functions. The most important change is for
2007-06-14 04:35:46 +00:00
orte_wait.h
And ORTE is ready for prime-time. All Windows tricks are in:
2006-08-23 03:32:36 +00:00
orte_wakeup.c
Fix mis-declared variable type
2007-06-11 16:48:35 +00:00
orte_wakeup.h
Add a new function to wake orterun up - used in failed-to-start scenarios, but can be used anytime a lower level needs to ensure orterun wakes up
2007-04-23 12:49:25 +00:00
params.h
Fix a problem observed by Brian where processes launched local to mpirun lost their environment except for MCA params.
2007-07-13 15:47:57 +00:00
runtime_internal.h
Next step in the project split, mainly source code re-arranging
2006-02-12 01:33:29 +00:00
runtime_types.h
Update the copyright notices for IU and UTK.
2005-11-05 19:57:48 +00:00
runtime.h
Commit the orted-failed-to-start code. This correctly causes the system to detect the failure of an orted to start and allows the system to terminate all procs/orteds that *did* start.
2007-05-21 18:31:28 +00:00