1
1
openmpi/orte/runtime
Ralph Castain fd6a044b7f Cleanup some cruft resulting from the move of the btl's to opal. We had created the ability to delay modex operations, which included a need to delay retrieving hostname info for remote procs. This allowed us to not retrieve the modex info until first message unless required - the hostname is generally only required for debug and error messages.
Properly setup the opal_process_info structure early in the initialization procedure. Define the local hostname right at the beginning of opal_init so all parts of opal can use it. Overlay that during orte_init as the user may choose to remove fqdn and strip prefixes during that time. Setup the job_session_dir and other such info immediately when it becomes available during orte_init.
2014-10-03 16:02:57 -06:00
..
data_type_support Add a bunch of debug to help track down the problem, and eventually find another place where comparison of signatures was incorrectly performed - use the dss compare operation to be consistent and safe 2014-08-27 19:52:20 +00:00
help-orte-runtime.txt As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: 2013-08-20 18:59:36 +00:00
Makefile.am Use the correct abstraction layer name for the data dirs 2014-05-08 14:32:24 +00:00
orte_cr.c MCA/base: Add new MCA variable system 2013-03-27 21:09:41 +00:00
orte_cr.h Correct several export declarations. 2011-08-15 09:45:51 +00:00
orte_data_server.c As per the RFC, bring in the ORTE async progress code and the rewrite of OOB: 2013-08-22 16:37:40 +00:00
orte_data_server.h Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
orte_finalize.c Per the PMIx RFC: 2014-08-21 18:56:47 +00:00
orte_globals.c Provide a better help message when we are unable to complete a connection due to a firewall. 2014-09-16 16:28:29 +00:00
orte_globals.h [Contribution from Artem - moved it to svn from git for him] 2014-09-15 18:00:46 +00:00
orte_info_support.c version configury: make dist now works 2014-10-02 11:32:54 -07:00
orte_info_support.h Update OMPI frameworks to use the MCA framework system. 2013-03-27 21:17:31 +00:00
orte_init.c Cleanup some cruft resulting from the move of the btl's to opal. We had created the ability to delay modex operations, which included a need to delay retrieving hostname info for remote procs. This allowed us to not retrieve the modex info until first message unless required - the hostname is generally only required for debug and error messages. 2014-10-03 16:02:57 -06:00
orte_locks.c Start reducing our dependency on the event library by removing at least one instance where we use it to redirect the program counter. Rolf reported occasional hangs of mpirun in very specific circumstances after all daemons were done. A review of MTT results indicates this may have been happening more generally in a small fraction of cases. 2010-07-17 21:03:27 +00:00
orte_locks.h Start reducing our dependency on the event library by removing at least one instance where we use it to redirect the program counter. Rolf reported occasional hangs of mpirun in very specific circumstances after all daemons were done. A review of MTT results indicates this may have been happening more generally in a small fraction of cases. 2010-07-17 21:03:27 +00:00
orte_mca_params.c [Contribution from Artem - moved it to svn from git for him] 2014-09-15 18:00:46 +00:00
orte_quit.c Cleanup a set of typos on the orte_get_attribute call 2014-06-03 20:36:38 +00:00
orte_quit.h Sorry for mid-day commit, but I had promised on the call to do this upon my return. 2012-04-06 14:23:13 +00:00
orte_wait.c Remove useless variables. 2014-07-03 00:30:54 +00:00
orte_wait.h Revert r32222, r32210, and r32203 as they created a problem when daemon collectives did not involve app procs on every node. Instead, modify the ompi/mca/rte/orte/rte_orte.h to add a new function that allows apps to request new daemon collective ids for use in barrier and modex operations. This will only appear in ORTE-based installations, but it is only being used by a couple of researchers at the moment. 2014-07-15 03:48:00 +00:00
runtime_internals.h Modify the accounting system to recycle jobids. Properly recover resources from nodes and jobs upon completion. Adjustments in several places were required to deal with sparsely populated job, node, and proc arrays as a result of this change. 2009-03-03 16:39:13 +00:00
runtime.h As per the RFC, bring in the ORTE async progress code and the rewrite of OOB: 2013-08-22 16:37:40 +00:00