Nathan Hjelm
8962ce25b0
fixed some compiler errors caused by seg_key changes. osc/rdma may need to be updated to use btls that use 128 bit segment keys
...
This commit was SVN r25448.
2011-11-06 20:19:14 +00:00
Samuel Gutierrez
e03bc93fb7
only use pmi grpcomm and pubsub during the direct launch case. use PMI environment variable to setup vpid in ess alps on cray xe systems. add pmi test code.
...
This commit was SVN r25447.
2011-11-06 17:28:40 +00:00
Ralph Castain
34f0a27cb6
Initialize the locality info - at time of pmap creation, we at least know node locality
...
This commit was SVN r25446.
2011-11-06 17:06:41 +00:00
Nathan Hjelm
520a7c570e
changes to seg_key needed for a new btl
...
This commit was SVN r25445.
2011-11-06 16:19:09 +00:00
Ralph Castain
729935dffb
Minor cleanups, mirroring what Jeff did to ompi_info
...
This commit was SVN r25438.
2011-11-05 00:42:49 +00:00
Jeff Squyres
38451d4972
Add the MPI API version to the ompi_info output. How did we never
...
have this in there before?
This commit was SVN r25437.
2011-11-04 23:30:59 +00:00
Jeff Squyres
b43deb7091
Update for SVN 1.7.x, which only has a single top-level .svn directory
...
(no more .svn directories scattered throughout the tree)
This commit was SVN r25435.
2011-11-04 19:52:12 +00:00
Rolf vandeVaart
f777fe8eba
Change tab to spaces.
...
This commit was SVN r25433.
2011-11-04 17:18:30 +00:00
Jeff Squyres
f08b8bf2d4
Per this thread:
...
http://www.open-mpi.org/community/lists/devel/2011/10/9878.php
I am making a final decision to decide the behavior of what happens
when an MCA parameter is re-registered and changes types. In
developer builds (i.e., OPAL_ENABLE_DEBUG==1), a show_help message
will be displayed. In all builds, an error status will be returned.
Specifically, the logic looks like this:
{{{
if (detect_re-registration_with_type_change) {
#if OPAL_ENABLE_DEBUG
opal_show_help(...);
#endif
return OPAL_ERR_VALUE_OUT_OF_BOUNDS;
}
}}}
If someone would like to change this behavior, they are welcome to do
so. :-) I am committing this so that ''some'' action occurs (rather
than talking about the issue and then nothing happens).
This commit was SVN r25432.
2011-11-04 14:16:49 +00:00
Christopher Yeoh
fb57a74a40
Removes pointless memmove which because of a previous memcpy will always
...
have identical source and destination pointers. See #2871
Plugs a couple of minor memory leaks related to remote qp info
This commit was SVN r25431.
2011-11-04 00:15:08 +00:00
Christopher Yeoh
7e7701e7fc
Removes misleading debug warning from opal_free when a NULL
...
pointer is passed to it.
Fixes trac:2884
This commit was SVN r25430.
The following Trac tickets were found above:
Ticket 2884 --> https://svn.open-mpi.org/trac/ompi/ticket/2884
2011-11-03 23:57:26 +00:00
Jeff Squyres
886a9d589b
Custom patch from Brice for the hwloc-1.2.2ompi distro, per an issue
...
that Chris Yeoh/IBM found. See the thread below for more info:
http://www.open-mpi.org/community/lists/hwloc-devel/2011/11/2521.php
This commit was SVN r25429.
2011-11-03 14:53:22 +00:00
Ralph Castain
fcee46b063
Add an option for printing a diffable process map for testing mappers
...
This commit was SVN r25428.
2011-11-03 14:22:07 +00:00
Ralph Castain
5f73b874d9
Update ignores
...
This commit was SVN r25427.
2011-11-03 13:57:09 +00:00
Jeff Squyres
1d6d39d2ea
Missed this free/re-strdup
...
This commit was SVN r25426.
2011-11-03 11:31:37 +00:00
Jeff Squyres
6139256e45
v may get incremented, so be sure to save the ''original'' strdup'ed
...
pointer and free ''that'' -- not the (possibly incremented) pointer
This commit was SVN r25425.
2011-11-03 11:23:17 +00:00
Mike Dubman
7595a80a63
fix self pid
...
This commit was SVN r25424.
2011-11-03 06:46:20 +00:00
Samuel Gutierrez
3fe7b3ee54
add PMI support to ess alps module. xt system guys: please yell at me if i missed something in cnos.
...
This commit was SVN r25423.
2011-11-03 04:04:32 +00:00
Samuel Gutierrez
27b9bcfafd
update ess alps configuration file to include CNOS and PMI checks. some of the features committed here aren't being used, but they will be. also update orte_check_pmi.m4 to include missing call to action-if-not-found if --with-pmi is not specified or is disabled.
...
This commit was SVN r25422.
2011-11-03 02:14:47 +00:00
Jeff Squyres
7f6f7bd0eb
Remove this component; twitter long ago switched to the oauth
...
authentication, and no one has ever updated this component to match.
It can be revived out of history if anyone cares.
This commit was SVN r25421.
2011-11-02 21:04:49 +00:00
Ralph Castain
891027c10d
Cleanup error reports
...
This commit was SVN r25420.
2011-11-02 18:34:19 +00:00
Ralph Castain
b2e2d24726
As in the rsh module, report failed daemons to the errmgr for proper cleanup
...
This commit was SVN r25419.
2011-11-02 18:30:22 +00:00
Ralph Castain
3e4165fd8d
Cleanup includes
...
This commit was SVN r25418.
2011-11-02 18:28:28 +00:00
Ralph Castain
1bfc2bb424
Minor cleanup
...
This commit was SVN r25417.
2011-11-02 18:24:19 +00:00
Ralph Castain
b77552c45d
Cleanup some include files, return a silent error in open/select as the complaining component already output a message
...
This commit was SVN r25416.
2011-11-02 17:42:06 +00:00
Ralph Castain
198e001554
Add another test
...
This commit was SVN r25415.
2011-11-02 15:59:16 +00:00
Ralph Castain
55b996678e
Minor indentation changes
...
This commit was SVN r25414.
2011-11-02 15:56:56 +00:00
Nathan Hjelm
211e2dbdf3
clean up tab characters
...
This commit was SVN r25413.
2011-11-02 15:07:57 +00:00
Ralph Castain
f00753881e
Handle the case where mpirun -is- of the same topology as the compute nodes.
...
This commit was SVN r25412.
2011-11-01 22:26:03 +00:00
Jeff Squyres
12d4280d0b
Fix a bunch of memory leaks
...
This commit was SVN r25411.
2011-11-01 20:22:49 +00:00
Jeff Squyres
4fe26b0392
Fix some minor memory leaks
...
This commit was SVN r25410.
2011-11-01 20:22:26 +00:00
Jeff Squyres
308b88e1a7
Make use of the opal_dss_initialized variable for finalize protection
...
This commit was SVN r25409.
2011-11-01 20:22:12 +00:00
Ralph Castain
d28dd55d33
Minimize the amount of topology info returned by the daemons. Most clusters, especially at scale, use the same node topology on every node, so there is no re
...
ason to return the topology from every daemon. Borrow a page from the --hetero-apps page and let users indicate that the node topology differs by adding a --
hetero-nodes option to mpirun. If the option is set, then every daemon returns topology info. If not set, then only daemon vpid=1 returns it.
We always want one daemon to return the topology as the head node is often different from the compute nodes. Having one daemon return the compute node topolo
gy allows us to detect any such difference. All compute nodes are then set to the same topology.
This commit was SVN r25408.
2011-11-01 18:43:10 +00:00
Ralph Castain
14966e0f8f
Cleanup PMI startup - if a component isn't selected, it should finalize PMI IFF it started it. Otherwise, components that aren't selected can finalize PMI when it is in use by other parts of the system.
...
This commit was SVN r25407.
2011-11-01 16:25:12 +00:00
Mike Dubman
3edd77ea25
update mxm plugin to mxm api change: pass synchronous request as an opcode instead of a flag
...
This commit was SVN r25403.
2011-10-31 22:36:15 +00:00
Ralph Castain
4368199c86
Missing include
...
This commit was SVN r25402.
2011-10-31 13:39:57 +00:00
Mike Dubman
6b50ba22a6
select mxm ptl based on user preferences
...
This commit was SVN r25401.
2011-10-31 10:17:43 +00:00
Ralph Castain
96332a2859
Fix typo
...
This commit was SVN r25400.
2011-10-30 13:23:42 +00:00
Ralph Castain
71ed8e3cd3
Bring back the local node's binding capabilities along with its topology. Clean up indentation.
...
This commit was SVN r25399.
2011-10-30 13:20:16 +00:00
Ralph Castain
d492b20975
Bozo check for topology info
...
This commit was SVN r25398.
2011-10-30 11:49:38 +00:00
Ralph Castain
4232115a98
Ensure pruning remains within the current job/app being mapped.
...
This commit was SVN r25397.
2011-10-30 00:02:20 +00:00
Ralph Castain
648c85b41b
Add a simple pattern mapper as an example of how to use the topology info to create desired mappings. Let the user specify a pattern based on resource types, and map that pattern across all available nodes as resources permit.
...
Don't automatically display the topology for each node when --display-devel-map is set as it can overwhelm the reader. Use a separate flag --display-topo to get it.
This commit was SVN r25396.
2011-10-29 15:12:45 +00:00
Ralph Castain
12a589130a
Add some debug
...
This commit was SVN r25395.
2011-10-29 15:07:58 +00:00
Ralph Castain
965b04d1a5
Use the new utilities to get a topology that reflects available cpus
...
This commit was SVN r25394.
2011-10-29 15:07:36 +00:00
Ralph Castain
7ba4675adf
Bring over some useful utilities and definitions for working with hwloc inside ORTE/OMPI. Cache frequently computed info to save processing time when handling multiple nodes with the same topology. Deal with available cpus as defined by online vs allowed vs user-specified limits. Help deal with hwloc's unfortunate decision to lump all caches in the same object type.
...
This commit was SVN r25393.
2011-10-29 14:58:58 +00:00
Jeff Squyres
6092b50ebb
Fix the cases where the default values of MCA params were not always
...
handled properly when MCA parameters are re-registered and their types
change. Specifically, this case was broken:
1. Register an int MCA param with a non-zero default value
1. Re-register the same MCA param as a string with a NULL default value
The 2nd step would cause a segv because the first int default value
wasn't being reset properly. Here's sample code that shows the issue:
{{{
{
int ibogus;
char *sbogus;
opal_init(&argc, &argv);
mca_base_param_reg_int_name("type", "name", "help", false, false, 3, &ibogus);
printf("Ibogus: %d\n", ibogus);
mca_base_param_reg_string_name("type", "name", "help", false, false, NULL, &sbogus);
printf("Sbogus: %s\n", (NULL == sbogus) ? "NULL" : sbogus);
exit(0);
}
}}}
This commit fixes the problem from the sample code above as well as
the a similar issue for file-set MCA params and override values. It
also resets default values for MCA params initially registered as a
string but then re-registered as an int.
This commit was SVN r25392.
2011-10-29 12:29:31 +00:00
Jeff Squyres
b0bd0b3924
Don't use slashes in the date field ('/') because it'll confuse other
...
sed scripts in a build of that tarball (e.g., when substituting the
date into man pages).
This commit was SVN r25389.
2011-10-29 11:50:08 +00:00
Ralph Castain
e50bcbf028
Add the ability to specify a topology-containing xml file to describe the simulated nodes to support mapping tests against arbitrary topologies
...
This commit was SVN r25388.
2011-10-29 02:01:11 +00:00
Ralph Castain
21d45b0807
Just some cleanup in case of error
...
This commit was SVN r25387.
2011-10-29 01:55:19 +00:00
Ralph Castain
7fa5f82d70
Add simulator component to support testing of large scale mapping methods. Automatically sets do-not-resolve and do-not-launch, and creates however many nodes the user wants to simulate in the system.
...
This commit was SVN r25386.
2011-10-28 23:48:53 +00:00