Howard Pritchard
dc98b62070
add initial support for an alps odls component
...
It turns out that the support for Open MPI apps on
Cray was hanging on a thin thread of support when
using the mpirun job launcher. It just happened that
with a certain set of configuration options things would
work. This is bound to backfire at some point.
To fix this weakness, as well as to allow for mpirun launched
jobs to benefit from many of the advanced placement features
provided by the Cray Linux Environment (as opposed to the hwloc
only default env of orte), a new odls alps component is introduced.
2014-11-17 14:00:09 -07:00
Todd Kordenbrock
5ac1a94f3f
Merge pull request #274 from jsquyres/topic/delete-opal-check-portals.m4
...
opal_check_portals.m4: remove this unused file
2014-11-17 10:12:40 -06:00
Jeff Squyres
225c684ccc
opal_check_portals.m4: remove this unused file
...
It looks like OPAL_CHECK_PORTALS is not used anywhere; it looks like
it has been completely replaced by opal_check_portals4.m4
(OPAL_CHECK_PORTALS4).
2014-11-16 12:44:14 -08:00
Nathan Hjelm
d495d49b1c
Merge pull request #273 from open-mpi/topic/yoda_rdma_flags
...
OSHMEM: spml yoda: use flags to check if btl is RDMA capable
2014-11-16 12:04:04 -07:00
Alex Mikheev
fbb9dc5b1e
OSHMEM: spml ikrit valgrind fix
...
always initialize request flags
2014-11-16 17:24:16 +02:00
Alex Mikheev
3443c1d5e5
OSHMEM: spml yoda: use flags to check if btl is RDMA capable
2014-11-16 17:20:20 +02:00
Ralph Castain
d9ceb5aea4
Fix C++ builds by removing no-longer-needed type declaration
2014-11-14 11:44:24 -08:00
Gilles Gouaillardet
f3b36fdf6e
orted/pmix: fix pmix_server_release when several jobids are running on the same node
2014-11-14 16:17:28 +09:00
Gilles Gouaillardet
84b21d726e
orte/util: add OPAL_{VPID,JOBID} types to orte_attr_{load,unload}
2014-11-14 15:55:25 +09:00
Gilles Gouaillardet
4de7021acc
configury: fix a typo
2014-11-14 13:39:41 +09:00
Gilles Gouaillardet
2177f9ec3e
fix missing copyright, no code change
2014-11-13 14:56:09 +09:00
Gilles Gouaillardet
cd6e3ecb07
oshmem/yoda: fix a typo in mca_spml_yoda_get_completion
2014-11-13 14:53:32 +09:00
Andrew Friedley
b97cda7fd9
PSM MTL: Don't connect procs already connected
...
PSM has issues when trying calling psm_ep_connect() more than once for a
specific peer. Use the psm_ep_connect mask argument to avoid connecting
to processes that are already connected.
OMPI ticket #268 .
2014-11-12 15:52:02 -08:00
Howard Pritchard
a632b632ca
better way to tell if a process is in a Cray PAGG
...
Use a more reliable way to tell if a process is
1) in a Cray PAGG
2) is actually considered an application process on
a compute node (not for example, a process in a PAGG
on a mom node).
2014-11-12 12:56:15 -07:00
Howard Pritchard
72bb4a2eee
make cray pmi compile again
...
Commit @80f07b65 resulted in changes that
caused cray pmi component to no longer compile.
This commit fixes that issue.
2014-11-12 12:33:30 -07:00
Nathan Hjelm
cfbb9cba16
btl/vader: don't assume the address in the put/get segment is unmodified when
...
using knem
It is valid to modify the remote segment that will be used with the
btl put/get operations as long as the resulting address range falls in
the originally prepared segment. Vader should have been calculating the
offset of the remote address in the registered region. This commit
fixes this issue.
2014-11-12 10:12:52 -07:00
rhc54
1fdb6a62d3
Merge pull request #265 from miked-mellanox/topic/undeprecate_env_x
...
ORTE: undeprecate -x var=val in mpirun
Looks okay to me - thanks!
2014-11-12 08:46:09 -08:00
rhc54
87fa1061d4
Merge pull request #267 from artpol84/s2_fix
...
Fix SLURM PMI2 component. set s2_nrank to the relative position of a pro...
Good catch!
2014-11-12 08:43:03 -08:00
Jeff Squyres
f39b294afe
mca base: fix trivial typos in help message
2014-11-12 08:40:17 -08:00
Artem Polyakov
fce08a3db3
Fix SLURM PMI2 component. set s2_nrank to the relative position of a process inside the node
...
(not relative position of a node inside the allocation).
2014-11-12 16:26:35 +06:00
Gilles Gouaillardet
b088175705
btl/vader: fix a typo in mca_btl_vader_put_knem
2014-11-12 19:00:00 +09:00
Mike Dubman
f83d6045aa
ORTE: undeprecate -x var=val in mpirun
...
mpirun -x var=val is back, actually it is useful alias for -mca mca_base_env_list "var=val"
2014-11-12 10:51:15 +02:00
Ralph Castain
780c93ee57
Per the PR and discussion on today's telecon, extend the process name definition as a two-field struct of uint32_t's down to the OPAL layer. This resolves issues created by prior commits that impacted both heterogeneous and SPARC support. This also simplifies the OMPI code base by removing the need for frequent memcpy's when transitioning between the OMPI/ORTE layers and OPAL.
...
We recognize that this means other users of OPAL will need to "wrap" the opal_process_name_t if they desire to abstract it in some fashion. This is regrettable, and we are looking at possible alternatives that might mitigate that requirement. Meantime, however, we have to put the needs of the OMPI community first, and are taking this step to restore hetero and SPARC support.
2014-11-11 17:00:42 -08:00
Gilles Gouaillardet
40942c62ce
dstore: remove unused variables
2014-11-11 18:14:59 +09:00
Gilles Gouaillardet
43af1e27ca
Merge pull request #245 from ggouaillardet/poc/pmi_slurm
...
pmix: correctly split pmi messages
2014-11-11 03:18:40 -05:00
Gilles Gouaillardet
80f07b65f1
pmix: correctly split pmi messages
...
Thanks to @elenash for all the reviews
2014-11-11 17:16:00 +09:00
Ralph Castain
d0704ef118
Restore handling of physical processors in rankfiles. Note that the prior implementation was likely incorrect as it falsely assumed that physical core indices were unique, which isn't always true. Stipulate that physical rankfiles can only include PU numbers, and bind the result to the core that contains that physical PU. Update the mpirun man page to cover the new use-case.
2014-11-10 14:00:40 -08:00
Ralph Castain
2a90788724
Support physical processor ids in rankfile
2014-11-10 14:00:40 -08:00
Howard Pritchard
6c8c9cb4a3
another fix for --enable-dlopen for ugni btl
...
missed a change to create libmca_common_ugni.la
file correctly.
2014-11-10 13:40:59 -07:00
Jeff Squyres
a904a2deeb
OpenMPI.3in: remove trailing blank lines; no content changes
2014-11-10 08:38:24 -08:00
Jeff Squyres
31e79f469a
NEWS: whitespace cleanup; no content changes
2014-11-10 08:38:24 -08:00
Jeff Squyres
5271ed9ec6
NEWS: sync with 1.8.4 bullets
2014-11-10 08:38:24 -08:00
Alex Mikheev
097b469f61
OSHMEM: sshmem verbs: fix shared_mr detection
...
It seems that 5ce2f10067
changed default flag values but it did not modify detection code.
2014-11-10 13:34:04 +02:00
Gilles Gouaillardet
d2d7f39a4b
btl/vader: use FRAG_ALLOC_USER when single_copy_mechanism is VADER_NONE
2014-11-10 17:02:45 +09:00
Gilles Gouaillardet
df6115aac4
topo/base: fix uninitialized variable
...
this commit fixes a bug introduced by commit open-mpi/ompi@e7c59e3adb
2014-11-10 13:06:50 +09:00
Howard Pritchard
5c08aa8552
enable ugni btl to work without disable-dlopen
...
There were mistakes in the Makefiles for the ugni btl and
mca/common/ugni that prevented the ugni btl from being
used unless one happened to set the --disable-dlopen option
on the config line.
This commit fixes this problem.
2014-11-09 15:19:47 -07:00
Ralph Castain
8c837d3cb3
Doh - if we can't output an entire block, then we need to adjust the number of bytes remaining to be output or else we will output duplicate bytes when next we are able to write.
2014-11-07 13:13:13 -08:00
rolfv
022612c83b
Missed a removal from previous commit
2014-11-07 11:08:41 -08:00
rolfv
cbb43d5ac3
Make sure initialization happens
2014-11-07 11:00:45 -08:00
bosilca
e7c59e3adb
Merge pull request #227 from ggouaillardet/rfc/coll_basic_neighbor
...
RFC/coll basic neighbor
2014-11-07 11:33:25 -05:00
Ralph Castain
b56b744041
Silence some warnings and remove debug output
2014-11-07 07:54:01 -08:00
Elena
eb7872488c
fix incorrect mca param registration in latest commit
2014-11-07 07:31:42 +02:00
elenash
2687637071
Merge pull request #263 from elenash/master
...
dstore sm component implementing shared memory database for pmix client/server communication
2014-11-07 07:56:55 +03:00
Howard Pritchard
b389895c66
fix make dist for pmix/cray
...
Include file was left out of "sources" list that prevented
building for cray from dist tarball.
2014-11-06 15:10:51 -07:00
Howard Pritchard
59f8d0a92d
cleanup ugni compiler warnings
2014-11-06 12:25:10 -07:00
Ralph Castain
8f20b2e170
Remove invalid NEWS item for 1.8.4
2014-11-06 09:33:26 -08:00
Ralph Castain
1808746c1d
Merge remote-tracking branch 'origin'
2014-11-06 08:42:54 -08:00
George Bosilca
8da5dcc22e
Don't release the provided opal_proc in the error path.
2014-11-06 08:42:23 -08:00
Gilles Gouaillardet
e269a52ac7
btl/openib: send openib modex with the PMIX_GLOBAL flag
2014-11-06 08:42:23 -08:00
Ralph Castain
ce92c7c309
Copy 1.8.4 release NEWS entries
2014-11-06 08:41:57 -08:00