Ralph Castain
4e3d12d9c1
Fix suicide operation when MPI app loses connection to its local daemon. In that scenario, we correctly callback up to the MPI layer notifying it of the lost connection. However, when the MPI layer calls back down to tell the RTE to abort, it is passing back a flag indicating we should report that error to our local daemon - which is dead. This leads to an infinite loop. Break it by using checking the flag indicating an abnormal term was ordered by the RTE and thus don't attempt to send the message.
...
cmr=v1.7.4:reviewer=jsquyres
This commit was SVN r30475.
2014-01-29 16:56:54 +00:00
Ralph Castain
410a3afa7b
Fix --without-hwloc operations - must default to map-by slot in that scenario
...
cmr=v1.7.4:reviewer=jsquyres
This commit was SVN r30474.
2014-01-29 16:54:05 +00:00
George Bosilca
489f093b59
It didn't compile. Cleanup a little the headers inclusion.
...
This commit was SVN r30473.
2014-01-29 14:30:55 +00:00
Jeff Squyres
a9d665b142
Sync with 1.7 README.
...
This commit was SVN r30467.
2014-01-28 22:40:14 +00:00
Jeff Squyres
3fa9d36aba
Per http://www.open-mpi.org/community/lists/devel/2014/01/13938.php ,
...
Orion Poplawski noticed that we should not be installing mpio.h.
cmr=v1.7.4:reviewer=hjelmn:subject=do not install mpio.h
This commit was SVN r30465.
2014-01-28 21:46:26 +00:00
Ralph Castain
bf73c517ba
Update the search_compare.pl script so it outputs the names of the files being updated
...
This commit was SVN r30464.
2014-01-28 19:37:27 +00:00
Ralph Castain
42eb0bbe1b
Fix --without-hwloc builds
...
cmr=v1.7.4:reviewer=jsquyres
This commit was SVN r30462.
2014-01-28 17:10:32 +00:00
Ralph Castain
c874ce3b61
Don't look for the coretemp file when configuring as it might not be on the head node, but is available on the backend
...
Refs trac:4176
This commit was SVN r30461.
The following Trac tickets were found above:
Ticket 4176 --> https://svn.open-mpi.org/trac/ompi/ticket/4176
2014-01-28 16:15:12 +00:00
Ralph Castain
56a13f134c
Set ignore
...
This commit was SVN r30460.
2014-01-28 16:09:12 +00:00
Ralph Castain
63a52e321a
update platform files
...
cmr=v1.7.4:reviewer=ompi-gk1.7
This commit was SVN r30459.
2014-01-28 15:50:29 +00:00
Jeff Squyres
e098073d62
Merge v1.7 README changes into trunk
...
This commit was SVN r30457.
2014-01-28 15:36:52 +00:00
Jeff Squyres
4edeb229cc
Add MPIEXEC_TIMEOUT environment variable to the man page.
...
cmr=v1.7.4:reviewer=rhc
This commit was SVN r30455.
2014-01-28 14:40:17 +00:00
Mike Dubman
30e1e49a9e
OSHMEM: refactoring to reuse common functions from different components.
...
This is preparation for moving verbs dependent code out from memheap/base component
Refs: #3763
This commit was SVN r30454.
2014-01-28 07:30:36 +00:00
Jeff Squyres
0feddcd92b
Add lots of bullets for 1.7.4.
...
This commit was SVN r30451.
2014-01-28 04:45:01 +00:00
Jeff Squyres
1e3fc4d321
Add --with-libevent configure switch notes.
...
This commit was SVN r30450.
2014-01-28 04:44:19 +00:00
Ralph Castain
84a0ab3a75
Ah @$#!$#% - missed one last help message that needs to be corrected.
...
cmr=v1.7.4:reviewer=jsquyres:subject=correct help message
This commit was SVN r30449.
2014-01-28 04:03:24 +00:00
Ralph Castain
fa64a401c6
Update LICENSE file with Intel info
...
cmr=v1.7.4:reviewer=ompi-gk1.7
This commit was SVN r30441.
2014-01-27 23:43:05 +00:00
Jeff Squyres
6adc16081a
Add a configure test that mimics what OMPI's current mpi_f08
...
implementation does (that is not quite adherant to the Fortran
standard). If a compiler allows this behavior, build the mpi_f08
wrapper. For example, ifort allows it, but Pathscale/EKOPath 5.0 is
stricter in its Fortran compliance and disallows it.
This test is temporary; the real fix is to make OMPI adhere to Fortran
properly (i.e., see #4157 ). Once we fix #4157 , this test should be
removed. The main reason for committing this test is to put it into
v1.7.4 so that we can release, but with the intent to remove it by
1.7.5 (or 1.8.x at the latest!).
Refs trac:4157
cmr=v1.7.4:reviewer=ompi-rm1.7:subject=Add mpi_f08-(non)compliance configure test
This commit was SVN r30440.
The following Trac tickets were found above:
Ticket 4157 --> https://svn.open-mpi.org/trac/ompi/ticket/4157
2014-01-27 23:37:42 +00:00
Ralph Castain
af4a9a0688
Make clear that --with-pmi can/should be used to specify the path to the pmi installation since at least one person didn't realize it.
...
cmr=v1.7.4:reviewer=jsquyres
This commit was SVN r30439.
2014-01-27 22:50:37 +00:00
Ralph Castain
941bfd4604
Final cleanup of cpus-per-proc for 1.7.4 - provide better checking for cpus-per-proc and mismatched mapping/binding directives, and provide error messages telling the user what to do to get it right.
...
cmr=v1.7.4:reviewer=jsquyres
This commit was SVN r30438.
2014-01-27 22:40:51 +00:00
Rolf vandeVaart
791a3a5ec6
Fix CUDA-aware support with sendi optimization. Need to make sure copy function
...
is initialized.
This commit was SVN r30437.
2014-01-27 18:35:01 +00:00
Jeff Squyres
0b5a1b5de1
Update svn:ignore -- no need to ignore a file that does not exist
...
This commit was SVN r30436.
2014-01-27 16:35:05 +00:00
Jeff Squyres
0120852a2f
Per Paul Hargrove's point: we shouldn't be doing anything for F77 any more.
...
This commit was SVN r30435.
2014-01-27 15:37:13 +00:00
Ralph Castain
53b1be5067
Only report launch progress when specifically requested to do so. Thanks to Tetsuya Mishima for spotting it.
...
Reviewed by rhc and RM-approved
cmr=v1.7.4:reviewer=ompi-gk1.7
This commit was SVN r30434.
2014-01-27 15:17:42 +00:00
Ralph Castain
956aab03a7
Track the origin of a message so it can be passed across transports
...
Refs trac:4184
This commit was SVN r30433.
The following Trac tickets were found above:
Ticket 4184 --> https://svn.open-mpi.org/trac/ompi/ticket/4184
2014-01-26 21:09:26 +00:00
Ralph Castain
09267731f8
Add another header to be checked
...
cmr=v1.7.5:reviewer=rhc:subject=Add usock oob support
This commit was SVN r30432.
2014-01-26 21:05:45 +00:00
George Bosilca
bde9619386
Various minor cleanups.
...
This commit was SVN r30431.
2014-01-26 17:27:12 +00:00
George Bosilca
ea3cb1e110
Don't forget to call del_procs.
...
This commit was SVN r30430.
2014-01-26 17:26:40 +00:00
George Bosilca
d265981c55
Don't always retain the proc, do it only for new procs. This enforce a strict policy in the BML, it has one and only one ref on each proc.
...
This commit was SVN r30429.
2014-01-26 17:26:04 +00:00
George Bosilca
18ae20022a
Don't forget to release the bitmaps.
...
This commit was SVN r30428.
2014-01-26 17:24:38 +00:00
Mike Dubman
ffe662c98e
OSHMEM: use shmem barrier instead of internal
...
Refs: 3763
This commit was SVN r30427.
2014-01-25 07:49:56 +00:00
Ralph Castain
11562ab7cb
Ensure we build the sensor components even if the local system doesn't have the required directories and/or access permissions. Backend nodes that get the binary may have them, and aggregators need to load the component so they can log data even if they aren't locally monitoring. Detect that we can't access the required files when we first try to sample and turn the sampling portion of the plugin off at that time.
...
Refs trac:4172
This commit was SVN r30426.
The following Trac tickets were found above:
Ticket 4172 --> https://svn.open-mpi.org/trac/ompi/ticket/4172
2014-01-25 04:34:33 +00:00
Jeff Squyres
967550b3ac
Update the BIND(C, name="foo") test to see if the compiler supports
...
names longer than 32 characters.
Per discussion on the devel list starting here:
http://www.open-mpi.org/community/lists/devel/2014/01/13799.php we
need a new litmus test to disqualify older Fortran compilers (e.g.,
Pathscale 4.0.12) that *seem* to support all the Right Things, but a)
do not support BIND(C, name="super_long_name") or b) run into an
internal error when compiling our mpi_f08 module.
Testing for b) is sketchy at best. But OMPI has some BIND(C) names
that are >32 characters, and the same compilers that exhibit b) also
seem to not support BIND(C) names that are >32 characters (i.e., a)).
Hence, the following BIND(C) test checks to ensure that BIND(C,
name="foo") works, where "foo" is actually a name >32 characters.
cmr=v1.7.4:reviewer=rhc:subject=Update Fortran configure test to exclude older pathscale/open64 compilers from mpi_f08
This commit was SVN r30421.
2014-01-24 23:17:14 +00:00
Jeff Squyres
01e07f25e0
Ignore compiled Fortran module files
...
This commit was SVN r30419.
2014-01-24 22:44:45 +00:00
Jeff Squyres
21ffddbbd0
Addendum to r30408: if we're going to remove stale kruft, let's remove
...
all of it. :-)
Refs trac:4175.
This commit was SVN r30417.
The following SVN revision numbers were found above:
r30408 --> open-mpi/ompi@31acdb15bc
The following Trac tickets were found above:
Ticket 4175 --> https://svn.open-mpi.org/trac/ompi/ticket/4175
2014-01-24 22:19:36 +00:00
Ralph Castain
f73d23e723
Correct the location of the counter when tracking process launch for reporting progress
...
cmr=v1.7.4:reviewer=hjelmn
This commit was SVN r30415.
2014-01-24 21:03:05 +00:00
Jeff Squyres
98d67add3c
Updates to the README and wrapper compiler man pages for Fortran.
...
Thanks to Paul Hargrove for spotting these issues.
cmr=v1.7.4:reviewer=rhc:subject=Fortran README+man page updates
This commit was SVN r30414.
2014-01-24 21:00:00 +00:00
Ralph Castain
e3cb4b4a5b
Grant Nathan his wish - add an --disable-getpwuid to the configure options and protect all users of that code so it disappears if disabled.
...
cmr=v1.7.5:reviewer=hjelmn:subject=disable getpwuid if requested
This commit was SVN r30413.
2014-01-24 19:18:37 +00:00
Ralph Castain
b32556e6dc
Fixes trac:4143
...
After IM with Nathan, apply patch from ticket after verification by Paul Hargrove that it fixes the problem on non-x86 32-bit platforms
Verified by Paul, RM-approved
cmr=v1.7.4:reviewer=ompi-gk1.7
This commit was SVN r30411.
The following Trac tickets were found above:
Ticket 4143 --> https://svn.open-mpi.org/trac/ompi/ticket/4143
2014-01-24 17:56:52 +00:00
Ralph Castain
c6c10f47b8
Label rows with the table name when printing db logs
...
cmr=v1.7.5:reviewer=rhc
This commit was SVN r30410.
2014-01-24 17:30:12 +00:00
Ralph Castain
e496e348a4
Some cleanup of the sensor system to ensure things go in the right place, avoid segfaults under abnormal conditions, etc.
...
cmr=v1.7.5:reviewer=rhc
This commit was SVN r30409.
2014-01-24 17:29:24 +00:00
Ralph Castain
31acdb15bc
We haven't really supported orteCC in a long time, so let's remove the stale cruft. Thanks to Paul Hargrove for noticing!
...
cmr=v1.7.4:reviewer=jsquyres:subject=remove stale orteCC cruft
This commit was SVN r30408.
2014-01-24 17:26:54 +00:00
Adrian Reber
0af2897c12
removed trailing whitespaces in orte-checkpoint.c
...
This commit was SVN r30407.
2014-01-24 17:23:49 +00:00
Adrian Reber
659eb1b10a
silence two compiler warnings
...
This commit was SVN r30406.
2014-01-24 17:22:28 +00:00
Adrian Reber
919260a0d2
fix communication between orte-checkpoint and orterun
...
Right after starting the communication with orterun the buffer
containing the message is deleted. This patch removes the deletion
of the buffer which is now done by orte_rml_send_callback(). This is
now also the callback function used by orte_rml.send_buffer_nb().
The previous callback hnp_receiver() was introduced by an
earlier patch which only was trying to get the code to compile again.
This commit was SVN r30405.
2014-01-24 17:18:28 +00:00
Adrian Reber
8c93ebffeb
orte_snapc_base_select() wants to know if it is an application
...
The function
int orte_snapc_base_select(bool seed, bool app);
wants to know if it called by an application or not. Therefore
it expects as second paremeter 'bool app'. It used to be
'!ORTE_PROC_IS_DAEMON' which is not always correct if it is
a tool or a HNP. This patch changes it to ORTE_PROC_IS_APP, which
has the correct information if it is an application.
This commit was SVN r30404.
2014-01-24 17:14:41 +00:00
Ralph Castain
883c1a1c57
Fix ia64 operations by correcting a couple of bugs in the ia64 atomics. Thanks to Paul Hargrove for the patch!
...
Since Paul is the only one of the team with the required hardware to test it, and he has done so, consider this RM-approved.
cmr=v1.7.5:reviewer=ompi-gk1.7
This commit was SVN r30401.
2014-01-24 00:14:37 +00:00
Ralph Castain
14bf1c9463
Some minor cleanups:
...
* don't return null if someone wants to print ORTE_SUCCESS
* rename some stale process types
* keep show_help local if we are in standalone operation as there is nobody to send it to
cmr=v1.7.5:reviewer=jsquyres
This commit was SVN r30400.
2014-01-23 21:35:20 +00:00
Ralph Castain
32996cd705
Add new sensors for chip frequency and power (when permissions allow) Note that we don't support all chipsets at this time, but others are welcome to extend as desired.
...
cmr=v1.7.5:reviewer=rhc
This commit was SVN r30399.
2014-01-23 21:33:21 +00:00
Nathan Hjelm
2435057a57
ignore the iboffload component for now.
...
This commit was SVN r30398.
2014-01-23 16:06:21 +00:00