Terry Dontje
f3aaa885a3
corrected a couple places in orte where it said cpu_model when it should have been cpu_type.
...
This commit was SVN r24221.
2011-01-11 19:56:26 +00:00
Abhishek Kulkarni
11ffa854ff
Update the FTB notifier
...
* fix indentation issues
* update the name of one of the fault events published to the FTB (per the FTB MPI standard)
This commit was SVN r24213.
2011-01-10 18:58:31 +00:00
Nathan Hjelm
c082d05ecb
Reset the timer on MPIR_being_debugged only if MPIR_being_debugged is not set. Fix typo in return code.
...
This commit was SVN r24187.
2010-12-20 21:00:49 +00:00
Ralph Castain
2dc5cbb483
Remove stale code and API from the RML/OOB frameworks. Stopped using this code years ago.
...
This commit was SVN r24153.
2010-12-05 15:58:21 +00:00
Rolf vandeVaart
b67d3398da
It is convention to have orte_config.h included at top of file.
...
This commit was SVN r24146.
2010-12-03 16:13:31 +00:00
Shiqing Fan
f43862420c
Convert the bad dos line endings to unix style for all windows related files.
...
This commit was SVN r24137.
2010-12-02 12:08:08 +00:00
Ralph Castain
aaad8ae891
Remove unused var
...
This commit was SVN r24136.
2010-12-02 02:38:13 +00:00
Ralph Castain
f9ffff59f8
Ensure clean termination of threads and tcp multicast
...
This commit was SVN r24134.
2010-12-02 00:23:42 +00:00
Nathan Hjelm
75605faa75
added support for reattaching a debugger using the MPIR_attach_fifo
...
This commit was SVN r24132.
2010-12-01 20:13:58 +00:00
Ralph Castain
ad814f26cd
One more time, into the breach!
...
Restore the use of override_oversubscribe to indicate that the data source for resources on the backend nodes used in mapping is unreliable. In this situation (e.g., data came from hostfile, or we are just using localhost because nothing was provided), we don't trust the oversubscribe condition passed by the mapper. Instead, we check locally to ensure we set sched_yield correctly.
This commit was SVN r24130.
2010-12-01 15:15:26 +00:00
Ralph Castain
eba65e97f3
Extend the rmcast APIs to allow enable/disable of comm, required for clean termination by upper layer users.
...
Point the recv thread event base to the right place so it can wakeup when required.
Add a new error code for "comm disabled" when attempting to communicate after disabling comm.
This commit was SVN r24129.
2010-12-01 13:41:19 +00:00
Ralph Castain
9224302c10
Remove debug
...
This commit was SVN r24128.
2010-12-01 13:12:24 +00:00
Ralph Castain
30c37ea536
Ensure that the oversubscribed condition of nodes is accurately reported by the mapper, and that the results are communicated and used by the backend orteds when setting sched_yield on local procs. Restores prior behavior that was somehow lost along the way.
...
Includes a patch from Damien Guinier to fix vpid assignments when cpus-per-task is specified.
This commit was SVN r24126.
2010-12-01 12:51:39 +00:00
Ralph Castain
85a974b0de
Better check for NULL before using the value
...
This commit was SVN r24122.
2010-12-01 04:48:50 +00:00
Ralph Castain
c56185887b
Change the event base "wakeup" support to enable the passing of events to the central thread for add/del. Add a macro OPAL_UPDATE_EVBASE for this purpose as it will likely be widely used.
...
Update the ORTE thread support to utilize this capability. Update the rmcast framework to track the change.
This commit was SVN r24121.
2010-12-01 04:26:43 +00:00
Ralph Castain
0441e81882
Oops - ensure that multicast msgs get circulated properly with the tcp module
...
This commit was SVN r24118.
2010-11-30 21:13:53 +00:00
Ralph Castain
d20c023348
Checkpoint the threading support for multicast - will be revised shortly, but this version currently works.
...
This commit was SVN r24117.
2010-11-30 17:30:16 +00:00
Ralph Castain
71669720a3
Just get the output once on sigpipe error, and include the fd
...
This commit was SVN r24092.
2010-11-25 15:32:48 +00:00
Ralph Castain
30c635fd4d
Don't endlessly output sigpipe errors. Count the number of times we trap it, and abort if we get more than 10 of them.
...
This commit was SVN r24091.
2010-11-25 15:25:24 +00:00
Rolf vandeVaart
09fdd5cc23
Include fcntl.h, not sys/fcntl.h so we get the definition
...
of the open system call. That is what man page says to do.
Fixes warning on Solaris.
This commit was SVN r24073.
2010-11-19 17:40:02 +00:00
Ethan Mallove
66f2301170
Just plain "Grid Engine" instead of "Sun Grid Engine"
...
This commit was SVN r24068.
2010-11-18 19:30:04 +00:00
Abhishek Kulkarni
78a67654d4
add notifier events for process migration
...
This commit was SVN r24058.
2010-11-16 17:57:44 +00:00
Abhishek Kulkarni
6e6ccae082
Update the checkpoint notification events that we throw out over the FTB with a payload embedded in {}
...
This commit was SVN r24057.
2010-11-16 17:55:57 +00:00
Jeff Squyres
e4744b4ed5
Per http://www.open-mpi.org/community/lists/devel/2010/11/8671.php ,
...
change a bunch of OMPI_<foo> names to OPAL_<foo>.
This commit was SVN r24046.
2010-11-12 23:22:11 +00:00
Ralph Castain
bb521c6b7e
Properly count local procs to set oversubscribed condition
...
This commit was SVN r24037.
2010-11-10 21:59:35 +00:00
Ralph Castain
021bd77bf1
Don't free the event base if we aren't using progress threads
...
This commit was SVN r24036.
2010-11-10 21:58:58 +00:00
Ralph Castain
57257ab9b4
Use the right event base if threads are disabled. Always update the seq num
...
This commit was SVN r24034.
2010-11-10 21:26:04 +00:00
Ralph Castain
cbb758c4fb
Allow mcast threads to be disabled
...
This commit was SVN r24032.
2010-11-10 20:16:41 +00:00
Ralph Castain
22e40d92a0
Cleanup thread termination
...
This commit was SVN r24031.
2010-11-10 19:36:44 +00:00
Ralph Castain
01347926d1
Be a little more thorough about cleaning up during finalize
...
This commit was SVN r24014.
2010-11-09 14:56:27 +00:00
Shiqing Fan
d3701ccba8
type casts.
...
This commit was SVN r24013.
2010-11-09 09:17:22 +00:00
Ralph Castain
f2f41d1ca9
Be nice to those who don't enable-multicast...poor wretches.
...
This commit was SVN r24011.
2010-11-09 05:08:55 +00:00
Ralph Castain
a47b33678b
Add orte-level thread support to avoid some of the opal_if_threads protection used solely for ompi.
...
Use threads to help process multicast messages.
This commit was SVN r24009.
2010-11-08 19:09:23 +00:00
Ralph Castain
bf665692c3
Update the rmcast callback function API to return message sequence number. Update orte_mcast test to stress the system.
...
This commit was SVN r24004.
2010-11-07 23:29:52 +00:00
Abhishek Kulkarni
e0660101d3
Throw notifier events for checkpointing status (success or failure)
...
This commit was SVN r24003.
2010-11-07 22:12:09 +00:00
Abhishek Kulkarni
8cd3759f21
use the saved value of PID, saving some calls to getpid()
...
This commit was SVN r24002.
2010-11-07 22:09:49 +00:00
Abhishek Kulkarni
d1a4cc33dd
Update the FTB notifier wrt events decided by the CIFTS working group
...
This commit was SVN r24001.
2010-11-07 22:06:32 +00:00
Abhishek Kulkarni
ac2768ca7c
LOG_SYSLOG is a syslog facility. take it off the syslog options
...
This commit was SVN r24000.
2010-11-06 22:05:45 +00:00
Ralph Castain
875a6d61a4
Return correct status code
...
This commit was SVN r23969.
2010-10-29 00:43:50 +00:00
Ralph Castain
9ea2b196ce
Convert the opal_event framework to use direct function calls instead of hiding functions behind function pointers. Eliminate the opal_object_t abstraction of libevent's event struct so it can be directly passed to the libevent functions.
...
Note: the ompi_check_libfca.m4 file had to be modified to avoid it stomping on global CPPFLAGS and the like. The file was also relocated to the ompi/config directory as it pertains solely to an ompi-layer component.
Forgive the mid-day configure change, but I know Shiqing is working the windows issues and don't want to cause him unnecessary redo work.
This commit was SVN r23966.
2010-10-28 15:22:46 +00:00
Ralph Castain
c13b0bb668
Update some debugger attachment code per LLNL request
...
This commit was SVN r23965.
2010-10-28 03:06:20 +00:00
Brian Barrett
3ed00ba148
More fixes to make OMPI compile with minimal ORTE support again
...
This commit was SVN r23962.
2010-10-27 20:40:39 +00:00
Shiqing Fan
199df1eadf
Rename a few var names.
...
This commit was SVN r23959.
2010-10-27 11:52:57 +00:00
Nathan Hjelm
e7bfbe1d1a
added missing object initialization/destruction of mca_oob_tcp_component.tcp_listen_thread_event
...
This commit was SVN r23958.
2010-10-26 22:09:37 +00:00
Shiqing Fan
a3d9c91ff7
Exclude stdbool.h for Windows, and use the definition in opal. Immigrate the socket pair support from libevent. Fix other minor things and make it compile.
...
This commit was SVN r23951.
2010-10-26 14:53:50 +00:00
Ralph Castain
894230b121
This stuff is soooo out-of-date that a complete rewrite would be required - thankfully, nobody cares
...
This commit was SVN r23944.
2010-10-26 06:22:31 +00:00
Ralph Castain
86c7365e8e
Clean up a few initialization issues - don't think these are impacting the shared memory situation as it didn't fix the problem.
...
Setup the event API to support multiple bases in preparation for splitting the OMPI and ORTE events. Holding here pending shared memory resolution.
This commit was SVN r23943.
2010-10-26 02:41:42 +00:00
Ralph Castain
fc46dfa78a
Remove stale code
...
This commit was SVN r23942.
2010-10-26 02:37:56 +00:00
George Bosilca
5882290cdd
We need a default value or the compiler will whine.
...
This commit was SVN r23940.
2010-10-25 19:05:45 +00:00
Abhishek Kulkarni
c671ec52d1
Fix broken trunk compile after the libevent changes.
...
This commit was SVN r23929.
2010-10-25 14:11:48 +00:00