Jeff Squyres
fd61d78599
Add two MCA parameters to the MPI level to control behavior during
...
MPI_ABORT. From the ompi_info output:
MCA mpi: parameter "mpi_abort_delay" (current value: "0")
If nonzero, print out an identifying message when
MPI_ABORT is invoked (hostname, PID of the process
that called MPI_ABORT) and delay for that many seconds
before exiting (a negative delay value means to never
abort). This allows attaching of a debugger before
quitting the job.
MCA mpi: parameter "mpi_abort_print_stack" (current value: "0")
If nonzero, print out a stack trace when MPI_ABORT is
invoked
This commit was SVN r9487.
2006-03-31 00:31:15 +00:00
Galen Shipman
1d67917b69
must handle header validation correctly for each case, not enough in common
...
for the MACRO
This commit was SVN r9486.
2006-03-30 21:27:21 +00:00
Ralph Castain
8ba453b866
Modify the rmgr_proxy component so it includes the automatic wire-up of stdio.
...
This commit was SVN r9483.
2006-03-30 19:44:28 +00:00
Tim Woodall
9a73fe8beb
check for valid sequence number before attempting to use communicator
...
This commit was SVN r9482.
2006-03-30 19:36:15 +00:00
George Bosilca
2b3779cd6e
Correct some of the casting issues. By default the compilers attach an signed type
...
to the defines. As our internal types (job_id and co.) are unsigned that generate
several errors (integer overflow in expression and comparison between signed and
unsigned). Casting the defines to the correct type solve these problems.
This commit was SVN r9481.
2006-03-30 19:28:17 +00:00
Gleb Natapov
256bf70530
Forgot to add file to previous commit
...
This commit was SVN r9480.
2006-03-30 17:37:52 +00:00
Tim Woodall
637511e759
correct cleanup of callbacks
...
This commit was SVN r9479.
2006-03-30 16:55:02 +00:00
Gleb Natapov
79bcfb096f
Add type to frag. Sometimes we need to know that a frag is from short rdma area.
...
I used hack for this that doesn't work for mvapi, so changing it to something more sane.
This commit was SVN r9477.
2006-03-30 15:26:21 +00:00
Brian Barrett
e0eb9a19e7
* make orte_process_name_t be of fixed size (rather than depending on the size of
...
size_t). This should be the last piece of the puzzle required to get 32/64
interoperability working for ORTE.
This commit was SVN r9476.
2006-03-30 14:59:41 +00:00
Brian Barrett
6be35fb604
* Use the ORTE_<type> constants instead of internal DSS_TYPE_<type>_T constants
...
for the type to be packed / unpacked when dealing with sized types (like
size_t) so that the dss_unpack code to deal with types of different sizes is
activated. Necessary for proper 32/64 interoperability.
This commit was SVN r9475.
2006-03-30 14:33:25 +00:00
Gleb Natapov
ea11582191
Porting of short message RDMA from openib BTL. Endpoint registers circular buffer and sends its address and rkey to the peer. Peer uses this buffer to eagerly RDMA small message into it. Endpoint polls the buffer for message arrival before checking HP/LP QPs. Set btl_mvapi_use_eager_rdma to 1 to enable it.
...
This commit was SVN r9474.
2006-03-30 12:55:31 +00:00
George Bosilca
99ba9bea10
Correct the CRAY define (here is a interesting webpage with all the compilers defines
...
http://predef.sourceforge.net/precomp.html ).
This commit was SVN r9472.
2006-03-30 00:20:43 +00:00
Jeff Squyres
1597ee94b7
Update notes on the Portland compiler
...
This commit was SVN r9471.
2006-03-29 23:09:02 +00:00
Galen Shipman
641fa6c0d2
more fixes, reset state on completion..
...
This commit was SVN r9469.
2006-03-29 22:21:35 +00:00
Galen Shipman
2945f77f9e
randomly drop fragments without local completion, currently commented out as
...
we must handle the other cases first..
This commit was SVN r9468.
2006-03-29 22:19:58 +00:00
Andrew Friedley
0eba366b07
Various pieces all over to make basic small message send/recv work. Next step
...
is clean up the code.. it is in need of refactoring and testing.
Thanks to Brian for help in troubleshooting!
This commit was SVN r9466.
2006-03-29 21:55:41 +00:00
Jeff Squyres
142f59a435
Add release note about PGI compilers.
...
This commit was SVN r9459.
2006-03-29 19:45:45 +00:00
Brian Barrett
02c8a51b76
* fix endian encoding for 64 bit numbers to use hton64
...
* cleanup the unpack_size_mismatch macros a little bit
* ad comment about endianness of size_mismatch cleanup code so that I don't
think I've found a bug that really isn't and lose an hour tracking it down
again...
This commit was SVN r9458.
2006-03-29 18:58:02 +00:00
Galen Shipman
5271948ec0
--- opal object changes
...
add object size to opal class
no longer need the size when allocating a new object as this is stored in
the class structure
--- dr changes
Previous rev. maintained state on the communicator used for acking duplicate
fragments, but the communicator may be destroyed prior to successfull
delivery of an ack to the peer. We must therefore maintain this state
globally on a per peer, not a per peer, per communicator basis.
This requires that we use a global rank on the wire and translate this as
appropriate to a local rank within the communicator.
This commit was SVN r9454.
2006-03-29 16:19:17 +00:00
Gleb Natapov
2d9757e81b
Fix deadlock in opal_condition. If broadcast is called on condition with more than one thread waiting on it the first awaken thread may consume all signals so other threads will never run.
...
This commit was SVN r9451.
2006-03-29 14:54:58 +00:00
Sven Stork
711b30fbac
- introduction of user buffer checking for pt2pt operations
...
This commit was SVN r9449.
2006-03-29 09:26:27 +00:00
Brian Barrett
cf425f6289
* trick the stupid compiler (GCC in this case) into shutting up about not
...
being able to convert from object pointer to function pointer.
This commit was SVN r9445.
2006-03-29 01:26:16 +00:00
Brian Barrett
99e4c89183
* some typo fixes for orterun manpage
...
* Install orterun manpage as mpirun.1 and mpiexec.1 as well as orterun.1
This commit was SVN r9444.
2006-03-29 01:04:43 +00:00
Jeff Squyres
07b0e559f2
Fix copyright
...
This commit was SVN r9443.
2006-03-29 00:53:11 +00:00
George Bosilca
50b5a02f8b
Let the oob to call opal_progress instead of opal_progress_event. Now, the MPI
...
communications will be advanced in MPI_Finalize.
This commit was SVN r9442.
2006-03-28 22:09:40 +00:00
Brian Barrett
ae85aae6e5
* add some debugging if conftestval the file doesn't exist
...
* Try fclose()ing the file to see if that helps with the errors reported
with Absoft 8.2
This commit was SVN r9439.
2006-03-28 13:55:39 +00:00
George Bosilca
5d465cf118
Call the constructor on the DR lock.
...
This commit was SVN r9438.
2006-03-28 07:34:02 +00:00
George Bosilca
af7930a498
Don't forget to add the new .h file to the distribution list.
...
This commit was SVN r9437.
2006-03-28 07:16:39 +00:00
Graham Fagg
19906e66dc
missing lock?
...
This commit was SVN r9436.
2006-03-28 06:15:48 +00:00
George Bosilca
c93680074f
Enable the master convertor and add a function to clean them all up. Without talking
...
about the comments :)
This commit was SVN r9435.
2006-03-28 05:11:48 +00:00
George Bosilca
46c442fe0d
We do not have direct access to the module. Grab the one attached to the
...
window instead.
This commit was SVN r9434.
2006-03-28 05:06:40 +00:00
George Bosilca
deb1b237a7
First cut of a unique architecture base master convertor. It will hold the sizes
...
of the predefined types as seen by the remote architecture, as well as some
pointers to the conversion functions. Still in progress, but it should just fine
for what we have today in Open MPI.
This commit was SVN r9433.
2006-03-28 03:16:53 +00:00
Josh Hursey
35eb1a2970
Added a section on "Specifying Hosts" to the man page.
...
This commit was SVN r9432.
2006-03-27 23:46:38 +00:00
Tim Woodall
c1bf71b1be
- updated copyrights
...
- removed unused state
- starting to add support for btl failover
This commit was SVN r9431.
2006-03-27 22:48:12 +00:00
Tim Woodall
c724e4c804
- removed unused flags
...
- updated copyrights
This commit was SVN r9430.
2006-03-27 22:44:26 +00:00
Galen Shipman
af02867322
Use ones compiliment so we don't kill previously set flags..
...
This commit was SVN r9428.
2006-03-27 16:12:22 +00:00
Gleb Natapov
590c992a7e
fix recursive lock of openib_btl->ib_lock.
...
This commit was SVN r9427.
2006-03-26 15:02:43 +00:00
Gleb Natapov
01a119c3c5
fix compilation bug with --enable-mpi-threads
...
This commit was SVN r9426.
2006-03-26 13:24:10 +00:00
Gleb Natapov
a5a78b10cc
Implementation of short message RDMA. Endpoint registers circular buffer and sends its address and rkey to the peer. Peer uses this buffer to eagerly RDMA small message into it. Endpoint polls the buffer for message arrival before checking HP/LP QPs. Set btl_openib_use_eager_rdma to 1 to enable it.
...
This commit was SVN r9425.
2006-03-26 08:30:50 +00:00
George Bosilca
52e5d90a53
ompi_dt_swap_bytes is the way to go.
...
This commit was SVN r9424.
2006-03-25 07:12:07 +00:00
Brian Barrett
8258a998ab
* added NEWS items for the one-sided implementation and the heterogeneous
...
support.
This commit was SVN r9423.
2006-03-25 03:02:44 +00:00
Brian Barrett
b6ddded1fa
* Fix bug in determining size of C++ bool when creating the predefined
...
datatype MPI::BOOL.
* Add first round of support for heterogeneous platforms. This includes
endianness and size difference for C++ bool and Fortran LOGICAL. It
does not include differences in sizes for any other datatype or for
different representations of floating point numbers.
This commit was SVN r9422.
2006-03-25 02:53:41 +00:00
Galen Shipman
1677ca1cd4
continue to debug retransmission of incorrect offset,
...
only occurs on vfrag timeout..
This commit was SVN r9421.
2006-03-24 22:28:43 +00:00
George Bosilca
b22836ea93
Reset the completed flag when we roll-back the convertor.
...
This commit was SVN r9419.
2006-03-24 20:33:53 +00:00
Brian Barrett
01671f2991
* allow user to set "no_locks" info argument as MCA parameter to override the
...
default
* Add ability to start Put and Get requests immediately instead of queuing
until synchronizaion when using Fence. Not entirely sure this is
completely safe, so it must be explicitly enabled by the user, either with
an MCA parameter or info argument to Win_create.
This commit was SVN r9418.
2006-03-24 18:56:59 +00:00
Jeff Squyres
bc96040e1c
- Add Cisco copyright
...
- Add comment explaining why we used INT_MAX
- Update NEWS
This commit was SVN r9415.
2006-03-24 15:39:09 +00:00
Brian Barrett
14ef67934b
* update ignore files
...
This commit was SVN r9414.
2006-03-24 15:34:02 +00:00
Jeff Squyres
a843ce4c23
Clean up a minor memory leak
...
This commit was SVN r9413.
2006-03-24 15:28:42 +00:00
Brian Barrett
29c70291a9
* properly distribute the peruse test directory so that nightly builds
...
happen
This commit was SVN r9412.
2006-03-24 13:42:01 +00:00
Tim Woodall
2e376e0ee8
misc cleanup
...
This commit was SVN r9410.
2006-03-24 06:49:45 +00:00