2013-07-19 22:13:58 +00:00
|
|
|
/*
|
2016-02-05 13:04:34 -08:00
|
|
|
* Copyright (c) 2013-2016 Cisco Systems, Inc. All rights reserved.
|
2013-07-19 22:13:58 +00:00
|
|
|
* $COPYRIGHT$
|
|
|
|
*
|
|
|
|
* Additional copyrights may follow
|
|
|
|
*
|
|
|
|
* $HEADER$
|
|
|
|
*/
|
|
|
|
|
George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-)
WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL
All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic.
This commit was SVN r32317.
2014-07-26 00:47:28 +00:00
|
|
|
#include "opal_config.h"
|
2013-07-19 22:13:58 +00:00
|
|
|
|
|
|
|
#include <stdio.h>
|
|
|
|
#include <unistd.h>
|
|
|
|
|
2013-07-22 17:28:23 +00:00
|
|
|
#include "opal/util/show_help.h"
|
George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-)
WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL
All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic.
This commit was SVN r32317.
2014-07-26 00:47:28 +00:00
|
|
|
#include "opal/constants.h"
|
|
|
|
#include "opal/util/if.h"
|
2013-07-19 22:13:58 +00:00
|
|
|
|
2014-12-02 13:09:46 -08:00
|
|
|
#include "btl_usnic_module.h"
|
2013-07-19 22:13:58 +00:00
|
|
|
#include "btl_usnic_util.h"
|
|
|
|
|
|
|
|
|
2014-07-30 20:52:06 +00:00
|
|
|
void opal_btl_usnic_exit(opal_btl_usnic_module_t *module)
|
2013-07-19 22:13:58 +00:00
|
|
|
{
|
2014-07-30 20:52:06 +00:00
|
|
|
if (NULL == module) {
|
|
|
|
/* Find the first module with an error callback */
|
2014-12-02 13:09:46 -08:00
|
|
|
for (int i = 0; i < mca_btl_usnic_component.num_modules; ++i) {
|
2015-08-06 10:54:28 -07:00
|
|
|
if (NULL != mca_btl_usnic_component.usnic_active_modules &&
|
|
|
|
NULL != mca_btl_usnic_component.usnic_active_modules[i] &&
|
|
|
|
NULL != mca_btl_usnic_component.usnic_active_modules[i]->pml_error_callback) {
|
2014-07-30 20:52:06 +00:00
|
|
|
module = mca_btl_usnic_component.usnic_active_modules[i];
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
/* If we didn't find a PML error callback, just exit. */
|
|
|
|
if (NULL == module) {
|
|
|
|
exit(1);
|
|
|
|
}
|
|
|
|
}
|
2013-07-19 22:13:58 +00:00
|
|
|
|
2014-07-30 20:52:06 +00:00
|
|
|
/* After discussion with George, we decided that it was safe to
|
|
|
|
cast away the const from opal_proc_local_get() -- the error
|
|
|
|
function needs to be smart enough to not take certain actions
|
|
|
|
if the passed proc is yourself (e.g., don't call del_procs() on
|
|
|
|
yourself). */
|
|
|
|
if (NULL != module->pml_error_callback) {
|
|
|
|
module->pml_error_callback(&module->super,
|
|
|
|
MCA_BTL_ERROR_FLAGS_FATAL,
|
|
|
|
(opal_proc_t*) opal_proc_local_get(),
|
|
|
|
"usnic");
|
2013-07-19 22:13:58 +00:00
|
|
|
}
|
2014-07-30 20:52:06 +00:00
|
|
|
|
|
|
|
/* If the PML error callback returns (or if there wasn't one),
|
|
|
|
just exit. Shrug. */
|
|
|
|
exit(1);
|
2013-07-19 22:13:58 +00:00
|
|
|
}
|
|
|
|
|
|
|
|
|
2014-12-02 13:09:46 -08:00
|
|
|
/*
|
|
|
|
* Simple utility in a .c file, mainly so that inline functions in .h
|
|
|
|
* files don't need to include the show_help header file.
|
|
|
|
*/
|
|
|
|
void opal_btl_usnic_util_abort(const char *msg, const char *file, int line)
|
|
|
|
{
|
|
|
|
opal_show_help("help-mpi-btl-usnic.txt", "internal error after init",
|
|
|
|
true,
|
|
|
|
opal_process_info.nodename,
|
2015-12-15 19:01:19 -08:00
|
|
|
file, line, msg);
|
2014-12-02 13:09:46 -08:00
|
|
|
|
|
|
|
opal_btl_usnic_exit(NULL);
|
|
|
|
/* Never returns */
|
|
|
|
}
|
|
|
|
|
|
|
|
|
2013-07-19 22:13:58 +00:00
|
|
|
void
|
2014-12-02 13:09:46 -08:00
|
|
|
opal_btl_usnic_dump_hex(void *vaddr, int len)
|
2013-07-19 22:13:58 +00:00
|
|
|
{
|
|
|
|
char buf[128];
|
|
|
|
size_t bufspace;
|
|
|
|
int i, ret;
|
|
|
|
char *p;
|
|
|
|
uint32_t sum=0;
|
2014-12-02 13:09:46 -08:00
|
|
|
uint8_t *addr;
|
2013-07-19 22:13:58 +00:00
|
|
|
|
2014-12-02 13:09:46 -08:00
|
|
|
addr = vaddr;
|
2013-07-19 22:13:58 +00:00
|
|
|
p = buf;
|
|
|
|
memset(buf, 0, sizeof(buf));
|
|
|
|
bufspace = sizeof(buf) - 1;
|
|
|
|
|
|
|
|
for (i=0; i<len; ++i) {
|
|
|
|
ret = snprintf(p, bufspace, "%02x ", addr[i]);
|
|
|
|
p += ret;
|
|
|
|
bufspace -= ret;
|
|
|
|
|
|
|
|
sum += addr[i];
|
|
|
|
if ((i&15) == 15) {
|
|
|
|
opal_output(0, "%4x: %s\n", i&~15, buf);
|
|
|
|
|
|
|
|
p = buf;
|
|
|
|
memset(buf, 0, sizeof(buf));
|
|
|
|
bufspace = sizeof(buf) - 1;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
if ((i&15) != 0) {
|
|
|
|
opal_output(0, "%4x: %s\n", i&~15, buf);
|
|
|
|
}
|
|
|
|
/*opal_output(0, "buffer sum = %x\n", sum); */
|
|
|
|
}
|
|
|
|
|
|
|
|
|
2014-02-26 22:21:25 +00:00
|
|
|
/*
|
|
|
|
* Trivial wrapper around snprintf'ing an IPv4 address, with or
|
|
|
|
* without a CIDR mask (we don't usually carry around addresses in
|
|
|
|
* struct sockaddr form, so this wrapper is marginally easier than
|
|
|
|
* using inet_ntop()).
|
|
|
|
*/
|
George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-)
WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL
All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic.
This commit was SVN r32317.
2014-07-26 00:47:28 +00:00
|
|
|
void opal_btl_usnic_snprintf_ipv4_addr(char *out, size_t maxlen,
|
2016-02-05 13:04:34 -08:00
|
|
|
uint32_t addr_be, uint32_t netmask_be)
|
2014-02-26 22:21:25 +00:00
|
|
|
{
|
2014-12-02 13:09:46 -08:00
|
|
|
int prefixlen;
|
2016-02-05 13:04:34 -08:00
|
|
|
uint32_t netmask = ntohl(netmask_be);
|
|
|
|
uint32_t addr = ntohl(addr_be);
|
2014-02-26 22:21:25 +00:00
|
|
|
uint8_t *p = (uint8_t*) &addr;
|
2016-02-05 13:04:34 -08:00
|
|
|
|
2014-12-02 13:09:46 -08:00
|
|
|
if (netmask != 0) {
|
|
|
|
prefixlen = 33 - ffs(netmask);
|
2014-02-26 22:21:25 +00:00
|
|
|
snprintf(out, maxlen, "%u.%u.%u.%u/%u",
|
|
|
|
p[3],
|
2016-02-05 13:04:34 -08:00
|
|
|
p[2],
|
|
|
|
p[1],
|
|
|
|
p[0],
|
2014-12-02 13:09:46 -08:00
|
|
|
prefixlen);
|
2014-02-26 22:21:25 +00:00
|
|
|
} else {
|
|
|
|
snprintf(out, maxlen, "%u.%u.%u.%u",
|
2016-02-05 13:04:34 -08:00
|
|
|
p[3],
|
2014-02-26 22:21:25 +00:00
|
|
|
p[2],
|
2016-02-05 13:04:34 -08:00
|
|
|
p[1],
|
|
|
|
p[0]);
|
2014-02-26 22:21:25 +00:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
|
2013-10-23 15:51:11 +00:00
|
|
|
/* Pretty-print the given boolean array as a hexadecimal string. slen should
|
|
|
|
* include space for any null terminator. */
|
2014-12-02 13:09:46 -08:00
|
|
|
void opal_btl_usnic_snprintf_bool_array(char *s, size_t slen, bool a[],
|
|
|
|
size_t alen)
|
2013-10-23 15:51:11 +00:00
|
|
|
{
|
|
|
|
size_t i = 0;
|
|
|
|
size_t j = 0;
|
|
|
|
|
|
|
|
/* could accommodate other cases, but not needed right now */
|
|
|
|
assert(slen % 4 == 0);
|
|
|
|
|
|
|
|
/* compute one nybble at a time */
|
|
|
|
while (i < alen && (j < slen - 1)) {
|
|
|
|
unsigned char tmp = 0;
|
|
|
|
|
|
|
|
/* first bool is the leftmost (most significant) bit of the nybble */
|
|
|
|
tmp |= !!a[i+0] << 3;
|
|
|
|
tmp |= !!a[i+1] << 2;
|
|
|
|
tmp |= !!a[i+2] << 1;
|
|
|
|
tmp |= !!a[i+3] << 0;
|
|
|
|
tmp += '0';
|
|
|
|
s[j] = tmp;
|
|
|
|
|
|
|
|
++j;
|
|
|
|
i += 4;
|
|
|
|
}
|
|
|
|
|
|
|
|
s[j++] = '\0';
|
|
|
|
assert(i <= alen);
|
|
|
|
assert(j <= slen);
|
|
|
|
}
|
|
|
|
|
2013-11-04 22:52:03 +00:00
|
|
|
/* Return the largest size data size that can be packed into max_len using the
|
|
|
|
* given convertor. For example, a 1000 byte max_len buffer may only be able
|
|
|
|
* to hold 998 bytes if an indivisible convertor element straddles the 1000
|
|
|
|
* byte boundary.
|
|
|
|
*
|
|
|
|
* This routine internally clones the convertor and does not mutate it!
|
|
|
|
*/
|
George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-)
WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL
All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic.
This commit was SVN r32317.
2014-07-26 00:47:28 +00:00
|
|
|
size_t opal_btl_usnic_convertor_pack_peek(
|
2013-11-04 22:52:03 +00:00
|
|
|
const opal_convertor_t *conv,
|
|
|
|
size_t max_len)
|
|
|
|
{
|
|
|
|
int rc;
|
|
|
|
size_t packable_len, position;
|
|
|
|
opal_convertor_t temp;
|
|
|
|
|
|
|
|
OBJ_CONSTRUCT(&temp, opal_convertor_t);
|
|
|
|
position = conv->bConverted + max_len;
|
|
|
|
rc = opal_convertor_clone_with_position(conv, &temp, 1, &position);
|
|
|
|
if (OPAL_UNLIKELY(rc < 0)) {
|
|
|
|
BTL_ERROR(("unexpected convertor error"));
|
|
|
|
abort(); /* XXX */
|
|
|
|
}
|
|
|
|
assert(position >= conv->bConverted);
|
|
|
|
packable_len = position - conv->bConverted;
|
|
|
|
OBJ_DESTRUCT(&temp);
|
|
|
|
return packable_len;
|
|
|
|
}
|