[Gluster-users] Starting RDMA volume fails
Jochen Becker
j.becker at uvensys.de
Mon Nov 9 22:31:50 UTC 2015
Hi folks,
I have some problems starting a replica volume on a two node infiniband
setup.
Both systems are running the same hardware and Infiniband (ipoib,
ibverbs) seems to work well.
OS is Centos 7.1 fresh install and updated, Mellanox Ofed is in use,
openibd is running both gluster peers are in connected state with each
other. Creating the volume was no problem at all, but starting always
fails. Using the force option the volume seems to be started but cannot
be mounted.
Here are the entries of mnt-bricks-instances.log that happen during the
command gluster volume start instances:
[2015-11-09 22:01:00.153360] I [MSGID: 100030] [glusterfsd.c:2318:main]
0-/usr/sbin/glusterfsd: Started running /usr/sbin/glusterfsd version
3.7.5 (args: /usr/sbin/glusterfsd -s compute02 --volfile-id
instances.compute02.mnt-bricks-instances -p
/var/lib/glusterd/vols/instances/run/compute02-mnt-bricks-instances.pid
-S /var/run/gluster/8f5e59a0b8d5949b51b4c276192b0725.socket --brick-name
/mnt/bricks/instances -l
/var/log/glusterfs/bricks/mnt-bricks-instances.log --xlator-option
*-posix.glusterd-uuid=52109ce5-6173-4d22-bffc-a03c71d24791 --brick-port
49152 --xlator-option instances-server.listen-port=49152
--volfile-server-transport=rdma)
[2015-11-09 22:01:00.169326] I [MSGID: 101190]
[event-epoll.c:632:event_dispatch_epoll_worker] 0-epoll: Started thread
with index 1
[2015-11-09 22:01:00.177472] I [graph.c:269:gf_add_cmdline_options]
0-instances-server: adding option 'listen-port' for volume
'instances-server' with value '49152'
[2015-11-09 22:01:00.177519] I [graph.c:269:gf_add_cmdline_options]
0-instances-posix: adding option 'glusterd-uuid' for volume
'instances-posix' with value '52109ce5-6173-4d22-bffc-a03c71d24791'
[2015-11-09 22:01:00.177826] I [MSGID: 115034]
[server.c:403:_check_for_auth_option] 0-/mnt/bricks/instances: skip
format check for non-addr auth option auth.login./mnt/bricks/instances.allow
[2015-11-09 22:01:00.177913] I [MSGID: 115034]
[server.c:403:_check_for_auth_option] 0-/mnt/bricks/instances: skip
format check for non-addr auth option
auth.login.9d64b3ec-9d24-41ac-ba84-cf58c67c9b21.password
[2015-11-09 22:01:00.177916] I [MSGID: 101190]
[event-epoll.c:632:event_dispatch_epoll_worker] 0-epoll: Started thread
with index 2
[2015-11-09 22:01:00.179299] I
[rpcsvc.c:2215:rpcsvc_set_outstanding_rpc_limit] 0-rpc-service:
Configured rpc.outstanding-rpc-limit with value 64
[2015-11-09 22:01:00.181636] W [MSGID: 101002]
[options.c:957:xl_opt_validate] 0-instances-server: option 'listen-port'
is deprecated, preferred is 'transport.rdma.listen-port', continuing
with correction
[2015-11-09 22:01:00.183742] W [MSGID: 103071]
[rdma.c:4592:__gf_rdma_ctx_create] 0-rpc-transport/rdma: rdma_cm event
channel creation failed [Keine Berechtigung]
[2015-11-09 22:01:00.183782] W [MSGID: 103055] [rdma.c:4899:init]
0-rdma.instances-server: Failed to initialize IB Device
[2015-11-09 22:01:00.183796] W [rpc-transport.c:359:rpc_transport_load]
0-rpc-transport: 'rdma' initialization failed
[2015-11-09 22:01:00.183866] W [rpcsvc.c:1597:rpcsvc_transport_create]
0-rpc-service: cannot create listener, initing the transport failed
[2015-11-09 22:01:00.183884] W [MSGID: 115045] [server.c:1019:init]
0-instances-server: creation of listener failed
[2015-11-09 22:01:00.183898] E [MSGID: 101019]
[xlator.c:428:xlator_init] 0-instances-server: Initialization of volume
'instances-server' failed, review your volfile again
[2015-11-09 22:01:00.183912] E [graph.c:322:glusterfs_graph_init]
0-instances-server: initializing translator failed
[2015-11-09 22:01:00.183921] E [graph.c:661:glusterfs_graph_activate]
0-graph: init failed
[2015-11-09 22:01:00.184429] W [glusterfsd.c:1236:cleanup_and_exit]
(-->/usr/sbin/glusterfsd(mgmt_getspec_cbk+0x331) [0x7f7b6aee02f1]
-->/usr/sbin/glusterfsd(glusterfs_process_volfp+0x126) [0x7f7b6aedb0f6]
-->/usr/sbin/glusterfsd(cleanup_and_exit+0x69) [0x7f7b6aeda6d9] ) 0-:
received signum (0), shutting down
I think the problem starts at 2015-11-09 22:01:00.183742 where the
channel creation with rdma_cm fails. "Keine Berechtigung" means
something like missing permissions/rights. Module rdma_cm is loaded and
I can't find any other problem with the Infiniband or rdma. I have no
clue what is going wrong here, so any hints on how to proceed are
appreciated.
Cheers,
Jochen
More information about the Gluster-users
mailing list