[Bugs] [Bug 1253303] New: brick crashes cause of RDMA

bugzilla at redhat.com bugzilla at redhat.com
Thu Aug 13 12:42:34 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1253303

            Bug ID: 1253303
           Summary: brick crashes cause of RDMA
           Product: GlusterFS
           Version: 3.7.2
         Component: rdma
          Severity: high
          Assignee: bugs at gluster.org
          Reporter: geoffrey.letessier at cnrs.fr
                CC: bugs at gluster.org, gluster-bugs at redhat.com



Created attachment 1062515
  --> https://bugzilla.redhat.com/attachment.cgi?id=1062515&action=edit
2 of my 4 storage brick logs

Description of problem:
Sometimes a few minutes after having [re]start a volume, sometimes more, i see
some bricks in a down state.

Version-Release number of selected component (if applicable):
GlusterFS 3.7.2

How reproducible:
really often

Steps to Reproduce:
1. start the volume
2. wait a moment
3. check to volume status

Actual results:
1 (or more) brick is down

Expected results:
all bricks should be UP.

Additional info:
Here is an extract of one brick log:
==
[2015-07-21 15:31:28.870310] I [MSGID: 115034]
[server.c:397:_check_for_auth_option] 0-/export/brick_workdir/brick1/data: skip
format check for non-addr auth option
auth.login./export/brick_workdir/brick1/data.allow
[2015-07-21 15:31:28.870342] I [event-epoll.c:629:event_dispatch_epoll_worker]
0-epoll: Started thread with index 2
[2015-07-21 15:31:28.870367] I [MSGID: 115034]
[server.c:397:_check_for_auth_option] 0-/export/brick_workdir/brick1/data: skip
format check for non-addr auth option
auth.login.4f1596d6-a806-4b21-9efa-c6a824b756e7.password
[2015-07-21 15:31:28.882071] I [rpcsvc.c:2213:rpcsvc_set_outstanding_rpc_limit]
0-rpc-service: Configured rpc.outstanding-rpc-limit with value 64
[2015-07-21 15:31:28.882166] W [options.c:936:xl_opt_validate]
0-vol_workdir_amd-server: option 'listen-port' is deprecated, preferred is
'transport.socket.listen-port', continuing with correction
pending frames:
patchset: git://git.gluster.com/glusterfs.git
signal received: 11
time of crash:
2015-07-21 15:33:21
configuration details:
argp 1
backtrace 1
dlfcn 1
libpthread 1
llistxattr 1
setfsid 1
spinlock 1
epoll.h 1
xattr.h 1
st_atim.tv_nsec 1
package-string: glusterfs 3.7.2
/usr/lib64/libglusterfs.so.0(_gf_msg_backtrace_nomem+0xb6)[0x3386824b76]
/usr/lib64/libglusterfs.so.0(gf_print_trace+0x33f)[0x33868435af]
/lib64/libc.so.6[0x3c432326a0]
/usr/lib64/glusterfs/3.7.2/rpc-transport/rdma.so(+0x67e0)[0x7ff76edb17e0]
/usr/lib64/glusterfs/3.7.2/rpc-transport/rdma.so(+0xbf7b)[0x7ff76edb6f7b]
/lib64/libpthread.so.0[0x3c436079d1]
/lib64/libc.so.6(clone+0x6d)[0x3c432e89dd]
==

In attachments you can find all my brick logs from 2 of my storage nodes.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list