[Bugs] [Bug 1432542] New: Glusterd crashes when restarted with many volumes
bugzilla at redhat.com
bugzilla at redhat.com
Wed Mar 15 15:39:03 UTC 2017
https://bugzilla.redhat.com/show_bug.cgi?id=1432542
Bug ID: 1432542
Summary: Glusterd crashes when restarted with many volumes
Product: GlusterFS
Version: mainline
Component: glusterd
Assignee: bugs at gluster.org
Reporter: jdarcy at redhat.com
CC: bugs at gluster.org
This was actually found in a test for bug 1430860, which was about *glusterfsd*
crashing under the same load. That test never reproduced the original bug, but
it could reliably reproduce this one. Examples include:
https://build.gluster.org/job/centos6-regression/3607/consoleFull
two crashes in attach_brick/send_attach_req
https://build.gluster.org/job/centos6-regression/3608/consoleFull
first crash in attach_brick/send_attach_req
second crash in __synclock_unlock/list_del_init
https://build.gluster.org/job/centos6-regression/3609/consoleFull
one crash in __synclock_unlock/list_del_init
Investigation showed that the problem was with the lock manipulation done
around retries in attach_brick. Turns out that's not safe after all,
especially when it allows a new operation to overlap with one already in
progress - as the test does. A fix, which moves retries into a separate thread
and tracks connection states more carefully, is forthcoming shortly.
--
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
More information about the Bugs
mailing list