[Bugs] [Bug 1388221] New: gluster brick daemon segfaulted in pairs
bugzilla at redhat.com
bugzilla at redhat.com
Mon Oct 24 18:59:45 UTC 2016
https://bugzilla.redhat.com/show_bug.cgi?id=1388221
Bug ID: 1388221
Summary: gluster brick daemon segfaulted in pairs
Product: GlusterFS
Version: 3.8
Component: unclassified
Severity: high
Assignee: bugs at gluster.org
Reporter: jackie at drive.ai
CC: bugs at gluster.org
Description of problem:
We are running a distributed replicated volume: 16 pairs of bricks (rep count
2), 2 nodes.
On Friday, 2 pairs of brick daemons seg-faulted within minutes of each other,
leading to 2 subvolumes down (no replicas left). We tried to bring them up
again by doing a "volume start force”, which worked, but roughly 4 hours later
this happened again, but to two other pairs of bricks.
There is nothing of note in brick logs for the downed bricks, except that it
just suddenly stops logging. In the other logs (nfs, glusterhd, etc), we
simply start seeing errors saying “All sub volumes down” for those replicates.
This is on Ubuntu 16.04
Version-Release number of selected component (if applicable):
3.8.2
How reproducible:
It happened three rounds in total so far.
Steps to Reproduce:
1. force start volume
2. wait for crash
Additional info:
Core file too large to attach here (60-70M), is there an alternative way to
submit it?
Did not see any stacktraces anywhere.
--
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
More information about the Bugs
mailing list