[Bugs] [Bug 1241882] New: GlusterD cannot restart after being probed into a cluster.

bugzilla at redhat.com bugzilla at redhat.com
Fri Jul 10 10:16:04 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1241882

            Bug ID: 1241882
           Summary: GlusterD cannot restart after being probed into a
                    cluster.
           Product: GlusterFS
           Version: mainline
         Component: glusterd
          Assignee: kaushal at redhat.com
          Reporter: kaushal at redhat.com
                CC: bugs at gluster.org, gluster-bugs at redhat.com



In a cluster having 2 networks, sometimes when a new peer is added into the
cluster, glusterd on the new peer cannot restart. The restart fails because, it
cannot resolve bricks belonging the peer which probed the new peer into the
cluster. The resolution only fails if the bricks were created on the 2nd
network of the initiator peer, because the new peer doesn't know about the 2nd
network of the initiator. 

This is caused by race which hadn't been encountered before. The analysis is as
follows.

Assuming A, B and C as the peers. A and B are a cluster and have probed each
other on the 2 networks. C is probed from A.

During the probe, C is first validate by A. Once C is accepted, A sends and
update to both B and C to inform them of the each other. The update C gets from
A doesn't have A's second network information. C can only get this information
when B sends an update to C.

The problem faced here was that B didn't send an update to C. This happens
because B sending an update to C depends on the ordering of connection
establishment between B and C.

B and C both try to establish connections to each other once they receive A's
update and get to know of each other. If B establishes the connection first
then it sends and update to C. But if C establishes the connection first, B
will not send an update to C.

This is the first time this situation was observed. This doesn't happen always.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=f1xX0dQ4rV&a=cc_unsubscribe


More information about the Bugs mailing list