[Bugs] [Bug 1367270] [HC]: After bringing down and up of the bricks VM' s are getting paused

bugzilla at redhat.com bugzilla at redhat.com
Mon Aug 22 06:40:42 UTC 2016


https://bugzilla.redhat.com/show_bug.cgi?id=1367270



--- Comment #4 from Worker Ant <bugzilla-bot at gluster.org> ---
COMMIT: http://review.gluster.org/15222 committed in release-3.7 by Pranith
Kumar Karampuri (pkarampu at redhat.com) 
------
commit febaa1e46d3a91a29c4786a17abf29cfc7178254
Author: Krutika Dhananjay <kdhananj at redhat.com>
Date:   Thu Jul 28 21:29:59 2016 +0530

    cluster/afr: Prevent split-brain when bricks are brought off and on in
cyclic order

            Backport of: http://review.gluster.org/15080

    When the bricks are brought offline and then online in cyclic
    order while writes are in progress on a file, thanks to inode
    refresh in write txns, AFR will mostly fail the write attempt
    when the only good copy is offline. However, there is still a
    remote possibility that the file will run into split-brain if
    the brick that has the lone good copy goes offline *after* the
    inode refresh but *before* the write txn completes (I call it
    in-flight split-brain in the patch for ease of reference),
    requiring intervention from admin to resolve the split-brain
    before the IO can resume normally on the file. To get around this,
    the patch does the following things:
    i) retains the dirty xattrs on the file
    ii) avoids marking the last of the good copies as bad (or accused)
        in case it is the one to go down during the course of a write.
    iii) fails that particular write with the appropriate errno.

    This way, we still have one good copy left despite the split-brain
situation
    which when it is back online, will be chosen as source to do the heal.

    Change-Id: I7c13c6ddd5b8fe88b0f2684e8ce5f4a9c3a24a08
    BUG: 1367270
    Signed-off-by: Krutika Dhananjay <kdhananj at redhat.com>
    Reviewed-on: http://review.gluster.org/15222
    Smoke: Gluster Build System <jenkins at build.gluster.org>
    NetBSD-regression: NetBSD Build System <jenkins at build.gluster.org>
    CentOS-regression: Gluster Build System <jenkins at build.gluster.org>
    Reviewed-by: Oleksandr Natalenko <oleksandr at natalenko.name>
    Reviewed-by: Pranith Kumar Karampuri <pkarampu at redhat.com>

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list