[Bugs] [Bug 1054694] A replicated volume takes too much to come online when one server is down

Wed Apr 27 07:35:22 UTC 2016

https://bugzilla.redhat.com/show_bug.cgi?id=1054694


--- Comment #14 from Vijay Bellur <vbellur at redhat.com> ---
COMMIT: http://review.gluster.org/11113 committed in master by Pranith Kumar
Karampuri (pkarampu at redhat.com) 
------
commit 3c35329feb4dd479c9e4856ee27fa4b12c708db2
Author: Ravishankar N <ravishankar at redhat.com>
Date:   Wed Dec 23 13:49:14 2015 +0530

    afr: propagate child up event after timeout

    Problem: During mount, afr waits for response from all its children before
    notifying the parent xlator. In a 1x2 replica volume , if one of the nodes
is
    down, the mount will hang for more than a minute until child down is
received
    from the client xlator for that node.

    Fix:
    When parent up is received by afr, start a 10 second timer. In the timer
call
    back, if we receive a successful child up from atleast one brick, propagate
the
    event to the parent xlator.

    Change-Id: I31e57c8802c1a03a4a5d581ee4ab82f3a9c8799d
    BUG: 1054694
    Signed-off-by: Ravishankar N <ravishankar at redhat.com>
    Signed-off-by: Pranith Kumar K <pkarampu at redhat.com>
    Reviewed-on: http://review.gluster.org/11113
    NetBSD-regression: NetBSD Build System <jenkins at build.gluster.org>
    Smoke: Gluster Build System <jenkins at build.gluster.com>
    CentOS-regression: Gluster Build System <jenkins at build.gluster.com>

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=k6Sd6Xrr0f&a=cc_unsubscribe