[Bugs] [Bug 1501154] Brick Multiplexing: Gluster volume start force complains with command "Error : Request timed out" when there are multiple volumes

bugzilla at redhat.com bugzilla at redhat.com
Thu Oct 12 17:18:56 UTC 2017


https://bugzilla.redhat.com/show_bug.cgi?id=1501154



--- Comment #4 from Worker Ant <bugzilla-bot at gluster.org> ---
COMMIT: https://review.gluster.org/18499 committed in release-3.12 by jiffin
tony Thottan (jthottan at redhat.com) 
------
commit 8aa0c34c5301a15a87c0cb168a89cb291e85d741
Author: Sanju Rakonde <srakonde at redhat.com>
Date:   Sat Oct 7 03:33:40 2017 +0530

    glusterd:Marking all the brick status as stopped when a process goes down
in brick multiplexing

    In brick multiplexing environment, if a brick process goes down
    i.e., if we kill it with SIGKILL, the status of the brick for which
    the process came up for the first time is only changing to stopped.
    all other brick statuses are remain started. This is happening because
    the process was killed abruptly using SIGKILL signal and signal
    handler wasn't invoked and further cleanup wasn't triggered.

    When we try to start a volume using force, it shows error saying
    "Request timed out", since all the brickinfo->status are still in
    started state, we're waiting for one of the brick process to come up
    which never going to happen since the brick process was killed.

    To resolve this, In the disconnect event, We are checking all the
    processes that whether the brick which got disconnected belongs the
    process. Once we get the process we are calling a function named
    glusterd_mark_bricks_stopped_by_proc() and sending brick_proc_t object as
    an argument.

    From the glusterd_brick_proc_t we can get all the bricks attached
    to that process. but these are duplicated ones. To get the original
    brickinfo we are reading volinfo from brick. In volinfo we will have
    original brickinfo copies. We are changing brickinfo->status to
    stopped for all the bricks.

    >Change-Id: Ifb9054b3ee081ef56b39b2903ae686984fe827e7
    >BUG: 1499509
    >Signed-off-by: Sanju Rakonde <srakonde at redhat.com>
    >Reviewed-on: https://review.gluster.org/#/c/18444/
    >Smoke: Gluster Build System <jenkins at build.gluster.org>
    >CentOS-regression: Gluster Build System <jenkins at build.gluster.org>
    >Reviewed-by: Atin Mukherjee <amukherj at redhat.com>
    >cherry picked from commit 9422446d72bc054962d72ace9912ecb885946d49)

    Change-Id: Ifb9054b3ee081ef56b39b2903ae686984fe827e7
    BUG: 1501154
    Signed-off-by: Sanju Rakonde <srakonde at redhat.com>

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list