[Bugs] [Bug 1430148] New: USS is broken when multiplexing is on

bugzilla at redhat.com bugzilla at redhat.com
Tue Mar 7 23:45:32 UTC 2017


https://bugzilla.redhat.com/show_bug.cgi?id=1430148

            Bug ID: 1430148
           Summary: USS is broken when multiplexing is on
           Product: GlusterFS
           Version: mainline
         Component: glusterd
          Assignee: bugs at gluster.org
          Reporter: jdarcy at redhat.com
                CC: bugs at gluster.org



This manifests as a test failure in uss.t when we first try to access snap1
through USS.  The underlying problem is described in the commit message for the
patch I'll submit as soon as I have a bug number.

    This was causing USS tests to fail.  The underlying problem here is
    that if we try to queue the attach request too soon after starting a
    brick process then the socket code will get an error trying to write
    to the still-unconnected socket.  Its response is to shut down the
    socket, which causes the queued attach requests to be force-unwound.
    There's nothing to retry them, so they effectively never happen and
    those bricks (second and succeeding for a snapshot) never become
    available.

    We *do* have a retry loop for attach requests, but currently break out
    as soon as a request is queued - not actually sent.  The fix is to
    modify that loop so it will wait some more if the rpc connection isn't
    even complete yet.  Now we break out only when we have a completed
    connection *and* a queued request.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list