[Bugs] [Bug 1430148] New: USS is broken when multiplexing is on
bugzilla at redhat.com
bugzilla at redhat.com
Tue Mar 7 23:45:32 UTC 2017
https://bugzilla.redhat.com/show_bug.cgi?id=1430148
Bug ID: 1430148
Summary: USS is broken when multiplexing is on
Product: GlusterFS
Version: mainline
Component: glusterd
Assignee: bugs at gluster.org
Reporter: jdarcy at redhat.com
CC: bugs at gluster.org
This manifests as a test failure in uss.t when we first try to access snap1
through USS. The underlying problem is described in the commit message for the
patch I'll submit as soon as I have a bug number.
This was causing USS tests to fail. The underlying problem here is
that if we try to queue the attach request too soon after starting a
brick process then the socket code will get an error trying to write
to the still-unconnected socket. Its response is to shut down the
socket, which causes the queued attach requests to be force-unwound.
There's nothing to retry them, so they effectively never happen and
those bricks (second and succeeding for a snapshot) never become
available.
We *do* have a retry loop for attach requests, but currently break out
as soon as a request is queued - not actually sent. The fix is to
modify that loop so it will wait some more if the rpc connection isn't
even complete yet. Now we break out only when we have a completed
connection *and* a queued request.
--
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
More information about the Bugs
mailing list