<div dir="ltr"><div>Hey,<br></div>     3.9.1 reached its EndOfLife, you can use either 3.8.x or 3.10.x. which are active at the moment.<br></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, May 16, 2017 at 11:03 AM, Rafał Radecki <span dir="ltr">&lt;<a href="mailto:radecki.rafal@gmail.com" target="_blank">radecki.rafal@gmail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Hi All.<div><br></div><div>I have a 9 node dockerized glusterfs cluster and I am seeing a situation that:</div><div>1) docker daemon on 8th node failes and as a result glusterd on this node is leaving the cluster</div><div>2) as a result on 1st node I see message about 8th node being unavailable:</div><div><br></div><blockquote style="margin:0 0 0 40px;border:none;padding:0px"><div><div>[2017-05-15 12:48:22.142865] I [MSGID: 106004] [glusterd-handler.c:5808:__<wbr>glusterd_peer_rpc_notify] 0-management: Peer &lt;10.10.10.8&gt; (&lt;5cb55b7a-1e04-4fb8-bd1d-<wbr>55ee647719d2&gt;), in state &lt;Peer in Cluster&gt;, has disconnected from glusterd.</div></div><div><div>[2017-05-15 12:48:22.167746] W [glusterd-locks.c:675:<wbr>glusterd_mgmt_v3_unlock] (--&gt;/usr/lib64/glusterfs/3.9.<wbr>1/xlator/mgmt/glusterd.so(+<wbr>0x2035a) [0x7f7d9d62535a] --&gt;/usr/lib64/glusterfs/3.9.1/<wbr>xlator/mgmt/glusterd.so(+<wbr>0x29f48) [0x7f7d9d62ef48] --&gt;/usr/lib64/glus</div></div><div><div>terfs/3.9.1/xlator/mgmt/<wbr>glusterd.so(+0xd50aa) [0x7f7d9d6da0aa] ) 0-management: Lock for vol csv not held</div></div><div><div>[2017-05-15 12:48:22.167767] W [MSGID: 106118] [glusterd-handler.c:5833:__<wbr>glusterd_peer_rpc_notify] 0-management: Lock not released for csv</div></div></blockquote><div><br></div><div>and the gluster share is unavailable and when I try to list it I get:</div><div><br></div><div>Transport endpoint is not connected<br></div><div>3) then on 5th node I see message similar to 2) about 1st node being unavailable and 5th also disconnects from the cluster</div><div><br></div><blockquote style="margin:0 0 0 40px;border:none;padding:0px"><div><div>[2017-05-15 12:52:54.321189] W [glusterd-locks.c:675:<wbr>glusterd_mgmt_v3_unlock] (--&gt;/usr/lib64/glusterfs/3.9.<wbr>1/xlator/mgmt/glusterd.so(+<wbr>0x2035a) [0x7f7fda22335a] --&gt;/usr/lib64/glusterfs/3.9.1/<wbr>xlator/mgmt/glusterd.so(+<wbr>0x29f48) [0x7f7fda22cf48] --&gt;/usr/lib64/glus</div></div><div><div>terfs/3.9.1/xlator/mgmt/<wbr>glusterd.so(+0xd50aa) [0x7f7fda2d80aa] ) 0-management: Lock for vol csv not held</div></div><div><div><br></div></div><div><div>[2017-05-15 12:52:54.321200] W [MSGID: 106118] [glusterd-handler.c:5833:__<wbr>glusterd_peer_rpc_notify] 0-management: Lock not released for csv</div></div><div><div><br></div></div><div><div>[2017-05-15 12:53:04.659418] E [socket.c:2307:socket_connect_<wbr>finish] 0-management: connection to <a href="http://10.10.10.:24007" target="_blank">10.10.10.:24007</a> failed (Connection refused)</div></div></blockquote><div><br></div><div>I am quite new to gluster but as far as I see this is somewhat a chain in which failure of 1st node leads to disconnect of two other nodes. Any hints how to solve this? Are there any settings for retries/timeouts/reconnects in gluster which could help in my case?</div><div><br></div><div>Thanks for all help!</div><div><br></div><div>BR,</div><div>Rafal.</div></div>

<br>______________________________<wbr>_________________<br>

Gluster-users mailing list<br>

<a href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a><br>

<a href="http://lists.gluster.org/mailman/listinfo/gluster-users" rel="noreferrer" target="_blank">http://lists.gluster.org/<wbr>mailman/listinfo/gluster-users</a><br></blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr">Pranith<br></div></div>

</div>