<div dir="ltr"><div><div>Can you please provide output of following from all the nodes:<br><br></div>cat /var/lib/glusterd/<a href="http://glusterd.info">glusterd.info</a><br></div>cat /var/lib/glusterd/peers/*<br><br></div><div class="gmail_extra"><br><div class="gmail_quote">On Wed, May 10, 2017 at 5:02 PM, Pawan Alwandi <span dir="ltr"><<a href="mailto:pawan@platform.sh" target="_blank">pawan@platform.sh</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div><div>Hello,<br><br></div>I'm trying to upgrade gluster from 3.6.2 to 3.10.1 but don't see the glusterfsd and glusterfs processes coming up. <a href="http://gluster.readthedocs.io/en/latest/Upgrade-Guide/upgrade_to_3.10/" target="_blank">http://gluster.readthedocs.io/<wbr>en/latest/Upgrade-Guide/<wbr>upgrade_to_3.10/</a> is the process that I'm trying to follow.<br><br></div>This is a 3 node server setup with a replicated volume having replica count of 3.<br><br></div>Logs below:<br><div><div><div><div><br><span style="font-family:monospace,monospace">[2017-05-10 09:07:03.507959] I [MSGID: 100030] [glusterfsd.c:2460:main] 0-/usr/sbin/glusterd: Started running /usr/sbin/glusterd version 3.10.1 (args: /usr/sbin/glusterd -p /var/run/glusterd.pid)<br>[2017-05-10 09:07:03.512827] I [MSGID: 106478] [glusterd.c:1449:init] 0-management: Maximum allowed open file descriptors set to 65536<br>[2017-05-10 09:07:03.512855] I [MSGID: 106479] [glusterd.c:1496:init] 0-management: Using /var/lib/glusterd as working directory<br>[2017-05-10 09:07:03.520426] W [MSGID: 103071] [rdma.c:4590:__gf_rdma_ctx_<wbr>create] 0-rpc-transport/rdma: rdma_cm event channel creation failed [No such device]<br>[2017-05-10 09:07:03.520452] W [MSGID: 103055] [rdma.c:4897:init] 0-rdma.management: Failed to initialize IB Device<br>[2017-05-10 09:07:03.520465] W [rpc-transport.c:350:rpc_<wbr>transport_load] 0-rpc-transport: 'rdma' initialization failed<br>[2017-05-10 09:07:03.520518] W [rpcsvc.c:1661:rpcsvc_create_<wbr>listener] 0-rpc-service: cannot create listener, initing the transport failed<br>[2017-05-10 09:07:03.520534] E [MSGID: 106243] [glusterd.c:1720:init] 0-management: creation of 1 listeners failed, continuing with succeeded transport<br>[2017-05-10 09:07:04.931764] I [MSGID: 106513] [glusterd-store.c:2197:<wbr>glusterd_restore_op_version] 0-glusterd: retrieved op-version: 30600<br>[2017-05-10 09:07:04.964354] I [MSGID: 106544] [glusterd.c:158:glusterd_uuid_<wbr>init] 0-management: retrieved UUID: 7f2a6e11-2a53-4ab4-9ceb-<wbr>8be6a9f2d073<br>[2017-05-10 09:07:04.993944] I [MSGID: 106498] [glusterd-handler.c:3669:<wbr>glusterd_friend_add_from_<wbr>peerinfo] 0-management: connect returned 0<br>[2017-05-10 09:07:04.995864] I [MSGID: 106498] [glusterd-handler.c:3669:<wbr>glusterd_friend_add_from_<wbr>peerinfo] 0-management: connect returned 0<br>[2017-05-10 09:07:04.995879] W [MSGID: 106062] [glusterd-handler.c:3466:<wbr>glusterd_transport_inet_<wbr>options_build] 0-glusterd: Failed to get tcp-user-timeout<br>[2017-05-10 09:07:04.995903] I [rpc-clnt.c:1059:rpc_clnt_<wbr>connection_init] 0-management: setting frame-timeout to 600<br>[2017-05-10 09:07:04.996325] I [rpc-clnt.c:1059:rpc_clnt_<wbr>connection_init] 0-management: setting frame-timeout to 600<br>Final graph:<br>+-----------------------------<wbr>------------------------------<wbr>-------------------+<br> 1: volume management<br> 2: type mgmt/glusterd<br> 3: option rpc-auth.auth-glusterfs on<br> 4: option rpc-auth.auth-unix on<br> 5: option rpc-auth.auth-null on<br> 6: option rpc-auth-allow-insecure on<br> 7: option transport.socket.listen-<wbr>backlog 128<br> 8: option event-threads 1<br> 9: option ping-timeout 0<br> 10: option transport.socket.read-fail-log off<br> 11: option transport.socket.keepalive-<wbr>interval 2<br> 12: option transport.socket.keepalive-<wbr>time 10<br> 13: option transport-type rdma<br> 14: option working-directory /var/lib/glusterd<br> 15: end-volume<br> 16: <br>+-----------------------------<wbr>------------------------------<wbr>-------------------+<br>[2017-05-10 09:07:04.996310] W [MSGID: 106062] [glusterd-handler.c:3466:<wbr>glusterd_transport_inet_<wbr>options_build] 0-glusterd: Failed to get tcp-user-timeout<br>[2017-05-10 09:07:05.000461] I [MSGID: 101190] [event-epoll.c:629:event_<wbr>dispatch_epoll_worker] 0-epoll: Started thread with index 1<br>[2017-05-10 09:07:05.001493] W [socket.c:593:__socket_rwv] 0-management: readv on <a href="http://192.168.0.7:24007" target="_blank">192.168.0.7:24007</a> failed (No data available)<br>[2017-05-10 09:07:05.001513] I [MSGID: 106004] [glusterd-handler.c:5882:__<wbr>glusterd_peer_rpc_notify] 0-management: Peer <192.168.0.7> (<5ec54b4f-f60c-48c6-9e55-<wbr>95f2bb58f633>), in state <Peer in Cluster>, h<br>as disconnected from glusterd.<br>[2017-05-10 09:07:05.001677] W [glusterd-locks.c:675:<wbr>glusterd_mgmt_v3_unlock] (-->/usr/lib/x86_64-linux-gnu/<wbr>glusterfs/3.10.1/xlator/mgmt/<wbr>glusterd.so(+0x20559) [0x7f0bf9d74559] -->/usr/lib/x86_64-linux-gnu<br>/glusterfs/3.10.1/xlator/mgmt/<wbr>glusterd.so(+0x29cf0) [0x7f0bf9d7dcf0] -->/usr/lib/x86_64-linux-gnu/<wbr>glusterfs/3.10.1/xlator/mgmt/<wbr>glusterd.so(+0xd5ba3) [0x7f0bf9e29ba3] ) 0-management: Lock for vol shared no<br>t held<br>[2017-05-10 09:07:05.001696] W [MSGID: 106118] [glusterd-handler.c:5907:__<wbr>glusterd_peer_rpc_notify] 0-management: Lock not released for shared<br>[2017-05-10 09:07:05.003099] E [rpc-clnt.c:365:saved_frames_<wbr>unwind] (--> /usr/lib/x86_64-linux-gnu/<wbr>libglusterfs.so.0(_gf_log_<wbr>callingfn+0x13c)[<wbr>0x7f0bfeeca73c] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(s<br>aved_frames_unwind+0x1cf)[<wbr>0x7f0bfec904bf] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(saved_frames_<wbr>destroy+0xe)[0x7f0bfec905de] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(rpc_clnt_<wbr>connection_cleanup+0x<br>91)[0x7f0bfec91c21] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(rpc_clnt_notify+<wbr>0x290)[0x7f0bfec92710] ))))) 0-management: forced unwinding frame type(GLUSTERD-DUMP) op(DUMP(1)) called at 2017-05-10 09:0<br>7:05.000627 (xid=0x1)<br>[2017-05-10 09:07:05.003129] E [MSGID: 106167] [glusterd-handshake.c:2181:__<wbr>glusterd_peer_dump_version_<wbr>cbk] 0-management: Error through RPC layer, retry again later<br>[2017-05-10 09:07:05.003251] W [socket.c:593:__socket_rwv] 0-management: readv on <a href="http://192.168.0.6:24007" target="_blank">192.168.0.6:24007</a> failed (No data available)<br>[2017-05-10 09:07:05.003267] I [MSGID: 106004] [glusterd-handler.c:5882:__<wbr>glusterd_peer_rpc_notify] 0-management: Peer <192.168.0.6> (<83e9a0b9-6bd5-483b-8516-<wbr>d8928805ed95>), in state <Peer in Cluster>, h<br>as disconnected from glusterd.<br>[2017-05-10 09:07:05.003318] W [glusterd-locks.c:675:<wbr>glusterd_mgmt_v3_unlock] (-->/usr/lib/x86_64-linux-gnu/<wbr>glusterfs/3.10.1/xlator/mgmt/<wbr>glusterd.so(+0x20559) [0x7f0bf9d74559] -->/usr/lib/x86_64-linux-gnu<br>/glusterfs/3.10.1/xlator/mgmt/<wbr>glusterd.so(+0x29cf0) [0x7f0bf9d7dcf0] -->/usr/lib/x86_64-linux-gnu/<wbr>glusterfs/3.10.1/xlator/mgmt/<wbr>glusterd.so(+0xd5ba3) [0x7f0bf9e29ba3] ) 0-management: Lock for vol shared no<br>t held<br>[2017-05-10 09:07:05.003329] W [MSGID: 106118] [glusterd-handler.c:5907:__<wbr>glusterd_peer_rpc_notify] 0-management: Lock not released for shared<br>[2017-05-10 09:07:05.003457] E [rpc-clnt.c:365:saved_frames_<wbr>unwind] (--> /usr/lib/x86_64-linux-gnu/<wbr>libglusterfs.so.0(_gf_log_<wbr>callingfn+0x13c)[<wbr>0x7f0bfeeca73c] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(s<br>aved_frames_unwind+0x1cf)[<wbr>0x7f0bfec904bf] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(saved_frames_<wbr>destroy+0xe)[0x7f0bfec905de] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(rpc_clnt_<wbr>connection_cleanup+0x<br>91)[0x7f0bfec91c21] (--> /usr/lib/x86_64-linux-gnu/<wbr>libgfrpc.so.0(rpc_clnt_notify+<wbr>0x290)[0x7f0bfec92710] ))))) 0-management: forced unwinding frame type(GLUSTERD-DUMP) op(DUMP(1)) called at 2017-05-10 09:0<br>7:05.001407 (xid=0x1)</span><br><br></div><div>There are a bunch of errors reported but I'm not sure which is signal and which ones are noise. Does anyone have any idea whats going on here?<br><br></div><div>Thanks,<br></div><div>Pawan<br></div><div><br></div></div></div></div></div>
<br>______________________________<wbr>_________________<br>
Gluster-users mailing list<br>
<a href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a><br>
<a href="http://lists.gluster.org/mailman/listinfo/gluster-users" rel="noreferrer" target="_blank">http://lists.gluster.org/<wbr>mailman/listinfo/gluster-users</a><br></blockquote></div><br></div>