[Gluster-users] Transport Endpoint Not Connected When Writing a Lot of Files

Amar Tumballi amarts at gmail.com
Mon Oct 14 10:47:07 UTC 2019


One of the host ( 134.21.57.122 <http://134.21.57.122:24007/>) is not
reachable from your network. Also checking at the IP, it would have gotten
resolved to something else than expected. Can you check if 'diufnas22' is
properly resolved?

-Amar

On Mon, Oct 14, 2019 at 3:44 PM DUCARROZ Birgit <birgit.ducarroz at unifr.ch>
wrote:

> Thank you.
> I checked the logs but the information was not clear to me.
>
> I add the log of two different crashes. I will do an upgrade to
> glusterFS 6 in some weeks. Actually I cannot interrupt user activity on
> these servers since we are in the middle of the uni-semester.
>
> If these logfiles reveal something interesting to you, would be nice to
> get a hint.
>
>
> ol-data-client-2. Client process will keep trying to connect to glusterd
> until brick's port is available
> [2019-09-16 19:05:34.028164] E [rpc-clnt.c:348:saved_frames_unwind] (-->
> /usr/lib/x86_64-linux-gnu/libglusterfs.so.0(_gf_log_callingfn+0x13b)[0x7ff167753ddb]
>
> (--> /usr/lib/x86_64-linux-gnu/libgfrpc.so.0(+0xc021)[0x7ff167523021]
> (--> /usr/lib/x86_64-linux-gnu/libgfrpc.so.0(+0xc14e)[0x7ff16752314e]
> (-->
> /usr/lib/x86_64-linux-gnu/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x8e)[0x7ff1675246be]
>
> (--> /usr/lib/x86_64-linux-gnu/libgfrpc.so.0(+0xe268)[0x7ff167525268]
> ))))) 0-vol-data-client-2: forced unwinding frame type(GlusterFS 4.x v1)
> op(FSTAT(25)) called at 2019-09-16 19:05:28.736873 (xid=0x113aecf)
> [2019-09-16 19:05:34.028206] W [MSGID: 114031]
> [client-rpc-fops_v2.c:1260:client4_0_fstat_cbk] 0-vol-data-client-2:
> remote operation failed [Transport endpoint is not connected]
> [2019-09-16 19:05:44.970828] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-09-16 19:05:44.971030] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-09-16 19:05:44.971165] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-data-client-2:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-09-16 19:05:47.971375] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
>
> [2019-09-16 19:05:44.971200] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-data-client-2: disconnected from
> vol-data-client-2. Client process will keep trying to connect to
> glusterd until brick's port is available
>
>
>
> [2019-09-17 07:43:44.807182] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-data-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-09-17 07:43:44.807217] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-data-client-0: disconnected from
> vol-data-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-09-17 07:43:44.807228] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> Final graph:
>
> +------------------------------------------------------------------------------+
>    1: volume vol-data-client-0
>    2:     type protocol/client
>    3:     option ping-timeout 42
>    4:     option remote-host diufnas20
>    5:     option remote-subvolume /bigdisk/brick1/vol-data
>    6:     option transport-type socket
>    7:     option transport.address-family inet
>    8:     option username a14ffa1b-b64e-410c-894d-435c18e81b2d
>    9:     option password 37ba4281-166d-40fd-9ef0-08a187d1107b
>   10:     option transport.tcp-user-timeout 0
>   11:     option transport.socket.keepalive-time 20
>   12:     option transport.socket.keepalive-interval 2
>   13:     option transport.socket.keepalive-count 9
>   14:     option send-gids true
>   15: end-volume
>   16:
>   17: volume vol-data-client-1
>   18:     type protocol/client
>   19:     option ping-timeout 42
>   20:     option remote-host diufnas21
>   21:     option remote-subvolume /bigdisk/brick2/vol-data
>   22:     option transport-type socket
>   23:     option transport.address-family inet
>   24:     option username a14ffa1b-b64e-410c-894d-435c18e81b2d
>   25:     option password 37ba4281-166d-40fd-9ef0-08a187d1107b
>   26:     option transport.tcp-user-timeout 0
>   27:     option transport.socket.keepalive-time 20
> 29:     option transport.socket.keepalive-count 9
>   30:     option send-gids true
>   31: end-volume
>   32:
>   33: volume vol-data-client-2
>   34:     type protocol/client
>   35:     option ping-timeout 42
>   36:     option remote-host diufnas22
>   37:     option remote-subvolume /bigdisk/brick3/vol-data
>   38:     option transport-type socket
>   39:     option transport.address-family inet
>   40:     option username a14ffa1b-b64e-410c-894d-435c18e81b2d
>   41:     option password 37ba4281-166d-40fd-9ef0-08a187d1107b
>   42:     option transport.tcp-user-timeout 0
>   43:     option transport.socket.keepalive-time 20
>   44:     option transport.socket.keepalive-interval 2
>   45:     option transport.socket.keepalive-count 9
>   46:     option send-gids true
>   47: end-volume
>   48:
> 49: volume vol-data-replicate-0
>   50:     type cluster/replicate
>   51:     option afr-pending-xattr
> vol-data-client-0,vol-data-client-1,vol-data-client-2
>   52:     option arbiter-count 1
>   53:     option use-compound-fops off
>   54:     subvolumes vol-data-client-0 vol-data-client-1 vol-data-client-2
>   55: end-volume
>   56:
>   57: volume vol-data-dht
>   58:     type cluster/distribute
>   59:     option min-free-disk 10%
>   60:     option lock-migration off
>   61:     option force-migration off
>   62:     subvolumes vol-data-replicate-0
>   63: end-volume
>   64:
>   65: volume vol-data-write-behind
>   66:     type performance/write-behind
>   67:     subvolumes vol-data-dht
>   68: end-volume
>   69:
>   70: volume vol-data-read-ahead
>   71:     type performance/read-ahead
>   72:     subvolumes vol-data-write-behind
>   73: end-volume
>   74:
>   75: volume vol-data-readdir-ahead
>   76:     type performance/readdir-ahead
>   77:     option parallel-readdir off
>   78:     option rda-request-size 131072
>   79:     option rda-cache-limit 10MB
>   80:     subvolumes vol-data-read-ahead
>   81: end-volume
>   82:
>   83: volume vol-data-io-cache
>   84:     type performance/io-cache
>   85:     option max-file-size 256MB
>   86:     option cache-size 28GB
>   87:     subvolumes vol-data-readdir-ahead
>   88: end-volume
>   89:
>   90: volume vol-data-quick-read
>   91:     type performance/quick-read
>   92:     option cache-size 28GB
>   93:     subvolumes vol-data-io-cache
>   94: end-volume
>   95:
>   96: volume vol-data-open-behind
>   97:     type performance/open-behind
>   98:     subvolumes vol-data-quick-read
>   99: end-volume
> 100:
> 101: volume vol-data-md-cache
> 102:     type performance/md-cache
> 103:     subvolumes vol-data-open-behind
> 104: end-volume
> 105:
> 106: volume vol-data-io-threads
> 107:     type performance/io-threads
> 108:     subvolumes vol-data-md-cache
> 109: end-volume
> 110:
> 111: volume vol-data
> 112:     type debug/io-stats
> 113:     option log-level INFO
> 114:     option latency-measurement off
> 115:     option count-fop-hits off
> 116:     subvolumes vol-data-io-threads
> 117: end-volume
> 118:
> 119: volume meta-autoload
> 120:     type meta
> 121:     subvolumes vol-data
> 122: end-volume
> 123:
>
> +------------------------------------------------------------------------------+
> [2019-09-17 07:43:47.249546] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-2: connection to 134.21.57.122:24007 failed (No route
> to host); disconnecting socket
> [2019-09-17 07:43:48.801700] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
>
>
>
> root at nas20:/var/log/glusterfs# dmesg |grep error
> [    2.463658] i8042: probe of i8042 failed with error -5
> [    8.180404] EXT4-fs (sdb1): re-mounted. Opts: errors=remount-ro
> [   10.024111] EXT4-fs (sda): mounted filesystem with ordered data mode.
> Opts: errors=remount-ro
> [   64.432042] ureadahead[1478]: segfault at 7f4b99d3d2c0 ip
> 00005629096fe2d1 sp 00007fff9dc98250 error 6 in
> ureadahead[5629096fa000+8000]
>
>
> root at nas20:/var/log/glusterfs# cat export-users.log | grep "2019-10-08 20"
> [2019-10-08 20:10:33.695082] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-users /export/users)
> [2019-10-08 20:10:33.712430] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-08 20:10:33.816594] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-08 20:10:33.820975] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-users-client-0: parent translators are ready, attempting connect
> on transport
> [2019-10-08 20:10:33.821257] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-users-client-1: parent translators are ready, attempting connect
> on transport
> [2019-10-08 20:10:33.821466] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-users-client-2: parent translators are ready, attempting connect
> on transport
> [2019-10-08 20:10:33.822271] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:33.822425] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:33.822484] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-users-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-10-08 20:10:33.822518] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-users-client-0: disconnected
> from vol-users-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-10-08 20:10:33.822528] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-users-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-08 20:10:36.387074] E [socket.c:2524:socket_connect_finish]
> 0-vol-users-client-2: connection to 134.21.57.122:24007 failed (No route
> to host); disconnecting socket
> [2019-10-08 20:10:36.387120] E [socket.c:2524:socket_connect_finish]
> 0-vol-users-client-1: connection to 192.168.1.121:24007 failed (No route
> to host); disconnecting socket
> [2019-10-08 20:10:36.388236] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-08 20:10:36.388254] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> The message "E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-users-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up." repeated 2 times between [2019-10-08
> 20:10:33.822528] and [2019-10-08 20:10:36.387272]
> [2019-10-08 20:10:36.388596] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-users-replicate-0: no subvolumes
> up
> [2019-10-08 20:10:36.388667] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-users-dht: dict is null
> [2019-10-08 20:10:36.388724] E [fuse-bridge.c:4362:fuse_first_lookup]
> 0-fuse: first lookup on root failed (Transport endpoint is not connected)
> [2019-10-08 20:10:36.388847] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-users-replicate-0: no subvolumes
> up
> [2019-10-08 20:10:36.388864] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-users-dht: dict is null
> [2019-10-08 20:10:36.388883] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:36.388893] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 2: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:36.391191] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-users-replicate-0: no subvolumes
> up
> [2019-10-08 20:10:36.391218] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-users-dht: dict is null
> [2019-10-08 20:10:36.391241] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:36.391250] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 3: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:36.391317] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-users-replicate-0: no subvolumes
> up
> [2019-10-08 20:10:36.391333] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-users-dht: dict is null
> [2019-10-08 20:10:36.391352] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:36.391360] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 4: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:36.406967] I [fuse-bridge.c:5199:fuse_thread_proc]
> 0-fuse: initating unmount of /export/users
> [2019-10-08 20:10:36.407298] W [glusterfsd.c:1514:cleanup_and_exit]
> (-->/lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7f88cc59b6ba]
> -->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xed) [0x55c01427f70d]
> -->/usr/sbin/glusterfs(cleanup_and_exit+0x54) [0x55c01427f524] ) 0-:
> received signum (15), shutting down
> [2019-10-08 20:10:36.407318] I [fuse-bridge.c:5981:fini] 0-fuse:
> Unmounting '/export/users'.
> [2019-10-08 20:10:36.407326] I [fuse-bridge.c:5986:fini] 0-fuse: Closing
> fuse connection to '/export/users'.
> [2019-10-08 20:10:43.925719] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-users /export/users)
> [2019-10-08 20:10:43.929529] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-08 20:10:43.933210] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-08 20:10:43.933789] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-users-client-0: parent translators are ready, attempting connect
> on transport
> [2019-10-08 20:10:43.934151] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-users-client-1: parent translators are ready, attempting connect
> on transport
> [2019-10-08 20:10:43.934174] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.934269] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.934331] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-users-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-10-08 20:10:43.934369] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-users-client-0: disconnected
> from vol-users-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-10-08 20:10:43.934379] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-users-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-08 20:10:43.934434] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-users-client-2: parent translators are ready, attempting connect
> on transport
> [2019-10-08 20:10:43.934574] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.934782] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.934859] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.934931] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-users-client-1: changing port to 49154 (from 0)
> [2019-10-08 20:10:43.935152] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.935286] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-users-client-2: changing port to 49154 (from 0)
> [2019-10-08 20:10:43.935314] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.935515] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.935711] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.935919] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:43.936354] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-users-client-1:
> Connected to vol-users-client-1, attached to remote volume
> '/bigdisk/brick2/vol-users'.
> [2019-10-08 20:10:43.936375] I [MSGID: 108005]
> [afr-common.c:5336:__afr_handle_child_up_event] 0-vol-users-replicate-0:
> Subvolume 'vol-users-client-1' came back up; going online.
> [2019-10-08 20:10:43.936728] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-users-client-2:
> Connected to vol-users-client-2, attached to remote volume
> '/bigdisk/brick3/vol-users'.
> [2019-10-08 20:10:43.936742] I [MSGID: 108002]
> [afr-common.c:5611:afr_notify] 0-vol-users-replicate-0: Client-quorum is
> met
> [2019-10-08 20:10:43.937579] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-08 20:10:43.937595] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> [2019-10-08 20:10:43.939789] I [MSGID: 109005]
> [dht-selfheal.c:2342:dht_selfheal_directory] 0-vol-users-dht: Directory
> selfheal failed: Unable to form layout for directory /
> [2019-10-08 20:10:47.927439] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.927555] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.927627] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-users-client-0: changing port to 49152 (from 0)
> [2019-10-08 20:10:47.928087] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.928201] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-users-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.928717] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-users-client-0:
> Connected to vol-users-client-0, attached to remote volume
> '/bigdisk/brick1/vol-users'.
> root at nas20:/var/log/glusterfs# cat export-users.log | grep "2019-10-08 22"
> root at nas20:/var/log/glusterfs# cat export-users.log | grep "2019-10-08 21"
> root at nas20:/var/log/glusterfs# cat export-users.log | grep "2019-10-08 23"
> root at nas20:/var/log/glusterfs# cat export-data.log.log | grep
> "2019-10-08 23"
> cat: export-data.log.log: No such file or directory
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-08 15"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-08 16"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-08 17"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-08 19"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-08 1"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-08 20"
> [2019-10-08 20:10:33.695000] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-data /export/data)
> [2019-10-08 20:10:33.737302] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-08 20:10:33.816578] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-08 20:10:33.820946] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-0: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:33.821255] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-1: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:33.821467] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-2: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:33.822144] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:33.822243] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:33.822374] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-data-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-10-08 20:10:33.822412] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-data-client-0: disconnected from
> vol-data-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-10-08 20:10:33.822423] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-08 20:10:36.387062] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-2: connection to 134.21.57.122:24007 failed (No route
> to host); disconnecting socket
> [2019-10-08 20:10:36.387091] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-1: connection to 192.168.1.121:24007 failed (No route
> to host); disconnecting socket
> [2019-10-08 20:10:36.388218] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-08 20:10:36.388237] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> The message "E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up." repeated 2 times between [2019-10-08
> 20:10:33.822423] and [2019-10-08 20:10:36.387268]
> [2019-10-08 20:10:36.388590] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-08 20:10:36.388630] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-08 20:10:36.388723] E [fuse-bridge.c:4362:fuse_first_lookup]
> 0-fuse: first lookup on root failed (Transport endpoint is not connected)
> [2019-10-08 20:10:36.388855] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-08 20:10:36.388871] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-08 20:10:36.388892] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:36.388902] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 2: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:36.390447] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-08 20:10:36.390480] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-08 20:10:36.390503] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:36.390513] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 3: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:36.390580] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-08 20:10:36.390595] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-08 20:10:36.390614] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:36.390622] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 4: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:36.410905] I [fuse-bridge.c:5199:fuse_thread_proc]
> 0-fuse: initating unmount of /export/data
> [2019-10-08 20:10:36.411091] W [glusterfsd.c:1514:cleanup_and_exit]
> (-->/lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7ff189f586ba]
> -->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xed) [0x55946f24b70d]
> -->/usr/sbin/glusterfs(cleanup_and_exit+0x54) [0x55946f24b524] ) 0-:
> received signum (15), shutting down
> [2019-10-08 20:10:36.411113] I [fuse-bridge.c:5981:fini] 0-fuse:
> Unmounting '/export/data'.
> [2019-10-08 20:10:36.411122] I [fuse-bridge.c:5986:fini] 0-fuse: Closing
> fuse connection to '/export/data'.
> [2019-10-08 20:10:36.845106] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-data /export/data)
> [2019-10-08 20:10:36.848865] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-08 20:10:36.852064] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-08 20:10:36.852477] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-0: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:36.852694] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-1: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:36.852773] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:36.852877] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:36.852917] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-2: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:36.852947] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-data-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-10-08 20:10:36.852980] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-data-client-0: disconnected from
> vol-data-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-10-08 20:10:36.852990] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-08 20:10:37.387355] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:37.387579] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:37.387706] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-1: changing port to 49156 (from 0)
> [2019-10-08 20:10:37.388065] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:37.388253] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:37.389087] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-1:
> Connected to vol-data-client-1, attached to remote volume
> '/bigdisk/brick2/vol-data'.
> [2019-10-08 20:10:37.389102] I [MSGID: 108005]
> [afr-common.c:5336:__afr_handle_child_up_event] 0-vol-data-replicate-0:
> Subvolume 'vol-data-client-1' came back up; going online.
> [2019-10-08 20:10:39.387062] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-2: connection to 134.21.57.122:24007 failed (No route
> to host); disconnecting socket
> [2019-10-08 20:10:39.389703] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-08 20:10:39.389740] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> [2019-10-08 20:10:39.411859] I [glusterfsd-mgmt.c:53:mgmt_cbk_spec]
> 0-mgmt: Volume file changed
> [2019-10-08 20:10:40.832633] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-08 20:10:40.832712] E [fuse-bridge.c:4362:fuse_first_lookup]
> 0-fuse: first lookup on root failed (Transport endpoint is not connected)
> [2019-10-08 20:10:40.834248] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:40.834281] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 2: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:40.837624] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:40.837659] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 3: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:40.839468] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-08 20:10:40.839503] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 4: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-08 20:10:40.847013] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:40.847219] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:40.847368] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-2: changing port to 49158 (from 0)
> [2019-10-08 20:10:40.847725] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:40.847906] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> The message "E [MSGID: 101046] [dht-common.c:1502:dht_lookup_dir_cbk]
> 0-vol-data-dht: dict is null" repeated 3 times between [2019-10-08
> 20:10:40.832633] and [2019-10-08 20:10:40.839454]
> [2019-10-08 20:10:40.848759] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-2:
> Connected to vol-data-client-2, attached to remote volume
> '/bigdisk/brick3/vol-data'.
> [2019-10-08 20:10:40.848785] I [MSGID: 108002]
> [afr-common.c:5611:afr_notify] 0-vol-data-replicate-0: Client-quorum is met
> [2019-10-08 20:10:40.874884] I [fuse-bridge.c:5199:fuse_thread_proc]
> 0-fuse: initating unmount of /export/data
> [2019-10-08 20:10:40.875054] W [glusterfsd.c:1514:cleanup_and_exit]
> (-->/lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7fdc50b646ba]
> -->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xed) [0x563108ee670d]
> -->/usr/sbin/glusterfs(cleanup_and_exit+0x54) [0x563108ee6524] ) 0-:
> received signum (15), shutting down
> [2019-10-08 20:10:40.875079] I [fuse-bridge.c:5981:fini] 0-fuse:
> Unmounting '/export/data'.
> [2019-10-08 20:10:40.875087] I [fuse-bridge.c:5986:fini] 0-fuse: Closing
> fuse connection to '/export/data'.
> [2019-10-08 20:10:47.464875] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-data /export/data)
> [2019-10-08 20:10:47.468743] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-08 20:10:47.472050] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-08 20:10:47.472465] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-0: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:47.472803] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-1: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:47.472865] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.472968] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.473036] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-0: changing port to 49156 (from 0)
> [2019-10-08 20:10:47.473121] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-2: parent translators are ready, attempting connect on
> transport
> [2019-10-08 20:10:47.473466] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.473511] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.473681] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.473850] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.473928] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.474019] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-1: changing port to 49156 (from 0)
> [2019-10-08 20:10:47.474072] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.474309] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-2: changing port to 49158 (from 0)
> [2019-10-08 20:10:47.474621] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-0:
> Connected to vol-data-client-0, attached to remote volume
> '/bigdisk/brick1/vol-data'.
> [2019-10-08 20:10:47.474638] I [MSGID: 108005]
> [afr-common.c:5336:__afr_handle_child_up_event] 0-vol-data-replicate-0:
> Subvolume 'vol-data-client-0' came back up; going online.
> [2019-10-08 20:10:47.474750] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.474927] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.474958] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.475216] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-08 20:10:47.476030] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-1:
> Connected to vol-data-client-1, attached to remote volume
> '/bigdisk/brick2/vol-data'.
> [2019-10-08 20:10:47.476052] I [MSGID: 108002]
> [afr-common.c:5611:afr_notify] 0-vol-data-replicate-0: Client-quorum is met
> [2019-10-08 20:10:47.476152] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-2:
> Connected to vol-data-client-2, attached to remote volume
> '/bigdisk/brick3/vol-data'.
> [2019-10-08 20:10:47.477159] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-08 20:10:47.477210] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> [2019-10-08 20:10:47.478960] I [MSGID: 108031]
> [afr-common.c:2597:afr_local_discovery_cbk] 0-vol-data-replicate-0:
> selecting local read_child vol-data-client-0
> [2019-10-08 20:10:47.479971] I [MSGID: 108031]
> [afr-common.c:2597:afr_local_discovery_cbk] 0-vol-data-replicate-0:
> selecting local read_child vol-data-client-0
> [2019-10-08 20:10:47.480094] I [MSGID: 109005]
> [dht-selfheal.c:2342:dht_selfheal_directory] 0-vol-data-dht: Directory
> selfheal failed: Unable to form layout for directory /
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-09 1"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-09 7"
> root at nas20:/var/log/glusterfs# cat export-data.log | grep "2019-10-09 0"
> [2019-10-09 04:25:02.165330] I [MSGID: 100011]
> [glusterfsd.c:1599:reincarnate] 0-glusterfsd: Fetching the volume file
> from server...
> [2019-10-09 04:25:02.191948] I [glusterfsd-mgmt.c:1953:mgmt_getspec_cbk]
> 0-glusterfs: No change in volfile,continuing
> [2019-10-09 07:12:03.955619] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-data /export/data)
> [2019-10-09 07:12:03.981652] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-09 07:12:04.002485] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-09 07:12:04.003899] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-0: parent translators are ready, attempting connect on
> transport
> [2019-10-09 07:12:04.004147] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-1: parent translators are ready, attempting connect on
> transport
> [2019-10-09 07:12:04.004366] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-2: parent translators are ready, attempting connect on
> transport
> [2019-10-09 07:12:04.004628] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:04.004923] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:04.005244] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-data-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-10-09 07:12:04.005286] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-data-client-0: disconnected from
> vol-data-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-10-09 07:12:04.005297] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-09 07:12:06.690631] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-2: connection to 134.21.57.122:24007 failed (No route
> to host); disconnecting socket
> [2019-10-09 07:12:06.690792] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-1: connection to 192.168.1.121:24007 failed (No route
> to host); disconnecting socket
> [2019-10-09 07:12:06.691746] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-09 07:12:06.691771] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> The message "E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up." repeated 2 times between [2019-10-09
> 07:12:04.005297] and [2019-10-09 07:12:06.690811]
> [2019-10-09 07:12:06.692647] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-09 07:12:06.692695] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-09 07:12:06.692807] E [fuse-bridge.c:4362:fuse_first_lookup]
> 0-fuse: first lookup on root failed (Transport endpoint is not connected)
> [2019-10-09 07:12:06.692955] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-09 07:12:06.692980] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-09 07:12:06.693003] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-09 07:12:06.693013] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 2: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-09 07:12:06.695503] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-09 07:12:06.695526] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-09 07:12:06.695547] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-09 07:12:06.695556] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 3: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-09 07:12:06.695619] I [MSGID: 108006]
> [afr-common.c:5677:afr_local_init] 0-vol-data-replicate-0: no subvolumes up
> [2019-10-09 07:12:06.695633] E [MSGID: 101046]
> [dht-common.c:1502:dht_lookup_dir_cbk] 0-vol-data-dht: dict is null
> [2019-10-09 07:12:06.695650] W
> [fuse-resolve.c:132:fuse_resolve_gfid_cbk] 0-fuse:
> 00000000-0000-0000-0000-000000000001: failed to resolve (Transport
> endpoint is not connected)
> [2019-10-09 07:12:06.695658] E [fuse-bridge.c:928:fuse_getattr_resume]
> 0-glusterfs-fuse: 4: GETATTR 1 (00000000-0000-0000-0000-000000000001)
> resolution failed
> [2019-10-09 07:12:06.714499] I [fuse-bridge.c:5199:fuse_thread_proc]
> 0-fuse: initating unmount of /export/data
> [2019-10-09 07:12:06.714753] W [glusterfsd.c:1514:cleanup_and_exit]
> (-->/lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7f133ffef6ba]
> -->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xed) [0x562b2312c70d]
> -->/usr/sbin/glusterfs(cleanup_and_exit+0x54) [0x562b2312c524] ) 0-:
> received signum (15), shutting down
> [2019-10-09 07:12:06.714773] I [fuse-bridge.c:5981:fini] 0-fuse:
> Unmounting '/export/data'.
> [2019-10-09 07:12:06.714779] I [fuse-bridge.c:5986:fini] 0-fuse: Closing
> fuse connection to '/export/data'.
> [2019-10-09 07:12:07.109206] I [MSGID: 100030] [glusterfsd.c:2741:main]
> 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 4.1.8
> (args: /usr/sbin/glusterfs --process-name fuse
> --volfile-server=localhost --volfile-id=/vol-data /export/data)
> [2019-10-09 07:12:07.112870] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 1
> [2019-10-09 07:12:07.116011] I [MSGID: 101190]
> [event-epoll.c:617:event_dispatch_epoll_worker] 0-epoll: Started thread
> with index 2
> [2019-10-09 07:12:07.116421] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-0: parent translators are ready, attempting connect on
> transport
> [2019-10-09 07:12:07.116655] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-1: parent translators are ready, attempting connect on
> transport
> [2019-10-09 07:12:07.116676] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:07.116767] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:07.116833] E [MSGID: 114058]
> [client-handshake.c:1442:client_query_portmap_cbk] 0-vol-data-client-0:
> failed to get the port number for remote subvolume. Please run 'gluster
> volume status' on server to see if brick process is running.
> [2019-10-09 07:12:07.116835] I [MSGID: 114020] [client.c:2328:notify]
> 0-vol-data-client-2: parent translators are ready, attempting connect on
> transport
> [2019-10-09 07:12:07.116887] I [MSGID: 114018]
> [client.c:2254:client_rpc_notify] 0-vol-data-client-0: disconnected from
> vol-data-client-0. Client process will keep trying to connect to
> glusterd until brick's port is available
> [2019-10-09 07:12:07.116898] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-09 07:12:07.691005] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:09.690613] E [socket.c:2524:socket_connect_finish]
> 0-vol-data-client-2: connection to 134.21.57.122:24007 failed (No route
> to host); disconnecting socket
> [2019-10-09 07:12:11.111975] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.112083] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.112200] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.112397] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.112518] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-2: changing port to 49158 (from 0)
> [2019-10-09 07:12:11.112820] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.113013] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-2: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:09.690664] E [MSGID: 108006]
> [afr-common.c:5413:__afr_handle_child_down_event]
> 0-vol-data-replicate-0: All subvolumes are down. Going offline until
> atleast one of them comes back up.
> [2019-10-09 07:12:11.114003] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-2:
> Connected to vol-data-client-2, attached to remote volume
> '/bigdisk/brick3/vol-data'.
> [2019-10-09 07:12:11.114045] I [MSGID: 108005]
> [afr-common.c:5336:__afr_handle_child_up_event] 0-vol-data-replicate-0:
> Subvolume 'vol-data-client-2' came back up; going online.
> [2019-10-09 07:12:11.290914] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.291239] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-1: changing port to 49156 (from 0)
> [2019-10-09 07:12:11.291676] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.291919] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-1: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:11.293266] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-1:
> Connected to vol-data-client-1, attached to remote volume
> '/bigdisk/brick2/vol-data'.
> [2019-10-09 07:12:11.293306] I [MSGID: 108002]
> [afr-common.c:5611:afr_notify] 0-vol-data-replicate-0: Client-quorum is met
> [2019-10-09 07:12:11.295955] I [fuse-bridge.c:4294:fuse_init]
> 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.24
> kernel 7.23
> [2019-10-09 07:12:11.296014] I [fuse-bridge.c:4927:fuse_graph_sync]
> 0-fuse: switched to graph 0
> [2019-10-09 07:12:11.299181] I [MSGID: 109005]
> [dht-selfheal.c:2342:dht_selfheal_directory] 0-vol-data-dht: Directory
> selfheal failed: Unable to form layout for directory /
> [2019-10-09 07:12:14.112691] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:14.112772] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:17.113224] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:17.113319] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:20.113917] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:20.114031] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:24.393064] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:24.393253] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:26.393776] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:26.393880] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:29.394504] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:29.394614] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:32.395375] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:32.395534] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:35.395920] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:35.396027] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:38.396531] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:38.396618] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:41.397419] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:41.397526] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:44.398189] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:44.398312] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:47.399045] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:47.399166] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:50.399735] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:50.399855] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:53.400507] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:53.400616] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:56.401284] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:56.401402] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:59.402080] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:12:59.402200] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:02.402863] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:02.402984] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:05.404125] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:05.404320] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:08.404977] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:08.405172] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:11.405694] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:11.405884] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:14.406443] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:14.406629] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:17.407255] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:17.407445] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:20.408092] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:20.408277] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:23.409546] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:23.409735] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:26.410420] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:26.410600] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:29.411353] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:29.411528] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:32.412325] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:32.412505] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:35.413311] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:35.413491] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:38.414345] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:38.414540] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:41.415407] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:41.415597] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:44.416490] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:44.416672] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:47.417664] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:47.417851] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:50.418814] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:50.419005] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:53.419982] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:53.420166] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:56.421200] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:56.421388] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:59.422450] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:13:59.422630] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:02.423757] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:02.423952] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:05.425051] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:05.425243] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:08.425832] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:08.426011] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:11.426636] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:11.426846] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:16.310279] I [glusterfsd-mgmt.c:53:mgmt_cbk_spec]
> 0-mgmt: Volume file changed
> [2019-10-09 07:14:19.393266] I [glusterfsd-mgmt.c:53:mgmt_cbk_spec]
> 0-mgmt: Volume file changed
> [2019-10-09 07:14:19.465709] I [glusterfsd-mgmt.c:1953:mgmt_getspec_cbk]
> 0-glusterfs: No change in volfile,continuing
> [2019-10-09 07:14:19.467466] I [glusterfsd-mgmt.c:1953:mgmt_getspec_cbk]
> 0-glusterfs: No change in volfile,continuing
> [2019-10-09 07:14:29.457122] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:29.457312] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:29.457431] I [rpc-clnt.c:2105:rpc_clnt_reconfig]
> 0-vol-data-client-0: changing port to 49157 (from 0)
> [2019-10-09 07:14:29.458078] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:29.458264] W [rpc-clnt.c:1753:rpc_clnt_submit]
> 0-vol-data-client-0: error returned while attempting to connect to
> host:(null), port:0
> [2019-10-09 07:14:29.459212] I [MSGID: 114046]
> [client-handshake.c:1095:client_setvolume_cbk] 0-vol-data-client-0:
> Connected to vol-data-client-0, attached to remote volume
> '/bigdisk/brick1/vol-data'.
>
> Regards,
> Birgit
>
> On 13/10/19 08:13, Amar Tumballi wrote:
> > 'Transport endpoint not connected' (ie, ENOTCONN) comes when the n/w
> > connection is not established between client and the server. I recommend
> > checking the logs for particular reason. Specially the brick (server
> > side) logs will have some hints on this.
> >
> > About the crash, we treat it as a bug. Considering there is no specific
> > backtrace, or logs shared with the email, it is hard to tell if it is
> > already fixed in higher version or not.
> >
> > Considering you are in 4.1.8 version, and there are many releases done
> > after that, upgrading also can be an option.
> >
> > Regards,
> > Amar
> >
> >
> > On Fri, Oct 11, 2019 at 4:13 PM DUCARROZ Birgit
> > <birgit.ducarroz at unifr.ch <mailto:birgit.ducarroz at unifr.ch>> wrote:
> >
> >     Hi list,
> >
> >     Does anyone know what I can do to avoid "Transport Endpoint not
> >     connected" (and then to get a blocked server) when writing a lot of
> >     small files on a volume?
> >
> >     I'm running glusterfs 4.1.8 on 6 servers. With 3 servers I never have
> >     problems, but the other 3 servers are acting as HA storage for people
> >     who write sometimes a thousands of small files. This seems to
> provoke a
> >     crash of the gluster daemon.
> >
> >     I have 3 bricks whereas the 3rd brick acts as arbiter.
> >
> >
> >     # Location of the bricks:
> >     #-------$HOST1-------  -------$HOST3-------
> >     # brick1            |  | brick3           | brick3 = arbiter
> >     #                   |  |                  |
> >     #-------$HOST2-------  --------------------
> >     # brick2            |
> >     #--------------------
> >
> >     Checked:
> >     The underlying ext4 filesystem and the HD's seem to be without
> errors.
> >     The ports in the firewall should not be the problem since it occurs
> >     also
> >     when the firewall is disabled.
> >
> >     Any help appreciated!
> >     Kind regards,
> >     Birgit
> >     ________
> >
> >     Community Meeting Calendar:
> >
> >     APAC Schedule -
> >     Every 2nd and 4th Tuesday at 11:30 AM IST
> >     Bridge: https://bluejeans.com/118564314
> >
> >     NA/EMEA Schedule -
> >     Every 1st and 3rd Tuesday at 01:00 PM EDT
> >     Bridge: https://bluejeans.com/118564314
> >
> >     Gluster-users mailing list
> >     Gluster-users at gluster.org <mailto:Gluster-users at gluster.org>
> >     https://lists.gluster.org/mailman/listinfo/gluster-users
> >
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.gluster.org/pipermail/gluster-users/attachments/20191014/054bec4c/attachment.html>


More information about the Gluster-users mailing list