[Gluster-users] One node goes offline, the other node can't see the replicated volume anymore

Greg Scott GregScott at infrasupport.com
Tue Jul 16 01:16:21 UTC 2013


Here is a tail from /var/log/glusterfs/firewall-scripts.log.   The entries at 00:39:56 GMT must correspond to /var/log/messages at 19:39:56 USA Central time.   There are no firewall rules and the default policy is ACCEPT.   With no rc.local file, my application is out of the picture right now, so nobody is sleeping or taking themselves offline or anything like that.  Just a straight up boot using fstab to mount the filesystems.   
.
.
.
[2013-07-16 00:36:40.495183] I [afr-common.c:2120:afr_discovery_cbk] 0-firewall-scripts-replicate-0: selecting local read_child
firewall-scripts-client-0
[2013-07-16 00:39:12.584585] W [socket.c:514:__socket_rwv] 0-glusterfs: readv failed (No data available)
[2013-07-16 00:39:12.584853] W [socket.c:1962:__socket_proto_state_machine] 0-glusterfs: reading from socket failed. Error (No d
ata available), peer (127.0.0.1:24007)
[2013-07-16 00:39:12.714225] W [glusterfsd.c:970:cleanup_and_exit] (-->/usr/lib64/libc.so.6(clone+0x6d) [0x7fd5c27f913d] (-->/us
r/lib64/libpthread.so.0(+0x33c1607c53) [0x7fd5c2e8fc53] (-->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xd5) [0x7fd5c3b7de35]))) 0-
: received signum (15), shutting down
[2013-07-16 00:39:12.714289] I [fuse-bridge.c:5212:fini] 0-fuse: Unmounting '/firewall-scripts'.
[2013-07-16 00:39:12.715170] I [fuse-bridge.c:4583:fuse_thread_proc] 0-fuse: unmounting /firewall-scripts
[2013-07-16 00:39:56.667185] I [glusterfsd.c:1878:main] 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.4.0
beta4 (/usr/sbin/glusterfs --volfile-id=/firewall-scripts --volfile-server=localhost /firewall-scripts)
[2013-07-16 00:39:56.726785] I [socket.c:3480:socket_init] 0-glusterfs: SSL support is NOT enabled
[2013-07-16 00:39:56.726993] I [socket.c:3495:socket_init] 0-glusterfs: using system polling thread
[2013-07-16 00:40:05.794794] I [socket.c:3480:socket_init] 0-firewall-scripts-client-1: SSL support is NOT enabled
[2013-07-16 00:40:05.794927] I [socket.c:3495:socket_init] 0-firewall-scripts-client-1: using system polling thread
[2013-07-16 00:40:05.801351] I [socket.c:3480:socket_init] 0-firewall-scripts-client-0: SSL support is NOT enabled
[2013-07-16 00:40:05.801486] I [socket.c:3495:socket_init] 0-firewall-scripts-client-0: using system polling thread
[2013-07-16 00:40:05.801611] I [client.c:2154:notify] 0-firewall-scripts-client-0: parent translators are ready, attempting conn
ect on transport
[2013-07-16 00:40:05.817724] I [client.c:2154:notify] 0-firewall-scripts-client-1: parent translators are ready, attempting conn
ect on transport
Given volfile:
+------------------------------------------------------------------------------+
  1: volume firewall-scripts-client-0
  2:     type protocol/client
  3:     option password fb3955b7-a6ca-49bb-b886-d4b6609392f8
  4:     option username de6eacd1-31bc-4bdb-a049-776cd840059e
  5:     option transport-type tcp
  6:     option remote-subvolume /gluster-fw1
  7:     option remote-host 192.168.253.1
  8: end-volume
  9:
 10: volume firewall-scripts-client-1
 11:     type protocol/client
 12:     option password fb3955b7-a6ca-49bb-b886-d4b6609392f8
 13:     option username de6eacd1-31bc-4bdb-a049-776cd840059e
 14:     option transport-type tcp
 15:     option remote-subvolume /gluster-fw2
 16:     option remote-host 192.168.253.2
 17: end-volume
 18:
 19: volume firewall-scripts-replicate-0
 20:     type cluster/replicate
 21:     subvolumes firewall-scripts-client-0 firewall-scripts-client-1
 22: end-volume
 23:
 24: volume firewall-scripts-dht
 25:     type cluster/distribute
 26:     subvolumes firewall-scripts-replicate-0
 27: end-volume
 28:
 29: volume firewall-scripts-write-behind
 30:     type performance/write-behind
 31:     subvolumes firewall-scripts-dht
 32: end-volume
 33:
 34: volume firewall-scripts-read-ahead
 35:     type performance/read-ahead
 36:     subvolumes firewall-scripts-write-behind
 37: end-volume
 38:
 39: volume firewall-scripts-io-cache
 40:     type performance/io-cache
 41:     subvolumes firewall-scripts-read-ahead
 42: end-volume
 43:
 44: volume firewall-scripts-quick-read
 45:     type performance/quick-read
 46:     subvolumes firewall-scripts-io-cache
 47: end-volume
 48:
 49: volume firewall-scripts-open-behind
 50:     type performance/open-behind
 51:     subvolumes firewall-scripts-quick-read
 52: end-volume
 53:
 54: volume firewall-scripts-md-cache
 55:     type performance/md-cache
 56:     subvolumes firewall-scripts-open-behind
 57: end-volume
 58:
 59: volume firewall-scripts
 60:     type debug/io-stats
 61:     option count-fop-hits off
 62:     option latency-measurement off
 63:     subvolumes firewall-scripts-md-cache
 64: end-volume

+------------------------------------------------------------------------------+
[2013-07-16 00:40:05.975356] E [client-handshake.c:1741:client_query_portmap_cbk] 0-firewall-scripts-client-0: failed to get the
 port number for remote subvolume. Please run 'gluster volume status' on server to see if brick process is running.
[2013-07-16 00:40:05.975588] W [socket.c:514:__socket_rwv] 0-firewall-scripts-client-0: readv failed (No data available)
[2013-07-16 00:40:05.975708] I [client.c:2097:client_rpc_notify] 0-firewall-scripts-client-0: disconnected
[2013-07-16 00:40:06.027821] E [client-handshake.c:1741:client_query_portmap_cbk] 0-firewall-scripts-client-1: failed to get the
 port number for remote subvolume. Please run 'gluster volume status' on server to see if brick process is running.
[2013-07-16 00:40:06.028010] W [socket.c:514:__socket_rwv] 0-firewall-scripts-client-1: readv failed (No data available)
[2013-07-16 00:40:06.028103] I [client.c:2097:client_rpc_notify] 0-firewall-scripts-client-1: disconnected
[2013-07-16 00:40:06.028148] E [afr-common.c:3735:afr_notify] 0-firewall-scripts-replicate-0: All subvolumes are down. Going off
line until atleast one of them comes back up.
[2013-07-16 00:40:06.048172] I [fuse-bridge.c:4723:fuse_graph_setup] 0-fuse: switched to graph 0
[2013-07-16 00:40:06.049068] I [fuse-bridge.c:3680:fuse_init] 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.
13 kernel 7.21
[2013-07-16 00:40:06.051158] W [fuse-bridge.c:665:fuse_attr_cbk] 0-glusterfs-fuse: 2: LOOKUP() / => -1 (No such file or director
y)
[2013-07-16 00:40:06.077420] I [fuse-bridge.c:4583:fuse_thread_proc] 0-fuse: unmounting /firewall-scripts
[2013-07-16 00:40:06.078427] W [glusterfsd.c:970:cleanup_and_exit] (-->/usr/lib64/libc.so.6(clone+0x6d) [0x7f1f0078d13d] (-->/us
r/lib64/libpthread.so.0(+0x33c1607c53) [0x7f1f00e23c53] (-->/usr/sbin/glusterfs(glusterfs_sigwaiter+0xd5) [0x7f1f01b11e35]))) 0-
: received signum (15), shutting down
[2013-07-16 00:40:06.078501] I [fuse-bridge.c:5212:fini] 0-fuse: Unmounting '/firewall-scripts'.
[2013-07-16 00:53:39.844556] I [glusterfsd.c:1878:main] 0-/usr/sbin/glusterfs: Started running /usr/sbin/glusterfs version 3.4.0
beta4 (/usr/sbin/glusterfs --volfile-id=/firewall-scripts --volfile-server=localhost /firewall-scripts)
[2013-07-16 00:53:39.858957] I [socket.c:3480:socket_init] 0-glusterfs: SSL support is NOT enabled
[2013-07-16 00:53:39.859117] I [socket.c:3495:socket_init] 0-glusterfs: using system polling thread
[2013-07-16 00:53:39.907716] I [socket.c:3480:socket_init] 0-firewall-scripts-client-1: SSL support is NOT enabled
[2013-07-16 00:53:39.907881] I [socket.c:3495:socket_init] 0-firewall-scripts-client-1: using system polling thread
[2013-07-16 00:53:39.909563] I [socket.c:3480:socket_init] 0-firewall-scripts-client-0: SSL support is NOT enabled
[2013-07-16 00:53:39.909655] I [socket.c:3495:socket_init] 0-firewall-scripts-client-0: using system polling thread
[2013-07-16 00:53:39.909778] I [client.c:2154:notify] 0-firewall-scripts-client-0: parent translators are ready, attempting conn
ect on transport
[2013-07-16 00:53:39.920933] I [client.c:2154:notify] 0-firewall-scripts-client-1: parent translators are ready, attempting conn
ect on transport
Given volfile:
+------------------------------------------------------------------------------+
  1: volume firewall-scripts-client-0
  2:     type protocol/client
  3:     option password fb3955b7-a6ca-49bb-b886-d4b6609392f8
  4:     option username de6eacd1-31bc-4bdb-a049-776cd840059e
  5:     option transport-type tcp
  6:     option remote-subvolume /gluster-fw1
  7:     option remote-host 192.168.253.1
  8: end-volume
  9:
 10: volume firewall-scripts-client-1
 11:     type protocol/client
 12:     option password fb3955b7-a6ca-49bb-b886-d4b6609392f8
 13:     option username de6eacd1-31bc-4bdb-a049-776cd840059e
 14:     option transport-type tcp
 15:     option remote-subvolume /gluster-fw2
 16:     option remote-host 192.168.253.2
 17: end-volume
 18:
 19: volume firewall-scripts-replicate-0
 20:     type cluster/replicate
 21:     subvolumes firewall-scripts-client-0 firewall-scripts-client-1
 22: end-volume
 23:
 24: volume firewall-scripts-dht
 25:     type cluster/distribute
 26:     subvolumes firewall-scripts-replicate-0
 27: end-volume
 28:
 29: volume firewall-scripts-write-behind
 30:     type performance/write-behind
 31:     subvolumes firewall-scripts-dht
 32: end-volume
 33:
 34: volume firewall-scripts-read-ahead
 35:     type performance/read-ahead
 36:     subvolumes firewall-scripts-write-behind
 37: end-volume
 38:
 39: volume firewall-scripts-io-cache
 40:     type performance/io-cache
 41:     subvolumes firewall-scripts-read-ahead
 42: end-volume
 43:
 44: volume firewall-scripts-quick-read
 45:     type performance/quick-read
 46:     subvolumes firewall-scripts-io-cache
 47: end-volume
 48:
 49: volume firewall-scripts-open-behind
 50:     type performance/open-behind
 51:     subvolumes firewall-scripts-quick-read
 52: end-volume
 53:
 54: volume firewall-scripts-md-cache
 55:     type performance/md-cache
 56:     subvolumes firewall-scripts-open-behind
 57: end-volume
 58:
 59: volume firewall-scripts
 60:     type debug/io-stats
 61:     option count-fop-hits off
 62:     option latency-measurement off
 63:     subvolumes firewall-scripts-md-cache
 64: end-volume

+------------------------------------------------------------------------------+
[2013-07-16 00:53:39.933009] I [rpc-clnt.c:1676:rpc_clnt_reconfig] 0-firewall-scripts-client-0: changing port to 49152 (from 0)
[2013-07-16 00:53:39.933178] W [socket.c:514:__socket_rwv] 0-firewall-scripts-client-0: readv failed (No data available)
[2013-07-16 00:53:39.950457] I [rpc-clnt.c:1676:rpc_clnt_reconfig] 0-firewall-scripts-client-1: changing port to 49152 (from 0)
[2013-07-16 00:53:39.950621] W [socket.c:514:__socket_rwv] 0-firewall-scripts-client-1: readv failed (No data available)
[2013-07-16 00:53:39.966646] I [client-handshake.c:1658:select_server_supported_programs] 0-firewall-scripts-client-0: Using Pro
gram GlusterFS 3.3, Num (1298437), Version (330)
[2013-07-16 00:53:39.966994] I [client-handshake.c:1658:select_server_supported_programs] 0-firewall-scripts-client-1: Using Pro
gram GlusterFS 3.3, Num (1298437), Version (330)
[2013-07-16 00:53:39.967417] I [client-handshake.c:1456:client_setvolume_cbk] 0-firewall-scripts-client-0: Connected to 192.168.
253.1:49152, attached to remote volume '/gluster-fw1'.
[2013-07-16 00:53:39.967498] I [client-handshake.c:1468:client_setvolume_cbk] 0-firewall-scripts-client-0: Server and Client lk-
version numbers are not same, reopening the fds
[2013-07-16 00:53:39.967800] I [afr-common.c:3698:afr_notify] 0-firewall-scripts-replicate-0: Subvolume 'firewall-scripts-client
-0' came back up; going online.
[2013-07-16 00:53:39.967959] I [client-handshake.c:450:client_set_lk_version_cbk] 0-firewall-scripts-client-0: Server lk version
 = 1
[2013-07-16 00:53:39.968068] I [client-handshake.c:1456:client_setvolume_cbk] 0-firewall-scripts-client-1: Connected to 192.168.
253.2:49152, attached to remote volume '/gluster-fw2'.
[2013-07-16 00:53:39.968114] I [client-handshake.c:1468:client_setvolume_cbk] 0-firewall-scripts-client-1: Server and Client lk-
version numbers are not same, reopening the fds
[2013-07-16 00:53:39.982434] I [fuse-bridge.c:4723:fuse_graph_setup] 0-fuse: switched to graph 0
[2013-07-16 00:53:39.982877] I [client-handshake.c:450:client_set_lk_version_cbk] 0-firewall-scripts-client-1: Server lk version
 = 1
[2013-07-16 00:53:39.983205] I [fuse-bridge.c:3680:fuse_init] 0-glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.
13 kernel 7.21
[2013-07-16 00:53:39.984516] I [afr-common.c:2057:afr_set_root_inode_on_first_lookup] 0-firewall-scripts-replicate-0: added root
 inode
[2013-07-16 00:53:39.985412] I [afr-common.c:2120:afr_discovery_cbk] 0-firewall-scripts-replicate-0: selecting local read_child
firewall-scripts-client-0
[root at chicago-fw1 ~]#



More information about the Gluster-users mailing list