[Gluster-users] and again peer probe node1 hangs
free.aaa
free.aaa at gmail.com
Thu Apr 23 14:35:38 UTC 2015
Hi everybody!
I have "gluster peer probe gfs1" command hung with the result of "Probe
Sent to Peer (connected)"
> gfs3#gluster peer status
> Number of Peers: 3
>
> Hostname: gfs6
> Uuid: 6bd6ee25-e257-4703-b500-330741b90471
> State: Peer in Cluster (Connected)
>
> Hostname: gfs4
> Uuid: bb1bed20-25bf-43b0-8faa-49f1b5b9ae59
> State: Peer in Cluster (Connected)
>
> Hostname: gfs1
> Uuid: bb67c1da-2698-4c35-b29d-f80f8eb814a6
> State: Probe Sent to Peer (Connected)
I double checked the dns resolution. Forward and reverse resolution
works fine.
I tried to debug on gfs1 and gfs3 at the moment of probe command and it
seems to me that gfs1 does not sync all the gluster configuration from
gfs3. Especially peers folder contains only 1 peer with the following
content:
> gfs1#cat /var/lib/glusterd/peers/192.168.9.53
> uuid=00000000-0000-0000-0000-000000000000
> state=8
> hostname1=192.168.9.53
so there are no uuids both in the content of the file and in filename
itself. Besides there no info about other peers like gfs4, gfs6 in the
folder.
Last lines of the debug log on gfs1 looks like this:
> [2015-04-23 10:12:28.334239] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.54
> [2015-04-23 10:12:28.334413] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.54
> [2015-04-23 10:12:28.334557] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.54
> [2015-04-23 10:12:28.334697] D [common-utils.c:2946:gf_is_local_addr]
> 0-management: gfs4 is not local
> [2015-04-23 10:12:28.334707] D
> [glusterd-utils.c:5567:glusterd_hostname_to_uuid] 0-management:
> returning -1
> [2015-04-23 10:12:28.334714] D
> [glusterd-utils.c:1035:glusterd_volume_brickinfo_get] 0-management:
> Returning -1
> [2015-04-23 10:12:28.336881] D
> [glusterd-utils.c:5532:glusterd_friend_find_by_hostname] 0-management:
> Unable to find friend: gfs6
> [2015-04-23 10:12:28.337490] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.56
> [2015-04-23 10:12:28.337697] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.56
> [2015-04-23 10:12:28.337841] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.56
> [2015-04-23 10:12:28.337981] D [common-utils.c:2946:gf_is_local_addr]
> 0-management: gfs6 is not local
> [2015-04-23 10:12:28.337991] D
> [glusterd-utils.c:5567:glusterd_hostname_to_uuid] 0-management:
> returning -1
> [2015-04-23 10:12:28.337998] D
> [glusterd-utils.c:1035:glusterd_volume_brickinfo_get] 0-management:
> Returning -1
> [2015-04-23 10:12:28.338858] D
> [glusterd-utils.c:5523:glusterd_friend_find_by_hostname] 0-management:
> Friend gfs3 found.. state: 8
> [2015-04-23 10:12:28.338873] D
> [glusterd-utils.c:5567:glusterd_hostname_to_uuid] 0-management:
> returning 0
> [2015-04-23 10:12:28.340989] D
> [glusterd-utils.c:5532:glusterd_friend_find_by_hostname] 0-management:
> Unable to find friend: gfs6
> [2015-04-23 10:12:28.341545] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.56
> [2015-04-23 10:12:28.341697] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.56
> [2015-04-23 10:12:28.341840] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.56
> [2015-04-23 10:12:28.341980] D [common-utils.c:2946:gf_is_local_addr]
> 0-management: gfs6 is not local
> [2015-04-23 10:12:28.341991] D
> [glusterd-utils.c:5567:glusterd_hostname_to_uuid] 0-management:
> returning -1
> [2015-04-23 10:12:28.341997] D
> [glusterd-utils.c:685:glusterd_resolve_brick] 0-management: Returning -1
> [2015-04-23 10:12:28.342003] D
> [glusterd-utils.c:1035:glusterd_volume_brickinfo_get] 0-management:
> Returning -1
> [2015-04-23 10:12:28.344065] D
> [glusterd-utils.c:5532:glusterd_friend_find_by_hostname] 0-management:
> Unable to find friend: gfs4
> [2015-04-23 10:12:28.344620] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.54
> [2015-04-23 10:12:28.344772] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.54
> [2015-04-23 10:12:28.344914] D [common-utils.c:2930:gf_is_local_addr]
> 0-management: 192.168.9.54
> [2015-04-23 10:12:28.345054] D [common-utils.c:2946:gf_is_local_addr]
> 0-management: gfs4 is not local
> [2015-04-23 10:12:28.345064] D
> [glusterd-utils.c:5567:glusterd_hostname_to_uuid] 0-management:
> returning -1
> [2015-04-23 10:12:28.345071] D
> [glusterd-utils.c:1035:glusterd_volume_brickinfo_get] 0-management:
> Returning -1
> [2015-04-23 10:12:28.345543] D [run.c:190:runner_log] 0-: Starting the
> nfs/glustershd services: /usr/sbin/glusterfs -s localhost --volfile-id
> gluster/quotad -p /var/lib/glusterd/quotad/run/quotad.pid -l
> /var/log/glusterfs/quotad.log -S
> /var/run/3e619fbfe69c96b1dbc7486a7d38a7be.socket --xlator-option
> *replicate*.data-self-heal=off --xlator-option
> *replicate*.metadata-self-heal=off --xlator-option
> *replicate*.entry-self-heal=off
> ^C[2015-04-23 10:20:50.237403] W [glusterfsd.c:1095:cleanup_and_exit]
> (-->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(runner_end_reuse+0x26)
> [0x7fa81cd3dfb6]
> (-->/lib/x86_64-linux-gnu/libpthread.so.0(waitpid+0x5b)
> [0x7fa81c499c8b] (-->/lib/x86_64-linux-gnu/libc.so.6(+0x321e0)
> [0x7fa81bd3a1e0]))) 0-: received signum (2), shutting down
> [2015-04-23 10:20:50.237436] D
> [glusterfsd-mgmt.c:2025:glusterfs_mgmt_pmap_signout] 0-fsd-mgmt:
> portmapper signout arguments not given
So I think server hangs when trying to start volume with references to
gfs4 and gfs6 which it does not know about, but why gfs1 peers
configuration is empty?
Any help?
Thanks in advance!
Best regards,
Alex.
More information about the Gluster-users
mailing list