[Bugs] [Bug 1209831] peer probe fails because of missing glusterd.info file

bugzilla at redhat.com bugzilla at redhat.com
Fri Apr 10 07:18:11 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1209831



--- Comment #3 from ssamanta at redhat.com ---
This problem is always reproducible. I tried to create a new cluster and the
peer probe failed with missing glusterd.info file although glusterd is running
on all nodes.


Node-1
======

[root at gqas006 ssl]# service glusterd status
Redirecting to /bin/systemctl status  glusterd.service
glusterd.service - GlusterFS, a clustered file-system server
   Loaded: loaded (/usr/lib/systemd/system/glusterd.service; enabled)
   Active: active (running) since Thu 2015-04-09 10:29:02 EDT; 16h ago
 Main PID: 2068 (glusterd)
   CGroup: /system.slice/glusterd.service
           └─2068 /usr/sbin/glusterd -p /var/run/glusterd.pid

Apr 09 10:29:02 gqas006.sbu.lab.eng.bos.redhat.com systemd[1]: Started
GlusterFS, a clustered file-system server.
[root at gqas006 ssl]# hostname
gqas006.sbu.lab.eng.bos.redhat.com
[root at gqas006 ssl]# 

Node-2
======

[root at gqas005 ssl]# service glusterd status
Redirecting to /bin/systemctl status  glusterd.service
glusterd.service - GlusterFS, a clustered file-system server
   Loaded: loaded (/usr/lib/systemd/system/glusterd.service; enabled)
   Active: active (running) since Thu 2015-04-09 10:29:14 EDT; 16h ago
 Main PID: 2066 (glusterd)
   CGroup: /system.slice/glusterd.service
           └─2066 /usr/sbin/glusterd -p /var/run/glusterd.pid

Apr 09 10:29:14 gqas005.sbu.lab.eng.bos.redhat.com systemd[1]: Started
GlusterFS, a clustered file-system server.
[root at gqas005 ssl]

Node-3
======
[root at gqas009 ~]# service glusterd status
Redirecting to /bin/systemctl status  glusterd.service
glusterd.service - GlusterFS, a clustered file-system server
   Loaded: loaded (/usr/lib/systemd/system/glusterd.service; enabled)
   Active: active (running) since Fri 2015-04-10 02:49:29 EDT; 18min ago
  Process: 868 ExecStart=/usr/sbin/glusterd -p /var/run/glusterd.pid
(code=exited, status=0/SUCCESS)
 Main PID: 880 (glusterd)
   CGroup: /system.slice/glusterd.service
           └─880 /usr/sbin/glusterd -p /var/run/glusterd.pid

Apr 10 02:49:23 gqas009.sbu.lab.eng.bos.redhat.com systemd[1]: Starting
GlusterFS, a clustered file-system server...
Apr 10 02:49:29 gqas009.sbu.lab.eng.bos.redhat.com systemd[1]: Started
GlusterFS, a clustered file-system server.
[root at gqas009 ~]#

>From Node1 adding the peers fails

[root at gqas005 ssl]# gluster peer status
Connection failed. Please check if gluster daemon is operational.
[root at gqas005 ssl]# 
[root at gqas005 ssl]# 
[root at gqas005 ssl]# 
[root at gqas005 ssl]# gluster peer probe gqas006.sbu.lab.eng.bos.redhat.com
Connection failed. Please check if gluster daemon is operational.
[root at gqas005 ssl]# 


[2015-04-09 14:29:11.205892] I [glusterd.c:1214:init] 0-management: Maximum
allowed open file descriptors set to 65536
[2015-04-09 14:29:11.205926] I [glusterd.c:1259:init] 0-management: Using
/var/lib/glusterd as working directory
[2015-04-09 14:29:11.210274] W [rdma.c:4221:__gf_rdma_ctx_create]
0-rpc-transport/rdma: rdma_cm event channel creation failed (No such device)
[2015-04-09 14:29:11.210297] E [rdma.c:4519:init] 0-rdma.management: Failed to
initialize IB Device
[2015-04-09 14:29:11.210309] E [rpc-transport.c:333:rpc_transport_load]
0-rpc-transport: 'rdma' initialization failed
[2015-04-09 14:29:11.210374] W [rpcsvc.c:1524:rpcsvc_transport_create]
0-rpc-service: cannot create listener, initing the transport failed
[2015-04-09 14:29:11.210741] E [socket.c:792:__socket_server_bind]
0-socket.management: binding to  failed: Address already in use
[2015-04-09 14:29:11.210762] E [socket.c:795:__socket_server_bind]
0-socket.management: Port is already in use
[2015-04-09 14:29:11.210777] W [rpcsvc.c:1531:rpcsvc_transport_create]
0-rpc-service: listening on transport failed
[2015-04-09 14:29:14.064473] E [store.c:432:gf_store_handle_retrieve] 0-: Path
corresponding to /var/lib/glusterd/glusterd.info, returned error: (No such file
or directory)
[2015-04-09 14:29:14.064515] E [store.c:432:gf_store_handle_retrieve] 0-: Path
corresponding to /var/lib/glusterd/glusterd.info, returned error: (No such file
or directory)
[2015-04-09 14:29:14.064528] I
[glusterd-store.c:2063:glusterd_restore_op_version] 0-management: Detected new
install. Setting op-version to maximum : 30600
[2015-04-09 14:29:14.064644] I
[glusterd-store.c:3497:glusterd_store_retrieve_missed_snaps_list] 0-management:
No missed snaps list.
Final graph:

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list