[Bugs] [Bug 1213352] nfs-ganesha: HA issue, the iozone process is not moving ahead, once the nfs-ganesha is killed
bugzilla at redhat.com
bugzilla at redhat.com
Mon Apr 20 12:29:28 UTC 2015
https://bugzilla.redhat.com/show_bug.cgi?id=1213352
Saurabh <saujain at redhat.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Flags|needinfo?(saujain at redhat.co |
|m) |
--- Comment #2 from Saurabh <saujain at redhat.com> ---
So I am having four nodes, namely nfs[1,2,3,4]
nfs-ganehsa came up only on nfs2 and nfs3
and presently I killed nfs-ganesha process on nfs2
so collected the showmount output from nfs3,
[root at nfs3 ~]# showmount -e 10.70.36.217
Export list for 10.70.36.217:
/vol0 (everyone)
[root at nfs3 ~]# showmount -e 10.70.36.218
Export list for 10.70.36.218:
/vol0 (everyone)
[root at nfs3 ~]# showmount -e 10.70.36.219
Export list for 10.70.36.219:
/vol0 (everyone)
[root at nfs3 ~]# showmount -e 10.70.36.220
Export list for 10.70.36.220:
/vol0 (everyone)
node 1,
#####################################
[root at nfs1 ~]# ps -eaf | grep nfs
root 5338 6760 0 14:57 pts/0 00:00:00 grep nfs
[root at nfs1 ~]# pcs status
Cluster name: ganesha-ha-2
Last updated: Mon Apr 20 14:58:03 2015
Last change: Mon Apr 20 12:28:04 2015
Stack: cman
Current DC: nfs1 - partition with quorum
Version: 1.1.11-97629de
4 Nodes configured
22 Resources configured
Online: [ nfs1 nfs2 nfs3 nfs4 ]
Full list of resources:
Clone Set: nfs_start-clone [nfs_start]
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs3 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs1 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs2 (unmanaged)
Stopped: [ nfs4 ]
nfs1-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs4
Clone Set: nfs-mon-clone [nfs-mon]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
Clone Set: nfs-grace-clone [nfs-grace]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
nfs1-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs1-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs2-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs2
nfs2-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs2
nfs3-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs3-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs4-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs1
Failed actions:
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
node 2,
##########################################
[root at nfs2 ~]# ps -eaf | grep nfs
root 5260 16826 0 14:58 pts/0 00:00:00 grep nfs
root 6216 1 0 12:27 ? 00:00:05 /usr/bin/ganesha.nfsd -L
/var/log/ganesha.log -f /etc/ganesha/ganesha.conf -N NIV_EVENT -p
/var/run/ganesha.nfsd.pid
[root at nfs2 ~]# pcs status
Cluster name: ganesha-ha-2
Last updated: Mon Apr 20 14:58:49 2015
Last change: Mon Apr 20 12:28:04 2015
Stack: cman
Current DC: nfs1 - partition with quorum
Version: 1.1.11-97629de
4 Nodes configured
22 Resources configured
Online: [ nfs1 nfs2 nfs3 nfs4 ]
Full list of resources:
Clone Set: nfs_start-clone [nfs_start]
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs3 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs1 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs2 (unmanaged)
Stopped: [ nfs4 ]
nfs1-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs4
Clone Set: nfs-mon-clone [nfs-mon]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
Clone Set: nfs-grace-clone [nfs-grace]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
nfs1-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs1-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs2-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs2
nfs2-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs2
nfs3-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs3-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs4-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs1
Failed actions:
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
node 3,
#############################################
[root at nfs3 ~]# ps -eaf | grep nfs
root 20901 18085 0 14:59 pts/0 00:00:00 grep nfs
root 26369 1 0 12:27 ? 00:00:05 /usr/bin/ganesha.nfsd -L
/var/log/ganesha.log -f /etc/ganesha/ganesha.conf -N NIV_EVENT -p
/var/run/ganesha.nfsd.pid
[root at nfs3 ~]# pcs status
Cluster name: ganesha-ha-2
Last updated: Mon Apr 20 14:59:22 2015
Last change: Mon Apr 20 12:28:04 2015
Stack: cman
Current DC: nfs1 - partition with quorum
Version: 1.1.11-97629de
4 Nodes configured
22 Resources configured
Online: [ nfs1 nfs2 nfs3 nfs4 ]
Full list of resources:
Clone Set: nfs_start-clone [nfs_start]
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs3 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs1 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs2 (unmanaged)
Stopped: [ nfs4 ]
nfs1-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs4
Clone Set: nfs-mon-clone [nfs-mon]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
Clone Set: nfs-grace-clone [nfs-grace]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
nfs1-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs1-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs2-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs2
nfs2-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs2
nfs3-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs3-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs4-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs1
Failed actions:
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
node 4,
######################################
[root at nfs4 ~]# ps -eaf | grep nfs
root 16073 27004 0 04:12 pts/0 00:00:00 grep nfs
[root at nfs4 ~]# pcs status
Cluster name: ganesha-ha-2
Last updated: Mon Apr 20 04:13:00 2015
Last change: Mon Apr 20 01:41:11 2015
Stack: cman
Current DC: nfs1 - partition with quorum
Version: 1.1.11-97629de
4 Nodes configured
22 Resources configured
Online: [ nfs1 nfs2 nfs3 nfs4 ]
Full list of resources:
Clone Set: nfs_start-clone [nfs_start]
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs3 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs1 (unmanaged)
nfs_start (ocf::heartbeat:ganesha_nfsd): FAILED nfs2 (unmanaged)
Stopped: [ nfs4 ]
nfs1-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs4
Clone Set: nfs-mon-clone [nfs-mon]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
Clone Set: nfs-grace-clone [nfs-grace]
Started: [ nfs1 nfs2 nfs3 nfs4 ]
nfs1-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs1-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs2-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs2
nfs2-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs2
nfs3-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs3-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-cluster_ip-1 (ocf::heartbeat:IPaddr): Started nfs3
nfs4-trigger_ip-1 (ocf::heartbeat:Dummy): Started nfs3
nfs4-dead_ip-1 (ocf::heartbeat:Dummy): Started nfs1
Failed actions:
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs3 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs1 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40001ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
nfs_start_stop_0 on nfs2 'unknown error' (1): call=20, status=Timed Out,
last-rc-change='Mon Apr 20 12:27:09 2015', queued=0ms, exec=40002ms
--
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=8i71Vz9XRR&a=cc_unsubscribe
More information about the Bugs
mailing list