[Bugs] [Bug 1375541] New: [geo-rep]: defunct tar process while using tar+ssh sync
bugzilla at redhat.com
bugzilla at redhat.com
Tue Sep 13 11:01:21 UTC 2016
https://bugzilla.redhat.com/show_bug.cgi?id=1375541
Bug ID: 1375541
Summary: [geo-rep]: defunct tar process while using tar+ssh
sync
Product: GlusterFS
Version: 3.7.15
Component: geo-replication
Severity: high
Assignee: bugs at gluster.org
Reporter: avishwan at redhat.com
CC: bugs at gluster.org, csaba at redhat.com,
rhinduja at redhat.com, rhs-bugs at redhat.com,
storage-qa-internal at redhat.com
Depends On: 1373976, 1374286
Blocks: 1351528
+++ This bug was initially created as a clone of Bug #1374286 +++
+++ This bug was initially created as a clone of Bug #1373976 +++
Description of problem:
=======================
While syncing data using tar, the sync completes but lots of tar process
becomes defunct.
[root at dhcp41-167 ~]# ps -eaf | grep tar
root 12520 4519 1 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12521 4522 1 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12522 4519 6 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22664/cwd
root 12523 4522 6 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22663/cwd
root 12524 4510 1 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12525 4510 10 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22665/cwd
root 12526 4498 0 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12527 4498 0 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22662/cwd
root 12529 4186 0 17:19 pts/0 00:00:00 grep tar
[root at dhcp41-167 ~]#
[root at dhcp41-167 ~]# ps -eaf | grep tar
root 12520 4519 1 17:19 ? 00:00:00 [tar] <defunct>
root 12521 4522 0 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12523 4522 5 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22663/cwd
root 12524 4510 1 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12525 4510 7 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22665/cwd
root 12526 4498 0 17:19 ? 00:00:00 tar --sparse -cf - --files-from
-
root 12527 4498 1 17:19 ? 00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22662/cwd
root 12531 4186 0 17:19 pts/0 00:00:00 grep tar
[root at dhcp41-167 ~]#
[root at dhcp41-167 ~]# ps -eaf | grep tar
root 12520 4519 0 17:19 ? 00:00:00 [tar] <defunct>
root 12521 4522 0 17:19 ? 00:00:00 [tar] <defunct>
root 12524 4510 0 17:19 ? 00:00:00 [tar] <defunct>
root 12526 4498 0 17:19 ? 00:00:00 [tar] <defunct>
root 12533 4186 0 17:19 pts/0 00:00:00 grep tar
[root at dhcp41-167 ~]#
[root at dhcp41-167 ~]# ps -eaf | grep tar
root 12520 4519 0 17:19 ? 00:00:00 [tar] <defunct>
root 12521 4522 0 17:19 ? 00:00:00 [tar] <defunct>
root 12524 4510 0 17:19 ? 00:00:00 [tar] <defunct>
root 12526 4498 0 17:19 ? 00:00:00 [tar] <defunct>
root 12543 4186 0 17:19 pts/0 00:00:00 grep tar
[root at dhcp41-167 ~]#
Steps to Reproduce:
===================
1. Setup geo-rep between master and slave
2. Set config parameter use-tarssh true
3. Start geo-replication
4. Write some data on master volume
5, Monitor tar process on master nodes using "ps -eaf | grep tar"
Actual results:
===============
Data at master and slave is synced and arequal checksum matches, However, lots
of process gets defunct.
[root at dhcp41-167 ~]# ps -eaf | grep tar
root 12520 4519 0 17:19 ? 00:00:00 [tar] <defunct>
root 12521 4522 0 17:19 ? 00:00:00 [tar] <defunct>
root 12524 4510 0 17:19 ? 00:00:00 [tar] <defunct>
root 12526 4498 0 17:19 ? 00:00:00 [tar] <defunct>
root 12543 4186 0 17:19 pts/0 00:00:00 grep tar
[root at dhcp41-167 ~]#
Expected results:
=================
No tar process should be defunct
--- Additional comment from Worker Ant on 2016-09-08 08:02:46 EDT ---
REVIEW: http://review.gluster.org/15426 (geo-rep: Defunct tar process after
sync) posted (#1) for review on master by Aravinda VK (avishwan at redhat.com)
--- Additional comment from Worker Ant on 2016-09-08 12:13:22 EDT ---
REVIEW: http://review.gluster.org/15426 (geo-rep: Defunct tar process after
sync) posted (#2) for review on master by Aravinda VK (avishwan at redhat.com)
--- Additional comment from Worker Ant on 2016-09-13 06:59:31 EDT ---
COMMIT: http://review.gluster.org/15426 committed in master by Aravinda VK
(avishwan at redhat.com)
------
commit 6b30e9bf5a612e105eb7ded0a89ef25fd8530ba5
Author: Aravinda VK <avishwan at redhat.com>
Date: Thu Sep 8 17:30:37 2016 +0530
geo-rep: Defunct tar process after sync
After every sync iteration with tarssh mode leaves defunct tar
process.
Added wait for tar process to prevent this issue.
BUG: 1374286
Change-Id: I9953239ef601cc1970c814b00074b45eb00f481e
Signed-off-by: Aravinda VK <avishwan at redhat.com>
Reviewed-on: http://review.gluster.org/15426
Smoke: Gluster Build System <jenkins at build.gluster.org>
NetBSD-regression: NetBSD Build System <jenkins at build.gluster.org>
Reviewed-by: Saravanakumar Arumugam <sarumuga at redhat.com>
CentOS-regression: Gluster Build System <jenkins at build.gluster.org>
Reviewed-by: Kotresh HR <khiremat at redhat.com>
Referenced Bugs:
https://bugzilla.redhat.com/show_bug.cgi?id=1373976
[Bug 1373976] [geo-rep]: defunct tar process while using tar+ssh sync
https://bugzilla.redhat.com/show_bug.cgi?id=1374286
[Bug 1374286] [geo-rep]: defunct tar process while using tar+ssh sync
--
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
More information about the Bugs
mailing list