[Bugs] [Bug 1375541] New: [geo-rep]: defunct tar process while using tar+ssh sync

bugzilla at redhat.com bugzilla at redhat.com
Tue Sep 13 11:01:21 UTC 2016


https://bugzilla.redhat.com/show_bug.cgi?id=1375541

            Bug ID: 1375541
           Summary: [geo-rep]: defunct tar process while using tar+ssh
                    sync
           Product: GlusterFS
           Version: 3.7.15
         Component: geo-replication
          Severity: high
          Assignee: bugs at gluster.org
          Reporter: avishwan at redhat.com
                CC: bugs at gluster.org, csaba at redhat.com,
                    rhinduja at redhat.com, rhs-bugs at redhat.com,
                    storage-qa-internal at redhat.com
        Depends On: 1373976, 1374286
            Blocks: 1351528



+++ This bug was initially created as a clone of Bug #1374286 +++

+++ This bug was initially created as a clone of Bug #1373976 +++

Description of problem:
=======================

While syncing data using tar, the sync completes but lots of tar process
becomes defunct. 

[root at dhcp41-167 ~]# ps -eaf | grep tar
root     12520  4519  1 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12521  4522  1 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12522  4519  6 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22664/cwd
root     12523  4522  6 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22663/cwd
root     12524  4510  1 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12525  4510 10 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22665/cwd
root     12526  4498  0 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12527  4498  0 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22662/cwd
root     12529  4186  0 17:19 pts/0    00:00:00 grep tar
[root at dhcp41-167 ~]#
[root at dhcp41-167 ~]# ps -eaf | grep tar
root     12520  4519  1 17:19 ?        00:00:00 [tar] <defunct>
root     12521  4522  0 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12523  4522  5 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22663/cwd
root     12524  4510  1 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12525  4510  7 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22665/cwd
root     12526  4498  0 17:19 ?        00:00:00 tar --sparse -cf - --files-from
-
root     12527  4498  1 17:19 ?        00:00:00 ssh -oPasswordAuthentication=no
-oStrictHostKeyChecking=no -i /var/lib/glusterd/geo-replication/tar_ssh.pem -p
22 root at 10.70.41.203 tar --overwrite -xf - -C /proc/22662/cwd
root     12531  4186  0 17:19 pts/0    00:00:00 grep tar
[root at dhcp41-167 ~]#
[root at dhcp41-167 ~]# ps -eaf | grep tar
root     12520  4519  0 17:19 ?        00:00:00 [tar] <defunct>
root     12521  4522  0 17:19 ?        00:00:00 [tar] <defunct>
root     12524  4510  0 17:19 ?        00:00:00 [tar] <defunct>
root     12526  4498  0 17:19 ?        00:00:00 [tar] <defunct>
root     12533  4186  0 17:19 pts/0    00:00:00 grep tar
[root at dhcp41-167 ~]#
[root at dhcp41-167 ~]# ps -eaf | grep tar
root     12520  4519  0 17:19 ?        00:00:00 [tar] <defunct>
root     12521  4522  0 17:19 ?        00:00:00 [tar] <defunct>
root     12524  4510  0 17:19 ?        00:00:00 [tar] <defunct>
root     12526  4498  0 17:19 ?        00:00:00 [tar] <defunct>
root     12543  4186  0 17:19 pts/0    00:00:00 grep tar
[root at dhcp41-167 ~]# 


Steps to Reproduce:
===================
1. Setup geo-rep between master and slave
2. Set config parameter use-tarssh true
3. Start geo-replication
4. Write some data on master volume 
5, Monitor tar process on master nodes using "ps -eaf | grep tar" 

Actual results:
===============

Data at master and slave is synced and arequal checksum matches, However, lots
of process gets defunct. 
[root at dhcp41-167 ~]# ps -eaf | grep tar
root     12520  4519  0 17:19 ?        00:00:00 [tar] <defunct>
root     12521  4522  0 17:19 ?        00:00:00 [tar] <defunct>
root     12524  4510  0 17:19 ?        00:00:00 [tar] <defunct>
root     12526  4498  0 17:19 ?        00:00:00 [tar] <defunct>
root     12543  4186  0 17:19 pts/0    00:00:00 grep tar
[root at dhcp41-167 ~]# 


Expected results:
=================
No tar process should be defunct

--- Additional comment from Worker Ant on 2016-09-08 08:02:46 EDT ---

REVIEW: http://review.gluster.org/15426 (geo-rep: Defunct tar process after
sync) posted (#1) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Worker Ant on 2016-09-08 12:13:22 EDT ---

REVIEW: http://review.gluster.org/15426 (geo-rep: Defunct tar process after
sync) posted (#2) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Worker Ant on 2016-09-13 06:59:31 EDT ---

COMMIT: http://review.gluster.org/15426 committed in master by Aravinda VK
(avishwan at redhat.com) 
------
commit 6b30e9bf5a612e105eb7ded0a89ef25fd8530ba5
Author: Aravinda VK <avishwan at redhat.com>
Date:   Thu Sep 8 17:30:37 2016 +0530

    geo-rep: Defunct tar process after sync

    After every sync iteration with tarssh mode leaves defunct tar
    process.

    Added wait for tar process to prevent this issue.

    BUG: 1374286
    Change-Id: I9953239ef601cc1970c814b00074b45eb00f481e
    Signed-off-by: Aravinda VK <avishwan at redhat.com>
    Reviewed-on: http://review.gluster.org/15426
    Smoke: Gluster Build System <jenkins at build.gluster.org>
    NetBSD-regression: NetBSD Build System <jenkins at build.gluster.org>
    Reviewed-by: Saravanakumar Arumugam <sarumuga at redhat.com>
    CentOS-regression: Gluster Build System <jenkins at build.gluster.org>
    Reviewed-by: Kotresh HR <khiremat at redhat.com>


Referenced Bugs:

https://bugzilla.redhat.com/show_bug.cgi?id=1373976
[Bug 1373976] [geo-rep]: defunct tar process while using tar+ssh sync
https://bugzilla.redhat.com/show_bug.cgi?id=1374286
[Bug 1374286] [geo-rep]: defunct tar process while using tar+ssh sync
-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list