From alan.orth at gmail.com Sat Jun 1 16:07:44 2019 From: alan.orth at gmail.com (Alan Orth) Date: Sat, 1 Jun 2019 19:07:44 +0300 Subject: [Gluster-users] Does replace-brick migrate data? In-Reply-To: References: <32e26faf-e5c0-b944-2a32-c9eae408b146@redhat.com> <0ab0c28a-48a1-92c0-a106-f4fa94cb620f@redhat.com> <39dcc6a5-1610-93e1-aaff-7fef9b6c1faa@redhat.com>

Message-ID: Dear Ravi, The .glusterfs hardlinks/symlinks should be fine. I'm not sure how I could verify them for six bricks and millions of files, though... :\ I had a small success in fixing some issues with duplicated files on the FUSE mount point yesterday. I read quite a bit about the elastic hashing algorithm that determines which files get placed on which bricks based on the hash of their filename and the trusted.glusterfs.dht xattr on brick directories (thanks to Joe Julian's blog post and Python script for showing how it works?). With that knowledge I looked closer at one of the files that was appearing as duplicated on the FUSE mount and found that it was also duplicated on more than `replica 2` bricks. For this particular file I found two "real" files and several zero-size files with trusted.glusterfs.dht.linkto xattrs. Neither of the "real" files were on the correct brick as far as the DHT layout is concerned, so I copied one of them to the correct brick, deleted the others and their hard links, and did a `stat` on the file from the FUSE mount point and it fixed itself. Yay! Could this have been caused by a replace-brick that got interrupted and didn't finish re-labeling the xattrs? Should I be thinking of some heuristics to identify and fix these issues with a script (incorrect brick placement), or is this something a fix layout or repeated volume heals can fix? I've already completed a whole heal on this particular volume this week and it did heal about 1,000,000 files (mostly data and metadata, but about 20,000 entry heals as well). Thanks for your support, ? https://joejulian.name/post/dht-misses-are-expensive/ On Fri, May 31, 2019 at 7:57 AM Ravishankar N wrote: > > On 31/05/19 3:20 AM, Alan Orth wrote: > > Dear Ravi, > > I spent a bit of time inspecting the xattrs on some files and directories > on a few bricks for this volume and it looks a bit messy. Even if I could > make sense of it for a few and potentially heal them manually, there are > millions of files and directories in total so that's definitely not a > scalable solution. After a few missteps with `replace-brick ... commit > force` in the last week?one of which on a brick that was dead/offline?as > well as some premature `remove-brick` commands, I'm unsure how how to > proceed and I'm getting demotivated. It's scary how quickly things get out > of hand in distributed systems... > > Hi Alan, > The one good thing about gluster is it that the data is always available > directly on the backed bricks even if your volume has inconsistencies at > the gluster level. So theoretically, if your cluster is FUBAR, you could > just create a new volume and copy all data onto it via its mount from the > old volume's bricks. > > > I had hoped that bringing the old brick back up would help, but by the > time I added it again a few days had passed and all the brick-id's had > changed due to the replace/remove brick commands, not to mention that the > trusted.afr.$volume-client-xx values were now probably pointing to the > wrong bricks (?). > > Anyways, a few hours ago I started a full heal on the volume and I see > that there is a sustained 100MiB/sec of network traffic going from the old > brick's host to the new one. The completed heals reported in the logs look > promising too: > > Old brick host: > > # grep '2019-05-30' /var/log/glusterfs/glustershd.log | grep -o -E > 'Completed (data|metadata|entry) selfheal' | sort | uniq -c > 281614 Completed data selfheal > 84 Completed entry selfheal > 299648 Completed metadata selfheal > > New brick host: > > # grep '2019-05-30' /var/log/glusterfs/glustershd.log | grep -o -E > 'Completed (data|metadata|entry) selfheal' | sort | uniq -c > 198256 Completed data selfheal > 16829 Completed entry selfheal > 229664 Completed metadata selfheal > > So that's good I guess, though I have no idea how long it will take or if > it will fix the "missing files" issue on the FUSE mount. I've increased > cluster.shd-max-threads to 8 to hopefully speed up the heal process. > > The afr xattrs should not cause files to disappear from mount. If the > xattr names do not match what each AFR subvol expects (for eg. in a replica > 2 volume, trusted.afr.*-client-{0,1} for 1st subvol, client-{2,3} for 2nd > subvol and so on - ) for its children then it won't heal the data, that is > all. But in your case I see some inconsistencies like one brick having the > actual file (licenseserver.cfg) and the other having a linkto file (the > one with the dht.linkto xattr) *in the same replica pair*. > > > I'd be happy for any advice or pointers, > > Did you check if the .glusterfs hardlinks/symlinks exist and are in order > for all bricks? > > -Ravi > > > On Wed, May 29, 2019 at 5:20 PM Alan Orth wrote: > >> Dear Ravi, >> >> Thank you for the link to the blog post series?it is very informative and >> current! If I understand your blog post correctly then I think the answer >> to your previous question about pending AFRs is: no, there are no pending >> AFRs. I have identified one file that is a good test case to try to >> understand what happened after I issued the `gluster volume replace-brick >> ... commit force` a few days ago and then added the same original brick >> back to the volume later. This is the current state of the replica 2 >> distribute/replicate volume: >> >> [root at wingu0 ~]# gluster volume info apps >> >> Volume Name: apps >> Type: Distributed-Replicate >> Volume ID: f118d2da-79df-4ee1-919d-53884cd34eda >> Status: Started >> Snapshot Count: 0 >> Number of Bricks: 3 x 2 = 6 >> Transport-type: tcp >> Bricks: >> Brick1: wingu3:/mnt/gluster/apps >> Brick2: wingu4:/mnt/gluster/apps >> Brick3: wingu05:/data/glusterfs/sdb/apps >> Brick4: wingu06:/data/glusterfs/sdb/apps >> Brick5: wingu0:/mnt/gluster/apps >> Brick6: wingu05:/data/glusterfs/sdc/apps >> Options Reconfigured: >> diagnostics.client-log-level: DEBUG >> storage.health-check-interval: 10 >> nfs.disable: on >> >> I checked the xattrs of one file that is missing from the volume's FUSE >> mount (though I can read it if I access its full path explicitly), but is >> present in several of the volume's bricks (some with full size, others >> empty): >> >> [root at wingu0 ~]# getfattr -d -m. -e hex >> /mnt/gluster/apps/clcgenomics/clclicsrv/licenseserver.cfg >> >> getfattr: Removing leading '/' from absolute path names >> # file: mnt/gluster/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.afr.apps-client-3=0x000000000000000000000000 >> trusted.afr.apps-client-5=0x000000000000000000000000 >> trusted.afr.dirty=0x000000000000000000000000 >> trusted.bit-rot.version=0x0200000000000000585a396f00046e15 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> >> [root at wingu05 ~]# getfattr -d -m. -e hex /data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> getfattr: Removing leading '/' from absolute path names >> # file: data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >> trusted.glusterfs.dht.linkto=0x617070732d7265706c69636174652d3200 >> >> [root at wingu05 ~]# getfattr -d -m. -e hex /data/glusterfs/sdc/apps/clcgenomics/clclicsrv/licenseserver.cfg >> getfattr: Removing leading '/' from absolute path names >> # file: data/glusterfs/sdc/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >> >> [root at wingu06 ~]# getfattr -d -m. -e hex /data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> getfattr: Removing leading '/' from absolute path names >> # file: data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >> trusted.glusterfs.dht.linkto=0x617070732d7265706c69636174652d3200 >> >> According to the trusted.afr.apps-client-xx xattrs this particular file >> should be on bricks with id "apps-client-3" and "apps-client-5". It took me >> a few hours to realize that the brick-id values are recorded in the >> volume's volfiles in /var/lib/glusterd/vols/apps/bricks. After comparing >> those brick-id values with a volfile backup from before the replace-brick, >> I realized that the files are simply on the wrong brick now as far as >> Gluster is concerned. This particular file is now on the brick for >> "apps-client-4". As an experiment I copied this one file to the two >> bricks listed in the xattrs and I was then able to see the file from the >> FUSE mount (yay!). >> >> Other than replacing the brick, removing it, and then adding the old >> brick on the original server back, there has been no change in the data >> this entire time. Can I change the brick IDs in the volfiles so they >> reflect where the data actually is? Or perhaps script something to reset >> all the xattrs on the files/directories to point to the correct bricks? >> >> Thank you for any help or pointers, >> >> On Wed, May 29, 2019 at 7:24 AM Ravishankar N >> wrote: >> >>> >>> On 29/05/19 9:50 AM, Ravishankar N wrote: >>> >>> >>> On 29/05/19 3:59 AM, Alan Orth wrote: >>> >>> Dear Ravishankar, >>> >>> I'm not sure if Brick4 had pending AFRs because I don't know what that >>> means and it's been a few days so I am not sure I would be able to find >>> that information. >>> >>> When you find some time, have a look at a blog >>> series I wrote about AFR- I've tried to explain what one needs to know to >>> debug replication related issues in it. >>> >>> Made a typo error. The URL for the blog is https://wp.me/peiBB-6b >>> >>> -Ravi >>> >>> >>> Anyways, after wasting a few days rsyncing the old brick to a new host I >>> decided to just try to add the old brick back into the volume instead of >>> bringing it up on the new host. I created a new brick directory on the old >>> host, moved the old brick's contents into that new directory (minus the >>> .glusterfs directory), added the new brick to the volume, and then did >>> Vlad's find/stat trick? from the brick to the FUSE mount point. >>> >>> The interesting problem I have now is that some files don't appear in >>> the FUSE mount's directory listings, but I can actually list them directly >>> and even read them. What could cause that? >>> >>> Not sure, too many variables in the hacks that you did to take a guess. >>> You can check if the contents of the .glusterfs folder are in order on the >>> new brick (example hardlink for files and symlinks for directories are >>> present etc.) . >>> Regards, >>> Ravi >>> >>> >>> Thanks, >>> >>> ? >>> https://lists.gluster.org/pipermail/gluster-users/2018-February/033584.html >>> >>> On Fri, May 24, 2019 at 4:59 PM Ravishankar N >>> wrote: >>> >>>> >>>> On 23/05/19 2:40 AM, Alan Orth wrote: >>>> >>>> Dear list, >>>> >>>> I seem to have gotten into a tricky situation. Today I brought up a >>>> shiny new server with new disk arrays and attempted to replace one brick of >>>> a replica 2 distribute/replicate volume on an older server using the >>>> `replace-brick` command: >>>> >>>> # gluster volume replace-brick homes wingu0:/mnt/gluster/homes >>>> wingu06:/data/glusterfs/sdb/homes commit force >>>> >>>> The command was successful and I see the new brick in the output of >>>> `gluster volume info`. The problem is that Gluster doesn't seem to be >>>> migrating the data, >>>> >>>> `replace-brick` definitely must heal (not migrate) the data. In your >>>> case, data must have been healed from Brick-4 to the replaced Brick-3. Are >>>> there any errors in the self-heal daemon logs of Brick-4's node? Does >>>> Brick-4 have pending AFR xattrs blaming Brick-3? The doc is a bit out of >>>> date. replace-brick command internally does all the setfattr steps that are >>>> mentioned in the doc. >>>> >>>> -Ravi >>>> >>>> >>>> and now the original brick that I replaced is no longer part of the >>>> volume (and a few terabytes of data are just sitting on the old brick): >>>> >>>> # gluster volume info homes | grep -E "Brick[0-9]:" >>>> Brick1: wingu4:/mnt/gluster/homes >>>> Brick2: wingu3:/mnt/gluster/homes >>>> Brick3: wingu06:/data/glusterfs/sdb/homes >>>> Brick4: wingu05:/data/glusterfs/sdb/homes >>>> Brick5: wingu05:/data/glusterfs/sdc/homes >>>> Brick6: wingu06:/data/glusterfs/sdc/homes >>>> >>>> I see the Gluster docs have a more complicated procedure for replacing >>>> bricks that involves getfattr/setfattr?. How can I tell Gluster about the >>>> old brick? I see that I have a backup of the old volfile thanks to yum's >>>> rpmsave function if that helps. >>>> >>>> We are using Gluster 5.6 on CentOS 7. Thank you for any advice you can >>>> give. >>>> >>>> ? >>>> https://docs.gluster.org/en/latest/Administrator%20Guide/Managing%20Volumes/#replace-faulty-brick >>>> >>>> -- >>>> Alan Orth >>>> alan.orth at gmail.com >>>> https://picturingjordan.com >>>> https://englishbulgaria.net >>>> https://mjanja.ch >>>> "In heaven all the interesting people are missing." ?Friedrich Nietzsche >>>> >>>> _______________________________________________ >>>> Gluster-users mailing listGluster-users at gluster.orghttps://lists.gluster.org/mailman/listinfo/gluster-users >>>> >>>> >>> >>> -- >>> Alan Orth >>> alan.orth at gmail.com >>> https://picturingjordan.com >>> https://englishbulgaria.net >>> https://mjanja.ch >>> "In heaven all the interesting people are missing." ?Friedrich Nietzsche >>> >>> >>> _______________________________________________ >>> Gluster-users mailing listGluster-users at gluster.orghttps://lists.gluster.org/mailman/listinfo/gluster-users >>> >>> >> >> -- >> Alan Orth >> alan.orth at gmail.com >> https://picturingjordan.com >> https://englishbulgaria.net >> https://mjanja.ch >> "In heaven all the interesting people are missing." ?Friedrich Nietzsche >> > > > -- > Alan Orth > alan.orth at gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > "In heaven all the interesting people are missing." ?Friedrich Nietzsche > > -- Alan Orth alan.orth at gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch "In heaven all the interesting people are missing." ?Friedrich Nietzsche -------------- next part -------------- An HTML attachment was scrubbed... URL: From zgrep at 139.com Mon Jun 3 06:27:43 2019 From: zgrep at 139.com (=?utf-8?B?WGllIENoYW5nbG9uZw==?=) Date: 03 Jun 2019 14:27:43 +0800 Subject: [Gluster-users] write request hung in write-behind Message-ID: 2019060314274320643802@139.com> Hi all Test gluster 3.8.4-54.15 gnfs, i saw a write request hung in write-behind followed by 1545 FLUSH requests. I found a similar bugfix https://bugzilla.redhat.com/show_bug.cgi?id=1626787, but not sure if it's the right one. [xlator.performance.write-behind.wb_inode] path=/575/1e/5751e318f21f605f2aac241bf042e7a8.jpg inode=0x7f51775b71a0 window_conf=1073741824 window_current=293822 transit-size=293822 dontsync=0 [.WRITE] request-ptr=0x7f516eec2060 refcount=1 wound=yes generation-number=1 req->op_ret=293822 req->op_errno=0 sync-attempts=1 sync-in-progress=yes size=293822 offset=1048576 lied=-1 append=0 fulfilled=0 go=-1 [.FLUSH] request-ptr=0x7f517c2badf0 refcount=1 wound=no generation-number=2 req->op_ret=-1 req->op_errno=116 sync-attempts=0 [.FLUSH] request-ptr=0x7f5173e9f7b0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f51640b8ca0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f516f3979d0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f516f6ac8d0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 Any comments would be appreciated! Thanks -Xie -------------- next part -------------- An HTML attachment was scrubbed... URL: From rgowdapp at redhat.com Mon Jun 3 06:46:07 2019 From: rgowdapp at redhat.com (Raghavendra Gowdappa) Date: Mon, 3 Jun 2019 12:16:07 +0530 Subject: [Gluster-users] write request hung in write-behind In-Reply-To: <5cf4bde4.1c69fb81.42b08.1408SMTPIN_ADDED_BROKEN@mx.google.com> References: <5cf4bde4.1c69fb81.42b08.1408SMTPIN_ADDED_BROKEN@mx.google.com> Message-ID: On Mon, Jun 3, 2019 at 11:57 AM Xie Changlong wrote: > Hi all > > Test gluster 3.8.4-54.15 gnfs, i saw a write request hung in write-behind > followed by 1545 FLUSH requests. I found a similar > bugfix https://bugzilla.redhat.com/show_bug.cgi?id=1626787, but not sure > if it's the right one. > > [xlator.performance.write-behind.wb_inode] > path=/575/1e/5751e318f21f605f2aac241bf042e7a8.jpg > inode=0x7f51775b71a0 > window_conf=1073741824 > window_current=293822 > transit-size=293822 > dontsync=0 > > [.WRITE] > request-ptr=0x7f516eec2060 > refcount=1 > wound=yes > generation-number=1 > req->op_ret=293822 > req->op_errno=0 > sync-attempts=1 > sync-in-progress=yes > Note that the sync is still in progress. This means, write-behind has wound the write-request to its children and yet to receive the response (unless there is a bug in accounting of sync-in-progress). So, its likely that there are callstacks into children of write-behind, which are not complete yet. Are you sure the deepest hung call-stack is in write-behind? Can you check for frames with "complete=0"? size=293822 > offset=1048576 > lied=-1 > append=0 > fulfilled=0 > go=-1 > > [.FLUSH] > request-ptr=0x7f517c2badf0 > refcount=1 > wound=no > generation-number=2 > req->op_ret=-1 > req->op_errno=116 > sync-attempts=0 > > [.FLUSH] > request-ptr=0x7f5173e9f7b0 > refcount=1 > wound=no > generation-number=2 > req->op_ret=0 > req->op_errno=0 > sync-attempts=0 > > [.FLUSH] > request-ptr=0x7f51640b8ca0 > refcount=1 > wound=no > generation-number=2 > req->op_ret=0 > req->op_errno=0 > sync-attempts=0 > > [.FLUSH] > request-ptr=0x7f516f3979d0 > refcount=1 > wound=no > generation-number=2 > req->op_ret=0 > req->op_errno=0 > sync-attempts=0 > > [.FLUSH] > request-ptr=0x7f516f6ac8d0 > refcount=1 > wound=no > generation-number=2 > req->op_ret=0 > req->op_errno=0 > sync-attempts=0 > > > Any comments would be appreciated! > > Thanks > -Xie > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From zgrep at 139.com Mon Jun 3 07:40:47 2019 From: zgrep at 139.com (=?utf-8?B?WGllIENoYW5nbG9uZw==?=) Date: 03 Jun 2019 15:40:47 +0800 Subject: [Gluster-users] write request hung in write-behind Message-ID: 201906031540473580561@139.com> Firstly i correct myself, write request followed by 771(not 1545) FLUSH requests. I've attach gnfs dump file, totally 774 pending call-stacks, 771 of them pending on write-behind and the deepest call-stack is afr. [global.callpool.stack.771] stack=0x7f517f557f60 uid=0 gid=0 pid=0 unique=0 lk-owner= op=stack type=0 cnt=3 [global.callpool.stack.771.frame.1] frame=0x7f517f655880 ref_count=0 translator=cl35vol01-replicate-7 complete=0 parent=cl35vol01-dht wind_from=dht_writev wind_to=subvol->fops->writev unwind_to=dht_writev_cbk [global.callpool.stack.771.frame.2] frame=0x7f518ed90340 ref_count=1 translator=cl35vol01-dht complete=0 parent=cl35vol01-write-behind wind_from=wb_fulfill_head wind_to=FIRST_CHILD (frame->this)->fops->writev unwind_to=wb_fulfill_cbk [global.callpool.stack.771.frame.3] frame=0x7f516d3baf10 ref_count=1 translator=cl35vol01-write-behind complete=0 [global.callpool.stack.772] stack=0x7f51607a5a20 uid=0 gid=0 pid=0 unique=0 lk-owner=a0715b77517f0000 op=stack type=0 cnt=1 [global.callpool.stack.772.frame.1] frame=0x7f516ca2d1b0 ref_count=0 translator=cl35vol01-replicate-7 complete=0 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | wc -l 774 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep complete |wc -l 774 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep -E "complete=0" |wc -l 774 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | grep write-behind |wc -l 771 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | grep replicate-7 | wc -l 2 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | grep glusterfs | wc -l 1 ???: Raghavendra Gowdappa ??: 2019/06/03(???)14:46 ???: Xie Changlong; ???: gluster-users; ??: Re: write request hung in write-behind On Mon, Jun 3, 2019 at 11:57 AM Xie Changlong wrote: Hi all Test gluster 3.8.4-54.15 gnfs, i saw a write request hung in write-behind followed by 1545 FLUSH requests. I found a similar bugfix https://bugzilla.redhat.com/show_bug.cgi?id=1626787, but not sure if it's the right one. [xlator.performance.write-behind.wb_inode] path=/575/1e/5751e318f21f605f2aac241bf042e7a8.jpg inode=0x7f51775b71a0 window_conf=1073741824 window_current=293822 transit-size=293822 dontsync=0 [.WRITE] request-ptr=0x7f516eec2060 refcount=1 wound=yes generation-number=1 req->op_ret=293822 req->op_errno=0 sync-attempts=1 sync-in-progress=yes Note that the sync is still in progress. This means, write-behind has wound the write-request to its children and yet to receive the response (unless there is a bug in accounting of sync-in-progress). So, its likely that there are callstacks into children of write-behind, which are not complete yet. Are you sure the deepest hung call-stack is in write-behind? Can you check for frames with "complete=0"? size=293822 offset=1048576 lied=-1 append=0 fulfilled=0 go=-1 [.FLUSH] request-ptr=0x7f517c2badf0 refcount=1 wound=no generation-number=2 req->op_ret=-1 req->op_errno=116 sync-attempts=0 [.FLUSH] request-ptr=0x7f5173e9f7b0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f51640b8ca0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f516f3979d0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f516f6ac8d0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 Any comments would be appreciated! Thanks -Xie -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: glusterdump.20106.dump.1559038081 Type: application/octet-stream Size: 678986 bytes Desc: not available URL: From ravishankar at redhat.com Mon Jun 3 16:40:00 2019 From: ravishankar at redhat.com (Ravishankar N) Date: Mon, 3 Jun 2019 22:10:00 +0530 Subject: [Gluster-users] Does replace-brick migrate data? In-Reply-To: References: <32e26faf-e5c0-b944-2a32-c9eae408b146@redhat.com> <0ab0c28a-48a1-92c0-a106-f4fa94cb620f@redhat.com> <39dcc6a5-1610-93e1-aaff-7fef9b6c1faa@redhat.com>

Message-ID: <0aa881db-a724-13be-ff63-6c346d7f55d8@redhat.com> On 01/06/19 9:37 PM, Alan Orth wrote: > Dear Ravi, > > The .glusterfs hardlinks/symlinks should be fine. I'm not sure how I > could verify them for six bricks and millions of files, though... :\ Hi Alan, The reason I asked this is because you had mentioned in one of your earlier emails that when you moved content from the old brick to the new one, you had skipped the .glusterfs directory. So I was assuming that when you added back this new brick to the cluster, it might have been missing the .glusterfs entries. If that is the cae, one way to verify could be to check using a script if all files on the brick have a link-count of at least 2 and all dirs have valid symlinks inside .glusterfs pointing to themselves. > > I had a small success in fixing some issues with duplicated files on > the FUSE mount point yesterday. I read quite a bit about the elastic > hashing algorithm that determines which files get placed on which > bricks based on the hash of their filename and the > trusted.glusterfs.dht xattr on brick directories (thanks to Joe > Julian's blog post and Python script for showing how it works?). With > that knowledge I looked closer at one of the files that was appearing > as duplicated on the FUSE mount and found that it was also duplicated > on more than `replica 2` bricks. For this particular file I found two > "real" files and several zero-size files with > trusted.glusterfs.dht.linkto xattrs. Neither of the "real" files were > on the correct brick as far as the DHT layout is concerned, so I > copied one of them to the correct brick, deleted the others and their > hard links, and did a `stat` on the file from the FUSE mount point and > it fixed itself. Yay! > > Could this have been caused by a replace-brick that got interrupted > and didn't finish re-labeling the xattrs? No, replace-brick only initiates AFR self-heal, which just copies the contents from the other brick(s) of the *same* replica pair into the replaced brick.? The link-to files are created by DHT when you rename a file from the client. If the new name hashes to a different? brick, DHT does not move the entire file there. It instead creates the link-to file (the one with the dht.linkto xattrs) on the hashed subvol. The value of this xattr points to the brick where the actual data is there (`getfattr -e text` to see it for yourself).? Perhaps you had attempted a rebalance or remove-brick earlier and interrupted that? > Should I be thinking of some heuristics to identify and fix these > issues with a script (incorrect brick placement), or is this something > a fix layout or repeated volume heals can fix? I've already completed > a whole heal on this particular volume this week and it did heal about > 1,000,000 files (mostly data and metadata, but about 20,000 entry > heals as well). > Maybe you should let the AFR self-heals complete first and then attempt a full rebalance to take care of the dht link-to files. But? if the files are in millions, it could take quite some time to complete. Regards, Ravi > Thanks for your support, > > ? https://joejulian.name/post/dht-misses-are-expensive/ > > On Fri, May 31, 2019 at 7:57 AM Ravishankar N > wrote: > > > On 31/05/19 3:20 AM, Alan Orth wrote: >> Dear Ravi, >> >> I spent a bit of time inspecting the xattrs on some files and >> directories on a few bricks for this volume and it looks a bit >> messy. Even if I could make sense of it for a few and potentially >> heal them manually, there are millions of files and directories >> in total so that's definitely not a scalable solution. After a >> few missteps with `replace-brick ... commit force` in the last >> week?one of which on a brick that was dead/offline?as well as >> some premature `remove-brick` commands, I'm unsure how how to >> proceed and I'm getting demotivated. It's scary how quickly >> things get out of hand in distributed systems... > Hi Alan, > The one good thing about gluster is it that the data is always > available directly on the backed bricks even if your volume has > inconsistencies at the gluster level. So theoretically, if your > cluster is FUBAR, you could just create a new volume and copy all > data onto it via its mount from the old volume's bricks. >> >> I had hoped that bringing the old brick back up would help, but >> by the time I added it again a few days had passed and all the >> brick-id's had changed due to the replace/remove brick commands, >> not to mention that the trusted.afr.$volume-client-xx values were >> now probably pointing to the wrong bricks (?). >> >> Anyways, a few hours ago I started a full heal on the volume and >> I see that there is a sustained 100MiB/sec of network traffic >> going from the old brick's host to the new one. The completed >> heals reported in the logs look promising too: >> >> Old brick host: >> >> # grep '2019-05-30' /var/log/glusterfs/glustershd.log | grep -o >> -E 'Completed (data|metadata|entry) selfheal' | sort | uniq -c >> ?281614 Completed data selfheal >> ? ? ?84 Completed entry selfheal >> ?299648 Completed metadata selfheal >> >> New brick host: >> >> # grep '2019-05-30' /var/log/glusterfs/glustershd.log | grep -o >> -E 'Completed (data|metadata|entry) selfheal' | sort | uniq -c >> ?198256 Completed data selfheal >> ? 16829 Completed entry selfheal >> ?229664 Completed metadata selfheal >> >> So that's good I guess, though I have no idea how long it will >> take or if it will fix the "missing files" issue on the FUSE >> mount. I've increased cluster.shd-max-threads to 8 to hopefully >> speed up the heal process. > The afr xattrs should not cause files to disappear from mount. If > the xattr names do not match what each AFR subvol expects (for eg. > in a replica 2 volume, trusted.afr.*-client-{0,1} for 1st subvol, > client-{2,3} for 2nd subvol and so on - ) for its children then it > won't heal the data, that is all. But in your case I see some > inconsistencies like one brick having the actual file > (licenseserver.cfg) and the other having a linkto file (the one > with thedht.linkto xattr) /in the same replica pair/. >> >> I'd be happy for any advice or pointers, > > Did you check if the .glusterfs hardlinks/symlinks exist and are > in order for all bricks? > > -Ravi > >> >> On Wed, May 29, 2019 at 5:20 PM Alan Orth > > wrote: >> >> Dear Ravi, >> >> Thank you for the link to the blog post series?it is very >> informative and current! If I understand your blog post >> correctly then I think the answer to your previous question >> about pending AFRs is: no, there are no pending AFRs. I have >> identified one file that is a good test case to try to >> understand what happened after I issued the `gluster volume >> replace-brick ... commit force` a few days ago and then added >> the same original brick back to the volume later. This is the >> current state of the replica 2 distribute/replicate volume: >> >> [root at wingu0 ~]# gluster volume info apps >> >> Volume Name: apps >> Type: Distributed-Replicate >> Volume ID: f118d2da-79df-4ee1-919d-53884cd34eda >> Status: Started >> Snapshot Count: 0 >> Number of Bricks: 3 x 2 = 6 >> Transport-type: tcp >> Bricks: >> Brick1: wingu3:/mnt/gluster/apps >> Brick2: wingu4:/mnt/gluster/apps >> Brick3: wingu05:/data/glusterfs/sdb/apps >> Brick4: wingu06:/data/glusterfs/sdb/apps >> Brick5: wingu0:/mnt/gluster/apps >> Brick6: wingu05:/data/glusterfs/sdc/apps >> Options Reconfigured: >> diagnostics.client-log-level: DEBUG >> storage.health-check-interval: 10 >> nfs.disable: on >> >> I checked the xattrs of one file that is missing from the >> volume's FUSE mount (though I can read it if I access its >> full path explicitly), but is present in several of the >> volume's bricks (some with full size, others empty): >> >> [root at wingu0 ~]# getfattr -d -m. -e hex >> /mnt/gluster/apps/clcgenomics/clclicsrv/licenseserver.cfg >> >> getfattr: Removing leading '/' from absolute path names # >> file: >> mnt/gluster/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.afr.apps-client-3=0x000000000000000000000000 >> trusted.afr.apps-client-5=0x000000000000000000000000 >> trusted.afr.dirty=0x000000000000000000000000 >> trusted.bit-rot.version=0x0200000000000000585a396f00046e15 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd [root at wingu05 >> ~]# getfattr -d -m. -e hex >> /data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> getfattr: Removing leading '/' from absolute path names # >> file: >> data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >> trusted.glusterfs.dht.linkto=0x617070732d7265706c69636174652d3200 >> [root at wingu05 ~]# getfattr -d -m. -e hex >> /data/glusterfs/sdc/apps/clcgenomics/clclicsrv/licenseserver.cfg >> getfattr: Removing leading '/' from absolute path names # >> file: >> data/glusterfs/sdc/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >> [root at wingu06 ~]# getfattr -d -m. -e hex >> /data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> getfattr: Removing leading '/' from absolute path names # >> file: >> data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >> trusted.glusterfs.dht.linkto=0x617070732d7265706c69636174652d3200 >> >> According to the trusted.afr.apps-client-xxxattrs this >> particular file should be on bricks with id "apps-client-3" >> and "apps-client-5". It took me a few hours to realize that >> the brick-id values are recorded in the volume's volfiles in >> /var/lib/glusterd/vols/apps/bricks. After comparing those >> brick-id values with a volfile backup from before the >> replace-brick, I realized that the files are simply on the >> wrong brick now as far as Gluster is concerned. This >> particular file is now on the brick for "apps-client-4". As >> an experiment I copied this one file to the two bricks listed >> in the xattrs and I was then able to see the file from the >> FUSE mount (yay!). >> >> Other than replacing the brick, removing it, and then adding >> the old brick on the original server back, there has been no >> change in the data this entire time. Can I change the brick >> IDs in the volfiles so they reflect where the data actually >> is? Or perhaps script something to reset all the xattrs on >> the files/directories to point to the correct bricks? >> >> Thank you for any help or pointers, >> >> On Wed, May 29, 2019 at 7:24 AM Ravishankar N >> > wrote: >> >> >> On 29/05/19 9:50 AM, Ravishankar N wrote: >>> >>> >>> On 29/05/19 3:59 AM, Alan Orth wrote: >>>> Dear Ravishankar, >>>> >>>> I'm not sure if Brick4 had pending AFRs because I don't >>>> know what that means and it's been a few days so I am >>>> not sure I would be able to find that information. >>> When you find some time, have a look at a blog >>> series I wrote about AFR- I've >>> tried to explain what one needs to know to debug >>> replication related issues in it. >> >> Made a typo error. The URL for the blog is >> https://wp.me/peiBB-6b >> >> -Ravi >> >>>> >>>> Anyways, after wasting a few days rsyncing the old >>>> brick to a new host I decided to just try to add the >>>> old brick back into the volume instead of bringing it >>>> up on the new host. I created a new brick directory on >>>> the old host, moved the old brick's contents into that >>>> new directory (minus the .glusterfs directory), added >>>> the new brick to the volume, and then did Vlad's >>>> find/stat trick? from the brick to the FUSE mount point. >>>> >>>> The interesting problem I have now is that some files >>>> don't appear in the FUSE mount's directory listings, >>>> but I can actually list them directly and even read >>>> them. What could cause that? >>> Not sure, too many variables in the hacks that you did >>> to take a guess. You can check if the contents of the >>> .glusterfs folder are in order on the new brick (example >>> hardlink for files and symlinks for directories are >>> present etc.) . >>> Regards, >>> Ravi >>>> >>>> Thanks, >>>> >>>> ? >>>> https://lists.gluster.org/pipermail/gluster-users/2018-February/033584.html >>>> >>>> On Fri, May 24, 2019 at 4:59 PM Ravishankar N >>>> >>> > wrote: >>>> >>>> >>>> On 23/05/19 2:40 AM, Alan Orth wrote: >>>>> Dear list, >>>>> >>>>> I seem to have gotten into a tricky situation. >>>>> Today I brought up a shiny new server with new >>>>> disk arrays and attempted to replace one brick of >>>>> a replica 2 distribute/replicate volume on an >>>>> older server using the `replace-brick` command: >>>>> >>>>> # gluster volume replace-brick homes >>>>> wingu0:/mnt/gluster/homes >>>>> wingu06:/data/glusterfs/sdb/homes commit force >>>>> >>>>> The command was successful and I see the new brick >>>>> in the output of `gluster volume info`. The >>>>> problem is that Gluster doesn't seem to be >>>>> migrating the data, >>>> >>>> `replace-brick` definitely must heal (not migrate) >>>> the data. In your case, data must have been healed >>>> from Brick-4 to the replaced Brick-3. Are there any >>>> errors in the self-heal daemon logs of Brick-4's >>>> node? Does Brick-4 have pending AFR xattrs blaming >>>> Brick-3? The doc is a bit out of date. >>>> replace-brick command internally does all the >>>> setfattr steps that are mentioned in the doc. >>>> >>>> -Ravi >>>> >>>> >>>>> and now the original brick that I replaced is no >>>>> longer part of the volume (and a few terabytes of >>>>> data are just sitting on the old brick): >>>>> >>>>> # gluster volume info homes | grep -E "Brick[0-9]:" >>>>> Brick1: wingu4:/mnt/gluster/homes >>>>> Brick2: wingu3:/mnt/gluster/homes >>>>> Brick3: wingu06:/data/glusterfs/sdb/homes >>>>> Brick4: wingu05:/data/glusterfs/sdb/homes >>>>> Brick5: wingu05:/data/glusterfs/sdc/homes >>>>> Brick6: wingu06:/data/glusterfs/sdc/homes >>>>> >>>>> I see the Gluster docs have a more complicated >>>>> procedure for replacing bricks that involves >>>>> getfattr/setfattr?. How can I tell Gluster about >>>>> the old brick? I see that I have a backup of the >>>>> old volfile thanks to yum's rpmsave function if >>>>> that helps. >>>>> >>>>> We are using Gluster 5.6 on CentOS 7. Thank you >>>>> for any advice you can give. >>>>> >>>>> ? >>>>> https://docs.gluster.org/en/latest/Administrator%20Guide/Managing%20Volumes/#replace-faulty-brick >>>>> >>>>> -- >>>>> Alan Orth >>>>> alan.orth at gmail.com >>>>> https://picturingjordan.com >>>>> https://englishbulgaria.net >>>>> https://mjanja.ch >>>>> "In heaven all the interesting people are >>>>> missing." ?Friedrich Nietzsche >>>>> >>>>> _______________________________________________ >>>>> Gluster-users mailing list >>>>> Gluster-users at gluster.org >>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>> >>>> >>>> >>>> -- >>>> Alan Orth >>>> alan.orth at gmail.com >>>> https://picturingjordan.com >>>> https://englishbulgaria.net >>>> https://mjanja.ch >>>> "In heaven all the interesting people are missing." >>>> ?Friedrich Nietzsche >>> >>> _______________________________________________ >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> >> >> -- >> Alan Orth >> alan.orth at gmail.com >> https://picturingjordan.com >> https://englishbulgaria.net >> https://mjanja.ch >> "In heaven all the interesting people are missing." >> ?Friedrich Nietzsche >> >> >> >> -- >> Alan Orth >> alan.orth at gmail.com >> https://picturingjordan.com >> https://englishbulgaria.net >> https://mjanja.ch >> "In heaven all the interesting people are missing." ?Friedrich >> Nietzsche > > > > -- > Alan Orth > alan.orth at gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > "In heaven all the interesting people are missing." ?Friedrich Nietzsche -------------- next part -------------- An HTML attachment was scrubbed... URL: From snowmailer at gmail.com Mon Jun 3 16:58:01 2019 From: snowmailer at gmail.com (Martin) Date: Mon, 3 Jun 2019 18:58:01 +0200 Subject: [Gluster-users] No healing on peer disconnect - is it correct? Message-ID: <10D708D0-E523-46A0-91BF-FFC41886E316@gmail.com> Hi all, I need someone to explain if my gluster behaviour is correct. I am not sure if my gluster works as it should. I have simple Replica 3 - Number of Bricks: 1 x 3 = 3. When one of my hypervisor is disconnected as peer, i.e. gluster process is down but bricks running, other two healthy nodes start signalling that they lost one peer. This is correct. Next, I restart gluster process on node where gluster process failed and I thought It should trigger healing of files on failed node but nothing is happening. I run VMs disks on this gluster volume. No healing is triggered after gluster restart, remaining two nodes get peer back after restart of gluster and everything is running without down time. Even VMs that are running on ?failed? node where gluster process was down (bricks were up) are running without down time. Is this behaviour correct? I mean No healing is triggered after peer is reconnected back and VMs. Thanks for explanation. BR! Martin From hunter86_bg at yahoo.com Mon Jun 3 17:40:19 2019 From: hunter86_bg at yahoo.com (Strahil) Date: Mon, 03 Jun 2019 20:40:19 +0300 Subject: [Gluster-users] No healing on peer disconnect - is it correct? Message-ID: Hi Martin, By default gluster will proactively start to heal every 10 min - so this is not OK. Usually, I do not wait for that to get triggered and i run gluster volume heal full (using replica 3 with sharding of 4 MB -> oVirt default). Best Regards, Strahil NikolovOn Jun 3, 2019 19:58, Martin wrote: > > Hi all, > > I need someone to explain if my gluster behaviour is correct. I am not sure if my gluster works as it should. I have simple Replica 3 - Number of Bricks: 1 x 3 = 3. > > When one of my hypervisor is disconnected as peer, i.e. gluster process is down but bricks running, other two healthy nodes start signalling that they lost one peer. This is correct. > Next, I restart gluster process on node where gluster process failed and I thought It should trigger healing of files on failed node but nothing is happening. > > I run VMs disks on this gluster volume. No healing is triggered after gluster restart, remaining two nodes get peer back after restart of gluster and everything is running without down time. > Even VMs that are running on ?failed? node where gluster process was down (bricks were up) are running without down time. > > Is this behaviour correct? I mean No healing is triggered after peer is reconnected back and VMs. > > Thanks for explanation. > > BR! > Martin > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users From dcunningham at voisonics.com Mon Jun 3 22:15:21 2019 From: dcunningham at voisonics.com (David Cunningham) Date: Tue, 4 Jun 2019 10:15:21 +1200 Subject: [Gluster-users] Transport endpoint is not connected In-Reply-To: <20r8rlguxb86gpnxjwe3wpqw.1559189511842@email.android.com> References: <20r8rlguxb86gpnxjwe3wpqw.1559189511842@email.android.com> Message-ID: Hello all, We confirmed that the network provider blocking port 49152 was the issue. Thanks for all the help. On Thu, 30 May 2019 at 16:11, Strahil wrote: > You can try to run a ncat from gfs3: > > ncat -z -v gfs1 49152 > ncat -z -v gfs2 49152 > > If ncat fails to connect -> it's definately a firewall. > > Best Regards, > Strahil Nikolov > On May 30, 2019 01:33, David Cunningham wrote: > > Hi Ravi, > > I think it probably is a firewall issue with the network provider. I was > hoping to see a specific connection failure message we could send to them, > but will take it up with them anyway. > > Thanks for your help. > > > On Wed, 29 May 2019 at 23:10, Ravishankar N > wrote: > > I don't see a "Connected to gvol0-client-1" in the log. Perhaps a > firewall issue like the last time? Even in the earlier add-brick log from > the other email thread, connection to the 2nd brick was not established. > > -Ravi > On 29/05/19 2:26 PM, David Cunningham wrote: > > Hi Ravi and Joe, > > The command "gluster volume status gvol0" shows all 3 nodes as being > online, even on gfs3 as below. I've attached the glfsheal-gvol0.log, in > which I can't see anything like a connection error. Would you have any > further suggestions? Thank you. > > [root at gfs3 glusterfs]# gluster volume status gvol0 > Status of volume: gvol0 > Gluster process TCP Port RDMA Port Online > Pid > > ------------------------------------------------------------------------------ > Brick gfs1:/nodirectwritedata/gluster/gvol0 49152 0 Y > 7706 > Brick gfs2:/nodirectwritedata/gluster/gvol0 49152 0 Y > 7625 > Brick gfs3:/nodirectwritedata/gluster/gvol0 49152 0 Y > 7307 > Self-heal Daemon on localhost N/A N/A Y > 7316 > Self-heal Daemon on gfs1 N/A N/A Y > 40591 > Self-heal Daemon on gfs2 N/A N/A Y > 7634 > > Task Status of Volume gvol0 > > ------------------------------------------------------------------------------ > There are no active volume tasks > > > On Wed, 29 May 2019 at 16:26, Ravishankar N > wrote: > > > On 29/05/19 6:21 AM, David Cunningham wrote: > > -- David Cunningham, Voisonics Limited http://voisonics.com/ USA: +1 213 221 1092 New Zealand: +64 (0)28 2558 3782 -------------- next part -------------- An HTML attachment was scrubbed... URL: From rgowdapp at redhat.com Tue Jun 4 01:55:25 2019 From: rgowdapp at redhat.com (Raghavendra Gowdappa) Date: Tue, 4 Jun 2019 07:25:25 +0530 Subject: [Gluster-users] write request hung in write-behind In-Reply-To: <5cf4cf0d.1c69fb81.9003f.c502SMTPIN_ADDED_BROKEN@mx.google.com> References: <5cf4cf0d.1c69fb81.9003f.c502SMTPIN_ADDED_BROKEN@mx.google.com> Message-ID: On Mon, Jun 3, 2019 at 1:11 PM Xie Changlong wrote: > Firstly i correct myself, write request followed by 771(not 1545) FLUSH > requests. I've attach gnfs dump file, totally 774 pending call-stacks, > 771 of them pending on write-behind and the deepest call-stack is afr. > +Ravishankar Narayanankutty +Karampuri, Pranith Are you sure these were not call-stacks of in-progress ops? One way of confirming that would be to take statedumps periodically (say 3 min apart). Hung call stacks will be common to all the statedumps. > [global.callpool.stack.771] > stack=0x7f517f557f60 > uid=0 > gid=0 > pid=0 > unique=0 > lk-owner= > op=stack > type=0 > cnt=3 > > [global.callpool.stack.771.frame.1] > frame=0x7f517f655880 > ref_count=0 > translator=cl35vol01-replicate-7 > complete=0 > parent=cl35vol01-dht > wind_from=dht_writev > wind_to=subvol->fops->writev > unwind_to=dht_writev_cbk > > [global.callpool.stack.771.frame.2] > frame=0x7f518ed90340 > ref_count=1 > translator=cl35vol01-dht > complete=0 > parent=cl35vol01-write-behind > wind_from=wb_fulfill_head > wind_to=FIRST_CHILD (frame->this)->fops->writev > unwind_to=wb_fulfill_cbk > > [global.callpool.stack.771.frame.3] > frame=0x7f516d3baf10 > ref_count=1 > translator=cl35vol01-write-behind > complete=0 > > [global.callpool.stack.772] > stack=0x7f51607a5a20 > uid=0 > gid=0 > pid=0 > unique=0 > lk-owner=a0715b77517f0000 > op=stack > type=0 > cnt=1 > > [global.callpool.stack.772.frame.1] > frame=0x7f516ca2d1b0 > ref_count=0 > translator=cl35vol01-replicate-7 > complete=0 > > [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 > glusterdump.20106.dump.1559038081 |grep translator | wc -l > 774 > [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 > glusterdump.20106.dump.1559038081 |grep complete |wc -l > 774 > [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 > glusterdump.20106.dump.1559038081 |grep -E "complete=0" |wc -l > 774 > [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 > glusterdump.20106.dump.1559038081 |grep translator | grep write-behind > |wc -l > 771 > [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 > glusterdump.20106.dump.1559038081 |grep translator | grep replicate-7 | > wc -l > 2 > [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 > glusterdump.20106.dump.1559038081 |grep translator | grep glusterfs | wc > -l > 1 > > > > > ???: Raghavendra Gowdappa > ??: 2019/06/03(???)14:46 > ???: Xie Changlong ; > ???: gluster-users ; > ??: Re: write request hung in write-behind > > > > On Mon, Jun 3, 2019 at 11:57 AM Xie Changlong wrote: > >> Hi all >> >> Test gluster 3.8.4-54.15 gnfs, i saw a write request hung in write-behind >> followed by 1545 FLUSH requests. I found a similar >> bugfix https://bugzilla.redhat.com/show_bug.cgi?id=1626787, but not sure >> if it's the right one. >> >> [xlator.performance.write-behind.wb_inode] >> path=/575/1e/5751e318f21f605f2aac241bf042e7a8.jpg >> inode=0x7f51775b71a0 >> window_conf=1073741824 >> window_current=293822 >> transit-size=293822 >> dontsync=0 >> >> [.WRITE] >> request-ptr=0x7f516eec2060 >> refcount=1 >> wound=yes >> generation-number=1 >> req->op_ret=293822 >> req->op_errno=0 >> sync-attempts=1 >> sync-in-progress=yes >> > > Note that the sync is still in progress. This means, write-behind has > wound the write-request to its children and yet to receive the response > (unless there is a bug in accounting of sync-in-progress). So, its likely > that there are callstacks into children of write-behind, which are not > complete yet. Are you sure the deepest hung call-stack is in write-behind? > Can you check for frames with "complete=0"? > > size=293822 >> offset=1048576 >> lied=-1 >> append=0 >> fulfilled=0 >> go=-1 >> >> [.FLUSH] >> request-ptr=0x7f517c2badf0 >> refcount=1 >> wound=no >> generation-number=2 >> req->op_ret=-1 >> req->op_errno=116 >> sync-attempts=0 >> >> [.FLUSH] >> request-ptr=0x7f5173e9f7b0 >> refcount=1 >> wound=no >> generation-number=2 >> req->op_ret=0 >> req->op_errno=0 >> sync-attempts=0 >> >> [.FLUSH] >> request-ptr=0x7f51640b8ca0 >> refcount=1 >> wound=no >> generation-number=2 >> req->op_ret=0 >> req->op_errno=0 >> sync-attempts=0 >> >> [.FLUSH] >> request-ptr=0x7f516f3979d0 >> refcount=1 >> wound=no >> generation-number=2 >> req->op_ret=0 >> req->op_errno=0 >> sync-attempts=0 >> >> [.FLUSH] >> request-ptr=0x7f516f6ac8d0 >> refcount=1 >> wound=no >> generation-number=2 >> req->op_ret=0 >> req->op_errno=0 >> sync-attempts=0 >> >> >> Any comments would be appreciated! >> >> Thanks >> -Xie >> >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From zgrep at 139.com Tue Jun 4 02:06:24 2019 From: zgrep at 139.com (=?utf-8?B?WGllIENoYW5nbG9uZw==?=) Date: 04 Jun 2019 10:06:24 +0800 Subject: [Gluster-users] write request hung in write-behind Message-ID: 201906041006244014963@139.com> To me, all 'df' commands on specific(not all) nfs client hung forever. The temporary solution is disable performance.nfs.write-behind and cluster.eager-lock. I'll try to get more info back if encounter this problem again . ???: Raghavendra Gowdappa ??: 2019/06/04(???)09:55 ???: Xie Changlong;Ravishankar Narayanankutty;Karampuri, Pranith; ???: gluster-users; ??: Re: Re: write request hung in write-behind On Mon, Jun 3, 2019 at 1:11 PM Xie Changlong wrote: Firstly i correct myself, write request followed by 771(not 1545) FLUSH requests. I've attach gnfs dump file, totally 774 pending call-stacks, 771 of them pending on write-behind and the deepest call-stack is afr. +Ravishankar Narayanankutty +Karampuri, Pranith Are you sure these were not call-stacks of in-progress ops? One way of confirming that would be to take statedumps periodically (say 3 min apart). Hung call stacks will be common to all the statedumps. [global.callpool.stack.771] stack=0x7f517f557f60 uid=0 gid=0 pid=0 unique=0 lk-owner= op=stack type=0 cnt=3 [global.callpool.stack.771.frame.1] frame=0x7f517f655880 ref_count=0 translator=cl35vol01-replicate-7 complete=0 parent=cl35vol01-dht wind_from=dht_writev wind_to=subvol->fops->writev unwind_to=dht_writev_cbk [global.callpool.stack.771.frame.2] frame=0x7f518ed90340 ref_count=1 translator=cl35vol01-dht complete=0 parent=cl35vol01-write-behind wind_from=wb_fulfill_head wind_to=FIRST_CHILD (frame->this)->fops->writev unwind_to=wb_fulfill_cbk [global.callpool.stack.771.frame.3] frame=0x7f516d3baf10 ref_count=1 translator=cl35vol01-write-behind complete=0 [global.callpool.stack.772] stack=0x7f51607a5a20 uid=0 gid=0 pid=0 unique=0 lk-owner=a0715b77517f0000 op=stack type=0 cnt=1 [global.callpool.stack.772.frame.1] frame=0x7f516ca2d1b0 ref_count=0 translator=cl35vol01-replicate-7 complete=0 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | wc -l 774 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep complete |wc -l 774 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep -E "complete=0" |wc -l 774 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | grep write-behind |wc -l 771 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | grep replicate-7 | wc -l 2 [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 glusterdump.20106.dump.1559038081 |grep translator | grep glusterfs | wc -l 1 ???: Raghavendra Gowdappa ??: 2019/06/03(???)14:46 ???: Xie Changlong; ???: gluster-users; ??: Re: write request hung in write-behind On Mon, Jun 3, 2019 at 11:57 AM Xie Changlong wrote: Hi all Test gluster 3.8.4-54.15 gnfs, i saw a write request hung in write-behind followed by 1545 FLUSH requests. I found a similar bugfix https://bugzilla.redhat.com/show_bug.cgi?id=1626787, but not sure if it's the right one. [xlator.performance.write-behind.wb_inode] path=/575/1e/5751e318f21f605f2aac241bf042e7a8.jpg inode=0x7f51775b71a0 window_conf=1073741824 window_current=293822 transit-size=293822 dontsync=0 [.WRITE] request-ptr=0x7f516eec2060 refcount=1 wound=yes generation-number=1 req->op_ret=293822 req->op_errno=0 sync-attempts=1 sync-in-progress=yes Note that the sync is still in progress. This means, write-behind has wound the write-request to its children and yet to receive the response (unless there is a bug in accounting of sync-in-progress). So, its likely that there are callstacks into children of write-behind, which are not complete yet. Are you sure the deepest hung call-stack is in write-behind? Can you check for frames with "complete=0"? size=293822 offset=1048576 lied=-1 append=0 fulfilled=0 go=-1 [.FLUSH] request-ptr=0x7f517c2badf0 refcount=1 wound=no generation-number=2 req->op_ret=-1 req->op_errno=116 sync-attempts=0 [.FLUSH] request-ptr=0x7f5173e9f7b0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f51640b8ca0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f516f3979d0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 [.FLUSH] request-ptr=0x7f516f6ac8d0 refcount=1 wound=no generation-number=2 req->op_ret=0 req->op_errno=0 sync-attempts=0 Any comments would be appreciated! Thanks -Xie -------------- next part -------------- An HTML attachment was scrubbed... URL: From abhishpaliwal at gmail.com Tue Jun 4 10:09:59 2019 From: abhishpaliwal at gmail.com (ABHISHEK PALIWAL) Date: Tue, 4 Jun 2019 15:39:59 +0530 Subject: [Gluster-users] Memory leak in glusterfs In-Reply-To: References: Message-ID: Hi Team, Please respond on the issue which I raised. Regards, Abhishek On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL wrote: > Anyone please reply.... > > On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL > wrote: > >> Hi Team, >> >> I upload some valgrind logs from my gluster 5.4 setup. This is writing to >> the volume every 15 minutes. I stopped glusterd and then copy away the >> logs. The test was running for some simulated days. They are zipped in >> valgrind-54.zip. >> >> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >> glusterfs and even some definitely lost bytes. >> >> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record 391 >> of 391 >> ==2737== at 0x4C29C25: calloc (in >> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >> ==2737== by 0xA22485E: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA217C94: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA21D9F8: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA21DED9: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA21E685: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA1B9D8C: init (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0x4E511CE: xlator_init (in /usr/lib64/libglusterfs.so.0.0.1) >> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >> /usr/lib64/libglusterfs.so.0.0.1) >> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >> ==2737== >> ==2737== LEAK SUMMARY: >> ==2737== definitely lost: 1,053 bytes in 10 blocks >> ==2737== indirectly lost: 317 bytes in 3 blocks >> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >> ==2737== still reachable: 53,277 bytes in 201 blocks >> ==2737== suppressed: 0 bytes in 0 blocks >> >> -- >> >> >> >> >> Regards >> Abhishek Paliwal >> > -- Regards Abhishek Paliwal -------------- next part -------------- An HTML attachment was scrubbed... URL: From zgrep at 139.com Tue Jun 4 11:33:54 2019 From: zgrep at 139.com (=?utf-8?B?WGllIENoYW5nbG9uZw==?=) Date: 04 Jun 2019 19:33:54 +0800 Subject: [Gluster-users] GETXATTR op pending on index xlator for more than 10 hours Message-ID: 2019060419335438074695@139.com> Hi all, Today, i found gnfs GETXATTR bailing out on gluster release 3.12.0. I have a simple 4*2 Distributed-Rep volume. [2019-06-03 19:58:33.085880] E [rpc-clnt.c:185:Call_bail] 0-cl25vol01-client-4: bailing out frame type(GlusterFS 3.3) op(GETXATTR(18)) xid=0x21de4275 sent = 2019-06-03 19:28:30.552356. timeout = 1800 for 10.3.133.57:49153 xid= 0x21de4275 = 568214133 Then i try to dump brick 10.3.133.57:49153, and find the GETXATTR op pending on index xlator for more than 10 hours! 1111MicrosoftInternetExplorer402DocumentNotSpecified7.8 ?Normal0 [root at node0001 gluster]# grep -rn 568214133 gluster-brick-1-cl25vol01.6078.dump.15596* gluster-brick-1-cl25vol01.6078.dump.1559617125:5093:unique=568214133 gluster-brick-1-cl25vol01.6078.dump.1559618121:5230:unique=568214133 gluster-brick-1-cl25vol01.6078.dump.1559618912:5434:unique=568214133 gluster-brick-1-cl25vol01.6078.dump.1559628467:6921:unique=568214133 [root at node0001 gluster]# date -d @1559617125 Tue Jun 4 10:58:45 CST 2019 [root at node0001 gluster]# date -d @1559628467 Tue Jun 4 14:07:47 CST 2019 1111MicrosoftInternetExplorer402DocumentNotSpecified7.8 ?Normal0 [root at node0001 gluster]# [global.callpool.stack.115] stack=0x7f8b342623c0 uid=500 gid=500 pid=-6 unique=568214133 lk-owner=faffffff op=stack type=0 cnt=4 [global.callpool.stack.115.frame.1] frame=0x7f8b1d6fb540 ref_count=0 translator=cl25vol01-index complete=0 parent=cl25vol01-quota wind_from=quota_getxattr wind_to=(this->children->xlator)->fops->getxattr unwind_to=default_getxattr_cbk [global.callpool.stack.115.frame.2] frame=0x7f8b30a14da0 ref_count=1 translator=cl25vol01-quota complete=0 parent=cl25vol01-io-stats wind_from=io_stats_getxattr wind_to=(this->children->xlator)->fops->getxattr unwind_to=io_stats_getxattr_cbk [global.callpool.stack.115.frame.3] frame=0x7f8b6debada0 ref_count=1 translator=cl25vol01-io-stats complete=0 parent=cl25vol01-server wind_from=server_getxattr_resume wind_to=FIRST_CHILD(this)->fops->getxattr unwind_to=server_getxattr_cbk [global.callpool.stack.115.frame.4] frame=0x7f8b21962a60 ref_count=1 translator=cl25vol01-server complete=0 I've checked the code logic and got nothing, any advice? I still have the scene on my side, so we can dig more. Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: From hunter86_bg at yahoo.com Tue Jun 4 11:48:02 2019 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Tue, 4 Jun 2019 11:48:02 +0000 (UTC) Subject: [Gluster-users] Transport endpoint is not connected In-Reply-To: References: <20r8rlguxb86gpnxjwe3wpqw.1559189511842@email.android.com> Message-ID: <863936144.3309002.1559648882741@mail.yahoo.com> Hi David, You can ensure that 49152-49160 are opened in advance...You never know when you will need to deploy another Gluster Volume. best Regards,Strahil Nikolov ? ??????????, 3 ??? 2019 ?., 18:16:00 ?. ???????-4, David Cunningham ??????: Hello all, We confirmed that the network provider blocking port 49152 was the issue. Thanks for all the help. On Thu, 30 May 2019 at 16:11, Strahil wrote: You can try to run a ncat from gfs3: ncat -z -v gfs1 49152 ncat -z -v gfs2 49152 If ncat fails to connect ->? it's definately a firewall. Best Regards, Strahil Nikolov On May 30, 2019 01:33, David Cunningham wrote: Hi Ravi, I think it probably is a firewall issue with the network provider. I was hoping to see a specific connection failure message we could send to them, but will take it up with them anyway. Thanks for your help. On Wed, 29 May 2019 at 23:10, Ravishankar N wrote: I don't see a "Connected to gvol0-client-1" in the log.? Perhaps a firewall issue like the last time? Even in the earlier add-brick log from the other email thread, connection to the 2nd brick was not established. -Ravi On 29/05/19 2:26 PM, David Cunningham wrote: Hi Ravi and Joe, The command "gluster volume status gvol0" shows all 3 nodes as being online, even on gfs3 as below. I've attached the glfsheal-gvol0.log, in which I can't see anything like a connection error. Would you have any further suggestions? Thank you. [root at gfs3 glusterfs]# gluster volume status gvol0 Status of volume: gvol0 Gluster process???????????????????????????? TCP Port? RDMA Port? Online? Pid ------------------------------------------------------------------------------ Brick gfs1:/nodirectwritedata/gluster/gvol0 49152???? 0????????? Y?????? 7706 Brick gfs2:/nodirectwritedata/gluster/gvol0 49152???? 0????????? Y?????? 7625 Brick gfs3:/nodirectwritedata/gluster/gvol0 49152???? 0????????? Y?????? 7307 Self-heal Daemon on localhost?????????????? N/A?????? N/A??????? Y?????? 7316 Self-heal Daemon on gfs1??????????????????? N/A?????? N/A??????? Y?????? 40591 Self-heal Daemon on gfs2??????????????????? N/A?????? N/A??????? Y?????? 7634 ? Task Status of Volume gvol0 ------------------------------------------------------------------------------ There are no active volume tasks On Wed, 29 May 2019 at 16:26, Ravishankar N wrote: On 29/05/19 6:21 AM, David Cunningham wrote: -- David Cunningham, Voisonics Limited http://voisonics.com/ USA: +1 213 221 1092 New Zealand: +64 (0)28 2558 3782 -------------- next part -------------- An HTML attachment was scrubbed... URL: From khiremat at redhat.com Tue Jun 4 11:57:26 2019 From: khiremat at redhat.com (Kotresh Hiremath Ravishankar) Date: Tue, 4 Jun 2019 17:27:26 +0530 Subject: [Gluster-users] Geo Replication stops replicating In-Reply-To: References:

Message-ID: could you please try adding /usr/sbin to $PATH for user 'sas'? If it's bash, add 'export PATH=/usr/sbin:$PATH' in /home/sas/.bashrc On Tue, Jun 4, 2019 at 5:24 PM deepu srinivasan wrote: > Hi Kortesh > Please find the logs of the above error > *Master log snippet* > >> [2019-06-04 11:52:09.254731] I [resource(worker >> /home/sas/gluster/data/code-misc):1379:connect_remote] SSH: Initializing >> SSH connection between master and slave... >> [2019-06-04 11:52:09.308923] D [repce(worker >> /home/sas/gluster/data/code-misc):196:push] RepceClient: call >> 89724:139652759443264:1559649129.31 __repce_version__() ... >> [2019-06-04 11:52:09.602792] E [syncdutils(worker >> /home/sas/gluster/data/code-misc):311:log_raise_exception] : >> connection to peer is broken >> [2019-06-04 11:52:09.603312] E [syncdutils(worker >> /home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error >> cmd=ssh -oPasswordAuthentication=no -oStrictHostKeyChecking=no -i >> /var/lib/ glusterd/geo-replication/secret.pem -p 22 -oControlMaster=auto -S >> /tmp/gsyncd-aux-ssh-4aL2tc/d893f66e0addc32f7d0080bb503f5185.sock >> sas at 192.168.185.107 /usr/libexec/glusterfs/gsyncd slave code-misc sas@ >> 192.168.185.107::code-misc --master-node 192.168.185.106 >> --master-node-id 851b64d0-d885-4ae9-9b38-ab5b15db0fec --master-brick >> /home/sas/gluster/data/code-misc --local-node 192.168.185.122 --local-node- >> id bcaa7af6-c3a1-4411-8e99-4ebecb32eb6a --slave-timeout 120 >> --slave-log-level DEBUG --slave-gluster-log-level INFO >> --slave-gluster-command-dir /usr/sbin error=1 >> [2019-06-04 11:52:09.614996] I [repce(agent >> /home/sas/gluster/data/code-misc):97:service_loop] RepceServer: terminating >> on reaching EOF. >> [2019-06-04 11:52:09.615545] D [monitor(monitor):271:monitor] Monitor: >> worker(/home/sas/gluster/data/code-misc) connected >> [2019-06-04 11:52:09.616528] I [monitor(monitor):278:monitor] Monitor: >> worker died in startup phase brick=/home/sas/gluster/data/code-misc >> [2019-06-04 11:52:09.619391] I >> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker Status >> Change status=Faulty >> > > *Slave log snippet* > >> [2019-06-04 11:50:09.782668] E [syncdutils(slave >> 192.168.185.106/home/sas/gluster/data/code-misc):809:logerr] Popen: >> /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >> [2019-06-04 11:50:11.188167] W [gsyncd(slave >> 192.168.185.125/home/sas/gluster/data/code-misc):305:main] : >> Session config file not exists, using the default config >> path=/var/lib/glusterd/geo-replication/code-misc_192.168.185.107_code-misc/gsyncd.conf >> [2019-06-04 11:50:11.201070] I [resource(slave >> 192.168.185.125/home/sas/gluster/data/code-misc):1098:connect] GLUSTER: >> Mounting gluster volume locally... >> [2019-06-04 11:50:11.271231] E [resource(slave >> 192.168.185.125/home/sas/gluster/data/code-misc):1006:handle_mounter] >> MountbrokerMounter: glusterd answered mnt= >> [2019-06-04 11:50:11.271998] E [syncdutils(slave >> 192.168.185.125/home/sas/gluster/data/code-misc):805:errlog] Popen: >> command returned error cmd=/usr/sbin/gluster --remote-host=localhost >> system:: mount sas user-map-root=sas aux-gfid-mount acl log-level=INFO >> log-file=/var/log/glusterfs/geo-replication-slaves/code-misc_192.168.185.107_code-misc/mnt-192.168.185.125-home-sas-gluster-data-code-misc.log >> volfile-server=localhost volfile-id=code-misc client-pid=-1 error=1 >> [2019-06-04 11:50:11.272113] E [syncdutils(slave >> 192.168.185.125/home/sas/gluster/data/code-misc):809:logerr] Popen: >> /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) > > > On Tue, Jun 4, 2019 at 5:10 PM deepu srinivasan > wrote: > >> Hi >> As discussed I have upgraded gluster from 4.1 to 6.2 version. But the Geo >> replication failed to start. >> Stays in faulty state >> >> On Fri, May 31, 2019, 5:32 PM deepu srinivasan >> wrote: >> >>> Checked the data. It remains in 2708. No progress. >>> >>> On Fri, May 31, 2019 at 4:36 PM Kotresh Hiremath Ravishankar < >>> khiremat at redhat.com> wrote: >>> >>>> That means it could be working and the defunct process might be some >>>> old zombie one. Could you check, that data progress ? >>>> >>>> On Fri, May 31, 2019 at 4:29 PM deepu srinivasan >>>> wrote: >>>> >>>>> Hi >>>>> When i change the rsync option the rsync process doesnt seem to start >>>>> . Only a defunt process is listed in ps aux. Only when i set rsync option >>>>> to " " and restart all the process the rsync process is listed in ps aux. >>>>> >>>>> >>>>> On Fri, May 31, 2019 at 4:23 PM Kotresh Hiremath Ravishankar < >>>>> khiremat at redhat.com> wrote: >>>>> >>>>>> Yes, rsync config option should have fixed this issue. >>>>>> >>>>>> Could you share the output of the following? >>>>>> >>>>>> 1. gluster volume geo-replication :: >>>>>> config rsync-options >>>>>> 2. ps -ef | grep rsync >>>>>> >>>>>> On Fri, May 31, 2019 at 4:11 PM deepu srinivasan >>>>>> wrote: >>>>>> >>>>>>> Done. >>>>>>> We got the following result . >>>>>>> >>>>>>>> 1559298781.338234 write(2, "rsync: link_stat >>>>>>>> \"/tmp/gsyncd-aux-mount-EEJ_sY/.gfid/3fa6aed8-802e-4efe-9903-8bc171176d88\" >>>>>>>> failed: No such file or directory (2)", 128 >>>>>>> >>>>>>> seems like a file is missing ? >>>>>>> >>>>>>> On Fri, May 31, 2019 at 3:25 PM Kotresh Hiremath Ravishankar < >>>>>>> khiremat at redhat.com> wrote: >>>>>>> >>>>>>>> Hi, >>>>>>>> >>>>>>>> Could you take the strace with with more string size? The argument >>>>>>>> strings are truncated. >>>>>>>> >>>>>>>> strace -s 500 -ttt -T -p >>>>>>>> >>>>>>>> On Fri, May 31, 2019 at 3:17 PM deepu srinivasan < >>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>> >>>>>>>>> Hi Kotresh >>>>>>>>> The above-mentioned work around did not work properly. >>>>>>>>> >>>>>>>>> On Fri, May 31, 2019 at 3:16 PM deepu srinivasan < >>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>> >>>>>>>>>> Hi Kotresh >>>>>>>>>> We have tried the above-mentioned rsync option and we are >>>>>>>>>> planning to have the version upgrade to 6.0. >>>>>>>>>> >>>>>>>>>> On Fri, May 31, 2019 at 11:04 AM Kotresh Hiremath Ravishankar < >>>>>>>>>> khiremat at redhat.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi, >>>>>>>>>>> >>>>>>>>>>> This looks like the hang because stderr buffer filled up with >>>>>>>>>>> errors messages and no one reading it. >>>>>>>>>>> I think this issue is fixed in latest releases. As a workaround, >>>>>>>>>>> you can do following and check if it works. >>>>>>>>>>> >>>>>>>>>>> Prerequisite: >>>>>>>>>>> rsync version should be > 3.1.0 >>>>>>>>>>> >>>>>>>>>>> Workaround: >>>>>>>>>>> gluster volume geo-replication >>>>>>>>>>> :: config rsync-options "--ignore-missing- >>>>>>>>>>> args" >>>>>>>>>>> >>>>>>>>>>> Thanks, >>>>>>>>>>> Kotresh HR >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On Thu, May 30, 2019 at 5:39 PM deepu srinivasan < >>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>> >>>>>>>>>>>> Hi >>>>>>>>>>>> We were evaluating Gluster geo Replication between two DCs one >>>>>>>>>>>> is in US west and one is in US east. We took multiple trials for different >>>>>>>>>>>> file size. >>>>>>>>>>>> The Geo Replication tends to stop replicating but while >>>>>>>>>>>> checking the status it appears to be in Active state. But the slave volume >>>>>>>>>>>> did not increase in size. >>>>>>>>>>>> So we have restarted the geo-replication session and checked >>>>>>>>>>>> the status. The status was in an active state and it was in History Crawl >>>>>>>>>>>> for a long time. We have enabled the DEBUG mode in logging and checked for >>>>>>>>>>>> any error. >>>>>>>>>>>> There was around 2000 file appeared for syncing candidate. The >>>>>>>>>>>> Rsync process starts but the rsync did not happen in the slave volume. >>>>>>>>>>>> Every time the rsync process appears in the "ps auxxx" list but the >>>>>>>>>>>> replication did not happen in the slave end. What would be the cause of >>>>>>>>>>>> this problem? Is there anyway to debug it? >>>>>>>>>>>> >>>>>>>>>>>> We have also checked the strace of the rync program. >>>>>>>>>>>> it displays something like this >>>>>>>>>>>> >>>>>>>>>>>> "write(2, "rsync: link_stat \"/tmp/gsyncd-au"..., 128" >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> We are using the below specs >>>>>>>>>>>> >>>>>>>>>>>> Gluster version - 4.1.7 >>>>>>>>>>>> Sync mode - rsync >>>>>>>>>>>> Volume - 1x3 in each end (master and slave) >>>>>>>>>>>> Intranet Bandwidth - 10 Gig >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> -- >>>>>>>>>>> Thanks and Regards, >>>>>>>>>>> Kotresh H R >>>>>>>>>>> >>>>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Thanks and Regards, >>>>>>>> Kotresh H R >>>>>>>> >>>>>>> >>>>>> >>>>>> -- >>>>>> Thanks and Regards, >>>>>> Kotresh H R >>>>>> >>>>> >>>> >>>> -- >>>> Thanks and Regards, >>>> Kotresh H R >>>> >>> -- Thanks and Regards, Kotresh H R -------------- next part -------------- An HTML attachment was scrubbed... URL: From khiremat at redhat.com Tue Jun 4 17:49:55 2019 From: khiremat at redhat.com (Kotresh Hiremath Ravishankar) Date: Tue, 4 Jun 2019 23:19:55 +0530 Subject: [Gluster-users] Geo Replication stops replicating In-Reply-To: References:

Message-ID: Ccing Sunny, who was investing similar issue. On Tue, Jun 4, 2019 at 5:46 PM deepu srinivasan wrote: > Have already added the path in bashrc . Still in faulty state > > On Tue, Jun 4, 2019, 5:27 PM Kotresh Hiremath Ravishankar < > khiremat at redhat.com> wrote: > >> could you please try adding /usr/sbin to $PATH for user 'sas'? If it's >> bash, add 'export PATH=/usr/sbin:$PATH' in >> /home/sas/.bashrc >> >> On Tue, Jun 4, 2019 at 5:24 PM deepu srinivasan >> wrote: >> >>> Hi Kortesh >>> Please find the logs of the above error >>> *Master log snippet* >>> >>>> [2019-06-04 11:52:09.254731] I [resource(worker >>>> /home/sas/gluster/data/code-misc):1379:connect_remote] SSH: Initializing >>>> SSH connection between master and slave... >>>> [2019-06-04 11:52:09.308923] D [repce(worker >>>> /home/sas/gluster/data/code-misc):196:push] RepceClient: call >>>> 89724:139652759443264:1559649129.31 __repce_version__() ... >>>> [2019-06-04 11:52:09.602792] E [syncdutils(worker >>>> /home/sas/gluster/data/code-misc):311:log_raise_exception] : >>>> connection to peer is broken >>>> [2019-06-04 11:52:09.603312] E [syncdutils(worker >>>> /home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error >>>> cmd=ssh -oPasswordAuthentication=no -oStrictHostKeyChecking=no -i >>>> /var/lib/ glusterd/geo-replication/secret.pem -p 22 -oControlMaster=auto -S >>>> /tmp/gsyncd-aux-ssh-4aL2tc/d893f66e0addc32f7d0080bb503f5185.sock >>>> sas at 192.168.185.107 /usr/libexec/glusterfs/gsyncd slave code-misc sas@ >>>> 192.168.185.107::code-misc --master-node 192.168.185.106 >>>> --master-node-id 851b64d0-d885-4ae9-9b38-ab5b15db0fec --master-brick >>>> /home/sas/gluster/data/code-misc --local-node 192.168.185.122 --local-node- >>>> id bcaa7af6-c3a1-4411-8e99-4ebecb32eb6a --slave-timeout 120 >>>> --slave-log-level DEBUG --slave-gluster-log-level INFO >>>> --slave-gluster-command-dir /usr/sbin error=1 >>>> [2019-06-04 11:52:09.614996] I [repce(agent >>>> /home/sas/gluster/data/code-misc):97:service_loop] RepceServer: terminating >>>> on reaching EOF. >>>> [2019-06-04 11:52:09.615545] D [monitor(monitor):271:monitor] Monitor: >>>> worker(/home/sas/gluster/data/code-misc) connected >>>> [2019-06-04 11:52:09.616528] I [monitor(monitor):278:monitor] Monitor: >>>> worker died in startup phase brick=/home/sas/gluster/data/code-misc >>>> [2019-06-04 11:52:09.619391] I >>>> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker Status >>>> Change status=Faulty >>>> >>> >>> *Slave log snippet* >>> >>>> [2019-06-04 11:50:09.782668] E [syncdutils(slave >>>> 192.168.185.106/home/sas/gluster/data/code-misc):809:logerr] Popen: >>>> /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >>>> [2019-06-04 11:50:11.188167] W [gsyncd(slave >>>> 192.168.185.125/home/sas/gluster/data/code-misc):305:main] : >>>> Session config file not exists, using the default config >>>> path=/var/lib/glusterd/geo-replication/code-misc_192.168.185.107_code-misc/gsyncd.conf >>>> [2019-06-04 11:50:11.201070] I [resource(slave >>>> 192.168.185.125/home/sas/gluster/data/code-misc):1098:connect] >>>> GLUSTER: Mounting gluster volume locally... >>>> [2019-06-04 11:50:11.271231] E [resource(slave >>>> 192.168.185.125/home/sas/gluster/data/code-misc):1006:handle_mounter] >>>> MountbrokerMounter: glusterd answered mnt= >>>> [2019-06-04 11:50:11.271998] E [syncdutils(slave >>>> 192.168.185.125/home/sas/gluster/data/code-misc):805:errlog] Popen: >>>> command returned error cmd=/usr/sbin/gluster --remote-host=localhost >>>> system:: mount sas user-map-root=sas aux-gfid-mount acl log-level=INFO >>>> log-file=/var/log/glusterfs/geo-replication-slaves/code-misc_192.168.185.107_code-misc/mnt-192.168.185.125-home-sas-gluster-data-code-misc.log >>>> volfile-server=localhost volfile-id=code-misc client-pid=-1 error=1 >>>> [2019-06-04 11:50:11.272113] E [syncdutils(slave >>>> 192.168.185.125/home/sas/gluster/data/code-misc):809:logerr] Popen: >>>> /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >>> >>> >>> On Tue, Jun 4, 2019 at 5:10 PM deepu srinivasan >>> wrote: >>> >>>> Hi >>>> As discussed I have upgraded gluster from 4.1 to 6.2 version. But the >>>> Geo replication failed to start. >>>> Stays in faulty state >>>> >>>> On Fri, May 31, 2019, 5:32 PM deepu srinivasan >>>> wrote: >>>> >>>>> Checked the data. It remains in 2708. No progress. >>>>> >>>>> On Fri, May 31, 2019 at 4:36 PM Kotresh Hiremath Ravishankar < >>>>> khiremat at redhat.com> wrote: >>>>> >>>>>> That means it could be working and the defunct process might be some >>>>>> old zombie one. Could you check, that data progress ? >>>>>> >>>>>> On Fri, May 31, 2019 at 4:29 PM deepu srinivasan >>>>>> wrote: >>>>>> >>>>>>> Hi >>>>>>> When i change the rsync option the rsync process doesnt seem to >>>>>>> start . Only a defunt process is listed in ps aux. Only when i set rsync >>>>>>> option to " " and restart all the process the rsync process is listed in ps >>>>>>> aux. >>>>>>> >>>>>>> >>>>>>> On Fri, May 31, 2019 at 4:23 PM Kotresh Hiremath Ravishankar < >>>>>>> khiremat at redhat.com> wrote: >>>>>>> >>>>>>>> Yes, rsync config option should have fixed this issue. >>>>>>>> >>>>>>>> Could you share the output of the following? >>>>>>>> >>>>>>>> 1. gluster volume geo-replication >>>>>>>> :: config rsync-options >>>>>>>> 2. ps -ef | grep rsync >>>>>>>> >>>>>>>> On Fri, May 31, 2019 at 4:11 PM deepu srinivasan < >>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>> >>>>>>>>> Done. >>>>>>>>> We got the following result . >>>>>>>>> >>>>>>>>>> 1559298781.338234 write(2, "rsync: link_stat >>>>>>>>>> \"/tmp/gsyncd-aux-mount-EEJ_sY/.gfid/3fa6aed8-802e-4efe-9903-8bc171176d88\" >>>>>>>>>> failed: No such file or directory (2)", 128 >>>>>>>>> >>>>>>>>> seems like a file is missing ? >>>>>>>>> >>>>>>>>> On Fri, May 31, 2019 at 3:25 PM Kotresh Hiremath Ravishankar < >>>>>>>>> khiremat at redhat.com> wrote: >>>>>>>>> >>>>>>>>>> Hi, >>>>>>>>>> >>>>>>>>>> Could you take the strace with with more string size? The >>>>>>>>>> argument strings are truncated. >>>>>>>>>> >>>>>>>>>> strace -s 500 -ttt -T -p >>>>>>>>>> >>>>>>>>>> On Fri, May 31, 2019 at 3:17 PM deepu srinivasan < >>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi Kotresh >>>>>>>>>>> The above-mentioned work around did not work properly. >>>>>>>>>>> >>>>>>>>>>> On Fri, May 31, 2019 at 3:16 PM deepu srinivasan < >>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>> >>>>>>>>>>>> Hi Kotresh >>>>>>>>>>>> We have tried the above-mentioned rsync option and we are >>>>>>>>>>>> planning to have the version upgrade to 6.0. >>>>>>>>>>>> >>>>>>>>>>>> On Fri, May 31, 2019 at 11:04 AM Kotresh Hiremath Ravishankar < >>>>>>>>>>>> khiremat at redhat.com> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> Hi, >>>>>>>>>>>>> >>>>>>>>>>>>> This looks like the hang because stderr buffer filled up with >>>>>>>>>>>>> errors messages and no one reading it. >>>>>>>>>>>>> I think this issue is fixed in latest releases. As a >>>>>>>>>>>>> workaround, you can do following and check if it works. >>>>>>>>>>>>> >>>>>>>>>>>>> Prerequisite: >>>>>>>>>>>>> rsync version should be > 3.1.0 >>>>>>>>>>>>> >>>>>>>>>>>>> Workaround: >>>>>>>>>>>>> gluster volume geo-replication >>>>>>>>>>>>> :: config rsync-options "--ignore-missing >>>>>>>>>>>>> -args" >>>>>>>>>>>>> >>>>>>>>>>>>> Thanks, >>>>>>>>>>>>> Kotresh HR >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> On Thu, May 30, 2019 at 5:39 PM deepu srinivasan < >>>>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> Hi >>>>>>>>>>>>>> We were evaluating Gluster geo Replication between two DCs >>>>>>>>>>>>>> one is in US west and one is in US east. We took multiple trials for >>>>>>>>>>>>>> different file size. >>>>>>>>>>>>>> The Geo Replication tends to stop replicating but while >>>>>>>>>>>>>> checking the status it appears to be in Active state. But the slave volume >>>>>>>>>>>>>> did not increase in size. >>>>>>>>>>>>>> So we have restarted the geo-replication session and checked >>>>>>>>>>>>>> the status. The status was in an active state and it was in History Crawl >>>>>>>>>>>>>> for a long time. We have enabled the DEBUG mode in logging and checked for >>>>>>>>>>>>>> any error. >>>>>>>>>>>>>> There was around 2000 file appeared for syncing candidate. >>>>>>>>>>>>>> The Rsync process starts but the rsync did not happen in the slave volume. >>>>>>>>>>>>>> Every time the rsync process appears in the "ps auxxx" list but the >>>>>>>>>>>>>> replication did not happen in the slave end. What would be the cause of >>>>>>>>>>>>>> this problem? Is there anyway to debug it? >>>>>>>>>>>>>> >>>>>>>>>>>>>> We have also checked the strace of the rync program. >>>>>>>>>>>>>> it displays something like this >>>>>>>>>>>>>> >>>>>>>>>>>>>> "write(2, "rsync: link_stat \"/tmp/gsyncd-au"..., 128" >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> We are using the below specs >>>>>>>>>>>>>> >>>>>>>>>>>>>> Gluster version - 4.1.7 >>>>>>>>>>>>>> Sync mode - rsync >>>>>>>>>>>>>> Volume - 1x3 in each end (master and slave) >>>>>>>>>>>>>> Intranet Bandwidth - 10 Gig >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> -- >>>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>>> Kotresh H R >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> Thanks and Regards, >>>>>>>>>> Kotresh H R >>>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Thanks and Regards, >>>>>>>> Kotresh H R >>>>>>>> >>>>>>> >>>>>> >>>>>> -- >>>>>> Thanks and Regards, >>>>>> Kotresh H R >>>>>> >>>>> >> >> -- >> Thanks and Regards, >> Kotresh H R >> > -- Thanks and Regards, Kotresh H R -------------- next part -------------- An HTML attachment was scrubbed... URL: From alan.orth at gmail.com Tue Jun 4 22:08:34 2019 From: alan.orth at gmail.com (Alan Orth) Date: Wed, 5 Jun 2019 01:08:34 +0300 Subject: [Gluster-users] Does replace-brick migrate data? In-Reply-To: <0aa881db-a724-13be-ff63-6c346d7f55d8@redhat.com> References: <32e26faf-e5c0-b944-2a32-c9eae408b146@redhat.com> <0ab0c28a-48a1-92c0-a106-f4fa94cb620f@redhat.com> <39dcc6a5-1610-93e1-aaff-7fef9b6c1faa@redhat.com>

<0aa881db-a724-13be-ff63-6c346d7f55d8@redhat.com> Message-ID: Hi Ravi, You're right that I had mentioned using rsync to copy the brick content to a new host, but in the end I actually decided not to bring it up on a new brick. Instead I added the original brick back into the volume. So the xattrs and symlinks to .glusterfs on the original brick are fine. I think the problem probably lies with a remove-brick that got interrupted. A few weeks ago during the maintenance I had tried to remove a brick and then after twenty minutes and no obvious progress I stopped it?after that the bricks were still part of the volume. In the last few days I have run a fix-layout that took 26 hours and finished successfully. Then I started a full index heal and it has healed about 3.3 million files in a few days and I see a clear increase of network traffic from old brick host to new brick host over that time. Once the full index heal completes I will try to do a rebalance. Thank you, On Mon, Jun 3, 2019 at 7:40 PM Ravishankar N wrote: > > On 01/06/19 9:37 PM, Alan Orth wrote: > > Dear Ravi, > > The .glusterfs hardlinks/symlinks should be fine. I'm not sure how I could > verify them for six bricks and millions of files, though... :\ > > Hi Alan, > > The reason I asked this is because you had mentioned in one of your > earlier emails that when you moved content from the old brick to the new > one, you had skipped the .glusterfs directory. So I was assuming that when > you added back this new brick to the cluster, it might have been missing > the .glusterfs entries. If that is the cae, one way to verify could be to > check using a script if all files on the brick have a link-count of at > least 2 and all dirs have valid symlinks inside .glusterfs pointing to > themselves. > > > I had a small success in fixing some issues with duplicated files on the > FUSE mount point yesterday. I read quite a bit about the elastic hashing > algorithm that determines which files get placed on which bricks based on > the hash of their filename and the trusted.glusterfs.dht xattr on brick > directories (thanks to Joe Julian's blog post and Python script for showing > how it works?). With that knowledge I looked closer at one of the files > that was appearing as duplicated on the FUSE mount and found that it was > also duplicated on more than `replica 2` bricks. For this particular file I > found two "real" files and several zero-size files with > trusted.glusterfs.dht.linkto xattrs. Neither of the "real" files were on > the correct brick as far as the DHT layout is concerned, so I copied one of > them to the correct brick, deleted the others and their hard links, and did > a `stat` on the file from the FUSE mount point and it fixed itself. Yay! > > Could this have been caused by a replace-brick that got interrupted and > didn't finish re-labeling the xattrs? > > No, replace-brick only initiates AFR self-heal, which just copies the > contents from the other brick(s) of the *same* replica pair into the > replaced brick. The link-to files are created by DHT when you rename a > file from the client. If the new name hashes to a different brick, DHT > does not move the entire file there. It instead creates the link-to file > (the one with the dht.linkto xattrs) on the hashed subvol. The value of > this xattr points to the brick where the actual data is there (`getfattr -e > text` to see it for yourself). Perhaps you had attempted a rebalance or > remove-brick earlier and interrupted that? > > Should I be thinking of some heuristics to identify and fix these issues > with a script (incorrect brick placement), or is this something a fix > layout or repeated volume heals can fix? I've already completed a whole > heal on this particular volume this week and it did heal about 1,000,000 > files (mostly data and metadata, but about 20,000 entry heals as well). > > Maybe you should let the AFR self-heals complete first and then attempt a > full rebalance to take care of the dht link-to files. But if the files are > in millions, it could take quite some time to complete. > Regards, > Ravi > > Thanks for your support, > > ? https://joejulian.name/post/dht-misses-are-expensive/ > > On Fri, May 31, 2019 at 7:57 AM Ravishankar N > wrote: > >> >> On 31/05/19 3:20 AM, Alan Orth wrote: >> >> Dear Ravi, >> >> I spent a bit of time inspecting the xattrs on some files and directories >> on a few bricks for this volume and it looks a bit messy. Even if I could >> make sense of it for a few and potentially heal them manually, there are >> millions of files and directories in total so that's definitely not a >> scalable solution. After a few missteps with `replace-brick ... commit >> force` in the last week?one of which on a brick that was dead/offline?as >> well as some premature `remove-brick` commands, I'm unsure how how to >> proceed and I'm getting demotivated. It's scary how quickly things get out >> of hand in distributed systems... >> >> Hi Alan, >> The one good thing about gluster is it that the data is always available >> directly on the backed bricks even if your volume has inconsistencies at >> the gluster level. So theoretically, if your cluster is FUBAR, you could >> just create a new volume and copy all data onto it via its mount from the >> old volume's bricks. >> >> >> I had hoped that bringing the old brick back up would help, but by the >> time I added it again a few days had passed and all the brick-id's had >> changed due to the replace/remove brick commands, not to mention that the >> trusted.afr.$volume-client-xx values were now probably pointing to the >> wrong bricks (?). >> >> Anyways, a few hours ago I started a full heal on the volume and I see >> that there is a sustained 100MiB/sec of network traffic going from the old >> brick's host to the new one. The completed heals reported in the logs look >> promising too: >> >> Old brick host: >> >> # grep '2019-05-30' /var/log/glusterfs/glustershd.log | grep -o -E >> 'Completed (data|metadata|entry) selfheal' | sort | uniq -c >> 281614 Completed data selfheal >> 84 Completed entry selfheal >> 299648 Completed metadata selfheal >> >> New brick host: >> >> # grep '2019-05-30' /var/log/glusterfs/glustershd.log | grep -o -E >> 'Completed (data|metadata|entry) selfheal' | sort | uniq -c >> 198256 Completed data selfheal >> 16829 Completed entry selfheal >> 229664 Completed metadata selfheal >> >> So that's good I guess, though I have no idea how long it will take or if >> it will fix the "missing files" issue on the FUSE mount. I've increased >> cluster.shd-max-threads to 8 to hopefully speed up the heal process. >> >> The afr xattrs should not cause files to disappear from mount. If the >> xattr names do not match what each AFR subvol expects (for eg. in a replica >> 2 volume, trusted.afr.*-client-{0,1} for 1st subvol, client-{2,3} for 2nd >> subvol and so on - ) for its children then it won't heal the data, that is >> all. But in your case I see some inconsistencies like one brick having the >> actual file (licenseserver.cfg) and the other having a linkto file (the >> one with the dht.linkto xattr) *in the same replica pair*. >> >> >> I'd be happy for any advice or pointers, >> >> Did you check if the .glusterfs hardlinks/symlinks exist and are in order >> for all bricks? >> >> -Ravi >> >> >> On Wed, May 29, 2019 at 5:20 PM Alan Orth wrote: >> >>> Dear Ravi, >>> >>> Thank you for the link to the blog post series?it is very informative >>> and current! If I understand your blog post correctly then I think the >>> answer to your previous question about pending AFRs is: no, there are no >>> pending AFRs. I have identified one file that is a good test case to try to >>> understand what happened after I issued the `gluster volume replace-brick >>> ... commit force` a few days ago and then added the same original brick >>> back to the volume later. This is the current state of the replica 2 >>> distribute/replicate volume: >>> >>> [root at wingu0 ~]# gluster volume info apps >>> >>> Volume Name: apps >>> Type: Distributed-Replicate >>> Volume ID: f118d2da-79df-4ee1-919d-53884cd34eda >>> Status: Started >>> Snapshot Count: 0 >>> Number of Bricks: 3 x 2 = 6 >>> Transport-type: tcp >>> Bricks: >>> Brick1: wingu3:/mnt/gluster/apps >>> Brick2: wingu4:/mnt/gluster/apps >>> Brick3: wingu05:/data/glusterfs/sdb/apps >>> Brick4: wingu06:/data/glusterfs/sdb/apps >>> Brick5: wingu0:/mnt/gluster/apps >>> Brick6: wingu05:/data/glusterfs/sdc/apps >>> Options Reconfigured: >>> diagnostics.client-log-level: DEBUG >>> storage.health-check-interval: 10 >>> nfs.disable: on >>> >>> I checked the xattrs of one file that is missing from the volume's FUSE >>> mount (though I can read it if I access its full path explicitly), but is >>> present in several of the volume's bricks (some with full size, others >>> empty): >>> >>> [root at wingu0 ~]# getfattr -d -m. -e hex >>> /mnt/gluster/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> >>> getfattr: Removing leading '/' from absolute path names >>> # file: mnt/gluster/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >>> trusted.afr.apps-client-3=0x000000000000000000000000 >>> trusted.afr.apps-client-5=0x000000000000000000000000 >>> trusted.afr.dirty=0x000000000000000000000000 >>> trusted.bit-rot.version=0x0200000000000000585a396f00046e15 >>> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >>> >>> [root at wingu05 ~]# getfattr -d -m. -e hex /data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> getfattr: Removing leading '/' from absolute path names >>> # file: data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >>> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >>> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >>> trusted.glusterfs.dht.linkto=0x617070732d7265706c69636174652d3200 >>> >>> [root at wingu05 ~]# getfattr -d -m. -e hex /data/glusterfs/sdc/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> getfattr: Removing leading '/' from absolute path names >>> # file: data/glusterfs/sdc/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >>> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >>> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >>> >>> [root at wingu06 ~]# getfattr -d -m. -e hex /data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> getfattr: Removing leading '/' from absolute path names >>> # file: data/glusterfs/sdb/apps/clcgenomics/clclicsrv/licenseserver.cfg >>> security.selinux=0x73797374656d5f753a6f626a6563745f723a756e6c6162656c65645f743a733000 >>> trusted.gfid=0x878003a2fb5243b6a0d14d2f8b4306bd >>> trusted.gfid2path.82586deefbc539c3=0x34666437323861612d356462392d343836382d616232662d6564393031636566333561392f6c6963656e73657365727665722e636667 >>> trusted.glusterfs.dht.linkto=0x617070732d7265706c69636174652d3200 >>> >>> According to the trusted.afr.apps-client-xx xattrs this particular file >>> should be on bricks with id "apps-client-3" and "apps-client-5". It took me >>> a few hours to realize that the brick-id values are recorded in the >>> volume's volfiles in /var/lib/glusterd/vols/apps/bricks. After comparing >>> those brick-id values with a volfile backup from before the replace-brick, >>> I realized that the files are simply on the wrong brick now as far as >>> Gluster is concerned. This particular file is now on the brick for >>> "apps-client-4". As an experiment I copied this one file to the two >>> bricks listed in the xattrs and I was then able to see the file from the >>> FUSE mount (yay!). >>> >>> Other than replacing the brick, removing it, and then adding the old >>> brick on the original server back, there has been no change in the data >>> this entire time. Can I change the brick IDs in the volfiles so they >>> reflect where the data actually is? Or perhaps script something to reset >>> all the xattrs on the files/directories to point to the correct bricks? >>> >>> Thank you for any help or pointers, >>> >>> On Wed, May 29, 2019 at 7:24 AM Ravishankar N >>> wrote: >>> >>>> >>>> On 29/05/19 9:50 AM, Ravishankar N wrote: >>>> >>>> >>>> On 29/05/19 3:59 AM, Alan Orth wrote: >>>> >>>> Dear Ravishankar, >>>> >>>> I'm not sure if Brick4 had pending AFRs because I don't know what that >>>> means and it's been a few days so I am not sure I would be able to find >>>> that information. >>>> >>>> When you find some time, have a look at a blog >>>> series I wrote about AFR- I've tried to explain what one needs to know to >>>> debug replication related issues in it. >>>> >>>> Made a typo error. The URL for the blog is https://wp.me/peiBB-6b >>>> >>>> -Ravi >>>> >>>> >>>> Anyways, after wasting a few days rsyncing the old brick to a new host >>>> I decided to just try to add the old brick back into the volume instead of >>>> bringing it up on the new host. I created a new brick directory on the old >>>> host, moved the old brick's contents into that new directory (minus the >>>> .glusterfs directory), added the new brick to the volume, and then did >>>> Vlad's find/stat trick? from the brick to the FUSE mount point. >>>> >>>> The interesting problem I have now is that some files don't appear in >>>> the FUSE mount's directory listings, but I can actually list them directly >>>> and even read them. What could cause that? >>>> >>>> Not sure, too many variables in the hacks that you did to take a guess. >>>> You can check if the contents of the .glusterfs folder are in order on the >>>> new brick (example hardlink for files and symlinks for directories are >>>> present etc.) . >>>> Regards, >>>> Ravi >>>> >>>> >>>> Thanks, >>>> >>>> ? >>>> https://lists.gluster.org/pipermail/gluster-users/2018-February/033584.html >>>> >>>> On Fri, May 24, 2019 at 4:59 PM Ravishankar N >>>> wrote: >>>> >>>>> >>>>> On 23/05/19 2:40 AM, Alan Orth wrote: >>>>> >>>>> Dear list, >>>>> >>>>> I seem to have gotten into a tricky situation. Today I brought up a >>>>> shiny new server with new disk arrays and attempted to replace one brick of >>>>> a replica 2 distribute/replicate volume on an older server using the >>>>> `replace-brick` command: >>>>> >>>>> # gluster volume replace-brick homes wingu0:/mnt/gluster/homes >>>>> wingu06:/data/glusterfs/sdb/homes commit force >>>>> >>>>> The command was successful and I see the new brick in the output of >>>>> `gluster volume info`. The problem is that Gluster doesn't seem to be >>>>> migrating the data, >>>>> >>>>> `replace-brick` definitely must heal (not migrate) the data. In your >>>>> case, data must have been healed from Brick-4 to the replaced Brick-3. Are >>>>> there any errors in the self-heal daemon logs of Brick-4's node? Does >>>>> Brick-4 have pending AFR xattrs blaming Brick-3? The doc is a bit out of >>>>> date. replace-brick command internally does all the setfattr steps that are >>>>> mentioned in the doc. >>>>> >>>>> -Ravi >>>>> >>>>> >>>>> and now the original brick that I replaced is no longer part of the >>>>> volume (and a few terabytes of data are just sitting on the old brick): >>>>> >>>>> # gluster volume info homes | grep -E "Brick[0-9]:" >>>>> Brick1: wingu4:/mnt/gluster/homes >>>>> Brick2: wingu3:/mnt/gluster/homes >>>>> Brick3: wingu06:/data/glusterfs/sdb/homes >>>>> Brick4: wingu05:/data/glusterfs/sdb/homes >>>>> Brick5: wingu05:/data/glusterfs/sdc/homes >>>>> Brick6: wingu06:/data/glusterfs/sdc/homes >>>>> >>>>> I see the Gluster docs have a more complicated procedure for replacing >>>>> bricks that involves getfattr/setfattr?. How can I tell Gluster about the >>>>> old brick? I see that I have a backup of the old volfile thanks to yum's >>>>> rpmsave function if that helps. >>>>> >>>>> We are using Gluster 5.6 on CentOS 7. Thank you for any advice you can >>>>> give. >>>>> >>>>> ? >>>>> https://docs.gluster.org/en/latest/Administrator%20Guide/Managing%20Volumes/#replace-faulty-brick >>>>> >>>>> -- >>>>> Alan Orth >>>>> alan.orth at gmail.com >>>>> https://picturingjordan.com >>>>> https://englishbulgaria.net >>>>> https://mjanja.ch >>>>> "In heaven all the interesting people are missing." ?Friedrich >>>>> Nietzsche >>>>> >>>>> _______________________________________________ >>>>> Gluster-users mailing listGluster-users at gluster.orghttps://lists.gluster.org/mailman/listinfo/gluster-users >>>>> >>>>> >>>> >>>> -- >>>> Alan Orth >>>> alan.orth at gmail.com >>>> https://picturingjordan.com >>>> https://englishbulgaria.net >>>> https://mjanja.ch >>>> "In heaven all the interesting people are missing." ?Friedrich Nietzsche >>>> >>>> >>>> _______________________________________________ >>>> Gluster-users mailing listGluster-users at gluster.orghttps://lists.gluster.org/mailman/listinfo/gluster-users >>>> >>>> >>> >>> -- >>> Alan Orth >>> alan.orth at gmail.com >>> https://picturingjordan.com >>> https://englishbulgaria.net >>> https://mjanja.ch >>> "In heaven all the interesting people are missing." ?Friedrich Nietzsche >>> >> >> >> -- >> Alan Orth >> alan.orth at gmail.com >> https://picturingjordan.com >> https://englishbulgaria.net >> https://mjanja.ch >> "In heaven all the interesting people are missing." ?Friedrich Nietzsche >> >> > > -- > Alan Orth > alan.orth at gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > "In heaven all the interesting people are missing." ?Friedrich Nietzsche > > -- Alan Orth alan.orth at gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch "In heaven all the interesting people are missing." ?Friedrich Nietzsche -------------- next part -------------- An HTML attachment was scrubbed... URL: From atumball at redhat.com Wed Jun 5 07:00:16 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Wed, 5 Jun 2019 12:30:16 +0530 Subject: [Gluster-users] Update: GlusterFS code coverage Message-ID: All, I just wanted to update everyone about one of the initiatives we have undertaken, ie, increasing the overall code coverage of GlusterFS above 70%. You can have a look at current code coverage here: https://build.gluster.org/job/line-coverage/lastCompletedBuild/Line_20Coverage_20Report/ (This shows the latest all the time) The daily job, and its details are captured @ https://build.gluster.org/job/line-coverage/ When we started focus on code coverage 3 months back, our code coverage was around 60% overall. We kept the ambitious goal of increasing the code coverage by 10% before glusterfs-7.0 release, and I am happy to announce that we met this goal, before the branching. Before talking about next goals, I want to thank and call out few developers who made this happen. * Xavier Hernandez - Made EC cross 90% from < 70%. * Glusterd Team (Sanju, Rishub, Mohit, Atin) - Increased CLI/glusterd coverage * Geo-Rep Team (Kotresh, Sunny, Shwetha, Aravinda). * Sheetal (help to increase glfs-api test cases, which indirectly helped cover more code across). Also note that, Some components like AFR/replicate was already at 80%+ before we started the efforts. Now, our next goal is to make sure we have above 80% functions coverage in all of the top level components shown. Once that is done, we will focus on 75% code coverage across all components. (ie, no 'Red' in top level page). While it was possible to meet our goal of increasing the overall code coverage from 60% - 70%, increasing it above 70% is not going to be easy, mainly because it involves adding more tests for negative test cases, and adding tests with different options (currently >300 of them across). We also need to look at details from code coverage tests, and reverse engineer to see how to write a test to hit the particular line in the code. I personally invite everyone who is interested to contribute to gluster project to get involved in this effort. Help us write test cases, suggest how to improve it. Help by assigning interns write them for us (if your team has some of them). This is a good way to understand glusterfs code too. We are happy to organize sessions on how to walk through the code etc if required. Happy to hear feedback and see more contribution in this area. Regards, Amar -------------- next part -------------- An HTML attachment was scrubbed... URL: From emayoral at arsys.es Wed Jun 5 09:27:16 2019 From: emayoral at arsys.es (Eduardo Mayoral) Date: Wed, 5 Jun 2019 11:27:16 +0200 Subject: [Gluster-users] Advice for setup: SW RAID 6 vs JBOD Message-ID: Hi, ??? I am looking into a new gluster deployment to replace an ancient one. ??? For this deployment I will be using some repurposed servers I already have in stock. The disk specs are 12 * 3 TB SATA disks. No HW RAID controller. They also have some SSD which would be nice to leverage as cache or similar to improve performance, since it is already there. Advice on how to leverage the SSDs would be greatly appreciated. ??? One of the design choices I have to make is using 3 nodes for a replica-3 with JBOD, or using 2 nodes with a replica-2 and using SW RAID 6 for the disks, maybe adding a 3rd node with a smaller amount of disk as metadata node for the replica set. I would love to hear advice on the pros and cons of each setup from the gluster experts. ??? The data will be accessed from 4 to 6 systems with native gluster, not sure if that makes any difference. ??? The amount of data I have to store there is currently 20 TB, with moderate growth. iops are quite low so high performance is not an issue. The data will fit in any of the two setups. ??? Thanks in advance for your advice! -- Eduardo Mayoral Jimeno Systems engineer, platform department. Arsys Internet. emayoral at arsys.es - +34 941 620 105 - ext 2153 From hunter86_bg at yahoo.com Wed Jun 5 12:15:34 2019 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Wed, 5 Jun 2019 12:15:34 +0000 (UTC) Subject: [Gluster-users] Advice for setup: SW RAID 6 vs JBOD In-Reply-To: References: Message-ID: <1735787204.221988.1559736934501@mail.yahoo.com> Hi Eduardo, >??? I am looking into a new gluster deployment to replace an ancient one. ? >? For this deployment I will be using some repurposed servers I >already have in stock. The disk specs are 12 * 3 TB SATA disks. No HW >RAID controller. They also have some SSD which would be nice to leverage >as cache or similar to improve performance, since it is already there. >Advice on how to leverage the SSDs would be greatly appreciated. Gluster Tiering was dropped in favour of the LVM cache.keep in mind that in RHEL/CentOS 7 you should be careful for migration_threshold value sometimes is smaller than the chunk size.For details check: https://bugzilla.redhat.com/show_bug.cgi?id=1668163 >??? One of the design choices I have to make is using 3 nodes for a >replica-3 with JBOD, or using 2 nodes with a replica-2 and using SW RAID >6 for the disks, maybe adding a 3rd node with a smaller amount of disk >as metadata node for the replica set. I would love to hear advice on the >pros and cons of each setup from the gluster experts. If you go with replica3 - your reads will be from 3 servers - thus higher speedsIf you chose replica2 - you will eventually enter a split brain (Not a good one)If you choose replica2 arbiter1 (old replica 3 arbiter1) - you will read from only 2 servers , but save bandwidth. keep in mind that you need high-bandwidth NICs (as bonding/teaming is balancing based on MAC, IP and Port which in your case will all be the same)Another option is to use GlusterD2 with replica2 and remote arbiter (for example in the cloud or somewhere away). This setup does not require the arbiter to responce in a timely manner and is used only if 1 data brick is down. ?> ? The data will be accessed from 4 to 6 systems with native gluster, >not sure if that makes any difference. ? >? The amount of data I have to store there is currently 20 TB, with >moderate growth. iops are quite low so high performance is not an issue. >The data will fit in any of the two setups. I would go with replica3 if NICs are 10gbit/s or bigger and replica2 arbiter1 if NICs are smaller.GlusterD2 is still new and might be too risky for production (Gluster Devs can correct me here). My current setup is with Gluster v6.1 on Ovirt in a replica2 arbiter1 with 6 NICs x 1gbit/s ports (consumer grade) and in order to overcome the load-balancing issue , I'm using multiple thin LVs ontop a single NVMe - each LV is a gluster brick . Each gluster? volume has a separate tcp port and thus the teaming device is load-balancing traffic on another NIC. This allows me to stripe my data on VM level , but this setup is only OK for labs . ?Best Regards,Strahil Nikolov -------------- next part -------------- An HTML attachment was scrubbed... URL: From nbalacha at redhat.com Wed Jun 5 13:52:56 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Wed, 5 Jun 2019 19:22:56 +0530 Subject: [Gluster-users] Memory leak in glusterfs In-Reply-To: References: Message-ID: Hi, Writing to a volume should not affect glusterd. The stack you have shown in the valgrind looks like the memory used to initialise the structures glusterd uses and will free only when it is stopped. Can you provide more details to what it is you are trying to test? Regards, Nithya On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL wrote: > Hi Team, > > Please respond on the issue which I raised. > > Regards, > Abhishek > > On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL > wrote: > >> Anyone please reply.... >> >> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >> wrote: >> >>> Hi Team, >>> >>> I upload some valgrind logs from my gluster 5.4 setup. This is writing >>> to the volume every 15 minutes. I stopped glusterd and then copy away the >>> logs. The test was running for some simulated days. They are zipped in >>> valgrind-54.zip. >>> >>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>> glusterfs and even some definitely lost bytes. >>> >>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>> 391 of 391 >>> ==2737== at 0x4C29C25: calloc (in >>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>> ==2737== by 0xA22485E: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA217C94: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA21D9F8: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA21DED9: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA21E685: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA1B9D8C: init (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0x4E511CE: xlator_init (in /usr/lib64/libglusterfs.so.0.0.1) >>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>> /usr/lib64/libglusterfs.so.0.0.1) >>> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>> ==2737== >>> ==2737== LEAK SUMMARY: >>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>> ==2737== indirectly lost: 317 bytes in 3 blocks >>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>> ==2737== still reachable: 53,277 bytes in 201 blocks >>> ==2737== suppressed: 0 bytes in 0 blocks >>> >>> -- >>> >>> >>> >>> >>> Regards >>> Abhishek Paliwal >>> >> > > -- > > > > > Regards > Abhishek Paliwal > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From khiremat at redhat.com Thu Jun 6 04:58:43 2019 From: khiremat at redhat.com (Kotresh Hiremath Ravishankar) Date: Thu, 6 Jun 2019 10:28:43 +0530 Subject: [Gluster-users] Geo Replication stops replicating In-Reply-To: References:

Message-ID: Hi, I think the steps to setup non-root geo-rep is not followed properly. The following entry is missing in glusterd vol file which is required. The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] Could you please the steps from below? https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.4/html-single/administration_guide/index#Setting_Up_the_Environment_for_a_Secure_Geo-replication_Slave And let us know if you still face the issue. On Thu, Jun 6, 2019 at 10:24 AM deepu srinivasan wrote: > Hi Kotresh, Sunny > I Have mailed the logs I found in one of the slave machines. Is there > anything to do with permission? Please help. > > On Wed, Jun 5, 2019 at 2:28 PM deepu srinivasan > wrote: > >> Hi Kotresh, Sunny >> Found this log in the slave machine. >> >>> [2019-06-05 08:49:10.632583] I [MSGID: 106488] >>> [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: >>> Received get vol req >>> >>> The message "I [MSGID: 106488] >>> [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: >>> Received get vol req" repeated 2 times between [2019-06-05 08:49:10.632583] >>> and [2019-06-05 08:49:10.670863] >>> >>> The message "I [MSGID: 106496] >>> [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received >>> mount req" repeated 34 times between [2019-06-05 08:48:41.005398] and >>> [2019-06-05 08:50:37.254063] >>> >>> The message "E [MSGID: 106061] >>> [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option >>> mountbroker-root' missing in glusterd vol file" repeated 34 times between >>> [2019-06-05 08:48:41.005434] and [2019-06-05 08:50:37.254079] >>> >>> The message "W [MSGID: 106176] >>> [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful >>> mount request [No such file or directory]" repeated 34 times between >>> [2019-06-05 08:48:41.005444] and [2019-06-05 08:50:37.254080] >>> >>> [2019-06-05 08:50:46.361347] I [MSGID: 106496] >>> [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received >>> mount req >>> >>> [2019-06-05 08:50:46.361384] E [MSGID: 106061] >>> [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option >>> mountbroker-root' missing in glusterd vol file >>> >>> [2019-06-05 08:50:46.361419] W [MSGID: 106176] >>> [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful >>> mount request [No such file or directory] >>> >>> The message "I [MSGID: 106496] >>> [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received >>> mount req" repeated 33 times between [2019-06-05 08:50:46.361347] and >>> [2019-06-05 08:52:34.019741] >>> >>> The message "E [MSGID: 106061] >>> [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option >>> mountbroker-root' missing in glusterd vol file" repeated 33 times between >>> [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] >>> >>> The message "W [MSGID: 106176] >>> [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful >>> mount request [No such file or directory]" repeated 33 times between >>> [2019-06-05 08:50:46.361419] and [2019-06-05 08:52:34.019758] >>> >>> [2019-06-05 08:52:44.426839] I [MSGID: 106496] >>> [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received >>> mount req >>> >>> [2019-06-05 08:52:44.426886] E [MSGID: 106061] >>> [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option >>> mountbroker-root' missing in glusterd vol file >>> >>> [2019-06-05 08:52:44.426896] W [MSGID: 106176] >>> [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful >>> mount request [No such file or directory] >>> >> >> On Wed, Jun 5, 2019 at 1:06 AM deepu srinivasan >> wrote: >> >>> Thankyou Kotresh >>> >>> On Tue, Jun 4, 2019, 11:20 PM Kotresh Hiremath Ravishankar < >>> khiremat at redhat.com> wrote: >>> >>>> Ccing Sunny, who was investing similar issue. >>>> >>>> On Tue, Jun 4, 2019 at 5:46 PM deepu srinivasan >>>> wrote: >>>> >>>>> Have already added the path in bashrc . Still in faulty state >>>>> >>>>> On Tue, Jun 4, 2019, 5:27 PM Kotresh Hiremath Ravishankar < >>>>> khiremat at redhat.com> wrote: >>>>> >>>>>> could you please try adding /usr/sbin to $PATH for user 'sas'? If >>>>>> it's bash, add 'export PATH=/usr/sbin:$PATH' in >>>>>> /home/sas/.bashrc >>>>>> >>>>>> On Tue, Jun 4, 2019 at 5:24 PM deepu srinivasan >>>>>> wrote: >>>>>> >>>>>>> Hi Kortesh >>>>>>> Please find the logs of the above error >>>>>>> *Master log snippet* >>>>>>> >>>>>>>> [2019-06-04 11:52:09.254731] I [resource(worker >>>>>>>> /home/sas/gluster/data/code-misc):1379:connect_remote] SSH: Initializing >>>>>>>> SSH connection between master and slave... >>>>>>>> [2019-06-04 11:52:09.308923] D [repce(worker >>>>>>>> /home/sas/gluster/data/code-misc):196:push] RepceClient: call >>>>>>>> 89724:139652759443264:1559649129.31 __repce_version__() ... >>>>>>>> [2019-06-04 11:52:09.602792] E [syncdutils(worker >>>>>>>> /home/sas/gluster/data/code-misc):311:log_raise_exception] : >>>>>>>> connection to peer is broken >>>>>>>> [2019-06-04 11:52:09.603312] E [syncdutils(worker >>>>>>>> /home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error >>>>>>>> cmd=ssh -oPasswordAuthentication=no -oStrictHostKeyChecking=no -i >>>>>>>> /var/lib/ glusterd/geo-replication/secret.pem -p 22 -oControlMaster=auto -S >>>>>>>> /tmp/gsyncd-aux-ssh-4aL2tc/d893f66e0addc32f7d0080bb503f5185.sock >>>>>>>> sas at 192.168.185.107 /usr/libexec/glusterfs/gsyncd slave code-misc >>>>>>>> sas@ 192.168.185.107::code-misc --master-node 192.168.185.106 >>>>>>>> --master-node-id 851b64d0-d885-4ae9-9b38-ab5b15db0fec --master-brick >>>>>>>> /home/sas/gluster/data/code-misc --local-node 192.168.185.122 --local-node- >>>>>>>> id bcaa7af6-c3a1-4411-8e99-4ebecb32eb6a --slave-timeout 120 >>>>>>>> --slave-log-level DEBUG --slave-gluster-log-level INFO >>>>>>>> --slave-gluster-command-dir /usr/sbin error=1 >>>>>>>> [2019-06-04 11:52:09.614996] I [repce(agent >>>>>>>> /home/sas/gluster/data/code-misc):97:service_loop] RepceServer: terminating >>>>>>>> on reaching EOF. >>>>>>>> [2019-06-04 11:52:09.615545] D [monitor(monitor):271:monitor] >>>>>>>> Monitor: worker(/home/sas/gluster/data/code-misc) connected >>>>>>>> [2019-06-04 11:52:09.616528] I [monitor(monitor):278:monitor] >>>>>>>> Monitor: worker died in startup phase brick=/home/sas/gluster/data/code-misc >>>>>>>> [2019-06-04 11:52:09.619391] I >>>>>>>> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker Status >>>>>>>> Change status=Faulty >>>>>>>> >>>>>>> >>>>>>> *Slave log snippet* >>>>>>> >>>>>>>> [2019-06-04 11:50:09.782668] E [syncdutils(slave >>>>>>>> 192.168.185.106/home/sas/gluster/data/code-misc):809:logerr] >>>>>>>> Popen: /usr/sbin/gluster> 2 : failed with this errno (No such file or >>>>>>>> directory) >>>>>>>> [2019-06-04 11:50:11.188167] W [gsyncd(slave >>>>>>>> 192.168.185.125/home/sas/gluster/data/code-misc):305:main] : >>>>>>>> Session config file not exists, using the default config >>>>>>>> path=/var/lib/glusterd/geo-replication/code-misc_192.168.185.107_code-misc/gsyncd.conf >>>>>>>> [2019-06-04 11:50:11.201070] I [resource(slave >>>>>>>> 192.168.185.125/home/sas/gluster/data/code-misc):1098:connect] >>>>>>>> GLUSTER: Mounting gluster volume locally... >>>>>>>> [2019-06-04 11:50:11.271231] E [resource(slave >>>>>>>> 192.168.185.125/home/sas/gluster/data/code-misc):1006:handle_mounter] >>>>>>>> MountbrokerMounter: glusterd answered mnt= >>>>>>>> [2019-06-04 11:50:11.271998] E [syncdutils(slave >>>>>>>> 192.168.185.125/home/sas/gluster/data/code-misc):805:errlog] >>>>>>>> Popen: command returned error cmd=/usr/sbin/gluster --remote-host=localhost >>>>>>>> system:: mount sas user-map-root=sas aux-gfid-mount acl log-level=INFO >>>>>>>> log-file=/var/log/glusterfs/geo-replication-slaves/code-misc_192.168.185.107_code-misc/mnt-192.168.185.125-home-sas-gluster-data-code-misc.log >>>>>>>> volfile-server=localhost volfile-id=code-misc client-pid=-1 error=1 >>>>>>>> [2019-06-04 11:50:11.272113] E [syncdutils(slave >>>>>>>> 192.168.185.125/home/sas/gluster/data/code-misc):809:logerr] >>>>>>>> Popen: /usr/sbin/gluster> 2 : failed with this errno (No such file or >>>>>>>> directory) >>>>>>> >>>>>>> >>>>>>> On Tue, Jun 4, 2019 at 5:10 PM deepu srinivasan >>>>>>> wrote: >>>>>>> >>>>>>>> Hi >>>>>>>> As discussed I have upgraded gluster from 4.1 to 6.2 version. But >>>>>>>> the Geo replication failed to start. >>>>>>>> Stays in faulty state >>>>>>>> >>>>>>>> On Fri, May 31, 2019, 5:32 PM deepu srinivasan >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Checked the data. It remains in 2708. No progress. >>>>>>>>> >>>>>>>>> On Fri, May 31, 2019 at 4:36 PM Kotresh Hiremath Ravishankar < >>>>>>>>> khiremat at redhat.com> wrote: >>>>>>>>> >>>>>>>>>> That means it could be working and the defunct process might be >>>>>>>>>> some old zombie one. Could you check, that data progress ? >>>>>>>>>> >>>>>>>>>> On Fri, May 31, 2019 at 4:29 PM deepu srinivasan < >>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi >>>>>>>>>>> When i change the rsync option the rsync process doesnt seem to >>>>>>>>>>> start . Only a defunt process is listed in ps aux. Only when i set rsync >>>>>>>>>>> option to " " and restart all the process the rsync process is listed in ps >>>>>>>>>>> aux. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On Fri, May 31, 2019 at 4:23 PM Kotresh Hiremath Ravishankar < >>>>>>>>>>> khiremat at redhat.com> wrote: >>>>>>>>>>> >>>>>>>>>>>> Yes, rsync config option should have fixed this issue. >>>>>>>>>>>> >>>>>>>>>>>> Could you share the output of the following? >>>>>>>>>>>> >>>>>>>>>>>> 1. gluster volume geo-replication >>>>>>>>>>>> :: config rsync-options >>>>>>>>>>>> 2. ps -ef | grep rsync >>>>>>>>>>>> >>>>>>>>>>>> On Fri, May 31, 2019 at 4:11 PM deepu srinivasan < >>>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> Done. >>>>>>>>>>>>> We got the following result . >>>>>>>>>>>>> >>>>>>>>>>>>>> 1559298781.338234 write(2, "rsync: link_stat >>>>>>>>>>>>>> \"/tmp/gsyncd-aux-mount-EEJ_sY/.gfid/3fa6aed8-802e-4efe-9903-8bc171176d88\" >>>>>>>>>>>>>> failed: No such file or directory (2)", 128 >>>>>>>>>>>>> >>>>>>>>>>>>> seems like a file is missing ? >>>>>>>>>>>>> >>>>>>>>>>>>> On Fri, May 31, 2019 at 3:25 PM Kotresh Hiremath Ravishankar < >>>>>>>>>>>>> khiremat at redhat.com> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>> >>>>>>>>>>>>>> Could you take the strace with with more string size? The >>>>>>>>>>>>>> argument strings are truncated. >>>>>>>>>>>>>> >>>>>>>>>>>>>> strace -s 500 -ttt -T -p >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:17 PM deepu srinivasan < >>>>>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Hi Kotresh >>>>>>>>>>>>>>> The above-mentioned work around did not work properly. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:16 PM deepu srinivasan < >>>>>>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Hi Kotresh >>>>>>>>>>>>>>>> We have tried the above-mentioned rsync option and we are >>>>>>>>>>>>>>>> planning to have the version upgrade to 6.0. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On Fri, May 31, 2019 at 11:04 AM Kotresh Hiremath >>>>>>>>>>>>>>>> Ravishankar wrote: >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> This looks like the hang because stderr buffer filled up >>>>>>>>>>>>>>>>> with errors messages and no one reading it. >>>>>>>>>>>>>>>>> I think this issue is fixed in latest releases. As a >>>>>>>>>>>>>>>>> workaround, you can do following and check if it works. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Prerequisite: >>>>>>>>>>>>>>>>> rsync version should be > 3.1.0 >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Workaround: >>>>>>>>>>>>>>>>> gluster volume geo-replication >>>>>>>>>>>>>>>>> :: config rsync-options "--ignore- >>>>>>>>>>>>>>>>> missing-args" >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Thanks, >>>>>>>>>>>>>>>>> Kotresh HR >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> On Thu, May 30, 2019 at 5:39 PM deepu srinivasan < >>>>>>>>>>>>>>>>> sdeepugd at gmail.com> wrote: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Hi >>>>>>>>>>>>>>>>>> We were evaluating Gluster geo Replication between two >>>>>>>>>>>>>>>>>> DCs one is in US west and one is in US east. We took multiple trials for >>>>>>>>>>>>>>>>>> different file size. >>>>>>>>>>>>>>>>>> The Geo Replication tends to stop replicating but while >>>>>>>>>>>>>>>>>> checking the status it appears to be in Active state. But the slave volume >>>>>>>>>>>>>>>>>> did not increase in size. >>>>>>>>>>>>>>>>>> So we have restarted the geo-replication session and >>>>>>>>>>>>>>>>>> checked the status. The status was in an active state and it was in History >>>>>>>>>>>>>>>>>> Crawl for a long time. We have enabled the DEBUG mode in logging and >>>>>>>>>>>>>>>>>> checked for any error. >>>>>>>>>>>>>>>>>> There was around 2000 file appeared for syncing >>>>>>>>>>>>>>>>>> candidate. The Rsync process starts but the rsync did not happen in the >>>>>>>>>>>>>>>>>> slave volume. Every time the rsync process appears in the "ps auxxx" list >>>>>>>>>>>>>>>>>> but the replication did not happen in the slave end. What would be the >>>>>>>>>>>>>>>>>> cause of this problem? Is there anyway to debug it? >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> We have also checked the strace of the rync program. >>>>>>>>>>>>>>>>>> it displays something like this >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> "write(2, "rsync: link_stat \"/tmp/gsyncd-au"..., 128" >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> We are using the below specs >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Gluster version - 4.1.7 >>>>>>>>>>>>>>>>>> Sync mode - rsync >>>>>>>>>>>>>>>>>> Volume - 1x3 in each end (master and slave) >>>>>>>>>>>>>>>>>> Intranet Bandwidth - 10 Gig >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>>>>>>> Kotresh H R >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> -- >>>>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>>>> Kotresh H R >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> -- >>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>> Kotresh H R >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> Thanks and Regards, >>>>>>>>>> Kotresh H R >>>>>>>>>> >>>>>>>>> >>>>>> >>>>>> -- >>>>>> Thanks and Regards, >>>>>> Kotresh H R >>>>>> >>>>> >>>> >>>> -- >>>> Thanks and Regards, >>>> Kotresh H R >>>> >>> -- Thanks and Regards, Kotresh H R -------------- next part -------------- An HTML attachment was scrubbed... URL: From sunkumar at redhat.com Thu Jun 6 05:04:46 2019 From: sunkumar at redhat.com (Sunny Kumar) Date: Thu, 6 Jun 2019 10:34:46 +0530 Subject: [Gluster-users] Geo Replication stops replicating In-Reply-To: References:

Message-ID: Hi, Updated link for documentation : -- https://docs.gluster.org/en/latest/Administrator%20Guide/Geo%20Replication/ You can use this tool as well: http://aravindavk.in/blog/gluster-georep-tools/ -Sunny On Thu, Jun 6, 2019 at 10:29 AM Kotresh Hiremath Ravishankar wrote: > > Hi, > > I think the steps to setup non-root geo-rep is not followed properly. The following entry is missing in glusterd vol file which is required. > > The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] > > Could you please the steps from below? > > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.4/html-single/administration_guide/index#Setting_Up_the_Environment_for_a_Secure_Geo-replication_Slave > > And let us know if you still face the issue. > > > > > On Thu, Jun 6, 2019 at 10:24 AM deepu srinivasan wrote: >> >> Hi Kotresh, Sunny >> I Have mailed the logs I found in one of the slave machines. Is there anything to do with permission? Please help. >> >> On Wed, Jun 5, 2019 at 2:28 PM deepu srinivasan wrote: >>> >>> Hi Kotresh, Sunny >>> Found this log in the slave machine. >>>> >>>> [2019-06-05 08:49:10.632583] I [MSGID: 106488] [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: Received get vol req >>>> >>>> The message "I [MSGID: 106488] [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: Received get vol req" repeated 2 times between [2019-06-05 08:49:10.632583] and [2019-06-05 08:49:10.670863] >>>> >>>> The message "I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req" repeated 34 times between [2019-06-05 08:48:41.005398] and [2019-06-05 08:50:37.254063] >>>> >>>> The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 34 times between [2019-06-05 08:48:41.005434] and [2019-06-05 08:50:37.254079] >>>> >>>> The message "W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory]" repeated 34 times between [2019-06-05 08:48:41.005444] and [2019-06-05 08:50:37.254080] >>>> >>>> [2019-06-05 08:50:46.361347] I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req >>>> >>>> [2019-06-05 08:50:46.361384] E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file >>>> >>>> [2019-06-05 08:50:46.361419] W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory] >>>> >>>> The message "I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req" repeated 33 times between [2019-06-05 08:50:46.361347] and [2019-06-05 08:52:34.019741] >>>> >>>> The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] >>>> >>>> The message "W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory]" repeated 33 times between [2019-06-05 08:50:46.361419] and [2019-06-05 08:52:34.019758] >>>> >>>> [2019-06-05 08:52:44.426839] I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req >>>> >>>> [2019-06-05 08:52:44.426886] E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file >>>> >>>> [2019-06-05 08:52:44.426896] W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory] >>> >>> >>> On Wed, Jun 5, 2019 at 1:06 AM deepu srinivasan wrote: >>>> >>>> Thankyou Kotresh >>>> >>>> On Tue, Jun 4, 2019, 11:20 PM Kotresh Hiremath Ravishankar wrote: >>>>> >>>>> Ccing Sunny, who was investing similar issue. >>>>> >>>>> On Tue, Jun 4, 2019 at 5:46 PM deepu srinivasan wrote: >>>>>> >>>>>> Have already added the path in bashrc . Still in faulty state >>>>>> >>>>>> On Tue, Jun 4, 2019, 5:27 PM Kotresh Hiremath Ravishankar wrote: >>>>>>> >>>>>>> could you please try adding /usr/sbin to $PATH for user 'sas'? If it's bash, add 'export PATH=/usr/sbin:$PATH' in >>>>>>> /home/sas/.bashrc >>>>>>> >>>>>>> On Tue, Jun 4, 2019 at 5:24 PM deepu srinivasan wrote: >>>>>>>> >>>>>>>> Hi Kortesh >>>>>>>> Please find the logs of the above error >>>>>>>> Master log snippet >>>>>>>>> >>>>>>>>> [2019-06-04 11:52:09.254731] I [resource(worker /home/sas/gluster/data/code-misc):1379:connect_remote] SSH: Initializing SSH connection between master and slave... >>>>>>>>> [2019-06-04 11:52:09.308923] D [repce(worker /home/sas/gluster/data/code-misc):196:push] RepceClient: call 89724:139652759443264:1559649129.31 __repce_version__() ... >>>>>>>>> [2019-06-04 11:52:09.602792] E [syncdutils(worker /home/sas/gluster/data/code-misc):311:log_raise_exception] : connection to peer is broken >>>>>>>>> [2019-06-04 11:52:09.603312] E [syncdutils(worker /home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error cmd=ssh -oPasswordAuthentication=no -oStrictHostKeyChecking=no -i /var/lib/ glusterd/geo-replication/secret.pem -p 22 -oControlMaster=auto -S /tmp/gsyncd-aux-ssh-4aL2tc/d893f66e0addc32f7d0080bb503f5185.sock sas at 192.168.185.107 /usr/libexec/glusterfs/gsyncd slave code-misc sas@ 192.168.185.107::code-misc --master-node 192.168.185.106 --master-node-id 851b64d0-d885-4ae9-9b38-ab5b15db0fec --master-brick /home/sas/gluster/data/code-misc --local-node 192.168.185.122 --local-node- id bcaa7af6-c3a1-4411-8e99-4ebecb32eb6a --slave-timeout 120 --slave-log-level DEBUG --slave-gluster-log-level INFO --slave-gluster-command-dir /usr/sbin error=1 >>>>>>>>> [2019-06-04 11:52:09.614996] I [repce(agent /home/sas/gluster/data/code-misc):97:service_loop] RepceServer: terminating on reaching EOF. >>>>>>>>> [2019-06-04 11:52:09.615545] D [monitor(monitor):271:monitor] Monitor: worker(/home/sas/gluster/data/code-misc) connected >>>>>>>>> [2019-06-04 11:52:09.616528] I [monitor(monitor):278:monitor] Monitor: worker died in startup phase brick=/home/sas/gluster/data/code-misc >>>>>>>>> [2019-06-04 11:52:09.619391] I [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker Status Change status=Faulty >>>>>>>> >>>>>>>> >>>>>>>> Slave log snippet >>>>>>>>> >>>>>>>>> [2019-06-04 11:50:09.782668] E [syncdutils(slave 192.168.185.106/home/sas/gluster/data/code-misc):809:logerr] Popen: /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >>>>>>>>> [2019-06-04 11:50:11.188167] W [gsyncd(slave 192.168.185.125/home/sas/gluster/data/code-misc):305:main] : Session config file not exists, using the default config path=/var/lib/glusterd/geo-replication/code-misc_192.168.185.107_code-misc/gsyncd.conf >>>>>>>>> [2019-06-04 11:50:11.201070] I [resource(slave 192.168.185.125/home/sas/gluster/data/code-misc):1098:connect] GLUSTER: Mounting gluster volume locally... >>>>>>>>> [2019-06-04 11:50:11.271231] E [resource(slave 192.168.185.125/home/sas/gluster/data/code-misc):1006:handle_mounter] MountbrokerMounter: glusterd answered mnt= >>>>>>>>> [2019-06-04 11:50:11.271998] E [syncdutils(slave 192.168.185.125/home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error cmd=/usr/sbin/gluster --remote-host=localhost system:: mount sas user-map-root=sas aux-gfid-mount acl log-level=INFO log-file=/var/log/glusterfs/geo-replication-slaves/code-misc_192.168.185.107_code-misc/mnt-192.168.185.125-home-sas-gluster-data-code-misc.log volfile-server=localhost volfile-id=code-misc client-pid=-1 error=1 >>>>>>>>> [2019-06-04 11:50:11.272113] E [syncdutils(slave 192.168.185.125/home/sas/gluster/data/code-misc):809:logerr] Popen: /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >>>>>>>> >>>>>>>> >>>>>>>> On Tue, Jun 4, 2019 at 5:10 PM deepu srinivasan wrote: >>>>>>>>> >>>>>>>>> Hi >>>>>>>>> As discussed I have upgraded gluster from 4.1 to 6.2 version. But the Geo replication failed to start. >>>>>>>>> Stays in faulty state >>>>>>>>> >>>>>>>>> On Fri, May 31, 2019, 5:32 PM deepu srinivasan wrote: >>>>>>>>>> >>>>>>>>>> Checked the data. It remains in 2708. No progress. >>>>>>>>>> >>>>>>>>>> On Fri, May 31, 2019 at 4:36 PM Kotresh Hiremath Ravishankar wrote: >>>>>>>>>>> >>>>>>>>>>> That means it could be working and the defunct process might be some old zombie one. Could you check, that data progress ? >>>>>>>>>>> >>>>>>>>>>> On Fri, May 31, 2019 at 4:29 PM deepu srinivasan wrote: >>>>>>>>>>>> >>>>>>>>>>>> Hi >>>>>>>>>>>> When i change the rsync option the rsync process doesnt seem to start . Only a defunt process is listed in ps aux. Only when i set rsync option to " " and restart all the process the rsync process is listed in ps aux. >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> On Fri, May 31, 2019 at 4:23 PM Kotresh Hiremath Ravishankar wrote: >>>>>>>>>>>>> >>>>>>>>>>>>> Yes, rsync config option should have fixed this issue. >>>>>>>>>>>>> >>>>>>>>>>>>> Could you share the output of the following? >>>>>>>>>>>>> >>>>>>>>>>>>> 1. gluster volume geo-replication :: config rsync-options >>>>>>>>>>>>> 2. ps -ef | grep rsync >>>>>>>>>>>>> >>>>>>>>>>>>> On Fri, May 31, 2019 at 4:11 PM deepu srinivasan wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>> Done. >>>>>>>>>>>>>> We got the following result . >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> 1559298781.338234 write(2, "rsync: link_stat \"/tmp/gsyncd-aux-mount-EEJ_sY/.gfid/3fa6aed8-802e-4efe-9903-8bc171176d88\" failed: No such file or directory (2)", 128 >>>>>>>>>>>>>> >>>>>>>>>>>>>> seems like a file is missing ? >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:25 PM Kotresh Hiremath Ravishankar wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> Could you take the strace with with more string size? The argument strings are truncated. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> strace -s 500 -ttt -T -p >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:17 PM deepu srinivasan wrote: >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Hi Kotresh >>>>>>>>>>>>>>>> The above-mentioned work around did not work properly. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:16 PM deepu srinivasan wrote: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Hi Kotresh >>>>>>>>>>>>>>>>> We have tried the above-mentioned rsync option and we are planning to have the version upgrade to 6.0. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> On Fri, May 31, 2019 at 11:04 AM Kotresh Hiremath Ravishankar wrote: >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> This looks like the hang because stderr buffer filled up with errors messages and no one reading it. >>>>>>>>>>>>>>>>>> I think this issue is fixed in latest releases. As a workaround, you can do following and check if it works. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Prerequisite: >>>>>>>>>>>>>>>>>> rsync version should be > 3.1.0 >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Workaround: >>>>>>>>>>>>>>>>>> gluster volume geo-replication :: config rsync-options "--ignore-missing-args" >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Thanks, >>>>>>>>>>>>>>>>>> Kotresh HR >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> On Thu, May 30, 2019 at 5:39 PM deepu srinivasan wrote: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Hi >>>>>>>>>>>>>>>>>>> We were evaluating Gluster geo Replication between two DCs one is in US west and one is in US east. We took multiple trials for different file size. >>>>>>>>>>>>>>>>>>> The Geo Replication tends to stop replicating but while checking the status it appears to be in Active state. But the slave volume did not increase in size. >>>>>>>>>>>>>>>>>>> So we have restarted the geo-replication session and checked the status. The status was in an active state and it was in History Crawl for a long time. We have enabled the DEBUG mode in logging and checked for any error. >>>>>>>>>>>>>>>>>>> There was around 2000 file appeared for syncing candidate. The Rsync process starts but the rsync did not happen in the slave volume. Every time the rsync process appears in the "ps auxxx" list but the replication did not happen in the slave end. What would be the cause of this problem? Is there anyway to debug it? >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> We have also checked the strace of the rync program. >>>>>>>>>>>>>>>>>>> it displays something like this >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> "write(2, "rsync: link_stat \"/tmp/gsyncd-au"..., 128" >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> We are using the below specs >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Gluster version - 4.1.7 >>>>>>>>>>>>>>>>>>> Sync mode - rsync >>>>>>>>>>>>>>>>>>> Volume - 1x3 in each end (master and slave) >>>>>>>>>>>>>>>>>>> Intranet Bandwidth - 10 Gig >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>>>>>>>> Kotresh H R >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>>>>> Kotresh H R >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> -- >>>>>>>>>>>>> Thanks and Regards, >>>>>>>>>>>>> Kotresh H R >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> -- >>>>>>>>>>> Thanks and Regards, >>>>>>>>>>> Kotresh H R >>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Thanks and Regards, >>>>>>> Kotresh H R >>>>> >>>>> >>>>> >>>>> -- >>>>> Thanks and Regards, >>>>> Kotresh H R > > > > -- > Thanks and Regards, > Kotresh H R From abhishpaliwal at gmail.com Thu Jun 6 06:38:20 2019 From: abhishpaliwal at gmail.com (ABHISHEK PALIWAL) Date: Thu, 6 Jun 2019 12:08:20 +0530 Subject: [Gluster-users] Memory leak in glusterfs In-Reply-To: References: Message-ID: Hi Nithya, Here is the Setup details and test which we are doing as below: One client, two gluster Server. The client is writing and deleting one file each 15 minutes by script test_v4.15.sh. IP Server side: 128.224.98.157 /gluster/gv0/ 128.224.98.159 /gluster/gv0/ Client side: 128.224.98.160 /gluster_mount/ Server side: gluster volume create gv0 replica 2 128.224.98.157:/gluster/gv0/ 128.224.98.159:/gluster/gv0/ force gluster volume start gv0 root at 128:/tmp/brick/gv0# gluster volume info Volume Name: gv0 Type: Replicate Volume ID: 7105a475-5929-4d60-ba23-be57445d97b5 Status: Started Snapshot Count: 0 Number of Bricks: 1 x 2 = 2 Transport-type: tcp Bricks: Brick1: 128.224.98.157:/gluster/gv0 Brick2: 128.224.98.159:/gluster/gv0 Options Reconfigured: transport.address-family: inet nfs.disable: on performance.client-io-threads: off exec script: ./ps_mem.py -p 605 -w 61 > log root at 128:/# ./ps_mem.py -p 605 Private + Shared = RAM used Program 23668.0 KiB + 1188.0 KiB = 24856.0 KiB glusterfsd --------------------------------- 24856.0 KiB ================================= Client side: mount -t glusterfs -o acl -o resolve-gids 128.224.98.157:gv0 /gluster_mount We are using the below script write and delete the file. *test_v4.15.sh * Also the below script to see the memory increase whihle the script is above script is running in background. *ps_mem.py* I am attaching the script files as well as the result got after testing the scenario. On Wed, Jun 5, 2019 at 7:23 PM Nithya Balachandran wrote: > Hi, > > Writing to a volume should not affect glusterd. The stack you have shown > in the valgrind looks like the memory used to initialise the structures > glusterd uses and will free only when it is stopped. > > Can you provide more details to what it is you are trying to test? > > Regards, > Nithya > > > On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL > wrote: > >> Hi Team, >> >> Please respond on the issue which I raised. >> >> Regards, >> Abhishek >> >> On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL >> wrote: >> >>> Anyone please reply.... >>> >>> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >>> wrote: >>> >>>> Hi Team, >>>> >>>> I upload some valgrind logs from my gluster 5.4 setup. This is writing >>>> to the volume every 15 minutes. I stopped glusterd and then copy away the >>>> logs. The test was running for some simulated days. They are zipped in >>>> valgrind-54.zip. >>>> >>>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>>> glusterfs and even some definitely lost bytes. >>>> >>>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>>> 391 of 391 >>>> ==2737== at 0x4C29C25: calloc (in >>>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>>> ==2737== by 0xA22485E: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA217C94: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA21D9F8: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA21DED9: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA21E685: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA1B9D8C: init (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0x4E511CE: xlator_init (in /usr/lib64/libglusterfs.so.0.0.1) >>>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>>> /usr/lib64/libglusterfs.so.0.0.1) >>>> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >>>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>>> ==2737== >>>> ==2737== LEAK SUMMARY: >>>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>>> ==2737== indirectly lost: 317 bytes in 3 blocks >>>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>>> ==2737== still reachable: 53,277 bytes in 201 blocks >>>> ==2737== suppressed: 0 bytes in 0 blocks >>>> >>>> -- >>>> >>>> >>>> >>>> >>>> Regards >>>> Abhishek Paliwal >>>> >>> >> >> -- >> >> >> >> >> Regards >> Abhishek Paliwal >> _______________________________________________ >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users > > -- Regards Abhishek Paliwal -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ps_mem.py Type: text/x-python Size: 18465 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: test_v4.15.sh Type: application/x-shellscript Size: 660 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ps_mem_server1.log Type: text/x-log Size: 135168 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ps_mem_server2.log Type: text/x-log Size: 135168 bytes Desc: not available URL: From revirii at googlemail.com Thu Jun 6 06:53:49 2019 From: revirii at googlemail.com (Hu Bert) Date: Thu, 6 Jun 2019 08:53:49 +0200 Subject: [Gluster-users] Advice for setup: SW RAID 6 vs JBOD In-Reply-To: References: Message-ID: Good morning, my comment won't help you directly, but i thought i'd send it anyway... Our first glusterfs setup had 3 servers withs 4 disks=bricks (10TB, JBOD) each. Was running fine in the beginning, but then 1 disk failed. The following heal took ~1 month, with a bad performance (quite high IO). Shortly after the heal hat finished another disk failed -> same problems again. Not funny. For our new system we decided to use 3 servers with 10 disks (10 TB) each, but now the 10 disks in a SW RAID 10 (well, we split the 10 disks into 2 SW RAID 10, each of them is a brick, we have 2 gluster volumes). A lot of disk space "wasted", with this type of SW RAID and a replicate 3 setup, but we wanted to avoid the "healing takes a long time with bad performance" problems. Now mdadm takes care of replicating data, glusterfs should always see "good" bricks. And the decision may depend on what kind of data you have. Many small files, like tens of millions? Or not that much, but bigger files? I once watched a video (i think it was this one: https://www.youtube.com/watch?v=61HDVwttNYI). Recommendation there: RAID 6 or 10 for small files, for big files... well, already 2 years "old" ;-) As i said, this won't help you directly. You have to identify what's most important for your scenario; as you said, high performance is not an issue - if this is true even when you have slight performance issues after a disk fail then ok. My experience so far: the bigger and slower the disks are and the more data you have -> healing will hurt -> try to avoid this. If the disks are small and fast (SSDs), healing will be faster -> JBOD is an option. hth, Hubert Am Mi., 5. Juni 2019 um 11:33 Uhr schrieb Eduardo Mayoral : > > Hi, > > I am looking into a new gluster deployment to replace an ancient one. > > For this deployment I will be using some repurposed servers I > already have in stock. The disk specs are 12 * 3 TB SATA disks. No HW > RAID controller. They also have some SSD which would be nice to leverage > as cache or similar to improve performance, since it is already there. > Advice on how to leverage the SSDs would be greatly appreciated. > > One of the design choices I have to make is using 3 nodes for a > replica-3 with JBOD, or using 2 nodes with a replica-2 and using SW RAID > 6 for the disks, maybe adding a 3rd node with a smaller amount of disk > as metadata node for the replica set. I would love to hear advice on the > pros and cons of each setup from the gluster experts. > > The data will be accessed from 4 to 6 systems with native gluster, > not sure if that makes any difference. > > The amount of data I have to store there is currently 20 TB, with > moderate growth. iops are quite low so high performance is not an issue. > The data will fit in any of the two setups. > > Thanks in advance for your advice! > > -- > Eduardo Mayoral Jimeno > Systems engineer, platform department. Arsys Internet. > emayoral at arsys.es - +34 941 620 105 - ext 2153 > > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users From pkarampu at redhat.com Thu Jun 6 07:02:46 2019 From: pkarampu at redhat.com (Pranith Kumar Karampuri) Date: Thu, 6 Jun 2019 12:32:46 +0530 Subject: [Gluster-users] write request hung in write-behind In-Reply-To: <5cf5d239.1c69fb81.50f5a.c9f5SMTPIN_ADDED_BROKEN@mx.google.com> References: <5cf5d239.1c69fb81.50f5a.c9f5SMTPIN_ADDED_BROKEN@mx.google.com> Message-ID: On Tue, Jun 4, 2019 at 7:36 AM Xie Changlong wrote: > To me, all 'df' commands on specific(not all) nfs client hung forever. > The temporary solution is disable performance.nfs.write-behind and > cluster.eager-lock. > > I'll try to get more info back if encounter this problem again . > If you observe this issue again, take successive (at least a minute apart) statedumps of the processes and run https://github.com/gluster/glusterfs/blob/master/extras/identify-hangs.sh on it which will give the information about the hangs. > > > > ???: Raghavendra Gowdappa > ??: 2019/06/04(???)09:55 > ???: Xie Changlong ;Ravishankar Narayanankutty > ;Karampuri, Pranith ; > ???: gluster-users ; > ??: Re: Re: write request hung in write-behind > > > > On Mon, Jun 3, 2019 at 1:11 PM Xie Changlong wrote: > >> Firstly i correct myself, write request followed by 771(not 1545) FLUSH >> requests. I've attach gnfs dump file, totally 774 pending call-stacks, >> 771 of them pending on write-behind and the deepest call-stack is afr. >> > > +Ravishankar Narayanankutty +Karampuri, Pranith > > > Are you sure these were not call-stacks of in-progress ops? One way of > confirming that would be to take statedumps periodically (say 3 min apart). > Hung call stacks will be common to all the statedumps. > > >> [global.callpool.stack.771] >> stack=0x7f517f557f60 >> uid=0 >> gid=0 >> pid=0 >> unique=0 >> lk-owner= >> op=stack >> type=0 >> cnt=3 >> >> [global.callpool.stack.771.frame.1] >> frame=0x7f517f655880 >> ref_count=0 >> translator=cl35vol01-replicate-7 >> complete=0 >> parent=cl35vol01-dht >> wind_from=dht_writev >> wind_to=subvol->fops->writev >> unwind_to=dht_writev_cbk >> >> [global.callpool.stack.771.frame.2] >> frame=0x7f518ed90340 >> ref_count=1 >> translator=cl35vol01-dht >> complete=0 >> parent=cl35vol01-write-behind >> wind_from=wb_fulfill_head >> wind_to=FIRST_CHILD (frame->this)->fops->writev >> unwind_to=wb_fulfill_cbk >> >> [global.callpool.stack.771.frame.3] >> frame=0x7f516d3baf10 >> ref_count=1 >> translator=cl35vol01-write-behind >> complete=0 >> >> [global.callpool.stack.772] >> stack=0x7f51607a5a20 >> uid=0 >> gid=0 >> pid=0 >> unique=0 >> lk-owner=a0715b77517f0000 >> op=stack >> type=0 >> cnt=1 >> >> [global.callpool.stack.772.frame.1] >> frame=0x7f516ca2d1b0 >> ref_count=0 >> translator=cl35vol01-replicate-7 >> complete=0 >> >> [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 >> glusterdump.20106.dump.1559038081 |grep translator | wc -l >> 774 >> [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 >> glusterdump.20106.dump.1559038081 |grep complete |wc -l >> 774 >> [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 >> glusterdump.20106.dump.1559038081 |grep -E "complete=0" |wc -l >> 774 >> [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 >> glusterdump.20106.dump.1559038081 |grep translator | grep write-behind >> |wc -l >> 771 >> [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 >> glusterdump.20106.dump.1559038081 |grep translator | grep replicate-7 | >> wc -l >> 2 >> [root at rhel-201 35]# grep -rn "global.callpool.stack.*.frame.1" -A 5 >> glusterdump.20106.dump.1559038081 |grep translator | grep glusterfs | wc >> -l >> 1 >> >> >> >> >> ???: Raghavendra Gowdappa >> ??: 2019/06/03(???)14:46 >> ???: Xie Changlong ; >> ???: gluster-users ; >> ??: Re: write request hung in write-behind >> >> >> >> On Mon, Jun 3, 2019 at 11:57 AM Xie Changlong wrote: >> >>> Hi all >>> >>> Test gluster 3.8.4-54.15 gnfs, i saw a write request hung in >>> write-behind followed by 1545 FLUSH requests. I found a similar >>> bugfix https://bugzilla.redhat.com/show_bug.cgi?id=1626787, but not >>> sure if it's the right one. >>> >>> [xlator.performance.write-behind.wb_inode] >>> path=/575/1e/5751e318f21f605f2aac241bf042e7a8.jpg >>> inode=0x7f51775b71a0 >>> window_conf=1073741824 >>> window_current=293822 >>> transit-size=293822 >>> dontsync=0 >>> >>> [.WRITE] >>> request-ptr=0x7f516eec2060 >>> refcount=1 >>> wound=yes >>> generation-number=1 >>> req->op_ret=293822 >>> req->op_errno=0 >>> sync-attempts=1 >>> sync-in-progress=yes >>> >> >> Note that the sync is still in progress. This means, write-behind has >> wound the write-request to its children and yet to receive the response >> (unless there is a bug in accounting of sync-in-progress). So, its likely >> that there are callstacks into children of write-behind, which are not >> complete yet. Are you sure the deepest hung call-stack is in write-behind? >> Can you check for frames with "complete=0"? >> >> size=293822 >>> offset=1048576 >>> lied=-1 >>> append=0 >>> fulfilled=0 >>> go=-1 >>> >>> [.FLUSH] >>> request-ptr=0x7f517c2badf0 >>> refcount=1 >>> wound=no >>> generation-number=2 >>> req->op_ret=-1 >>> req->op_errno=116 >>> sync-attempts=0 >>> >>> [.FLUSH] >>> request-ptr=0x7f5173e9f7b0 >>> refcount=1 >>> wound=no >>> generation-number=2 >>> req->op_ret=0 >>> req->op_errno=0 >>> sync-attempts=0 >>> >>> [.FLUSH] >>> request-ptr=0x7f51640b8ca0 >>> refcount=1 >>> wound=no >>> generation-number=2 >>> req->op_ret=0 >>> req->op_errno=0 >>> sync-attempts=0 >>> >>> [.FLUSH] >>> request-ptr=0x7f516f3979d0 >>> refcount=1 >>> wound=no >>> generation-number=2 >>> req->op_ret=0 >>> req->op_errno=0 >>> sync-attempts=0 >>> >>> [.FLUSH] >>> request-ptr=0x7f516f6ac8d0 >>> refcount=1 >>> wound=no >>> generation-number=2 >>> req->op_ret=0 >>> req->op_errno=0 >>> sync-attempts=0 >>> >>> >>> Any comments would be appreciated! >>> >>> Thanks >>> -Xie >>> >>> >>> -- Pranith -------------- next part -------------- An HTML attachment was scrubbed... URL: From nbalacha at redhat.com Thu Jun 6 10:38:17 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Thu, 6 Jun 2019 16:08:17 +0530 Subject: [Gluster-users] Memory leak in glusterfs In-Reply-To: References:

Message-ID: Hi Abhishek, I am still not clear as to the purpose of the tests. Can you clarify why you are using valgrind and why you think there is a memory leak? Regards, Nithya On Thu, 6 Jun 2019 at 12:09, ABHISHEK PALIWAL wrote: > Hi Nithya, > > Here is the Setup details and test which we are doing as below: > > > One client, two gluster Server. > The client is writing and deleting one file each 15 minutes by script > test_v4.15.sh. > > IP > Server side: > 128.224.98.157 /gluster/gv0/ > 128.224.98.159 /gluster/gv0/ > > Client side: > 128.224.98.160 /gluster_mount/ > > Server side: > gluster volume create gv0 replica 2 128.224.98.157:/gluster/gv0/ > 128.224.98.159:/gluster/gv0/ force > gluster volume start gv0 > > root at 128:/tmp/brick/gv0# gluster volume info > > Volume Name: gv0 > Type: Replicate > Volume ID: 7105a475-5929-4d60-ba23-be57445d97b5 > Status: Started > Snapshot Count: 0 > Number of Bricks: 1 x 2 = 2 > Transport-type: tcp > Bricks: > Brick1: 128.224.98.157:/gluster/gv0 > Brick2: 128.224.98.159:/gluster/gv0 > Options Reconfigured: > transport.address-family: inet > nfs.disable: on > performance.client-io-threads: off > > exec script: ./ps_mem.py -p 605 -w 61 > log > root at 128:/# ./ps_mem.py -p 605 > Private + Shared = RAM used Program > 23668.0 KiB + 1188.0 KiB = 24856.0 KiB glusterfsd > --------------------------------- > 24856.0 KiB > ================================= > > > Client side: > mount -t glusterfs -o acl -o resolve-gids 128.224.98.157:gv0 > /gluster_mount > > > We are using the below script write and delete the file. > > *test_v4.15.sh * > > Also the below script to see the memory increase whihle the script is > above script is running in background. > > *ps_mem.py* > > I am attaching the script files as well as the result got after testing > the scenario. > > On Wed, Jun 5, 2019 at 7:23 PM Nithya Balachandran > wrote: > >> Hi, >> >> Writing to a volume should not affect glusterd. The stack you have shown >> in the valgrind looks like the memory used to initialise the structures >> glusterd uses and will free only when it is stopped. >> >> Can you provide more details to what it is you are trying to test? >> >> Regards, >> Nithya >> >> >> On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL >> wrote: >> >>> Hi Team, >>> >>> Please respond on the issue which I raised. >>> >>> Regards, >>> Abhishek >>> >>> On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL < >>> abhishpaliwal at gmail.com> wrote: >>> >>>> Anyone please reply.... >>>> >>>> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >>>> wrote: >>>> >>>>> Hi Team, >>>>> >>>>> I upload some valgrind logs from my gluster 5.4 setup. This is writing >>>>> to the volume every 15 minutes. I stopped glusterd and then copy away the >>>>> logs. The test was running for some simulated days. They are zipped in >>>>> valgrind-54.zip. >>>>> >>>>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>>>> glusterfs and even some definitely lost bytes. >>>>> >>>>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>>>> 391 of 391 >>>>> ==2737== at 0x4C29C25: calloc (in >>>>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>>>> ==2737== by 0xA22485E: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA217C94: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA21D9F8: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA21DED9: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA21E685: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA1B9D8C: init (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0x4E511CE: xlator_init (in >>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>>>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >>>>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>>>> ==2737== >>>>> ==2737== LEAK SUMMARY: >>>>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>>>> ==2737== indirectly lost: 317 bytes in 3 blocks >>>>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>>>> ==2737== still reachable: 53,277 bytes in 201 blocks >>>>> ==2737== suppressed: 0 bytes in 0 blocks >>>>> >>>>> -- >>>>> >>>>> >>>>> >>>>> >>>>> Regards >>>>> Abhishek Paliwal >>>>> >>>> >>> >>> -- >>> >>> >>> >>> >>> Regards >>> Abhishek Paliwal >>> _______________________________________________ >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> > > -- > > > > > Regards > Abhishek Paliwal > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sunkumar at redhat.com Thu Jun 6 10:40:16 2019 From: sunkumar at redhat.com (Sunny Kumar) Date: Thu, 6 Jun 2019 16:10:16 +0530 Subject: [Gluster-users] Geo Replication stops replicating In-Reply-To: References:

Message-ID: Above error can be tracked here: https://bugzilla.redhat.com/show_bug.cgi?id=1709248 and patch link: https://review.gluster.org/#/c/glusterfs/+/22716/ You can apply patch and test it however its waiting on regression to pass and merge. -Sunny On Thu, Jun 6, 2019 at 4:00 PM deepu srinivasan wrote: > > Hi > I have followed the following steps to create the geo-replication but the status seems to be in a faulty state. > > Steps : > > Installed cluster version 5.6 in totally six nodes. >> >> glusterfs 5.6 >> >> Repository revision: git://git.gluster.org/glusterfs.git >> >> Copyright (c) 2006-2016 Red Hat, Inc. >> >> GlusterFS comes with ABSOLUTELY NO WARRANTY. >> >> It is licensed to you under your choice of the GNU Lesser >> >> General Public License, version 3 or any later version (LGPLv3 >> >> or later), or the GNU General Public License, version 2 (GPLv2), >> >> in all cases as published by the Free Software Foundation > > > peer_probed the first three nodes and second three nodes. > > > > Added new volume in both the clusters > > > > execute gluster-mountbroker commands and restarted glusterd. >> >> gluster-mountbroker setup /var/mountbroker-root sas >> >> gluster-mountbroker remove --volume code-misc --user sas > > > configured a passwordless sssh from master to slave >> >> ssh-keygen; ssh-copy-id sas at 192.168.185.107 > > created a common pem pub file >> >> gluster system:: execute gsec_create > > created geo-replication session. >> >> gluster volume geo-replication code-misc sas at 192.168.185.107::code-misc create push-pem > > executed the following command in slave >> >> /usr/libexec/glusterfs/set_geo_rep_pem_keys.sh sas code-misc code-misc > > started the gluster geo-replication. >> >> gluster volume geo-replication code-misc sas at 192.168.185.107::code-misc start > > > Now the geo-replication works fine. > Tested with 2000 files All seems to sync finely. > > Now I updated all the node to version 6.2 by using rpms which were built by the source code in a docker container in my personal machine. > > >> gluster --version >> >> glusterfs 6.2 >> >> Repository revision: git://git.gluster.org/glusterfs.git >> >> Copyright (c) 2006-2016 Red Hat, Inc. >> >> GlusterFS comes with ABSOLUTELY NO WARRANTY. >> >> It is licensed to you under your choice of the GNU Lesser >> >> General Public License, version 3 or any later version (LGPLv3 >> >> or later), or the GNU General Public License, version 2 (GPLv2), >> >> in all cases as published by the Free Software Foundation. > > > I have stopped the glusterd daemons in all the node along with the volume and geo-replication. > Now I started the daemons, volume and geo-replication session the status seems to be faulty. > Also noted that the result of "gluster-mountbroker status" command always end in python exception like this >> >> Traceback (most recent call last): >> >> File "/usr/sbin/gluster-mountbroker", line 396, in >> >> runcli() >> >> File "/usr/lib/python2.7/site-packages/gluster/cliutils/cliutils.py", line 225, in runcli >> >> cls.run(args) >> >> File "/usr/sbin/gluster-mountbroker", line 275, in run >> >> out = execute_in_peers("node-status") >> >> File "/usr/lib/python2.7/site-packages/gluster/cliutils/cliutils.py", line 127, in execute_in_peers >> >> raise GlusterCmdException((rc, out, err, " ".join(cmd))) >> >> gluster.cliutils.cliutils.GlusterCmdException: (1, '', 'Unable to end. Error : Success\n', 'gluster system:: execute mountbroker.py node-status') > > > Is it I or everyone gets an error for gluster-mountbroker command for gluster version greater than 6.0?. Please help. > > Thank you > Deepak > > > On Thu, Jun 6, 2019 at 10:35 AM Sunny Kumar wrote: >> >> Hi, >> >> Updated link for documentation : >> >> -- https://docs.gluster.org/en/latest/Administrator%20Guide/Geo%20Replication/ >> >> You can use this tool as well: >> http://aravindavk.in/blog/gluster-georep-tools/ >> >> -Sunny >> >> On Thu, Jun 6, 2019 at 10:29 AM Kotresh Hiremath Ravishankar >> wrote: >> > >> > Hi, >> > >> > I think the steps to setup non-root geo-rep is not followed properly. The following entry is missing in glusterd vol file which is required. >> > >> > The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] >> > >> > Could you please the steps from below? >> > >> > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.4/html-single/administration_guide/index#Setting_Up_the_Environment_for_a_Secure_Geo-replication_Slave >> > >> > And let us know if you still face the issue. >> > >> > >> > >> > >> > On Thu, Jun 6, 2019 at 10:24 AM deepu srinivasan wrote: >> >> >> >> Hi Kotresh, Sunny >> >> I Have mailed the logs I found in one of the slave machines. Is there anything to do with permission? Please help. >> >> >> >> On Wed, Jun 5, 2019 at 2:28 PM deepu srinivasan wrote: >> >>> >> >>> Hi Kotresh, Sunny >> >>> Found this log in the slave machine. >> >>>> >> >>>> [2019-06-05 08:49:10.632583] I [MSGID: 106488] [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: Received get vol req >> >>>> >> >>>> The message "I [MSGID: 106488] [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: Received get vol req" repeated 2 times between [2019-06-05 08:49:10.632583] and [2019-06-05 08:49:10.670863] >> >>>> >> >>>> The message "I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req" repeated 34 times between [2019-06-05 08:48:41.005398] and [2019-06-05 08:50:37.254063] >> >>>> >> >>>> The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 34 times between [2019-06-05 08:48:41.005434] and [2019-06-05 08:50:37.254079] >> >>>> >> >>>> The message "W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory]" repeated 34 times between [2019-06-05 08:48:41.005444] and [2019-06-05 08:50:37.254080] >> >>>> >> >>>> [2019-06-05 08:50:46.361347] I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req >> >>>> >> >>>> [2019-06-05 08:50:46.361384] E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file >> >>>> >> >>>> [2019-06-05 08:50:46.361419] W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory] >> >>>> >> >>>> The message "I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req" repeated 33 times between [2019-06-05 08:50:46.361347] and [2019-06-05 08:52:34.019741] >> >>>> >> >>>> The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] >> >>>> >> >>>> The message "W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory]" repeated 33 times between [2019-06-05 08:50:46.361419] and [2019-06-05 08:52:34.019758] >> >>>> >> >>>> [2019-06-05 08:52:44.426839] I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req >> >>>> >> >>>> [2019-06-05 08:52:44.426886] E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file >> >>>> >> >>>> [2019-06-05 08:52:44.426896] W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory] >> >>> >> >>> >> >>> On Wed, Jun 5, 2019 at 1:06 AM deepu srinivasan wrote: >> >>>> >> >>>> Thankyou Kotresh >> >>>> >> >>>> On Tue, Jun 4, 2019, 11:20 PM Kotresh Hiremath Ravishankar wrote: >> >>>>> >> >>>>> Ccing Sunny, who was investing similar issue. >> >>>>> >> >>>>> On Tue, Jun 4, 2019 at 5:46 PM deepu srinivasan wrote: >> >>>>>> >> >>>>>> Have already added the path in bashrc . Still in faulty state >> >>>>>> >> >>>>>> On Tue, Jun 4, 2019, 5:27 PM Kotresh Hiremath Ravishankar wrote: >> >>>>>>> >> >>>>>>> could you please try adding /usr/sbin to $PATH for user 'sas'? If it's bash, add 'export PATH=/usr/sbin:$PATH' in >> >>>>>>> /home/sas/.bashrc >> >>>>>>> >> >>>>>>> On Tue, Jun 4, 2019 at 5:24 PM deepu srinivasan wrote: >> >>>>>>>> >> >>>>>>>> Hi Kortesh >> >>>>>>>> Please find the logs of the above error >> >>>>>>>> Master log snippet >> >>>>>>>>> >> >>>>>>>>> [2019-06-04 11:52:09.254731] I [resource(worker /home/sas/gluster/data/code-misc):1379:connect_remote] SSH: Initializing SSH connection between master and slave... >> >>>>>>>>> [2019-06-04 11:52:09.308923] D [repce(worker /home/sas/gluster/data/code-misc):196:push] RepceClient: call 89724:139652759443264:1559649129.31 __repce_version__() ... >> >>>>>>>>> [2019-06-04 11:52:09.602792] E [syncdutils(worker /home/sas/gluster/data/code-misc):311:log_raise_exception] : connection to peer is broken >> >>>>>>>>> [2019-06-04 11:52:09.603312] E [syncdutils(worker /home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error cmd=ssh -oPasswordAuthentication=no -oStrictHostKeyChecking=no -i /var/lib/ glusterd/geo-replication/secret.pem -p 22 -oControlMaster=auto -S /tmp/gsyncd-aux-ssh-4aL2tc/d893f66e0addc32f7d0080bb503f5185.sock sas at 192.168.185.107 /usr/libexec/glusterfs/gsyncd slave code-misc sas@ 192.168.185.107::code-misc --master-node 192.168.185.106 --master-node-id 851b64d0-d885-4ae9-9b38-ab5b15db0fec --master-brick /home/sas/gluster/data/code-misc --local-node 192.168.185.122 --local-node- id bcaa7af6-c3a1-4411-8e99-4ebecb32eb6a --slave-timeout 120 --slave-log-level DEBUG --slave-gluster-log-level INFO --slave-gluster-command-dir /usr/sbin error=1 >> >>>>>>>>> [2019-06-04 11:52:09.614996] I [repce(agent /home/sas/gluster/data/code-misc):97:service_loop] RepceServer: terminating on reaching EOF. >> >>>>>>>>> [2019-06-04 11:52:09.615545] D [monitor(monitor):271:monitor] Monitor: worker(/home/sas/gluster/data/code-misc) connected >> >>>>>>>>> [2019-06-04 11:52:09.616528] I [monitor(monitor):278:monitor] Monitor: worker died in startup phase brick=/home/sas/gluster/data/code-misc >> >>>>>>>>> [2019-06-04 11:52:09.619391] I [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker Status Change status=Faulty >> >>>>>>>> >> >>>>>>>> >> >>>>>>>> Slave log snippet >> >>>>>>>>> >> >>>>>>>>> [2019-06-04 11:50:09.782668] E [syncdutils(slave 192.168.185.106/home/sas/gluster/data/code-misc):809:logerr] Popen: /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >> >>>>>>>>> [2019-06-04 11:50:11.188167] W [gsyncd(slave 192.168.185.125/home/sas/gluster/data/code-misc):305:main] : Session config file not exists, using the default config path=/var/lib/glusterd/geo-replication/code-misc_192.168.185.107_code-misc/gsyncd.conf >> >>>>>>>>> [2019-06-04 11:50:11.201070] I [resource(slave 192.168.185.125/home/sas/gluster/data/code-misc):1098:connect] GLUSTER: Mounting gluster volume locally... >> >>>>>>>>> [2019-06-04 11:50:11.271231] E [resource(slave 192.168.185.125/home/sas/gluster/data/code-misc):1006:handle_mounter] MountbrokerMounter: glusterd answered mnt= >> >>>>>>>>> [2019-06-04 11:50:11.271998] E [syncdutils(slave 192.168.185.125/home/sas/gluster/data/code-misc):805:errlog] Popen: command returned error cmd=/usr/sbin/gluster --remote-host=localhost system:: mount sas user-map-root=sas aux-gfid-mount acl log-level=INFO log-file=/var/log/glusterfs/geo-replication-slaves/code-misc_192.168.185.107_code-misc/mnt-192.168.185.125-home-sas-gluster-data-code-misc.log volfile-server=localhost volfile-id=code-misc client-pid=-1 error=1 >> >>>>>>>>> [2019-06-04 11:50:11.272113] E [syncdutils(slave 192.168.185.125/home/sas/gluster/data/code-misc):809:logerr] Popen: /usr/sbin/gluster> 2 : failed with this errno (No such file or directory) >> >>>>>>>> >> >>>>>>>> >> >>>>>>>> On Tue, Jun 4, 2019 at 5:10 PM deepu srinivasan wrote: >> >>>>>>>>> >> >>>>>>>>> Hi >> >>>>>>>>> As discussed I have upgraded gluster from 4.1 to 6.2 version. But the Geo replication failed to start. >> >>>>>>>>> Stays in faulty state >> >>>>>>>>> >> >>>>>>>>> On Fri, May 31, 2019, 5:32 PM deepu srinivasan wrote: >> >>>>>>>>>> >> >>>>>>>>>> Checked the data. It remains in 2708. No progress. >> >>>>>>>>>> >> >>>>>>>>>> On Fri, May 31, 2019 at 4:36 PM Kotresh Hiremath Ravishankar wrote: >> >>>>>>>>>>> >> >>>>>>>>>>> That means it could be working and the defunct process might be some old zombie one. Could you check, that data progress ? >> >>>>>>>>>>> >> >>>>>>>>>>> On Fri, May 31, 2019 at 4:29 PM deepu srinivasan wrote: >> >>>>>>>>>>>> >> >>>>>>>>>>>> Hi >> >>>>>>>>>>>> When i change the rsync option the rsync process doesnt seem to start . Only a defunt process is listed in ps aux. Only when i set rsync option to " " and restart all the process the rsync process is listed in ps aux. >> >>>>>>>>>>>> >> >>>>>>>>>>>> >> >>>>>>>>>>>> On Fri, May 31, 2019 at 4:23 PM Kotresh Hiremath Ravishankar wrote: >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> Yes, rsync config option should have fixed this issue. >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> Could you share the output of the following? >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> 1. gluster volume geo-replication :: config rsync-options >> >>>>>>>>>>>>> 2. ps -ef | grep rsync >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> On Fri, May 31, 2019 at 4:11 PM deepu srinivasan wrote: >> >>>>>>>>>>>>>> >> >>>>>>>>>>>>>> Done. >> >>>>>>>>>>>>>> We got the following result . >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> 1559298781.338234 write(2, "rsync: link_stat \"/tmp/gsyncd-aux-mount-EEJ_sY/.gfid/3fa6aed8-802e-4efe-9903-8bc171176d88\" failed: No such file or directory (2)", 128 >> >>>>>>>>>>>>>> >> >>>>>>>>>>>>>> seems like a file is missing ? >> >>>>>>>>>>>>>> >> >>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:25 PM Kotresh Hiremath Ravishankar wrote: >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> Hi, >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> Could you take the strace with with more string size? The argument strings are truncated. >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> strace -s 500 -ttt -T -p >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:17 PM deepu srinivasan wrote: >> >>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>> Hi Kotresh >> >>>>>>>>>>>>>>>> The above-mentioned work around did not work properly. >> >>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>> On Fri, May 31, 2019 at 3:16 PM deepu srinivasan wrote: >> >>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>> Hi Kotresh >> >>>>>>>>>>>>>>>>> We have tried the above-mentioned rsync option and we are planning to have the version upgrade to 6.0. >> >>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>> On Fri, May 31, 2019 at 11:04 AM Kotresh Hiremath Ravishankar wrote: >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> Hi, >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> This looks like the hang because stderr buffer filled up with errors messages and no one reading it. >> >>>>>>>>>>>>>>>>>> I think this issue is fixed in latest releases. As a workaround, you can do following and check if it works. >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> Prerequisite: >> >>>>>>>>>>>>>>>>>> rsync version should be > 3.1.0 >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> Workaround: >> >>>>>>>>>>>>>>>>>> gluster volume geo-replication :: config rsync-options "--ignore-missing-args" >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> Thanks, >> >>>>>>>>>>>>>>>>>> Kotresh HR >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> On Thu, May 30, 2019 at 5:39 PM deepu srinivasan wrote: >> >>>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>>> Hi >> >>>>>>>>>>>>>>>>>>> We were evaluating Gluster geo Replication between two DCs one is in US west and one is in US east. We took multiple trials for different file size. >> >>>>>>>>>>>>>>>>>>> The Geo Replication tends to stop replicating but while checking the status it appears to be in Active state. But the slave volume did not increase in size. >> >>>>>>>>>>>>>>>>>>> So we have restarted the geo-replication session and checked the status. The status was in an active state and it was in History Crawl for a long time. We have enabled the DEBUG mode in logging and checked for any error. >> >>>>>>>>>>>>>>>>>>> There was around 2000 file appeared for syncing candidate. The Rsync process starts but the rsync did not happen in the slave volume. Every time the rsync process appears in the "ps auxxx" list but the replication did not happen in the slave end. What would be the cause of this problem? Is there anyway to debug it? >> >>>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>>> We have also checked the strace of the rync program. >> >>>>>>>>>>>>>>>>>>> it displays something like this >> >>>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>>> "write(2, "rsync: link_stat \"/tmp/gsyncd-au"..., 128" >> >>>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>>> We are using the below specs >> >>>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>>> Gluster version - 4.1.7 >> >>>>>>>>>>>>>>>>>>> Sync mode - rsync >> >>>>>>>>>>>>>>>>>>> Volume - 1x3 in each end (master and slave) >> >>>>>>>>>>>>>>>>>>> Intranet Bandwidth - 10 Gig >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>>>>> -- >> >>>>>>>>>>>>>>>>>> Thanks and Regards, >> >>>>>>>>>>>>>>>>>> Kotresh H R >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> >> >>>>>>>>>>>>>>> -- >> >>>>>>>>>>>>>>> Thanks and Regards, >> >>>>>>>>>>>>>>> Kotresh H R >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> >> >>>>>>>>>>>>> -- >> >>>>>>>>>>>>> Thanks and Regards, >> >>>>>>>>>>>>> Kotresh H R >> >>>>>>>>>>> >> >>>>>>>>>>> >> >>>>>>>>>>> >> >>>>>>>>>>> -- >> >>>>>>>>>>> Thanks and Regards, >> >>>>>>>>>>> Kotresh H R >> >>>>>>> >> >>>>>>> >> >>>>>>> >> >>>>>>> -- >> >>>>>>> Thanks and Regards, >> >>>>>>> Kotresh H R >> >>>>> >> >>>>> >> >>>>> >> >>>>> -- >> >>>>> Thanks and Regards, >> >>>>> Kotresh H R >> > >> > >> > >> > -- >> > Thanks and Regards, >> > Kotresh H R From sunkumar at redhat.com Thu Jun 6 11:38:31 2019 From: sunkumar at redhat.com (Sunny Kumar) Date: Thu, 6 Jun 2019 17:08:31 +0530 Subject: [Gluster-users] Geo Replication stops replicating In-Reply-To: References:

Message-ID: Whats current trackback please share. -Sunny On Thu, Jun 6, 2019 at 4:53 PM deepu srinivasan wrote: > > Hi Sunny > I have changed the file in /usr/libexec/glusterfs/peer_mountbroker.py as mentioned in the patch. > Now the "gluster-mountbroker status" command is working fine. But the geo-replication seems to be in the faulty state still. > > > Thankyou > Deepak > > On Thu, Jun 6, 2019 at 4:10 PM Sunny Kumar wrote: >> >> Above error can be tracked here: >> >> https://bugzilla.redhat.com/show_bug.cgi?id=1709248 >> >> and patch link: >> https://review.gluster.org/#/c/glusterfs/+/22716/ >> >> You can apply patch and test it however its waiting on regression to >> pass and merge. >> >> -Sunny >> >> >> On Thu, Jun 6, 2019 at 4:00 PM deepu srinivasan wrote: >> > >> > Hi >> > I have followed the following steps to create the geo-replication but the status seems to be in a faulty state. >> > >> > Steps : >> > >> > Installed cluster version 5.6 in totally six nodes. >> >> >> >> glusterfs 5.6 >> >> >> >> Repository revision: git://git.gluster.org/glusterfs.git >> >> >> >> Copyright (c) 2006-2016 Red Hat, Inc. >> >> >> >> GlusterFS comes with ABSOLUTELY NO WARRANTY. >> >> >> >> It is licensed to you under your choice of the GNU Lesser >> >> >> >> General Public License, version 3 or any later version (LGPLv3 >> >> >> >> or later), or the GNU General Public License, version 2 (GPLv2), >> >> >> >> in all cases as published by the Free Software Foundation >> > >> > >> > peer_probed the first three nodes and second three nodes. >> > >> > >> > >> > Added new volume in both the clusters >> > >> > >> > >> > execute gluster-mountbroker commands and restarted glusterd. >> >> >> >> gluster-mountbroker setup /var/mountbroker-root sas >> >> >> >> gluster-mountbroker remove --volume code-misc --user sas >> > >> > >> > configured a passwordless sssh from master to slave >> >> >> >> ssh-keygen; ssh-copy-id sas at 192.168.185.107 >> > >> > created a common pem pub file >> >> >> >> gluster system:: execute gsec_create >> > >> > created geo-replication session. >> >> >> >> gluster volume geo-replication code-misc sas at 192.168.185.107::code-misc create push-pem >> > >> > executed the following command in slave >> >> >> >> /usr/libexec/glusterfs/set_geo_rep_pem_keys.sh sas code-misc code-misc >> > >> > started the gluster geo-replication. >> >> >> >> gluster volume geo-replication code-misc sas at 192.168.185.107::code-misc start >> > >> > >> > Now the geo-replication works fine. >> > Tested with 2000 files All seems to sync finely. >> > >> > Now I updated all the node to version 6.2 by using rpms which were built by the source code in a docker container in my personal machine. >> > >> > >> >> gluster --version >> >> >> >> glusterfs 6.2 >> >> >> >> Repository revision: git://git.gluster.org/glusterfs.git >> >> >> >> Copyright (c) 2006-2016 Red Hat, Inc. >> >> >> >> GlusterFS comes with ABSOLUTELY NO WARRANTY. >> >> >> >> It is licensed to you under your choice of the GNU Lesser >> >> >> >> General Public License, version 3 or any later version (LGPLv3 >> >> >> >> or later), or the GNU General Public License, version 2 (GPLv2), >> >> >> >> in all cases as published by the Free Software Foundation. >> > >> > >> > I have stopped the glusterd daemons in all the node along with the volume and geo-replication. >> > Now I started the daemons, volume and geo-replication session the status seems to be faulty. >> > Also noted that the result of "gluster-mountbroker status" command always end in python exception like this >> >> >> >> Traceback (most recent call last): >> >> >> >> File "/usr/sbin/gluster-mountbroker", line 396, in >> >> >> >> runcli() >> >> >> >> File "/usr/lib/python2.7/site-packages/gluster/cliutils/cliutils.py", line 225, in runcli >> >> >> >> cls.run(args) >> >> >> >> File "/usr/sbin/gluster-mountbroker", line 275, in run >> >> >> >> out = execute_in_peers("node-status") >> >> >> >> File "/usr/lib/python2.7/site-packages/gluster/cliutils/cliutils.py", line 127, in execute_in_peers >> >> >> >> raise GlusterCmdException((rc, out, err, " ".join(cmd))) >> >> >> >> gluster.cliutils.cliutils.GlusterCmdException: (1, '', 'Unable to end. Error : Success\n', 'gluster system:: execute mountbroker.py node-status') >> > >> > >> > Is it I or everyone gets an error for gluster-mountbroker command for gluster version greater than 6.0?. Please help. >> > >> > Thank you >> > Deepak >> > >> > >> > On Thu, Jun 6, 2019 at 10:35 AM Sunny Kumar wrote: >> >> >> >> Hi, >> >> >> >> Updated link for documentation : >> >> >> >> -- https://docs.gluster.org/en/latest/Administrator%20Guide/Geo%20Replication/ >> >> >> >> You can use this tool as well: >> >> http://aravindavk.in/blog/gluster-georep-tools/ >> >> >> >> -Sunny >> >> >> >> On Thu, Jun 6, 2019 at 10:29 AM Kotresh Hiremath Ravishankar >> >> wrote: >> >> > >> >> > Hi, >> >> > >> >> > I think the steps to setup non-root geo-rep is not followed properly. The following entry is missing in glusterd vol file which is required. >> >> > >> >> > The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] >> >> > >> >> > Could you please the steps from below? >> >> > >> >> > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.4/html-single/administration_guide/index#Setting_Up_the_Environment_for_a_Secure_Geo-replication_Slave >> >> > >> >> > And let us know if you still face the issue. >> >> > >> >> > >> >> > >> >> > >> >> > On Thu, Jun 6, 2019 at 10:24 AM deepu srinivasan wrote: >> >> >> >> >> >> Hi Kotresh, Sunny >> >> >> I Have mailed the logs I found in one of the slave machines. Is there anything to do with permission? Please help. >> >> >> >> >> >> On Wed, Jun 5, 2019 at 2:28 PM deepu srinivasan wrote: >> >> >>> >> >> >>> Hi Kotresh, Sunny >> >> >>> Found this log in the slave machine. >> >> >>>> >> >> >>>> [2019-06-05 08:49:10.632583] I [MSGID: 106488] [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: Received get vol req >> >> >>>> >> >> >>>> The message "I [MSGID: 106488] [glusterd-handler.c:1559:__glusterd_handle_cli_get_volume] 0-management: Received get vol req" repeated 2 times between [2019-06-05 08:49:10.632583] and [2019-06-05 08:49:10.670863] >> >> >>>> >> >> >>>> The message "I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req" repeated 34 times between [2019-06-05 08:48:41.005398] and [2019-06-05 08:50:37.254063] >> >> >>>> >> >> >>>> The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 34 times between [2019-06-05 08:48:41.005434] and [2019-06-05 08:50:37.254079] >> >> >>>> >> >> >>>> The message "W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory]" repeated 34 times between [2019-06-05 08:48:41.005444] and [2019-06-05 08:50:37.254080] >> >> >>>> >> >> >>>> [2019-06-05 08:50:46.361347] I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req >> >> >>>> >> >> >>>> [2019-06-05 08:50:46.361384] E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file >> >> >>>> >> >> >>>> [2019-06-05 08:50:46.361419] W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory] >> >> >>>> >> >> >>>> The message "I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req" repeated 33 times between [2019-06-05 08:50:46.361347] and [2019-06-05 08:52:34.019741] >> >> >>>> >> >> >>>> The message "E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file" repeated 33 times between [2019-06-05 08:50:46.361384] and [2019-06-05 08:52:34.019757] >> >> >>>> >> >> >>>> The message "W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory]" repeated 33 times between [2019-06-05 08:50:46.361419] and [2019-06-05 08:52:34.019758] >> >> >>>> >> >> >>>> [2019-06-05 08:52:44.426839] I [MSGID: 106496] [glusterd-handler.c:3187:__glusterd_handle_mount] 0-glusterd: Received mount req >> >> >>>> >> >> >>>> [2019-06-05 08:52:44.426886] E [MSGID: 106061] [glusterd-mountbroker.c:555:glusterd_do_mount] 0-management: 'option mountbroker-root' missing in glusterd vol file >> >> >>>> >> >> >>>> [2019-06-05 08:52:44.426896] W [MSGID: 106176] [glusterd-mountbroker.c:719:glusterd_do_mount] 0-management: unsuccessful mount request [No such file or directory] >> >> >>> >> >> >>> >> >> >>> On Wed, Jun 5, 2019 at 1:06 AM deepu srinivasan wrote: >> >> >>>> >> >> >>>> Thankyou Kotresh >> >> >>>> >> >> >>>> On Tue, Jun 4, 2019, 11:20 PM Kotresh Hiremath Ravishankar wrote: >> >> >>>>> >> >> >>>>> Ccing Sunny, who was investing similar issue. >> >> >>>>> >> >> >>>>> On Tue, Jun 4, 2019 at 5:46 PM deepu srinivasan wrote: >> >> >>>>>> >> >> >>>>>> Have already added the path in bashrc . Still in faulty state >> >> >>>>>> >> >> >>>>>> On Tue, Jun 4, 2019, 5:27 PM Kotresh Hiremath Ravishankar wrote: >> >> >>>>>>> >> >> >>>>>>> could you please try adding /usr/sbin to $PATH for user 'sas'? If it's bash, add 'export PATH=/usr/sbin:$PATH' in >> >> >>>>>>> /home/sas/.bashrc >> >> >>>>>>> >> >> >>>>>>> On Tue, Jun 4, 2019 at 5:24 PM deepu srinivasan wrote: >> >> >>>>>>>> >> >> >>>>>>>> Hi Kortesh >> >> >>>>>>>> Please find the logs of the above error >> >> >>>>>>>> Master log snippet >> >> >>>>>>>>> >> >> >>>>>>>>> [2019-06-04 11:52:09.254731] I [resource(worker /home/sas/gluster/data/code-misc):1379:connect_remote] SSH: Initializing SSH connection between master and slave... >> >> >>>>>>>>> [2019-06-04 11:52:09.308923] D [repce(worker /home/sas/gluster/data/code-misc):196:push] RepceClient: call 89724:139652759443264:1559649129.31 __repce_version__() ... >> >> >>>>>>>>> [2019-06-04 11:52:09.602792] E [syncdutils(worker /home/sas/gluster/data/code-misc):311:log_raise_exception]