From peljasz at yahoo.co.uk Wed Jul 1 14:46:11 2020 From: peljasz at yahoo.co.uk (lejeczek) Date: Wed, 1 Jul 2020 15:46:11 +0100 Subject: [Gluster-users] volume process does not start - glusterfs is happy with it? In-Reply-To: References:

Message-ID: <13c2171b-ea34-1e8b-c060-7ebed1e016ee@yahoo.co.uk> On 30/06/2020 11:31, Barak Sason Rofman wrote: > Greetings, > > I'm not sure if that's directly related to your problem, > but on a general level, AFAIK, replica-2 vols are not > recommended due to split brain possibility: > https://docs.gluster.org/en/latest/Administrator%20Guide/Split%20brain%20and%20ways%20to%20deal%20with%20it/ > > It's recommended to either use replica-3 or arbiter Arbiter. > > Regards, > > On Tue, Jun 30, 2020 at 1:14 PM lejeczek > > wrote: > > Hi everybody. > > I have two peers in the cluster and a 2-replica volume > which seems okey if it was not for one weird bit - > when a peer reboots then on that peer after a reboot I > see: > > $ gluster volume status USERs > Status of volume: USERs > Gluster process???????????????????????????? TCP Port? > RDMA Port? Online? Pid > ------------------------------------------------------------------------------ > Brick swir.direct:/00.STORAGE/2/0-GLUSTER-U > SERs??????????????????????????????????????? N/A?????? > N/A??????? N?????? N/A? > Brick dzien.direct:/00.STORAGE/2/0-GLUSTER- > USERs?????????????????????????????????????? 49152???? > 0????????? Y?????? 57338 > Self-heal Daemon on localhost?????????????? N/A?????? > N/A??????? Y?????? 4302 > Self-heal Daemon on dzien.direct??????????? N/A?????? > N/A??????? Y?????? 57359 > ? > Task Status of Volume USERs > ------------------------------------------------------------------------------ > There are no active volume tasks > > I do not suppose it's expected. > On such rebooted node I see: > $ systemctl status -l glusterd > ? glusterd.service - GlusterFS, a clustered > file-system server > ?? Loaded: loaded > (/usr/lib/systemd/system/glusterd.service; enabled; > vendor preset: enabled) > ? Drop-In: /etc/systemd/system/glusterd.service.d > ?????????? ??override.conf > ?? Active: active (running) since Mon 2020-06-29 > 21:37:36 BST; 13h ago > ???? Docs: man:glusterd(8) > ? Process: 4071 ExecStart=/usr/sbin/glusterd -p > /var/run/glusterd.pid --log-level $LOG_LEVEL > $GLUSTERD_OPTIONS (code=exited, status> > ?Main PID: 4086 (glusterd) > ??? Tasks: 20 (limit: 101792) > ?? Memory: 28.9M > ?? CGroup: /system.slice/glusterd.service > ?????????? ??4086 /usr/sbin/glusterd -p > /var/run/glusterd.pid --log-level INFO > ?????????? ??4302 /usr/sbin/glusterfs -s localhost > --volfile-id shd/USERs -p > /var/run/gluster/shd/USERs/USERs-shd.pid -l /var/log/g> > > Jun 29 21:37:36 swir.private.pawel systemd[1]: > Starting GlusterFS, a clustered file-system server... > Jun 29 21:37:36 swir.private.pawel systemd[1]: Started > GlusterFS, a clustered file-system server. > > And I do not see any other apparent problems nor errors. > On that node I manually: > $ systemctl restart glusterd.service > and... > > $ gluster volume status USERs > Status of volume: USERs > Gluster process???????????????????????????? TCP Port? > RDMA Port? Online? Pid > ------------------------------------------------------------------------------ > Brick swir.direct:/00.STORAGE/2/0-GLUSTER-U > SERs??????????????????????????????????????? 49152???? > 0????????? Y?????? 103225 > Brick dzien.direct:/00.STORAGE/2/0-GLUSTER- > USERs?????????????????????????????????????? 49152???? > 0????????? Y?????? 57338 > Self-heal Daemon on localhost?????????????? N/A?????? > N/A??????? Y?????? 103270 > Self-heal Daemon on dzien.direct??????????? N/A?????? > N/A??????? Y?????? 57359 > > Is not a puzzle??? I'm on glusterfs-7.6-1.el8.x86_64 > I hope somebody can share some thoughts. > many thanks, L. > That cannot be it!? If the root cause of this problem is 2-replica volume then it would be a massive cock-up! Then 2-volume replica should be banned and forbidden. I hope some can suggest a way to troubleshoot it. ps. we all, I presume all, know problems of 2-replica volumes. many thanks, L. > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-users > > > > -- > *Barak Sason Rofman* > > Gluster Storage?Development > > Red Hat?Israel > > 34 Jerusalem rd. Ra'anana, 43501 > > bsasonro at redhat.com ? > ??T:?_+972-9-7692304_ > M:?_+972-52-4326355_ > > @RedHat ???Red Hat > ??Red Hat > > > From felix.koelzow at gmx.de Wed Jul 1 17:57:22 2020 From: felix.koelzow at gmx.de (=?UTF-8?Q?Felix_K=c3=b6lzow?=) Date: Wed, 1 Jul 2020 19:57:22 +0200 Subject: [Gluster-users] volume process does not start - glusterfs is happy with it? In-Reply-To: <13c2171b-ea34-1e8b-c060-7ebed1e016ee@yahoo.co.uk> References:

<13c2171b-ea34-1e8b-c060-7ebed1e016ee@yahoo.co.uk> Message-ID: <05676387-1bf2-b4c9-bf60-0106c94be047@gmx.de> Hey, what about the device mapper? Everything was mount properly during reboot? It happens to me if the lvm device mapper got a timeout during the reboot process while mounting the brick itself. Regards, Felix On 01/07/2020 16:46, lejeczek wrote: > > On 30/06/2020 11:31, Barak Sason Rofman wrote: >> Greetings, >> >> I'm not sure if that's directly related to your problem, >> but on a general level, AFAIK, replica-2 vols are not >> recommended due to split brain possibility: >> https://docs.gluster.org/en/latest/Administrator%20Guide/Split%20brain%20and%20ways%20to%20deal%20with%20it/ >> >> It's recommended to either use replica-3 or arbiter Arbiter. >> >> Regards, >> >> On Tue, Jun 30, 2020 at 1:14 PM lejeczek >> > wrote: >> >> Hi everybody. >> >> I have two peers in the cluster and a 2-replica volume >> which seems okey if it was not for one weird bit - >> when a peer reboots then on that peer after a reboot I >> see: >> >> $ gluster volume status USERs >> Status of volume: USERs >> Gluster process???????????????????????????? TCP Port >> RDMA Port? Online? Pid >> ------------------------------------------------------------------------------ >> Brick swir.direct:/00.STORAGE/2/0-GLUSTER-U >> SERs??????????????????????????????????????? N/A >> N/A??????? N?????? N/A >> Brick dzien.direct:/00.STORAGE/2/0-GLUSTER- >> USERs?????????????????????????????????????? 49152 >> 0????????? Y?????? 57338 >> Self-heal Daemon on localhost?????????????? N/A >> N/A??????? Y?????? 4302 >> Self-heal Daemon on dzien.direct??????????? N/A >> N/A??????? Y?????? 57359 >> >> Task Status of Volume USERs >> ------------------------------------------------------------------------------ >> There are no active volume tasks >> >> I do not suppose it's expected. >> On such rebooted node I see: >> $ systemctl status -l glusterd >> ? glusterd.service - GlusterFS, a clustered >> file-system server >> ?? Loaded: loaded >> (/usr/lib/systemd/system/glusterd.service; enabled; >> vendor preset: enabled) >> ? Drop-In: /etc/systemd/system/glusterd.service.d >> ?????????? ??override.conf >> ?? Active: active (running) since Mon 2020-06-29 >> 21:37:36 BST; 13h ago >> ???? Docs: man:glusterd(8) >> ? Process: 4071 ExecStart=/usr/sbin/glusterd -p >> /var/run/glusterd.pid --log-level $LOG_LEVEL >> $GLUSTERD_OPTIONS (code=exited, status> >> ?Main PID: 4086 (glusterd) >> ??? Tasks: 20 (limit: 101792) >> ?? Memory: 28.9M >> ?? CGroup: /system.slice/glusterd.service >> ?????????? ??4086 /usr/sbin/glusterd -p >> /var/run/glusterd.pid --log-level INFO >> ?????????? ??4302 /usr/sbin/glusterfs -s localhost >> --volfile-id shd/USERs -p >> /var/run/gluster/shd/USERs/USERs-shd.pid -l /var/log/g> >> >> Jun 29 21:37:36 swir.private.pawel systemd[1]: >> Starting GlusterFS, a clustered file-system server... >> Jun 29 21:37:36 swir.private.pawel systemd[1]: Started >> GlusterFS, a clustered file-system server. >> >> And I do not see any other apparent problems nor errors. >> On that node I manually: >> $ systemctl restart glusterd.service >> and... >> >> $ gluster volume status USERs >> Status of volume: USERs >> Gluster process???????????????????????????? TCP Port >> RDMA Port? Online? Pid >> ------------------------------------------------------------------------------ >> Brick swir.direct:/00.STORAGE/2/0-GLUSTER-U >> SERs??????????????????????????????????????? 49152 >> 0????????? Y?????? 103225 >> Brick dzien.direct:/00.STORAGE/2/0-GLUSTER- >> USERs?????????????????????????????????????? 49152 >> 0????????? Y?????? 57338 >> Self-heal Daemon on localhost?????????????? N/A >> N/A??????? Y?????? 103270 >> Self-heal Daemon on dzien.direct??????????? N/A >> N/A??????? Y?????? 57359 >> >> Is not a puzzle??? I'm on glusterfs-7.6-1.el8.x86_64 >> I hope somebody can share some thoughts. >> many thanks, L. >> > That cannot be it!? If the root cause of this problem is > 2-replica volume then it would be a massive cock-up! Then > 2-volume replica should be banned and forbidden. > > I hope some can suggest a way to troubleshoot it. > > ps. we all, I presume all, know problems of 2-replica volumes. > > many thanks, L. > > >> ________ >> >> >> >> Community Meeting Calendar: >> >> Schedule - >> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >> Bridge: https://bluejeans.com/441850968 >> >> Gluster-users mailing list >> Gluster-users at gluster.org >> >> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> >> >> -- >> *Barak Sason Rofman* >> >> Gluster Storage?Development >> >> Red Hat?Israel >> >> 34 Jerusalem rd. Ra'anana, 43501 >> >> bsasonro at redhat.com >> ??T:?_+972-9-7692304_ >> M:?_+972-52-4326355_ >> >> @RedHat ???Red Hat >> ??Red Hat >> >> >> > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users From hunter86_bg at yahoo.com Wed Jul 1 18:33:27 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Wed, 01 Jul 2020 21:33:27 +0300 Subject: [Gluster-users] volume process does not start - glusterfs is happy with it? In-Reply-To: <05676387-1bf2-b4c9-bf60-0106c94be047@gmx.de> References:

<13c2171b-ea34-1e8b-c060-7ebed1e016ee@yahoo.co.uk> <05676387-1bf2-b4c9-bf60-0106c94be047@gmx.de> Message-ID: Sometimes the brick comes up slower than glusterd service (which starts the brick processes). The problem is that if you leave glusterd to depend on both bricks and a brick fails (for example FS problem) then the other brick will not come up too. After a system crash, the VDO service replay log was taking too much and the glusterd failed (as bricks were not ready yet), so I just created an override like this one: # /etc/systemd/system/glusterd.service.d/01-dependencies.conf [Unit] [root at ovirt1 ~]# cat /etc/systemd/system/glusterd.service.d/01-dependencies.conf [Unit] Description=GlusterFS, a clustered file-system server After=network.target rpcbind.service gluster_bricks-engine.mount gluster_bricks-data.mount gluster_bricks-fast1.mount gluster_bricks-fast2.mount gluster_bricks-fast3.mount gluster_bricks-fast4.mount Before=network-online.target I have created systemd mount units, due to VDO , but most probably the local-fs.target will generate the mount units for you from the fstab. Best Regards, Strahil Nikolov ?? 1 ??? 2020 ?. 20:57:22 GMT+03:00, "Felix K?lzow" ??????: >Hey, > > >what about the device mapper? Everything was mount properly during >reboot? > >It happens to me if the lvm device mapper got a timeout during the >reboot > >process while mounting the brick itself. > > >Regards, > >Felix > >On 01/07/2020 16:46, lejeczek wrote: >> >> On 30/06/2020 11:31, Barak Sason Rofman wrote: >>> Greetings, >>> >>> I'm not sure if that's directly related to your problem, >>> but on a general level, AFAIK, replica-2 vols are not >>> recommended due to split brain possibility: >>> >https://docs.gluster.org/en/latest/Administrator%20Guide/Split%20brain%20and%20ways%20to%20deal%20with%20it/ >>> >>> It's recommended to either use replica-3 or arbiter Arbiter. >>> >>> Regards, >>> >>> On Tue, Jun 30, 2020 at 1:14 PM lejeczek >>> > wrote: >>> >>> Hi everybody. >>> >>> I have two peers in the cluster and a 2-replica volume >>> which seems okey if it was not for one weird bit - >>> when a peer reboots then on that peer after a reboot I >>> see: >>> >>> $ gluster volume status USERs >>> Status of volume: USERs >>> Gluster process???????????????????????????? TCP Port >>> RDMA Port? Online? Pid >>> >------------------------------------------------------------------------------ >>> Brick swir.direct:/00.STORAGE/2/0-GLUSTER-U >>> SERs??????????????????????????????????????? N/A >>> N/A??????? N?????? N/A >>> Brick dzien.direct:/00.STORAGE/2/0-GLUSTER- >>> USERs?????????????????????????????????????? 49152 >>> 0????????? Y?????? 57338 >>> Self-heal Daemon on localhost?????????????? N/A >>> N/A??????? Y?????? 4302 >>> Self-heal Daemon on dzien.direct??????????? N/A >>> N/A??????? Y?????? 57359 >>> >>> Task Status of Volume USERs >>> >------------------------------------------------------------------------------ >>> There are no active volume tasks >>> >>> I do not suppose it's expected. >>> On such rebooted node I see: >>> $ systemctl status -l glusterd >>> ? glusterd.service - GlusterFS, a clustered >>> file-system server >>> ?? Loaded: loaded >>> (/usr/lib/systemd/system/glusterd.service; enabled; >>> vendor preset: enabled) >>> ? Drop-In: /etc/systemd/system/glusterd.service.d >>> ?????????? ??override.conf >>> ?? Active: active (running) since Mon 2020-06-29 >>> 21:37:36 BST; 13h ago >>> ???? Docs: man:glusterd(8) >>> ? Process: 4071 ExecStart=/usr/sbin/glusterd -p >>> /var/run/glusterd.pid --log-level $LOG_LEVEL >>> $GLUSTERD_OPTIONS (code=exited, status> >>> ?Main PID: 4086 (glusterd) >>> ??? Tasks: 20 (limit: 101792) >>> ?? Memory: 28.9M >>> ?? CGroup: /system.slice/glusterd.service >>> ?????????? ??4086 /usr/sbin/glusterd -p >>> /var/run/glusterd.pid --log-level INFO >>> ?????????? ??4302 /usr/sbin/glusterfs -s localhost >>> --volfile-id shd/USERs -p >>> /var/run/gluster/shd/USERs/USERs-shd.pid -l /var/log/g> >>> >>> Jun 29 21:37:36 swir.private.pawel systemd[1]: >>> Starting GlusterFS, a clustered file-system server... >>> Jun 29 21:37:36 swir.private.pawel systemd[1]: Started >>> GlusterFS, a clustered file-system server. >>> >>> And I do not see any other apparent problems nor errors. >>> On that node I manually: >>> $ systemctl restart glusterd.service >>> and... >>> >>> $ gluster volume status USERs >>> Status of volume: USERs >>> Gluster process???????????????????????????? TCP Port >>> RDMA Port? Online? Pid >>> >------------------------------------------------------------------------------ >>> Brick swir.direct:/00.STORAGE/2/0-GLUSTER-U >>> SERs??????????????????????????????????????? 49152 >>> 0????????? Y?????? 103225 >>> Brick dzien.direct:/00.STORAGE/2/0-GLUSTER- >>> USERs?????????????????????????????????????? 49152 >>> 0????????? Y?????? 57338 >>> Self-heal Daemon on localhost?????????????? N/A >>> N/A??????? Y?????? 103270 >>> Self-heal Daemon on dzien.direct??????????? N/A >>> N/A??????? Y?????? 57359 >>> >>> Is not a puzzle??? I'm on glusterfs-7.6-1.el8.x86_64 >>> I hope somebody can share some thoughts. >>> many thanks, L. >>> >> That cannot be it!? If the root cause of this problem is >> 2-replica volume then it would be a massive cock-up! Then >> 2-volume replica should be banned and forbidden. >> >> I hope some can suggest a way to troubleshoot it. >> >> ps. we all, I presume all, know problems of 2-replica volumes. >> >> many thanks, L. >> >> >>> ________ >>> >>> >>> >>> Community Meeting Calendar: >>> >>> Schedule - >>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>> Bridge: https://bluejeans.com/441850968 >>> >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> >>> https://lists.gluster.org/mailman/listinfo/gluster-users >>> >>> >>> >>> -- >>> *Barak Sason Rofman* >>> >>> Gluster Storage?Development >>> >>> Red Hat?Israel >>> >>> 34 Jerusalem rd. Ra'anana, 43501 >>> >>> bsasonro at redhat.com >>> ??T:?_+972-9-7692304_ >>> M:?_+972-52-4326355_ >>> >>> @RedHat ???Red Hat >>> ??Red Hat >>> >>> >>> >> ________ >> >> >> >> Community Meeting Calendar: >> >> Schedule - >> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >> Bridge: https://bluejeans.com/441850968 >> >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users >________ > > > >Community Meeting Calendar: > >Schedule - >Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >Bridge: https://bluejeans.com/441850968 > >Gluster-users mailing list >Gluster-users at gluster.org >https://lists.gluster.org/mailman/listinfo/gluster-users From peljasz at yahoo.co.uk Thu Jul 2 13:32:21 2020 From: peljasz at yahoo.co.uk (lejeczek) Date: Thu, 2 Jul 2020 14:32:21 +0100 Subject: [Gluster-users] gluster cmd output - how to format References: <87c430eb-56b6-57d2-ef2a-f974b69cc5fe.ref@yahoo.co.uk> Message-ID: <87c430eb-56b6-57d2-ef2a-f974b69cc5fe@yahoo.co.uk> hi guys Would you know if it's possible to format gluster cmd output? What frustrates me personally is "forced" line wrapping, as example of: $ gluster volume status many thanks, L. From evilmf at gmail.com Thu Jul 2 13:33:51 2020 From: evilmf at gmail.com (Marco Fais) Date: Thu, 2 Jul 2020 14:33:51 +0100 Subject: [Gluster-users] Problems with qemu and disperse volumes (live merge) In-Reply-To: <93D3EE3B-B5B3-4689-BF66-C1442A03971E@yahoo.com> References:

<93D3EE3B-B5B3-4689-BF66-C1442A03971E@yahoo.com> Message-ID: Hi Strahil, WARNING: As you enabled sharding - NEVER DISABLE SHARDING, EVER ! > Thanks -- good to be reminded :) > >When you say they will not be optimal are you referring mainly to > >performance considerations? We did plenty of testing, and in terms of > >performance didn't have issues even with I/O intensive workloads (using > >SSDs, I had issues with spinning disks). > > Yes, the client side has to connect to 6 bricks (4+2) at a time and > calculate the data in order to obtain the necessary information.Same is > valid for writing. > If you need to conserve space, you can test VDO without compression (of > even with it). > Understood -- will explore VDO. Storage usage efficiency is less important than fault tolerance or performance for us -- disperse volumes seemed to tick all the boxes so we looked at them primarily. But clearly I had missed that they are not used as mainstream VM storage for oVirt (I did know they weren't supported, but as explained thought was more on the management side). > > Also with replica volumes, you can use 'choose-local' /in case you > have faster than the network storage (like NVMe)/ and increase the read > speed. Of course this feature is useful for Hyperconverged setup (gluster > + ovirt on the same node). > Will explore this option as well, thanks for the suggestion. > If you were using ovirt 4.3 , I would recommend you to focus on > gluster. Yet, you use oVirt 4.4 which is quite newer and it needs some > polishing. > Ovirt 4.3.9 (using the older Centos 7 qemu/libvirt) unfortunately had similar issues with the disperse volumes. Not sure if exactly the same, as never looked deeper into it, but the results were similar. Ovirt 4.4.0 has some issues with snapshot deletion that are independent from Gluster (I have raised the issue here, https://bugzilla.redhat.com/show_bug.cgi?id=1840414, should be sorted with 4.4.2 I guess), so at the moment it only works with the "testing" AV repo. > Check ovirt engine logs (on the HostedEngine VM or your standalone > engine) , vdsm logs on the host that was running the VM and next - check > the brick logs. > Will do. Thanks, Marco -------------- next part -------------- An HTML attachment was scrubbed... URL: From shreyansh.shah at alpha-grep.com Thu Jul 2 14:39:25 2020 From: shreyansh.shah at alpha-grep.com (Shreyansh Shah) Date: Thu, 2 Jul 2020 20:09:25 +0530 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance Message-ID: Hi All, *We are facing "Mismatching layouts for ,gfid = " errors.* We have a distributed glusterfs 5.10, no replication, 2 bricks (4TB each) on each node, 7 nodes in total. We added new bricks yesterday to the existing setup. Post that we did a rebalance fix-layout and then a rebalance (which is currently still in progress). The status shows "failed" on certain bricks but "in progress" for others. Adding output for gluster rebalance status below. The glusterfs client logs are flooded with "Mismatching layouts for ,gfid = " The performance too seems to have degraded due to this, even basic commands like `cd` and `ls` are taking more than a minute compared to sub-second number before brick addition. Apart from that we also experienced many binaries and files giving error stale file handle error even though the files were present. *gluster rebalance status :* Node Rebalanced-files size scanned failures skipped status run time in h:m:s --------- ----------- ----------- ----------- ----------- ----------- ------------ -------------- localhost 176 3.5GB 12790 0 8552 in progress 21:36:01 10.132.0.72 8232 394.8GB 19995 21 26 failed 14:50:30 10.132.0.44 12625 1.0TB 50023 1 10202 in progress 21:36:00 10.132.0.3 21982 956.8GB 79145 1 34571 in progress 21:36:00 10.132.0.9 7975 355.8GB 20157 6 1522 failed 14:51:45 10.132.0.73 6293 394.5GB 26414 151 8085 failed 14:51:45 10.132.0.70 6480 477.1GB 21058 27 1787 failed 14:50:32 Estimated time left for rebalance to complete : 130:56:28 *Logs from one of the clients below:* [2020-07-02 12:30:14.971916] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk layout - 2761060380 - 3067813815 - 4159036738 [2020-07-02 12:30:14.971935] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc [2020-07-02 12:30:15.032013] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk layout - 3681390552 - 3988143987 - 4159036738 [2020-07-02 12:30:15.032059] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc [2020-07-02 12:30:15.032107] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk layout - 3374637116 - 3681390551 - 4159036738 [2020-07-02 12:30:15.032153] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc [2020-07-02 12:30:15.093329] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk layout - 2454306944 - 2761060379 - 4159036738 [2020-07-02 12:30:15.093373] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 [2020-07-02 12:30:15.093460] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk layout - 2761060380 - 3067813815 - 4159036738 [2020-07-02 12:30:15.093515] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 [2020-07-02 12:30:15.151063] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk layout - 3681390552 - 3988143987 - 4159036738 [2020-07-02 12:30:15.151108] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 [2020-07-02 12:30:15.151149] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk layout - 3374637116 - 3681390551 - 4159036738 [2020-07-02 12:30:15.151162] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 [2020-07-02 12:30:15.424321] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk layout - 920400036 - 1227153471 - 4159036738 [2020-07-02 12:30:15.424380] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 [2020-07-02 12:30:15.424456] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk layout - 1840730208 - 2147483643 - 4159036738 [2020-07-02 12:30:15.424484] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 [2020-07-02 12:30:15.424525] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk layout - 1533976772 - 1840730207 - 4159036738 [2020-07-02 12:30:15.424542] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 [2020-07-02 12:30:15.424596] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk layout - 613646600 - 920400035 - 4159036738 [2020-07-02 12:30:15.424607] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 [2020-07-02 12:30:16.004482] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on data-client-7 (hashed subvol is data-client-17) [2020-07-02 12:30:16.005523] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_CDS_1_DATA.dat [2020-07-02 12:30:16.531047] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat on data-client-9 (hashed subvol is data-client-19) [2020-07-02 12:30:16.532086] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_CDS_1_METADATA.dat [2020-07-02 12:30:18.733229] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on data-client-17 (hashed subvol is data-client-9) [2020-07-02 12:30:18.734421] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_CDS_2_DATA.dat [2020-07-02 12:30:19.171930] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat on data-client-9 (hashed subvol is data-client-18) [2020-07-02 12:30:19.172901] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_CDS_2_METADATA.dat [2020-07-02 12:30:21.028495] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on data-client-6 (hashed subvol is data-client-15) [2020-07-02 12:30:21.029836] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_2_DATA.dat [2020-07-02 12:30:21.127648] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat on data-client-11 (hashed subvol is data-client-3) [2020-07-02 12:30:21.128713] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_2_METADATA.dat [2020-07-02 12:30:21.201126] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on data-client-15 (hashed subvol is data-client-7) [2020-07-02 12:30:21.201928] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_3_DATA.dat [2020-07-02 12:30:21.566158] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat on data-client-7 (hashed subvol is data-client-16) [2020-07-02 12:30:21.567123] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_3_METADATA.dat [2020-07-02 12:30:21.649357] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on data-client-2 (hashed subvol is data-client-11) [2020-07-02 12:30:21.661381] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_4_DATA.dat [2020-07-02 12:30:21.748937] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat on data-client-15 (hashed subvol is data-client-7) [2020-07-02 12:30:21.749481] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_4_METADATA.dat [2020-07-02 12:30:21.898593] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on data-client-14 (hashed subvol is data-client-7) [2020-07-02 12:30:21.899442] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_6_DATA.dat [2020-07-02 12:30:22.039337] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat on data-client-10 (hashed subvol is data-client-2) [2020-07-02 12:30:22.040086] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/BSE_EQ_6_METADATA.dat [2020-07-02 12:30:22.501877] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat on data-client-15 (hashed subvol is data-client-8) [2020-07-02 12:30:22.502712] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECDS1_DATA.dat [2020-07-02 12:30:22.782577] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 (hashed subvol is data-client-6) [2020-07-02 12:30:22.783777] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECDS1_METADATA.dat [2020-07-02 12:30:23.146847] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on data-client-17 (hashed subvol is data-client-9) [2020-07-02 12:30:23.148009] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECM1_DATA.dat [2020-07-02 12:30:23.229290] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed subvol is data-client-6) [2020-07-02 12:30:23.230151] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECM1_METADATA.dat [2020-07-02 12:30:23.889520] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on data-client-2 (hashed subvol is data-client-11) [2020-07-02 12:30:23.896618] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECM2_DATA.dat [2020-07-02 12:30:24.093017] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed subvol is data-client-15) [2020-07-02 12:30:24.094117] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECM2_METADATA.dat [2020-07-02 12:30:24.345257] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on data-client-17 (hashed subvol is data-client-10) [2020-07-02 12:30:24.346234] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECM3_DATA.dat [2020-07-02 12:30:24.425835] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed subvol is data-client-15) [2020-07-02 12:30:24.426880] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSECM3_METADATA.dat [2020-07-02 12:30:25.158718] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat on data-client-9 (hashed subvol is data-client-19) [2020-07-02 12:30:25.159619] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO1_DATA.dat [2020-07-02 12:30:25.531479] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed subvol is data-client-10) [2020-07-02 12:30:25.540569] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO1_METADATA.dat [2020-07-02 12:30:25.771692] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat on data-client-11 (hashed subvol is data-client-3) [2020-07-02 12:30:25.772610] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO2_DATA.dat [2020-07-02 12:30:25.866118] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 (hashed subvol is data-client-8) [2020-07-02 12:30:25.866917] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO2_METADATA.dat [2020-07-02 12:30:26.424386] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat on data-client-9 (hashed subvol is data-client-18) [2020-07-02 12:30:26.425309] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO3_DATA.dat [2020-07-02 12:30:26.818852] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 (hashed subvol is data-client-2) [2020-07-02 12:30:26.819890] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO3_METADATA.dat [2020-07-02 12:30:27.352405] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat on data-client-10 (hashed subvol is data-client-2) [2020-07-02 12:30:27.352914] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO4_DATA.dat [2020-07-02 12:30:27.521286] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed subvol is data-client-18) [2020-07-02 12:30:27.522325] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO4_METADATA.dat [2020-07-02 12:30:28.566634] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat on data-client-2 (hashed subvol is data-client-11) [2020-07-02 12:30:28.579295] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO5_DATA.dat [2020-07-02 12:30:28.958028] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat on data-client-7 (hashed subvol is data-client-16) [2020-07-02 12:30:28.959102] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO6_DATA.dat [2020-07-02 12:30:29.012429] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed subvol is data-client-15) [2020-07-02 12:30:29.013416] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/MCASTNSEFNO6_METADATA.dat [2020-07-02 12:30:29.396716] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on data-client-17 (hashed subvol is data-client-10) [2020-07-02 12:30:29.397740] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/NSEFO_BSE_TSDATA.dat [2020-07-02 12:30:29.556312] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed subvol is data-client-18) [2020-07-02 12:30:29.557197] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat [2020-07-02 12:30:30.605354] I [MSGID: 109045] [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting deletion of stale linkfile /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 (hashed subvol is data-client-19) [2020-07-02 12:30:30.606117] I [MSGID: 109069] [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink returned with op_ret -> 0 and op-errno -> 0 for /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat [2020-07-02 12:30:31.559206] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - 613576736 - 920330171 - 4159036738 [2020-07-02 12:30:31.559255] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 [2020-07-02 12:30:31.569025] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - 920330172 - 1227083607 - 4159036738 [2020-07-02 12:30:31.569067] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 [2020-07-02 12:30:31.701849] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - 3374637116 - 3681390551 - 4159036738 [2020-07-02 12:30:31.701895] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX, gfid = fff324f2-f855-4881-b77c-81e856522373 [2020-07-02 12:30:31.738464] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - 3681390552 - 3988143987 - 4159036738 [2020-07-02 12:30:31.738507] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX, gfid = fff324f2-f855-4881-b77c-81e856522373 [2020-07-02 12:30:31.857102] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk layout - 3067883680 - 3374637115 - 4159036738 [2020-07-02 12:30:31.857147] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = f8447150-4801-4188-add9-ea295bb88729 [2020-07-02 12:30:31.857180] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk layout - 3374637116 - 3681390551 - 4159036738 [2020-07-02 12:30:31.857197] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = f8447150-4801-4188-add9-ea295bb88729 [2020-07-02 12:30:31.917705] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 - 306753435 - 4159036738 [2020-07-02 12:30:31.917781] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = f8447150-4801-4188-add9-ea295bb88729 [2020-07-02 12:30:31.917855] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk layout - 3988213852 - 4294967295 - 4159036738 [2020-07-02 12:30:31.917874] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = f8447150-4801-4188-add9-ea295bb88729 [2020-07-02 12:30:32.390945] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - 3681460416 - 3988213851 - 4159036738 [2020-07-02 12:30:32.390998] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX/NIFTY, gfid = b2d4deb7-c58c-4046-b6f2-7c7f44d71311 [2020-07-02 12:30:32.391056] I [MSGID: 109064] [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - 3988213852 - 4294967295 - 4159036738 [2020-07-02 12:30:32.391075] I [MSGID: 109018] [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for /processed_data/Indexes/NSEINDEX/NIFTY, gfid = b2d4deb7-c58c-4046-b6f2-7c7f44d71311 [2020-07-02 12:33:50.915279] I [MSGID: 109066] [dht-rename.c:1922:dht_rename] 4-data-dht: renaming /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 (2cb54500-814d-4e85-83e7-e33d9440b18d) (hash=data-client-6/cache=data-client-18) => /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) (hash=data-client-6/cache=) [2020-07-02 12:34:09.799586] I [MSGID: 109066] [dht-rename.c:1922:dht_rename] 4-data-dht: renaming /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k (99938ee6-6986-4123-9d72-ec09e2310b4f) (hash=data-client-17/cache=data-client-18) => /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) (hash=data-client-17/cache=) .... Please look into this at top-priority if possible. Let me know if anything else is required. -- Regards, Shreyansh Shah -------------- next part -------------- An HTML attachment was scrubbed... URL: From hunter86_bg at yahoo.com Thu Jul 2 14:45:38 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Thu, 02 Jul 2020 17:45:38 +0300 Subject: [Gluster-users] Problems with qemu and disperse volumes (live merge) In-Reply-To: References:

<93D3EE3B-B5B3-4689-BF66-C1442A03971E@yahoo.com> Message-ID: ?? 2 ??? 2020 ?. 16:33:51 GMT+03:00, Marco Fais ??????: >Hi Strahil, > >WARNING: As you enabled sharding - NEVER DISABLE SHARDING, EVER ! >> > >Thanks -- good to be reminded :) > > >> >When you say they will not be optimal are you referring mainly to >> >performance considerations? We did plenty of testing, and in terms >of >> >performance didn't have issues even with I/O intensive workloads >(using >> >SSDs, I had issues with spinning disks). >> >> Yes, the client side has to connect to 6 bricks (4+2) at a time and >> calculate the data in order to obtain the necessary information.Same >is >> valid for writing. >> If you need to conserve space, you can test VDO without compression >(of >> even with it). >> > >Understood -- will explore VDO. Storage usage efficiency is less >important >than fault tolerance or performance for us -- disperse volumes seemed >to >tick all the boxes so we looked at them primarily. >But clearly I had missed that they are not used as mainstream VM >storage >for oVirt (I did know they weren't supported, but as explained thought >was >more on the management side). > > >> >> Also with replica volumes, you can use 'choose-local' /in case >you >> have faster than the network storage (like NVMe)/ and increase the >read >> speed. Of course this feature is useful for Hyperconverged setup >(gluster >> + ovirt on the same node). >> > >Will explore this option as well, thanks for the suggestion. > > >> If you were using ovirt 4.3 , I would recommend you to focus on >> gluster. Yet, you use oVirt 4.4 which is quite newer and it needs > some >> polishing. >> > >Ovirt 4.3.9 (using the older Centos 7 qemu/libvirt) unfortunately had >similar issues with the disperse volumes. Not sure if exactly the same, >as >never looked deeper into it, but the results were similar. >Ovirt 4.4.0 has some issues with snapshot deletion that are independent >from Gluster (I have raised the issue here, >https://bugzilla.redhat.com/show_bug.cgi?id=1840414, should be sorted >with >4.4.2 I guess), so at the moment it only works with the "testing" AV >repo. In such case I can recommend you to: 1. Ensure you have enough space on all bricks for the logs (/var/log/gluster). Several gigs should be OK 2. Enable all logs to 'TRACE' . Red Hat's documentation on the topic is quite good: https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3/html/administration_guide/configuring_the_log_level 3. Reproduce the issue on a fresh VM (never done snapshot deletion) 4. Disable (switch to info) all logs as per the link in point 2 The logs will be spread among all nodes. If you have remote logging available, you can also use it for analysis of the logs. Most probably the brick logs can provide useful information. > >> Check ovirt engine logs (on the HostedEngine VM or your standalone >> engine) , vdsm logs on the host that was running the VM and next - >check >> the brick logs. >> > >Will do. > >Thanks, >Marco About VDO - it might require some tuning and even afterwards it won't be very performant, so it depends on your needs. Best Regards, Strahil Nikolov From hunter86_bg at yahoo.com Thu Jul 2 20:21:32 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Thu, 02 Jul 2020 23:21:32 +0300 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References: Message-ID: Hi Shreyansh, have you checked the gluster logs on the failed nodes for any clues -> for example 10.132.0.9 ? Sadly I don't have much experience with pure distributed volumes, but I do think that everything will be back to normal when the rebalance is complete. Yet, based on the output - it won't be soon. Best Regards, Strahil Nikolov ?? 2 ??? 2020 ?. 17:39:25 GMT+03:00, Shreyansh Shah ??????: >Hi All, > >*We are facing "Mismatching layouts for ,gfid = " >errors.* > >We have a distributed glusterfs 5.10, no replication, 2 bricks (4TB >each) >on each node, 7 nodes in total. We added new bricks yesterday to the >existing setup. >Post that we did a rebalance fix-layout and then a rebalance (which is >currently still in progress). The status shows "failed" on certain >bricks >but "in progress" for others. Adding output for gluster rebalance >status >below. > >The glusterfs client logs are flooded with "Mismatching layouts for >,gfid = " >The performance too seems to have degraded due to this, even basic >commands >like `cd` and `ls` are taking more than a minute compared to sub-second >number before brick addition. >Apart from that we also experienced many binaries and files giving >error >stale file handle error even though the files were present. > > >*gluster rebalance status :* > >Node Rebalanced-files size scanned failures >skipped status run time in h:m:s >--------- ----------- ----------- ----------- ----------- >----------- ------------ -------------- >localhost 176 3.5GB 12790 0 > 8552 in progress 21:36:01 >10.132.0.72 8232 394.8GB 19995 21 > 26 failed 14:50:30 >10.132.0.44 12625 1.0TB 50023 1 > 10202 in progress 21:36:00 >10.132.0.3 21982 956.8GB 79145 1 > 34571 in progress 21:36:00 >10.132.0.9 7975 355.8GB 20157 6 > 1522 failed 14:51:45 >10.132.0.73 6293 394.5GB 26414 151 > 8085 failed 14:51:45 >10.132.0.70 6480 477.1GB 21058 27 > 1787 failed 14:50:32 >Estimated time left for rebalance to complete : 130:56:28 > > >*Logs from one of the clients below:* > >[2020-07-02 12:30:14.971916] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; >disk >layout - 2761060380 - 3067813815 - 4159036738 >[2020-07-02 12:30:14.971935] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >[2020-07-02 12:30:15.032013] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; >disk >layout - 3681390552 - 3988143987 - 4159036738 >[2020-07-02 12:30:15.032059] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >[2020-07-02 12:30:15.032107] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; >disk >layout - 3374637116 - 3681390551 - 4159036738 >[2020-07-02 12:30:15.032153] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >[2020-07-02 12:30:15.093329] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; >disk >layout - 2454306944 - 2761060379 - 4159036738 >[2020-07-02 12:30:15.093373] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >[2020-07-02 12:30:15.093460] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; >disk >layout - 2761060380 - 3067813815 - 4159036738 >[2020-07-02 12:30:15.093515] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >[2020-07-02 12:30:15.151063] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; >disk >layout - 3681390552 - 3988143987 - 4159036738 >[2020-07-02 12:30:15.151108] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >[2020-07-02 12:30:15.151149] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; >disk >layout - 3374637116 - 3681390551 - 4159036738 >[2020-07-02 12:30:15.151162] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >[2020-07-02 12:30:15.424321] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; >disk >layout - 920400036 - 1227153471 - 4159036738 >[2020-07-02 12:30:15.424380] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >[2020-07-02 12:30:15.424456] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; >disk >layout - 1840730208 - 2147483643 - 4159036738 >[2020-07-02 12:30:15.424484] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >[2020-07-02 12:30:15.424525] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; >disk >layout - 1533976772 - 1840730207 - 4159036738 >[2020-07-02 12:30:15.424542] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >[2020-07-02 12:30:15.424596] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk >layout - 613646600 - 920400035 - 4159036738 >[2020-07-02 12:30:15.424607] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >[2020-07-02 12:30:16.004482] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat >on >data-client-7 (hashed subvol is data-client-17) >[2020-07-02 12:30:16.005523] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_CDS_1_DATA.dat >[2020-07-02 12:30:16.531047] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/BSE_CDS_1_METADATA.dat >on data-client-9 (hashed subvol is data-client-19) >[2020-07-02 12:30:16.532086] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_CDS_1_METADATA.dat >[2020-07-02 12:30:18.733229] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat >on >data-client-17 (hashed subvol is data-client-9) >[2020-07-02 12:30:18.734421] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_CDS_2_DATA.dat >[2020-07-02 12:30:19.171930] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/BSE_CDS_2_METADATA.dat >on data-client-9 (hashed subvol is data-client-18) >[2020-07-02 12:30:19.172901] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_CDS_2_METADATA.dat >[2020-07-02 12:30:21.028495] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat >on >data-client-6 (hashed subvol is data-client-15) >[2020-07-02 12:30:21.029836] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_2_DATA.dat >[2020-07-02 12:30:21.127648] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/BSE_EQ_2_METADATA.dat >on data-client-11 (hashed subvol is data-client-3) >[2020-07-02 12:30:21.128713] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_2_METADATA.dat >[2020-07-02 12:30:21.201126] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat >on >data-client-15 (hashed subvol is data-client-7) >[2020-07-02 12:30:21.201928] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_3_DATA.dat >[2020-07-02 12:30:21.566158] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/BSE_EQ_3_METADATA.dat >on data-client-7 (hashed subvol is data-client-16) >[2020-07-02 12:30:21.567123] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_3_METADATA.dat >[2020-07-02 12:30:21.649357] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat >on >data-client-2 (hashed subvol is data-client-11) >[2020-07-02 12:30:21.661381] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_4_DATA.dat >[2020-07-02 12:30:21.748937] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/BSE_EQ_4_METADATA.dat >on data-client-15 (hashed subvol is data-client-7) >[2020-07-02 12:30:21.749481] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_4_METADATA.dat >[2020-07-02 12:30:21.898593] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat >on >data-client-14 (hashed subvol is data-client-7) >[2020-07-02 12:30:21.899442] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_6_DATA.dat >[2020-07-02 12:30:22.039337] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/BSE_EQ_6_METADATA.dat >on data-client-10 (hashed subvol is data-client-2) >[2020-07-02 12:30:22.040086] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/BSE_EQ_6_METADATA.dat >[2020-07-02 12:30:22.501877] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECDS1_DATA.dat >on data-client-15 (hashed subvol is data-client-8) >[2020-07-02 12:30:22.502712] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECDS1_DATA.dat >[2020-07-02 12:30:22.782577] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 >(hashed subvol is data-client-6) >[2020-07-02 12:30:22.783777] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECDS1_METADATA.dat >[2020-07-02 12:30:23.146847] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECM1_DATA.dat on >data-client-17 (hashed subvol is data-client-9) >[2020-07-02 12:30:23.148009] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECM1_DATA.dat >[2020-07-02 12:30:23.229290] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 >(hashed >subvol is data-client-6) >[2020-07-02 12:30:23.230151] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECM1_METADATA.dat >[2020-07-02 12:30:23.889520] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECM2_DATA.dat on >data-client-2 (hashed subvol is data-client-11) >[2020-07-02 12:30:23.896618] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECM2_DATA.dat >[2020-07-02 12:30:24.093017] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 >(hashed >subvol is data-client-15) >[2020-07-02 12:30:24.094117] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECM2_METADATA.dat >[2020-07-02 12:30:24.345257] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECM3_DATA.dat on >data-client-17 (hashed subvol is data-client-10) >[2020-07-02 12:30:24.346234] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECM3_DATA.dat >[2020-07-02 12:30:24.425835] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 >(hashed >subvol is data-client-15) >[2020-07-02 12:30:24.426880] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSECM3_METADATA.dat >[2020-07-02 12:30:25.158718] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO1_DATA.dat >on data-client-9 (hashed subvol is data-client-19) >[2020-07-02 12:30:25.159619] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO1_DATA.dat >[2020-07-02 12:30:25.531479] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 >(hashed >subvol is data-client-10) >[2020-07-02 12:30:25.540569] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO1_METADATA.dat >[2020-07-02 12:30:25.771692] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO2_DATA.dat >on data-client-11 (hashed subvol is data-client-3) >[2020-07-02 12:30:25.772610] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO2_DATA.dat >[2020-07-02 12:30:25.866118] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 >(hashed subvol is data-client-8) >[2020-07-02 12:30:25.866917] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO2_METADATA.dat >[2020-07-02 12:30:26.424386] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO3_DATA.dat >on data-client-9 (hashed subvol is data-client-18) >[2020-07-02 12:30:26.425309] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO3_DATA.dat >[2020-07-02 12:30:26.818852] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 >(hashed subvol is data-client-2) >[2020-07-02 12:30:26.819890] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO3_METADATA.dat >[2020-07-02 12:30:27.352405] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO4_DATA.dat >on data-client-10 (hashed subvol is data-client-2) >[2020-07-02 12:30:27.352914] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO4_DATA.dat >[2020-07-02 12:30:27.521286] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 >(hashed >subvol is data-client-18) >[2020-07-02 12:30:27.522325] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO4_METADATA.dat >[2020-07-02 12:30:28.566634] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO5_DATA.dat >on data-client-2 (hashed subvol is data-client-11) >[2020-07-02 12:30:28.579295] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO5_DATA.dat >[2020-07-02 12:30:28.958028] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO6_DATA.dat >on data-client-7 (hashed subvol is data-client-16) >[2020-07-02 12:30:28.959102] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO6_DATA.dat >[2020-07-02 12:30:29.012429] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 >(hashed >subvol is data-client-15) >[2020-07-02 12:30:29.013416] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/MCASTNSEFNO6_METADATA.dat >[2020-07-02 12:30:29.396716] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/NSEFO_BSE_TSDATA.dat on >data-client-17 (hashed subvol is data-client-10) >[2020-07-02 12:30:29.397740] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/NSEFO_BSE_TSDATA.dat >[2020-07-02 12:30:29.556312] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 >(hashed >subvol is data-client-18) >[2020-07-02 12:30:29.557197] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >[2020-07-02 12:30:30.605354] I [MSGID: 109045] >[dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >deletion of stale linkfile >/processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on >data-client-9 >(hashed subvol is data-client-19) >[2020-07-02 12:30:30.606117] I [MSGID: 109069] >[dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >returned with op_ret -> 0 and op-errno -> 0 for >/processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >[2020-07-02 12:30:31.559206] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - >613576736 - 920330171 - 4159036738 >[2020-07-02 12:30:31.559255] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >[2020-07-02 12:30:31.569025] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout >- >920330172 - 1227083607 - 4159036738 >[2020-07-02 12:30:31.569067] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >[2020-07-02 12:30:31.701849] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout >- >3374637116 - 3681390551 - 4159036738 >[2020-07-02 12:30:31.701895] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX, gfid = >fff324f2-f855-4881-b77c-81e856522373 >[2020-07-02 12:30:31.738464] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout >- >3681390552 - 3988143987 - 4159036738 >[2020-07-02 12:30:31.738507] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX, gfid = >fff324f2-f855-4881-b77c-81e856522373 >[2020-07-02 12:30:31.857102] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; >disk >layout - 3067883680 - 3374637115 - 4159036738 >[2020-07-02 12:30:31.857147] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >f8447150-4801-4188-add9-ea295bb88729 >[2020-07-02 12:30:31.857180] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; >disk >layout - 3374637116 - 3681390551 - 4159036738 >[2020-07-02 12:30:31.857197] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >f8447150-4801-4188-add9-ea295bb88729 >[2020-07-02 12:30:31.917705] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout >- 0 >- 306753435 - 4159036738 >[2020-07-02 12:30:31.917781] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >f8447150-4801-4188-add9-ea295bb88729 >[2020-07-02 12:30:31.917855] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; >disk >layout - 3988213852 - 4294967295 - 4159036738 >[2020-07-02 12:30:31.917874] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >f8447150-4801-4188-add9-ea295bb88729 >[2020-07-02 12:30:32.390945] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout >- >3681460416 - 3988213851 - 4159036738 >[2020-07-02 12:30:32.390998] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX/NIFTY, gfid = >b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >[2020-07-02 12:30:32.391056] I [MSGID: 109064] >[dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout >- >3988213852 - 4294967295 - 4159036738 >[2020-07-02 12:30:32.391075] I [MSGID: 109018] >[dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts >for >/processed_data/Indexes/NSEINDEX/NIFTY, gfid = >b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >[2020-07-02 12:33:50.915279] I [MSGID: 109066] >[dht-rename.c:1922:dht_rename] 4-data-dht: renaming >/raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >(2cb54500-814d-4e85-83e7-e33d9440b18d) >(hash=data-client-6/cache=data-client-18) => >/raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >(hash=data-client-6/cache=) >[2020-07-02 12:34:09.799586] I [MSGID: 109066] >[dht-rename.c:1922:dht_rename] 4-data-dht: renaming >/raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >(99938ee6-6986-4123-9d72-ec09e2310b4f) >(hash=data-client-17/cache=data-client-18) => >/raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >(hash=data-client-17/cache=) >.... > > >Please look into this at top-priority if possible. >Let me know if anything else is required. From felix.koelzow at gmx.de Fri Jul 3 08:16:30 2020 From: felix.koelzow at gmx.de (=?UTF-8?Q?Felix_K=c3=b6lzow?=) Date: Fri, 3 Jul 2020 10:16:30 +0200 Subject: [Gluster-users] Geo-replication completely broken In-Reply-To: References: <31c26aca-2dbd-e798-27f0-e8c33afe7f21@gmx.de> Message-ID: <3c8c2b2b-b539-f8f7-0597-d96d2be1fd74@gmx.de> Dear Users, the geo-replication is still broken. This is not really a comfortable situation. Does any user has had the same experience and is able to share a possible workaround? We are actually running gluster v6.0 Regards, Felix On 25/06/2020 10:04, Shwetha Acharya wrote: > Hi Rob and Felix, > > Please share the *-changes.log files and brick logs, which will help > in analysis of the issue. > > Regards, > Shwetha > > On Thu, Jun 25, 2020 at 1:26 PM Felix K?lzow > wrote: > > Hey Rob, > > > same issue for our third volume. Have a look at the logs just from > right now (below). > > Question: You removed the htime files and the old changelogs. Just > rm the files or is there something to pay more attention > > before removing the changelog files and the htime file. > > Regards, > > Felix > > [2020-06-25 07:51:53.795430] I [resource(worker > /gluster/vg00/dispersed_fuse1024/brick):1435:connect_remote] SSH: > SSH connection between master and slave established.??? > duration=1.2341 > [2020-06-25 07:51:53.795639] I [resource(worker > /gluster/vg00/dispersed_fuse1024/brick):1105:connect] GLUSTER: > Mounting gluster volume locally... > [2020-06-25 07:51:54.520601] I [monitor(monitor):280:monitor] > Monitor: worker died in startup phase > brick=/gluster/vg01/dispersed_fuse1024/brick > [2020-06-25 07:51:54.535809] I > [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker > Status Change??? status=Faulty > [2020-06-25 07:51:54.882143] I [resource(worker > /gluster/vg00/dispersed_fuse1024/brick):1128:connect] GLUSTER: > Mounted gluster volume??? duration=1.0864 > [2020-06-25 07:51:54.882388] I [subcmds(worker > /gluster/vg00/dispersed_fuse1024/brick):84:subcmd_worker] : > Worker spawn successful. Acknowledging back to monitor > [2020-06-25 07:51:56.911412] E [repce(agent > /gluster/vg00/dispersed_fuse1024/brick):121:worker] : call > failed: > Traceback (most recent call last): > ? File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line > 117, in worker > ??? res = getattr(self.obj, rmeth)(*in_data[2:]) > ? File > "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", line > 40, in register > ??? return Changes.cl_register(cl_brick, cl_dir, cl_log, cl_level, > retries) > ? File > "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", line > 46, in cl_register > ??? cls.raise_changelog_err() > ? File > "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", line > 30, in raise_changelog_err > ??? raise ChangelogException(errn, os.strerror(errn)) > ChangelogException: [Errno 2] No such file or directory > [2020-06-25 07:51:56.912056] E [repce(worker > /gluster/vg00/dispersed_fuse1024/brick):213:__call__] RepceClient: > call failed call=75086:140098349655872:1593071514.91 > method=register??? error=ChangelogException > [2020-06-25 07:51:56.912396] E [resource(worker > /gluster/vg00/dispersed_fuse1024/brick):1286:service_loop] > GLUSTER: Changelog register failed??? error=[Errno 2] No such file > or directory > [2020-06-25 07:51:56.928031] I [repce(agent > /gluster/vg00/dispersed_fuse1024/brick):96:service_loop] > RepceServer: terminating on reaching EOF. > [2020-06-25 07:51:57.886126] I [monitor(monitor):280:monitor] > Monitor: worker died in startup phase > brick=/gluster/vg00/dispersed_fuse1024/brick > [2020-06-25 07:51:57.895920] I > [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: Worker > Status Change??? status=Faulty > [2020-06-25 07:51:58.607405] I [gsyncdstatus(worker > /gluster/vg00/dispersed_fuse1024/brick):287:set_passive] > GeorepStatus: Worker Status Change??? status=Passive > [2020-06-25 07:51:58.607768] I [gsyncdstatus(worker > /gluster/vg01/dispersed_fuse1024/brick):287:set_passive] > GeorepStatus: Worker Status Change??? status=Passive > [2020-06-25 07:51:58.608004] I [gsyncdstatus(worker > /gluster/vg00/dispersed_fuse1024/brick):281:set_active] > GeorepStatus: Worker Status Change??? status=Active > > > On 25/06/2020 09:15, Rob.Quagliozzi at rabobank.com > wrote: >> >> Hi All, >> >> We?ve got two six node RHEL 7.8 clusters and geo-replication >> would appear to be completely broken between them. I?ve deleted >> the session, removed & recreated pem files, old changlogs/htime >> (after removing relevant options from volume) and completely set >> up geo-rep from scratch, but the new session comes up as >> Initializing, then goes faulty, and starts looping. Volume (on >> both sides) is a 4 x 2 disperse, running Gluster v6 (RH latest).? >> Gsyncd reports: >> >> [2020-06-25 07:07:14.701423] I >> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: >> Worker Status Change status=Initializing... >> >> [2020-06-25 07:07:14.701744] I [monitor(monitor):159:monitor] >> Monitor: starting gsyncd worker?? brick=/rhgs/brick20/brick >> slave_node=bxts470194.eu.rabonet.com >> >> >> [2020-06-25 07:07:14.707997] D [monitor(monitor):230:monitor] >> Monitor: Worker would mount volume privately >> >> [2020-06-25 07:07:14.757181] I [gsyncd(agent >> /rhgs/brick20/brick):318:main] : Using session config file >> path=/var/lib/glusterd/geo-replication/prd_mx_intvol_bxts470190_prd_mx_intvol/gsyncd.conf >> >> [2020-06-25 07:07:14.758126] D [subcmds(agent >> /rhgs/brick20/brick):107:subcmd_agent] : RPC FD????? >> rpc_fd='5,12,11,10' >> >> [2020-06-25 07:07:14.758627] I [changelogagent(agent >> /rhgs/brick20/brick):72:__init__] ChangelogAgent: Agent listining... >> >> [2020-06-25 07:07:14.764234] I [gsyncd(worker >> /rhgs/brick20/brick):318:main] : Using session config file >> path=/var/lib/glusterd/geo-replication/prd_mx_intvol_bxts470190_prd_mx_intvol/gsyncd.conf >> >> [2020-06-25 07:07:14.779409] I [resource(worker >> /rhgs/brick20/brick):1386:connect_remote] SSH: Initializing SSH >> connection between master and slave... >> >> [2020-06-25 07:07:14.841793] D [repce(worker >> /rhgs/brick20/brick):195:push] RepceClient: call >> 6799:140380783982400:1593068834.84 __repce_version__() ... >> >> [2020-06-25 07:07:16.148725] D [repce(worker >> /rhgs/brick20/brick):215:__call__] RepceClient: call >> 6799:140380783982400:1593068834.84 __repce_version__ -> 1.0 >> >> [2020-06-25 07:07:16.148911] D [repce(worker >> /rhgs/brick20/brick):195:push] RepceClient: call >> 6799:140380783982400:1593068836.15 version() ... >> >> [2020-06-25 07:07:16.149574] D [repce(worker >> /rhgs/brick20/brick):215:__call__] RepceClient: call >> 6799:140380783982400:1593068836.15 version -> 1.0 >> >> [2020-06-25 07:07:16.149735] D [repce(worker >> /rhgs/brick20/brick):195:push] RepceClient: call >> 6799:140380783982400:1593068836.15 pid() ... >> >> [2020-06-25 07:07:16.150588] D [repce(worker >> /rhgs/brick20/brick):215:__call__] RepceClient: call >> 6799:140380783982400:1593068836.15 pid -> 30703 >> >> [2020-06-25 07:07:16.150747] I [resource(worker >> /rhgs/brick20/brick):1435:connect_remote] SSH: SSH connection >> between master and slave established. duration=1.3712 >> >> [2020-06-25 07:07:16.150819] I [resource(worker >> /rhgs/brick20/brick):1105:connect] GLUSTER: Mounting gluster >> volume locally... >> >> [2020-06-25 07:07:16.265860] D [resource(worker >> /rhgs/brick20/brick):879:inhibit] DirectMounter: auxiliary >> glusterfs mount in place >> >> [2020-06-25 07:07:17.272511] D [resource(worker >> /rhgs/brick20/brick):953:inhibit] DirectMounter: auxiliary >> glusterfs mount prepared >> >> [2020-06-25 07:07:17.272708] I [resource(worker >> /rhgs/brick20/brick):1128:connect] GLUSTER: Mounted gluster >> volume????? duration=1.1218 >> >> [2020-06-25 07:07:17.272794] I [subcmds(worker >> /rhgs/brick20/brick):84:subcmd_worker] : Worker spawn >> successful. Acknowledging back to monitor >> >> [2020-06-25 07:07:17.272973] D [master(worker >> /rhgs/brick20/brick):104:gmaster_builder] : setting up >> change detection mode mode=xsync >> >> [2020-06-25 07:07:17.273063] D [monitor(monitor):273:monitor] >> Monitor: worker(/rhgs/brick20/brick) connected >> >> [2020-06-25 07:07:17.273678] D [master(worker >> /rhgs/brick20/brick):104:gmaster_builder] : setting up >> change detection mode mode=changelog >> >> [2020-06-25 07:07:17.274224] D [master(worker >> /rhgs/brick20/brick):104:gmaster_builder] : setting up >> change detection mode mode=changeloghistory >> >> [2020-06-25 07:07:17.276484] D [repce(worker >> /rhgs/brick20/brick):195:push] RepceClient: call >> 6799:140380783982400:1593068837.28 version() ... >> >> [2020-06-25 07:07:17.276916] D [repce(worker >> /rhgs/brick20/brick):215:__call__] RepceClient: call >> 6799:140380783982400:1593068837.28 version -> 1.0 >> >> [2020-06-25 07:07:17.277009] D [master(worker >> /rhgs/brick20/brick):777:setup_working_dir] _GMaster: changelog >> working dir >> /var/lib/misc/gluster/gsyncd/prd_mx_intvol_bxts470190_prd_mx_intvol/rhgs-brick20-brick >> >> [2020-06-25 07:07:17.277098] D [repce(worker >> /rhgs/brick20/brick):195:push] RepceClient: call >> 6799:140380783982400:1593068837.28 init() ... >> >> [2020-06-25 07:07:17.292944] D [repce(worker >> /rhgs/brick20/brick):215:__call__] RepceClient: call >> 6799:140380783982400:1593068837.28 init -> None >> >> [2020-06-25 07:07:17.293097] D [repce(worker >> /rhgs/brick20/brick):195:push] RepceClient: call >> 6799:140380783982400:1593068837.29 >> register('/rhgs/brick20/brick', >> '/var/lib/misc/gluster/gsyncd/prd_mx_intvol_bxts470190_prd_mx_intvol/rhgs-brick20-brick', >> '/var/log/glusterfs/geo-replication/prd_mx_intvol_bxts470190_prd_mx_intvol/changes-rhgs-brick20-brick.log', >> 8, 5) ... >> >> [2020-06-25 07:07:19.296294] E [repce(agent >> /rhgs/brick20/brick):121:worker] : call failed: >> >> Traceback (most recent call last): >> >> ? File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line >> 117, in worker >> >> ??? res = getattr(self.obj, rmeth)(*in_data[2:]) >> >> ? File >> "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", >> line 40, in register >> >> ??? return Changes.cl_register(cl_brick, cl_dir, cl_log, >> cl_level, retries) >> >> ? File >> "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", >> line 46, in cl_register >> >> ??? cls.raise_changelog_err() >> >> ? File >> "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", >> line 30, in raise_changelog_err >> >> ??? raise ChangelogException(errn, os.strerror(errn)) >> >> ChangelogException: [Errno 2] No such file or directory >> >> [2020-06-25 07:07:19.297161] E [repce(worker >> /rhgs/brick20/brick):213:__call__] RepceClient: call failed >> call=6799:140380783982400:1593068837.29 method=register >> error=ChangelogException >> >> [2020-06-25 07:07:19.297338] E [resource(worker >> /rhgs/brick20/brick):1286:service_loop] GLUSTER: Changelog >> register failed????? error=[Errno 2] No such file or directory >> >> [2020-06-25 07:07:19.315074] I [repce(agent >> /rhgs/brick20/brick):96:service_loop] RepceServer: terminating on >> reaching EOF. >> >> [2020-06-25 07:07:20.275701] I [monitor(monitor):280:monitor] >> Monitor: worker died in startup phase???? brick=/rhgs/brick20/brick >> >> [2020-06-25 07:07:20.277383] I >> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: >> Worker Status Change status=Faulty >> >> We?ve done everything we can think of, including an ?strace ?f? >> on the pid, and we can?t really find anything. I?m about to lose >> the last of my hair over this, so does anyone have any ideas at >> all? We?ve even removed the entire slave vol and rebuilt it. >> >> Thanks >> >> Rob >> >> *Rob Quagliozzi* >> >> *Specialised Application Support* >> >> >> >> ------------------------------------------------------------------------ >> This email (including any attachments to it) is confidential, >> legally privileged, subject to copyright and is sent for the >> personal attention of the intended recipient only. If you have >> received this email in error, please advise us immediately and >> delete it. You are notified that disclosing, copying, >> distributing or taking any action in reliance on the contents of >> this information is strictly prohibited. Although we have taken >> reasonable precautions to ensure no viruses are present in this >> email, we cannot accept responsibility for any loss or damage >> arising from the viruses in this email or attachments. We exclude >> any liability for the content of this email, or for the >> consequences of any actions taken on the basis of the information >> provided in this email or its attachments, unless that >> information is subsequently confirmed in writing. <#rbnl#1898i> >> ------------------------------------------------------------------------ >> >> >> ________ >> >> >> >> Community Meeting Calendar: >> >> Schedule - >> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >> Bridge:https://bluejeans.com/441850968 >> >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > > > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From peljasz at yahoo.co.uk Fri Jul 3 10:46:33 2020 From: peljasz at yahoo.co.uk (lejeczek) Date: Fri, 3 Jul 2020 11:46:33 +0100 Subject: [Gluster-users] Official Bugzilla? References: Message-ID: hi guys, where those of use who run gluster from(via) EPEL repo should go to report bugs? many thanks, L. From hunter86_bg at yahoo.com Fri Jul 3 12:54:52 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Fri, 03 Jul 2020 15:54:52 +0300 Subject: [Gluster-users] Geo-replication completely broken In-Reply-To: <3c8c2b2b-b539-f8f7-0597-d96d2be1fd74@gmx.de> References: <31c26aca-2dbd-e798-27f0-e8c33afe7f21@gmx.de> <3c8c2b2b-b539-f8f7-0597-d96d2be1fd74@gmx.de> Message-ID: Hi Felix, It seems I missed your reply with the change log that Shwetha requested. Best Regards, Strahil Nikolov ?? 3 ??? 2020 ?. 11:16:30 GMT+03:00, "Felix K?lzow" ??????: >Dear Users, >the geo-replication is still broken. This is not really a comfortable >situation. >Does any user has had the same experience and is able to share a >possible workaround? >We are actually running gluster v6.0 >Regards, > >Felix > > >On 25/06/2020 10:04, Shwetha Acharya wrote: >> Hi Rob and Felix, >> >> Please share the *-changes.log files and brick logs, which will help >> in analysis of the issue. >> >> Regards, >> Shwetha >> >> On Thu, Jun 25, 2020 at 1:26 PM Felix K?lzow > > wrote: >> >> Hey Rob, >> >> >> same issue for our third volume. Have a look at the logs just >from >> right now (below). >> >> Question: You removed the htime files and the old changelogs. >Just >> rm the files or is there something to pay more attention >> >> before removing the changelog files and the htime file. >> >> Regards, >> >> Felix >> >> [2020-06-25 07:51:53.795430] I [resource(worker >> /gluster/vg00/dispersed_fuse1024/brick):1435:connect_remote] SSH: >> SSH connection between master and slave established.??? >> duration=1.2341 >> [2020-06-25 07:51:53.795639] I [resource(worker >> /gluster/vg00/dispersed_fuse1024/brick):1105:connect] GLUSTER: >> Mounting gluster volume locally... >> [2020-06-25 07:51:54.520601] I [monitor(monitor):280:monitor] >> Monitor: worker died in startup phase >> brick=/gluster/vg01/dispersed_fuse1024/brick >> [2020-06-25 07:51:54.535809] I >> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: >Worker >> Status Change??? status=Faulty >> [2020-06-25 07:51:54.882143] I [resource(worker >> /gluster/vg00/dispersed_fuse1024/brick):1128:connect] GLUSTER: >> Mounted gluster volume??? duration=1.0864 >> [2020-06-25 07:51:54.882388] I [subcmds(worker >> /gluster/vg00/dispersed_fuse1024/brick):84:subcmd_worker] : >> Worker spawn successful. Acknowledging back to monitor >> [2020-06-25 07:51:56.911412] E [repce(agent >> /gluster/vg00/dispersed_fuse1024/brick):121:worker] : call >> failed: >> Traceback (most recent call last): >> ? File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line >> 117, in worker >> ??? res = getattr(self.obj, rmeth)(*in_data[2:]) >> ? File >> "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", >line >> 40, in register >> ??? return Changes.cl_register(cl_brick, cl_dir, cl_log, >cl_level, >> retries) >> ? File >> "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", >line >> 46, in cl_register >> ??? cls.raise_changelog_err() >> ? File >> "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", >line >> 30, in raise_changelog_err >> ??? raise ChangelogException(errn, os.strerror(errn)) >> ChangelogException: [Errno 2] No such file or directory >> [2020-06-25 07:51:56.912056] E [repce(worker >> /gluster/vg00/dispersed_fuse1024/brick):213:__call__] >RepceClient: >> call failed call=75086:140098349655872:1593071514.91 >> method=register??? error=ChangelogException >> [2020-06-25 07:51:56.912396] E [resource(worker >> /gluster/vg00/dispersed_fuse1024/brick):1286:service_loop] >> GLUSTER: Changelog register failed??? error=[Errno 2] No such >file >> or directory >> [2020-06-25 07:51:56.928031] I [repce(agent >> /gluster/vg00/dispersed_fuse1024/brick):96:service_loop] >> RepceServer: terminating on reaching EOF. >> [2020-06-25 07:51:57.886126] I [monitor(monitor):280:monitor] >> Monitor: worker died in startup phase >> brick=/gluster/vg00/dispersed_fuse1024/brick >> [2020-06-25 07:51:57.895920] I >> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: >Worker >> Status Change??? status=Faulty >> [2020-06-25 07:51:58.607405] I [gsyncdstatus(worker >> /gluster/vg00/dispersed_fuse1024/brick):287:set_passive] >> GeorepStatus: Worker Status Change??? status=Passive >> [2020-06-25 07:51:58.607768] I [gsyncdstatus(worker >> /gluster/vg01/dispersed_fuse1024/brick):287:set_passive] >> GeorepStatus: Worker Status Change??? status=Passive >> [2020-06-25 07:51:58.608004] I [gsyncdstatus(worker >> /gluster/vg00/dispersed_fuse1024/brick):281:set_active] >> GeorepStatus: Worker Status Change??? status=Active >> >> >> On 25/06/2020 09:15, Rob.Quagliozzi at rabobank.com >> wrote: >>> >>> Hi All, >>> >>> We?ve got two six node RHEL 7.8 clusters and geo-replication >>> would appear to be completely broken between them. I?ve deleted >>> the session, removed & recreated pem files, old changlogs/htime >>> (after removing relevant options from volume) and completely set >>> up geo-rep from scratch, but the new session comes up as >>> Initializing, then goes faulty, and starts looping. Volume (on >>> both sides) is a 4 x 2 disperse, running Gluster v6 (RH >latest).? >>> Gsyncd reports: >>> >>> [2020-06-25 07:07:14.701423] I >>> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: >>> Worker Status Change status=Initializing... >>> >>> [2020-06-25 07:07:14.701744] I [monitor(monitor):159:monitor] >>> Monitor: starting gsyncd worker?? brick=/rhgs/brick20/brick >>> slave_node=bxts470194.eu.rabonet.com >>> >>> >>> [2020-06-25 07:07:14.707997] D [monitor(monitor):230:monitor] >>> Monitor: Worker would mount volume privately >>> >>> [2020-06-25 07:07:14.757181] I [gsyncd(agent >>> /rhgs/brick20/brick):318:main] : Using session config file >>> >path=/var/lib/glusterd/geo-replication/prd_mx_intvol_bxts470190_prd_mx_intvol/gsyncd.conf >>> >>> [2020-06-25 07:07:14.758126] D [subcmds(agent >>> /rhgs/brick20/brick):107:subcmd_agent] : RPC FD????? >>> rpc_fd='5,12,11,10' >>> >>> [2020-06-25 07:07:14.758627] I [changelogagent(agent >>> /rhgs/brick20/brick):72:__init__] ChangelogAgent: Agent >listining... >>> >>> [2020-06-25 07:07:14.764234] I [gsyncd(worker >>> /rhgs/brick20/brick):318:main] : Using session config file >>> >path=/var/lib/glusterd/geo-replication/prd_mx_intvol_bxts470190_prd_mx_intvol/gsyncd.conf >>> >>> [2020-06-25 07:07:14.779409] I [resource(worker >>> /rhgs/brick20/brick):1386:connect_remote] SSH: Initializing SSH >>> connection between master and slave... >>> >>> [2020-06-25 07:07:14.841793] D [repce(worker >>> /rhgs/brick20/brick):195:push] RepceClient: call >>> 6799:140380783982400:1593068834.84 __repce_version__() ... >>> >>> [2020-06-25 07:07:16.148725] D [repce(worker >>> /rhgs/brick20/brick):215:__call__] RepceClient: call >>> 6799:140380783982400:1593068834.84 __repce_version__ -> 1.0 >>> >>> [2020-06-25 07:07:16.148911] D [repce(worker >>> /rhgs/brick20/brick):195:push] RepceClient: call >>> 6799:140380783982400:1593068836.15 version() ... >>> >>> [2020-06-25 07:07:16.149574] D [repce(worker >>> /rhgs/brick20/brick):215:__call__] RepceClient: call >>> 6799:140380783982400:1593068836.15 version -> 1.0 >>> >>> [2020-06-25 07:07:16.149735] D [repce(worker >>> /rhgs/brick20/brick):195:push] RepceClient: call >>> 6799:140380783982400:1593068836.15 pid() ... >>> >>> [2020-06-25 07:07:16.150588] D [repce(worker >>> /rhgs/brick20/brick):215:__call__] RepceClient: call >>> 6799:140380783982400:1593068836.15 pid -> 30703 >>> >>> [2020-06-25 07:07:16.150747] I [resource(worker >>> /rhgs/brick20/brick):1435:connect_remote] SSH: SSH connection >>> between master and slave established. duration=1.3712 >>> >>> [2020-06-25 07:07:16.150819] I [resource(worker >>> /rhgs/brick20/brick):1105:connect] GLUSTER: Mounting gluster >>> volume locally... >>> >>> [2020-06-25 07:07:16.265860] D [resource(worker >>> /rhgs/brick20/brick):879:inhibit] DirectMounter: auxiliary >>> glusterfs mount in place >>> >>> [2020-06-25 07:07:17.272511] D [resource(worker >>> /rhgs/brick20/brick):953:inhibit] DirectMounter: auxiliary >>> glusterfs mount prepared >>> >>> [2020-06-25 07:07:17.272708] I [resource(worker >>> /rhgs/brick20/brick):1128:connect] GLUSTER: Mounted gluster >>> volume????? duration=1.1218 >>> >>> [2020-06-25 07:07:17.272794] I [subcmds(worker >>> /rhgs/brick20/brick):84:subcmd_worker] : Worker spawn >>> successful. Acknowledging back to monitor >>> >>> [2020-06-25 07:07:17.272973] D [master(worker >>> /rhgs/brick20/brick):104:gmaster_builder] : setting up >>> change detection mode mode=xsync >>> >>> [2020-06-25 07:07:17.273063] D [monitor(monitor):273:monitor] >>> Monitor: worker(/rhgs/brick20/brick) connected >>> >>> [2020-06-25 07:07:17.273678] D [master(worker >>> /rhgs/brick20/brick):104:gmaster_builder] : setting up >>> change detection mode mode=changelog >>> >>> [2020-06-25 07:07:17.274224] D [master(worker >>> /rhgs/brick20/brick):104:gmaster_builder] : setting up >>> change detection mode mode=changeloghistory >>> >>> [2020-06-25 07:07:17.276484] D [repce(worker >>> /rhgs/brick20/brick):195:push] RepceClient: call >>> 6799:140380783982400:1593068837.28 version() ... >>> >>> [2020-06-25 07:07:17.276916] D [repce(worker >>> /rhgs/brick20/brick):215:__call__] RepceClient: call >>> 6799:140380783982400:1593068837.28 version -> 1.0 >>> >>> [2020-06-25 07:07:17.277009] D [master(worker >>> /rhgs/brick20/brick):777:setup_working_dir] _GMaster: changelog >>> working dir >>> >/var/lib/misc/gluster/gsyncd/prd_mx_intvol_bxts470190_prd_mx_intvol/rhgs-brick20-brick >>> >>> [2020-06-25 07:07:17.277098] D [repce(worker >>> /rhgs/brick20/brick):195:push] RepceClient: call >>> 6799:140380783982400:1593068837.28 init() ... >>> >>> [2020-06-25 07:07:17.292944] D [repce(worker >>> /rhgs/brick20/brick):215:__call__] RepceClient: call >>> 6799:140380783982400:1593068837.28 init -> None >>> >>> [2020-06-25 07:07:17.293097] D [repce(worker >>> /rhgs/brick20/brick):195:push] RepceClient: call >>> 6799:140380783982400:1593068837.29 >>> register('/rhgs/brick20/brick', >>> >'/var/lib/misc/gluster/gsyncd/prd_mx_intvol_bxts470190_prd_mx_intvol/rhgs-brick20-brick', >>> >'/var/log/glusterfs/geo-replication/prd_mx_intvol_bxts470190_prd_mx_intvol/changes-rhgs-brick20-brick.log', >>> 8, 5) ... >>> >>> [2020-06-25 07:07:19.296294] E [repce(agent >>> /rhgs/brick20/brick):121:worker] : call failed: >>> >>> Traceback (most recent call last): >>> >>> ? File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line >>> 117, in worker >>> >>> ??? res = getattr(self.obj, rmeth)(*in_data[2:]) >>> >>> ? File >>> "/usr/libexec/glusterfs/python/syncdaemon/changelogagent.py", >>> line 40, in register >>> >>> ??? return Changes.cl_register(cl_brick, cl_dir, cl_log, >>> cl_level, retries) >>> >>> ? File >>> "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", >>> line 46, in cl_register >>> >>> ??? cls.raise_changelog_err() >>> >>> ? File >>> "/usr/libexec/glusterfs/python/syncdaemon/libgfchangelog.py", >>> line 30, in raise_changelog_err >>> >>> ??? raise ChangelogException(errn, os.strerror(errn)) >>> >>> ChangelogException: [Errno 2] No such file or directory >>> >>> [2020-06-25 07:07:19.297161] E [repce(worker >>> /rhgs/brick20/brick):213:__call__] RepceClient: call failed >>> call=6799:140380783982400:1593068837.29 method=register >>> error=ChangelogException >>> >>> [2020-06-25 07:07:19.297338] E [resource(worker >>> /rhgs/brick20/brick):1286:service_loop] GLUSTER: Changelog >>> register failed????? error=[Errno 2] No such file or directory >>> >>> [2020-06-25 07:07:19.315074] I [repce(agent >>> /rhgs/brick20/brick):96:service_loop] RepceServer: terminating >on >>> reaching EOF. >>> >>> [2020-06-25 07:07:20.275701] I [monitor(monitor):280:monitor] >>> Monitor: worker died in startup phase???? >brick=/rhgs/brick20/brick >>> >>> [2020-06-25 07:07:20.277383] I >>> [gsyncdstatus(monitor):248:set_worker_status] GeorepStatus: >>> Worker Status Change status=Faulty >>> >>> We?ve done everything we can think of, including an ?strace ?f? >>> on the pid, and we can?t really find anything. I?m about to lose >>> the last of my hair over this, so does anyone have any ideas at >>> all? We?ve even removed the entire slave vol and rebuilt it. >>> >>> Thanks >>> >>> Rob >>> >>> *Rob Quagliozzi* >>> >>> *Specialised Application Support* >>> >>> >>> >>> >------------------------------------------------------------------------ >>> This email (including any attachments to it) is confidential, >>> legally privileged, subject to copyright and is sent for the >>> personal attention of the intended recipient only. If you have >>> received this email in error, please advise us immediately and >>> delete it. You are notified that disclosing, copying, >>> distributing or taking any action in reliance on the contents of >>> this information is strictly prohibited. Although we have taken >>> reasonable precautions to ensure no viruses are present in this >>> email, we cannot accept responsibility for any loss or damage >>> arising from the viruses in this email or attachments. We >exclude >>> any liability for the content of this email, or for the >>> consequences of any actions taken on the basis of the >information >>> provided in this email or its attachments, unless that >>> information is subsequently confirmed in writing. <#rbnl#1898i> >>> >------------------------------------------------------------------------ >>> >>> >>> ________ >>> >>> >>> >>> Community Meeting Calendar: >>> >>> Schedule - >>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>> Bridge:https://bluejeans.com/441850968 >>> >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >> ________ >> >> >> >> Community Meeting Calendar: >> >> Schedule - >> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >> Bridge: https://bluejeans.com/441850968 >> >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> >> ________ >> >> >> >> Community Meeting Calendar: >> >> Schedule - >> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >> Bridge: https://bluejeans.com/441850968 >> >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users From kkeithle at redhat.com Fri Jul 3 13:03:45 2020 From: kkeithle at redhat.com (Kaleb Keithley) Date: Fri, 3 Jul 2020 09:03:45 -0400 Subject: [Gluster-users] Official Bugzilla? In-Reply-To: References:

Message-ID: On Fri, Jul 3, 2020 at 6:46 AM lejeczek wrote: > hi guys, > > where those of use who run gluster from(via) EPEL repo > glusterfs rpms aren't in EPEL, and haven't been for something like six or seven years. Perhaps you meant from the CentOS Storage SIG repo? > should go to report bugs? > https://github.com/gluster/glusterfs/issues If you're really still running glusterfs from old EPEL RPMs then you're running a very very old version of gluster and you should upgrade to something more recent. I can pretty much guarantee any bugs in something that old have been fixed in a newer version. > > many thanks, L. > > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From shreyansh.shah at alpha-grep.com Mon Jul 6 08:41:04 2020 From: shreyansh.shah at alpha-grep.com (Shreyansh Shah) Date: Mon, 6 Jul 2020 14:11:04 +0530 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References: Message-ID: Hi, Did anyone get a chance to look into this? On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah wrote: > Hi All, > > *We are facing "Mismatching layouts for ,gfid = " errors.* > > We have a distributed glusterfs 5.10, no replication, 2 bricks (4TB each) > on each node, 7 nodes in total. We added new bricks yesterday to the > existing setup. > Post that we did a rebalance fix-layout and then a rebalance (which is > currently still in progress). The status shows "failed" on certain bricks > but "in progress" for others. Adding output for gluster rebalance status > below. > > The glusterfs client logs are flooded with "Mismatching layouts for > ,gfid = " > The performance too seems to have degraded due to this, even basic > commands like `cd` and `ls` are taking more than a minute compared to > sub-second number before brick addition. > Apart from that we also experienced many binaries and files giving error > stale file handle error even though the files were present. > > > *gluster rebalance status :* > > Node Rebalanced-files size scanned failures > skipped status run time in h:m:s > --------- ----------- ----------- ----------- ----------- > ----------- ------------ -------------- > localhost 176 3.5GB 12790 0 > 8552 in progress 21:36:01 > 10.132.0.72 8232 394.8GB 19995 21 > 26 failed 14:50:30 > 10.132.0.44 12625 1.0TB 50023 1 > 10202 in progress 21:36:00 > 10.132.0.3 21982 956.8GB 79145 1 > 34571 in progress 21:36:00 > 10.132.0.9 7975 355.8GB 20157 6 > 1522 failed 14:51:45 > 10.132.0.73 6293 394.5GB 26414 151 > 8085 failed 14:51:45 > 10.132.0.70 6480 477.1GB 21058 27 > 1787 failed 14:50:32 > Estimated time left for rebalance to complete : 130:56:28 > > > *Logs from one of the clients below:* > > [2020-07-02 12:30:14.971916] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk > layout - 2761060380 - 3067813815 - 4159036738 > [2020-07-02 12:30:14.971935] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc > [2020-07-02 12:30:15.032013] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk > layout - 3681390552 - 3988143987 - 4159036738 > [2020-07-02 12:30:15.032059] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc > [2020-07-02 12:30:15.032107] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk > layout - 3374637116 - 3681390551 - 4159036738 > [2020-07-02 12:30:15.032153] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc > [2020-07-02 12:30:15.093329] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk > layout - 2454306944 - 2761060379 - 4159036738 > [2020-07-02 12:30:15.093373] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 > [2020-07-02 12:30:15.093460] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk > layout - 2761060380 - 3067813815 - 4159036738 > [2020-07-02 12:30:15.093515] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 > [2020-07-02 12:30:15.151063] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk > layout - 3681390552 - 3988143987 - 4159036738 > [2020-07-02 12:30:15.151108] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 > [2020-07-02 12:30:15.151149] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk > layout - 3374637116 - 3681390551 - 4159036738 > [2020-07-02 12:30:15.151162] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 > [2020-07-02 12:30:15.424321] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk > layout - 920400036 - 1227153471 - 4159036738 > [2020-07-02 12:30:15.424380] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 > [2020-07-02 12:30:15.424456] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk > layout - 1840730208 - 2147483643 - 4159036738 > [2020-07-02 12:30:15.424484] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 > [2020-07-02 12:30:15.424525] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk > layout - 1533976772 - 1840730207 - 4159036738 > [2020-07-02 12:30:15.424542] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 > [2020-07-02 12:30:15.424596] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk > layout - 613646600 - 920400035 - 4159036738 > [2020-07-02 12:30:15.424607] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 > [2020-07-02 12:30:16.004482] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on > data-client-7 (hashed subvol is data-client-17) > [2020-07-02 12:30:16.005523] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_CDS_1_DATA.dat > [2020-07-02 12:30:16.531047] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat > on data-client-9 (hashed subvol is data-client-19) > [2020-07-02 12:30:16.532086] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_CDS_1_METADATA.dat > [2020-07-02 12:30:18.733229] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on > data-client-17 (hashed subvol is data-client-9) > [2020-07-02 12:30:18.734421] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_CDS_2_DATA.dat > [2020-07-02 12:30:19.171930] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat > on data-client-9 (hashed subvol is data-client-18) > [2020-07-02 12:30:19.172901] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_CDS_2_METADATA.dat > [2020-07-02 12:30:21.028495] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on > data-client-6 (hashed subvol is data-client-15) > [2020-07-02 12:30:21.029836] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_2_DATA.dat > [2020-07-02 12:30:21.127648] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat > on data-client-11 (hashed subvol is data-client-3) > [2020-07-02 12:30:21.128713] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_2_METADATA.dat > [2020-07-02 12:30:21.201126] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on > data-client-15 (hashed subvol is data-client-7) > [2020-07-02 12:30:21.201928] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_3_DATA.dat > [2020-07-02 12:30:21.566158] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat > on data-client-7 (hashed subvol is data-client-16) > [2020-07-02 12:30:21.567123] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_3_METADATA.dat > [2020-07-02 12:30:21.649357] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on > data-client-2 (hashed subvol is data-client-11) > [2020-07-02 12:30:21.661381] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_4_DATA.dat > [2020-07-02 12:30:21.748937] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat > on data-client-15 (hashed subvol is data-client-7) > [2020-07-02 12:30:21.749481] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_4_METADATA.dat > [2020-07-02 12:30:21.898593] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on > data-client-14 (hashed subvol is data-client-7) > [2020-07-02 12:30:21.899442] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_6_DATA.dat > [2020-07-02 12:30:22.039337] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat > on data-client-10 (hashed subvol is data-client-2) > [2020-07-02 12:30:22.040086] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/BSE_EQ_6_METADATA.dat > [2020-07-02 12:30:22.501877] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat > on data-client-15 (hashed subvol is data-client-8) > [2020-07-02 12:30:22.502712] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECDS1_DATA.dat > [2020-07-02 12:30:22.782577] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 > (hashed subvol is data-client-6) > [2020-07-02 12:30:22.783777] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECDS1_METADATA.dat > [2020-07-02 12:30:23.146847] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on > data-client-17 (hashed subvol is data-client-9) > [2020-07-02 12:30:23.148009] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECM1_DATA.dat > [2020-07-02 12:30:23.229290] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed > subvol is data-client-6) > [2020-07-02 12:30:23.230151] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECM1_METADATA.dat > [2020-07-02 12:30:23.889520] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on > data-client-2 (hashed subvol is data-client-11) > [2020-07-02 12:30:23.896618] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECM2_DATA.dat > [2020-07-02 12:30:24.093017] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed > subvol is data-client-15) > [2020-07-02 12:30:24.094117] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECM2_METADATA.dat > [2020-07-02 12:30:24.345257] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on > data-client-17 (hashed subvol is data-client-10) > [2020-07-02 12:30:24.346234] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECM3_DATA.dat > [2020-07-02 12:30:24.425835] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed > subvol is data-client-15) > [2020-07-02 12:30:24.426880] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSECM3_METADATA.dat > [2020-07-02 12:30:25.158718] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat > on data-client-9 (hashed subvol is data-client-19) > [2020-07-02 12:30:25.159619] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO1_DATA.dat > [2020-07-02 12:30:25.531479] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed > subvol is data-client-10) > [2020-07-02 12:30:25.540569] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO1_METADATA.dat > [2020-07-02 12:30:25.771692] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat > on data-client-11 (hashed subvol is data-client-3) > [2020-07-02 12:30:25.772610] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO2_DATA.dat > [2020-07-02 12:30:25.866118] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 > (hashed subvol is data-client-8) > [2020-07-02 12:30:25.866917] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO2_METADATA.dat > [2020-07-02 12:30:26.424386] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat > on data-client-9 (hashed subvol is data-client-18) > [2020-07-02 12:30:26.425309] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO3_DATA.dat > [2020-07-02 12:30:26.818852] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 > (hashed subvol is data-client-2) > [2020-07-02 12:30:26.819890] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO3_METADATA.dat > [2020-07-02 12:30:27.352405] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat > on data-client-10 (hashed subvol is data-client-2) > [2020-07-02 12:30:27.352914] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO4_DATA.dat > [2020-07-02 12:30:27.521286] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed > subvol is data-client-18) > [2020-07-02 12:30:27.522325] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO4_METADATA.dat > [2020-07-02 12:30:28.566634] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat > on data-client-2 (hashed subvol is data-client-11) > [2020-07-02 12:30:28.579295] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO5_DATA.dat > [2020-07-02 12:30:28.958028] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat > on data-client-7 (hashed subvol is data-client-16) > [2020-07-02 12:30:28.959102] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO6_DATA.dat > [2020-07-02 12:30:29.012429] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed > subvol is data-client-15) > [2020-07-02 12:30:29.013416] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/MCASTNSEFNO6_METADATA.dat > [2020-07-02 12:30:29.396716] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on > data-client-17 (hashed subvol is data-client-10) > [2020-07-02 12:30:29.397740] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/NSEFO_BSE_TSDATA.dat > [2020-07-02 12:30:29.556312] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed > subvol is data-client-18) > [2020-07-02 12:30:29.557197] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat > [2020-07-02 12:30:30.605354] I [MSGID: 109045] > [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting > deletion of stale linkfile > /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 > (hashed subvol is data-client-19) > [2020-07-02 12:30:30.606117] I [MSGID: 109069] > [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink > returned with op_ret -> 0 and op-errno -> 0 for > /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat > [2020-07-02 12:30:31.559206] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - > 613576736 - 920330171 - 4159036738 > [2020-07-02 12:30:31.559255] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 > [2020-07-02 12:30:31.569025] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - > 920330172 - 1227083607 - 4159036738 > [2020-07-02 12:30:31.569067] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 > [2020-07-02 12:30:31.701849] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - > 3374637116 - 3681390551 - 4159036738 > [2020-07-02 12:30:31.701895] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX, gfid = > fff324f2-f855-4881-b77c-81e856522373 > [2020-07-02 12:30:31.738464] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - > 3681390552 - 3988143987 - 4159036738 > [2020-07-02 12:30:31.738507] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX, gfid = > fff324f2-f855-4881-b77c-81e856522373 > [2020-07-02 12:30:31.857102] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk > layout - 3067883680 - 3374637115 - 4159036738 > [2020-07-02 12:30:31.857147] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = > f8447150-4801-4188-add9-ea295bb88729 > [2020-07-02 12:30:31.857180] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk > layout - 3374637116 - 3681390551 - 4159036738 > [2020-07-02 12:30:31.857197] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = > f8447150-4801-4188-add9-ea295bb88729 > [2020-07-02 12:30:31.917705] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 > - 306753435 - 4159036738 > [2020-07-02 12:30:31.917781] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = > f8447150-4801-4188-add9-ea295bb88729 > [2020-07-02 12:30:31.917855] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk > layout - 3988213852 - 4294967295 - 4159036738 > [2020-07-02 12:30:31.917874] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = > f8447150-4801-4188-add9-ea295bb88729 > [2020-07-02 12:30:32.390945] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - > 3681460416 - 3988213851 - 4159036738 > [2020-07-02 12:30:32.390998] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX/NIFTY, gfid = > b2d4deb7-c58c-4046-b6f2-7c7f44d71311 > [2020-07-02 12:30:32.391056] I [MSGID: 109064] > [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: > data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - > 3988213852 - 4294967295 - 4159036738 > [2020-07-02 12:30:32.391075] I [MSGID: 109018] > [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for > /processed_data/Indexes/NSEINDEX/NIFTY, gfid = > b2d4deb7-c58c-4046-b6f2-7c7f44d71311 > [2020-07-02 12:33:50.915279] I [MSGID: 109066] > [dht-rename.c:1922:dht_rename] 4-data-dht: renaming > /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 > (2cb54500-814d-4e85-83e7-e33d9440b18d) > (hash=data-client-6/cache=data-client-18) => > /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) > (hash=data-client-6/cache=) > [2020-07-02 12:34:09.799586] I [MSGID: 109066] > [dht-rename.c:1922:dht_rename] 4-data-dht: renaming > /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k > (99938ee6-6986-4123-9d72-ec09e2310b4f) > (hash=data-client-17/cache=data-client-18) => > /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) > (hash=data-client-17/cache=) > .... > > > Please look into this at top-priority if possible. > Let me know if anything else is required. > > > -- > Regards, > Shreyansh Shah > -- Regards, Shreyansh Shah -------------- next part -------------- An HTML attachment was scrubbed... URL: From bsasonro at redhat.com Mon Jul 6 17:13:34 2020 From: bsasonro at redhat.com (Barak Sason Rofman) Date: Mon, 6 Jul 2020 20:13:34 +0300 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References:

Message-ID: Greetings Shreyansh, Off-hand I can't come up with a reason for these failures. In order to start looking into this, access to the full rebalance logs is required (possibly brick logs as well). Can you provide those? My regards, On Mon, Jul 6, 2020 at 11:41 AM Shreyansh Shah < shreyansh.shah at alpha-grep.com> wrote: > Hi, > Did anyone get a chance to look into this? > > On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah < > shreyansh.shah at alpha-grep.com> wrote: > >> Hi All, >> >> *We are facing "Mismatching layouts for ,gfid = " errors.* >> >> We have a distributed glusterfs 5.10, no replication, 2 bricks (4TB each) >> on each node, 7 nodes in total. We added new bricks yesterday to the >> existing setup. >> Post that we did a rebalance fix-layout and then a rebalance (which is >> currently still in progress). The status shows "failed" on certain bricks >> but "in progress" for others. Adding output for gluster rebalance status >> below. >> >> The glusterfs client logs are flooded with "Mismatching layouts for >> ,gfid = " >> The performance too seems to have degraded due to this, even basic >> commands like `cd` and `ls` are taking more than a minute compared to >> sub-second number before brick addition. >> Apart from that we also experienced many binaries and files giving error >> stale file handle error even though the files were present. >> >> >> *gluster rebalance status :* >> >> Node Rebalanced-files size scanned failures >> skipped status run time in h:m:s >> --------- ----------- ----------- ----------- ----------- >> ----------- ------------ -------------- >> localhost 176 3.5GB 12790 0 >> 8552 in progress 21:36:01 >> 10.132.0.72 8232 394.8GB 19995 21 >> 26 failed 14:50:30 >> 10.132.0.44 12625 1.0TB 50023 1 >> 10202 in progress 21:36:00 >> 10.132.0.3 21982 956.8GB 79145 1 >> 34571 in progress 21:36:00 >> 10.132.0.9 7975 355.8GB 20157 6 >> 1522 failed 14:51:45 >> 10.132.0.73 6293 394.5GB 26414 151 >> 8085 failed 14:51:45 >> 10.132.0.70 6480 477.1GB 21058 27 >> 1787 failed 14:50:32 >> Estimated time left for rebalance to complete : 130:56:28 >> >> >> *Logs from one of the clients below:* >> >> [2020-07-02 12:30:14.971916] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk >> layout - 2761060380 - 3067813815 - 4159036738 >> [2020-07-02 12:30:14.971935] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >> [2020-07-02 12:30:15.032013] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk >> layout - 3681390552 - 3988143987 - 4159036738 >> [2020-07-02 12:30:15.032059] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >> [2020-07-02 12:30:15.032107] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk >> layout - 3374637116 - 3681390551 - 4159036738 >> [2020-07-02 12:30:15.032153] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >> [2020-07-02 12:30:15.093329] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk >> layout - 2454306944 - 2761060379 - 4159036738 >> [2020-07-02 12:30:15.093373] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >> [2020-07-02 12:30:15.093460] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk >> layout - 2761060380 - 3067813815 - 4159036738 >> [2020-07-02 12:30:15.093515] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >> [2020-07-02 12:30:15.151063] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk >> layout - 3681390552 - 3988143987 - 4159036738 >> [2020-07-02 12:30:15.151108] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >> [2020-07-02 12:30:15.151149] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk >> layout - 3374637116 - 3681390551 - 4159036738 >> [2020-07-02 12:30:15.151162] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >> [2020-07-02 12:30:15.424321] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk >> layout - 920400036 - 1227153471 - 4159036738 >> [2020-07-02 12:30:15.424380] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >> [2020-07-02 12:30:15.424456] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk >> layout - 1840730208 - 2147483643 - 4159036738 >> [2020-07-02 12:30:15.424484] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >> [2020-07-02 12:30:15.424525] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk >> layout - 1533976772 - 1840730207 - 4159036738 >> [2020-07-02 12:30:15.424542] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >> [2020-07-02 12:30:15.424596] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk >> layout - 613646600 - 920400035 - 4159036738 >> [2020-07-02 12:30:15.424607] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >> [2020-07-02 12:30:16.004482] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on >> data-client-7 (hashed subvol is data-client-17) >> [2020-07-02 12:30:16.005523] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_CDS_1_DATA.dat >> [2020-07-02 12:30:16.531047] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat >> on data-client-9 (hashed subvol is data-client-19) >> [2020-07-02 12:30:16.532086] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_CDS_1_METADATA.dat >> [2020-07-02 12:30:18.733229] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on >> data-client-17 (hashed subvol is data-client-9) >> [2020-07-02 12:30:18.734421] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_CDS_2_DATA.dat >> [2020-07-02 12:30:19.171930] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat >> on data-client-9 (hashed subvol is data-client-18) >> [2020-07-02 12:30:19.172901] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_CDS_2_METADATA.dat >> [2020-07-02 12:30:21.028495] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on >> data-client-6 (hashed subvol is data-client-15) >> [2020-07-02 12:30:21.029836] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_2_DATA.dat >> [2020-07-02 12:30:21.127648] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat >> on data-client-11 (hashed subvol is data-client-3) >> [2020-07-02 12:30:21.128713] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_2_METADATA.dat >> [2020-07-02 12:30:21.201126] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on >> data-client-15 (hashed subvol is data-client-7) >> [2020-07-02 12:30:21.201928] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_3_DATA.dat >> [2020-07-02 12:30:21.566158] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat >> on data-client-7 (hashed subvol is data-client-16) >> [2020-07-02 12:30:21.567123] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_3_METADATA.dat >> [2020-07-02 12:30:21.649357] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on >> data-client-2 (hashed subvol is data-client-11) >> [2020-07-02 12:30:21.661381] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_4_DATA.dat >> [2020-07-02 12:30:21.748937] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat >> on data-client-15 (hashed subvol is data-client-7) >> [2020-07-02 12:30:21.749481] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_4_METADATA.dat >> [2020-07-02 12:30:21.898593] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on >> data-client-14 (hashed subvol is data-client-7) >> [2020-07-02 12:30:21.899442] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_6_DATA.dat >> [2020-07-02 12:30:22.039337] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat >> on data-client-10 (hashed subvol is data-client-2) >> [2020-07-02 12:30:22.040086] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/BSE_EQ_6_METADATA.dat >> [2020-07-02 12:30:22.501877] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat >> on data-client-15 (hashed subvol is data-client-8) >> [2020-07-02 12:30:22.502712] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECDS1_DATA.dat >> [2020-07-02 12:30:22.782577] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 >> (hashed subvol is data-client-6) >> [2020-07-02 12:30:22.783777] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECDS1_METADATA.dat >> [2020-07-02 12:30:23.146847] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on >> data-client-17 (hashed subvol is data-client-9) >> [2020-07-02 12:30:23.148009] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECM1_DATA.dat >> [2020-07-02 12:30:23.229290] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed >> subvol is data-client-6) >> [2020-07-02 12:30:23.230151] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECM1_METADATA.dat >> [2020-07-02 12:30:23.889520] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on >> data-client-2 (hashed subvol is data-client-11) >> [2020-07-02 12:30:23.896618] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECM2_DATA.dat >> [2020-07-02 12:30:24.093017] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed >> subvol is data-client-15) >> [2020-07-02 12:30:24.094117] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECM2_METADATA.dat >> [2020-07-02 12:30:24.345257] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on >> data-client-17 (hashed subvol is data-client-10) >> [2020-07-02 12:30:24.346234] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECM3_DATA.dat >> [2020-07-02 12:30:24.425835] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed >> subvol is data-client-15) >> [2020-07-02 12:30:24.426880] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSECM3_METADATA.dat >> [2020-07-02 12:30:25.158718] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat >> on data-client-9 (hashed subvol is data-client-19) >> [2020-07-02 12:30:25.159619] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO1_DATA.dat >> [2020-07-02 12:30:25.531479] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed >> subvol is data-client-10) >> [2020-07-02 12:30:25.540569] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat >> [2020-07-02 12:30:25.771692] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat >> on data-client-11 (hashed subvol is data-client-3) >> [2020-07-02 12:30:25.772610] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO2_DATA.dat >> [2020-07-02 12:30:25.866118] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 >> (hashed subvol is data-client-8) >> [2020-07-02 12:30:25.866917] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat >> [2020-07-02 12:30:26.424386] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat >> on data-client-9 (hashed subvol is data-client-18) >> [2020-07-02 12:30:26.425309] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO3_DATA.dat >> [2020-07-02 12:30:26.818852] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 >> (hashed subvol is data-client-2) >> [2020-07-02 12:30:26.819890] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat >> [2020-07-02 12:30:27.352405] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat >> on data-client-10 (hashed subvol is data-client-2) >> [2020-07-02 12:30:27.352914] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO4_DATA.dat >> [2020-07-02 12:30:27.521286] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed >> subvol is data-client-18) >> [2020-07-02 12:30:27.522325] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat >> [2020-07-02 12:30:28.566634] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat >> on data-client-2 (hashed subvol is data-client-11) >> [2020-07-02 12:30:28.579295] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO5_DATA.dat >> [2020-07-02 12:30:28.958028] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat >> on data-client-7 (hashed subvol is data-client-16) >> [2020-07-02 12:30:28.959102] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO6_DATA.dat >> [2020-07-02 12:30:29.012429] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed >> subvol is data-client-15) >> [2020-07-02 12:30:29.013416] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat >> [2020-07-02 12:30:29.396716] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on >> data-client-17 (hashed subvol is data-client-10) >> [2020-07-02 12:30:29.397740] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/NSEFO_BSE_TSDATA.dat >> [2020-07-02 12:30:29.556312] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed >> subvol is data-client-18) >> [2020-07-02 12:30:29.557197] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >> [2020-07-02 12:30:30.605354] I [MSGID: 109045] >> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >> deletion of stale linkfile >> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 >> (hashed subvol is data-client-19) >> [2020-07-02 12:30:30.606117] I [MSGID: 109069] >> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >> returned with op_ret -> 0 and op-errno -> 0 for >> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >> [2020-07-02 12:30:31.559206] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - >> 613576736 - 920330171 - 4159036738 >> [2020-07-02 12:30:31.559255] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >> [2020-07-02 12:30:31.569025] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - >> 920330172 - 1227083607 - 4159036738 >> [2020-07-02 12:30:31.569067] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >> [2020-07-02 12:30:31.701849] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - >> 3374637116 - 3681390551 - 4159036738 >> [2020-07-02 12:30:31.701895] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX, gfid = >> fff324f2-f855-4881-b77c-81e856522373 >> [2020-07-02 12:30:31.738464] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - >> 3681390552 - 3988143987 - 4159036738 >> [2020-07-02 12:30:31.738507] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX, gfid = >> fff324f2-f855-4881-b77c-81e856522373 >> [2020-07-02 12:30:31.857102] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk >> layout - 3067883680 - 3374637115 - 4159036738 >> [2020-07-02 12:30:31.857147] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >> f8447150-4801-4188-add9-ea295bb88729 >> [2020-07-02 12:30:31.857180] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk >> layout - 3374637116 - 3681390551 - 4159036738 >> [2020-07-02 12:30:31.857197] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >> f8447150-4801-4188-add9-ea295bb88729 >> [2020-07-02 12:30:31.917705] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 >> - 306753435 - 4159036738 >> [2020-07-02 12:30:31.917781] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >> f8447150-4801-4188-add9-ea295bb88729 >> [2020-07-02 12:30:31.917855] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk >> layout - 3988213852 - 4294967295 - 4159036738 >> [2020-07-02 12:30:31.917874] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >> f8447150-4801-4188-add9-ea295bb88729 >> [2020-07-02 12:30:32.390945] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - >> 3681460416 - 3988213851 - 4159036738 >> [2020-07-02 12:30:32.390998] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >> [2020-07-02 12:30:32.391056] I [MSGID: 109064] >> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >> data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - >> 3988213852 - 4294967295 - 4159036738 >> [2020-07-02 12:30:32.391075] I [MSGID: 109018] >> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >> [2020-07-02 12:33:50.915279] I [MSGID: 109066] >> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >> /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >> (2cb54500-814d-4e85-83e7-e33d9440b18d) >> (hash=data-client-6/cache=data-client-18) => >> /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >> (hash=data-client-6/cache=) >> [2020-07-02 12:34:09.799586] I [MSGID: 109066] >> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >> /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >> (99938ee6-6986-4123-9d72-ec09e2310b4f) >> (hash=data-client-17/cache=data-client-18) => >> /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >> (hash=data-client-17/cache=) >> .... >> >> >> Please look into this at top-priority if possible. >> Let me know if anything else is required. >> >> >> -- >> Regards, >> Shreyansh Shah >> > > > -- > Regards, > Shreyansh Shah > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > -- *Barak Sason Rofman* Gluster Storage Development Red Hat Israel 34 Jerusalem rd. Ra'anana, 43501 bsasonro at redhat.com T: *+972-9-7692304* M: *+972-52-4326355* @RedHat Red Hat Red Hat -------------- next part -------------- An HTML attachment was scrubbed... URL: From bsasonro at redhat.com Mon Jul 6 17:38:16 2020 From: bsasonro at redhat.com (Barak Sason Rofman) Date: Mon, 6 Jul 2020 20:38:16 +0300 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References:

Message-ID: I think it would be best. As I can't say at this point where the problem is originating from, brick logs might also be necessary (I assume I would have a better picture once I have the rebalance logs). Cheers, On Mon, Jul 6, 2020 at 8:16 PM Shreyansh Shah wrote: > Hi Barak, > Can provide the rebalance logs. Do you require all the brick logs (14 in > total)? > > On Mon, Jul 6, 2020 at 10:43 PM Barak Sason Rofman > wrote: > >> Greetings Shreyansh, >> >> Off-hand I can't come up with a reason for these failures. >> In order to start looking into this, access to the full rebalance logs is >> required (possibly brick logs as well). >> Can you provide those? >> >> My regards, >> >> >> On Mon, Jul 6, 2020 at 11:41 AM Shreyansh Shah < >> shreyansh.shah at alpha-grep.com> wrote: >> >>> Hi, >>> Did anyone get a chance to look into this? >>> >>> On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah < >>> shreyansh.shah at alpha-grep.com> wrote: >>> >>>> Hi All, >>>> >>>> *We are facing "Mismatching layouts for ,gfid = " >>>> errors.* >>>> >>>> We have a distributed glusterfs 5.10, no replication, 2 bricks (4TB >>>> each) on each node, 7 nodes in total. We added new bricks yesterday to the >>>> existing setup. >>>> Post that we did a rebalance fix-layout and then a rebalance (which is >>>> currently still in progress). The status shows "failed" on certain bricks >>>> but "in progress" for others. Adding output for gluster rebalance status >>>> below. >>>> >>>> The glusterfs client logs are flooded with "Mismatching layouts for >>>> ,gfid = " >>>> The performance too seems to have degraded due to this, even basic >>>> commands like `cd` and `ls` are taking more than a minute compared to >>>> sub-second number before brick addition. >>>> Apart from that we also experienced many binaries and files giving >>>> error stale file handle error even though the files were present. >>>> >>>> >>>> *gluster rebalance status :* >>>> >>>> Node Rebalanced-files size scanned failures >>>> skipped status run time in h:m:s >>>> --------- ----------- ----------- ----------- ----------- >>>> ----------- ------------ -------------- >>>> localhost 176 3.5GB 12790 0 >>>> 8552 in progress 21:36:01 >>>> 10.132.0.72 8232 394.8GB 19995 21 >>>> 26 failed 14:50:30 >>>> 10.132.0.44 12625 1.0TB 50023 1 >>>> 10202 in progress 21:36:00 >>>> 10.132.0.3 21982 956.8GB 79145 1 >>>> 34571 in progress 21:36:00 >>>> 10.132.0.9 7975 355.8GB 20157 6 >>>> 1522 failed 14:51:45 >>>> 10.132.0.73 6293 394.5GB 26414 151 >>>> 8085 failed 14:51:45 >>>> 10.132.0.70 6480 477.1GB 21058 27 >>>> 1787 failed 14:50:32 >>>> Estimated time left for rebalance to complete : 130:56:28 >>>> >>>> >>>> *Logs from one of the clients below:* >>>> >>>> [2020-07-02 12:30:14.971916] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk >>>> layout - 2761060380 - 3067813815 - 4159036738 >>>> [2020-07-02 12:30:14.971935] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>> [2020-07-02 12:30:15.032013] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk >>>> layout - 3681390552 - 3988143987 - 4159036738 >>>> [2020-07-02 12:30:15.032059] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>> [2020-07-02 12:30:15.032107] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>> layout - 3374637116 - 3681390551 - 4159036738 >>>> [2020-07-02 12:30:15.032153] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>> [2020-07-02 12:30:15.093329] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk >>>> layout - 2454306944 - 2761060379 - 4159036738 >>>> [2020-07-02 12:30:15.093373] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>> [2020-07-02 12:30:15.093460] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk >>>> layout - 2761060380 - 3067813815 - 4159036738 >>>> [2020-07-02 12:30:15.093515] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>> [2020-07-02 12:30:15.151063] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk >>>> layout - 3681390552 - 3988143987 - 4159036738 >>>> [2020-07-02 12:30:15.151108] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>> [2020-07-02 12:30:15.151149] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk >>>> layout - 3374637116 - 3681390551 - 4159036738 >>>> [2020-07-02 12:30:15.151162] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>> [2020-07-02 12:30:15.424321] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk >>>> layout - 920400036 - 1227153471 - 4159036738 >>>> [2020-07-02 12:30:15.424380] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>> [2020-07-02 12:30:15.424456] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk >>>> layout - 1840730208 - 2147483643 - 4159036738 >>>> [2020-07-02 12:30:15.424484] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>> [2020-07-02 12:30:15.424525] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk >>>> layout - 1533976772 - 1840730207 - 4159036738 >>>> [2020-07-02 12:30:15.424542] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>> [2020-07-02 12:30:15.424596] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk >>>> layout - 613646600 - 920400035 - 4159036738 >>>> [2020-07-02 12:30:15.424607] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>> [2020-07-02 12:30:16.004482] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on >>>> data-client-7 (hashed subvol is data-client-17) >>>> [2020-07-02 12:30:16.005523] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_CDS_1_DATA.dat >>>> [2020-07-02 12:30:16.531047] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>> on data-client-9 (hashed subvol is data-client-19) >>>> [2020-07-02 12:30:16.532086] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>> [2020-07-02 12:30:18.733229] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on >>>> data-client-17 (hashed subvol is data-client-9) >>>> [2020-07-02 12:30:18.734421] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_CDS_2_DATA.dat >>>> [2020-07-02 12:30:19.171930] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>> on data-client-9 (hashed subvol is data-client-18) >>>> [2020-07-02 12:30:19.172901] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>> [2020-07-02 12:30:21.028495] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on >>>> data-client-6 (hashed subvol is data-client-15) >>>> [2020-07-02 12:30:21.029836] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_2_DATA.dat >>>> [2020-07-02 12:30:21.127648] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>> on data-client-11 (hashed subvol is data-client-3) >>>> [2020-07-02 12:30:21.128713] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>> [2020-07-02 12:30:21.201126] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on >>>> data-client-15 (hashed subvol is data-client-7) >>>> [2020-07-02 12:30:21.201928] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_3_DATA.dat >>>> [2020-07-02 12:30:21.566158] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>> on data-client-7 (hashed subvol is data-client-16) >>>> [2020-07-02 12:30:21.567123] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>> [2020-07-02 12:30:21.649357] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on >>>> data-client-2 (hashed subvol is data-client-11) >>>> [2020-07-02 12:30:21.661381] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_4_DATA.dat >>>> [2020-07-02 12:30:21.748937] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>> on data-client-15 (hashed subvol is data-client-7) >>>> [2020-07-02 12:30:21.749481] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>> [2020-07-02 12:30:21.898593] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on >>>> data-client-14 (hashed subvol is data-client-7) >>>> [2020-07-02 12:30:21.899442] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_6_DATA.dat >>>> [2020-07-02 12:30:22.039337] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>> on data-client-10 (hashed subvol is data-client-2) >>>> [2020-07-02 12:30:22.040086] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>> [2020-07-02 12:30:22.501877] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>> on data-client-15 (hashed subvol is data-client-8) >>>> [2020-07-02 12:30:22.502712] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>> [2020-07-02 12:30:22.782577] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 >>>> (hashed subvol is data-client-6) >>>> [2020-07-02 12:30:22.783777] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat >>>> [2020-07-02 12:30:23.146847] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on >>>> data-client-17 (hashed subvol is data-client-9) >>>> [2020-07-02 12:30:23.148009] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECM1_DATA.dat >>>> [2020-07-02 12:30:23.229290] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed >>>> subvol is data-client-6) >>>> [2020-07-02 12:30:23.230151] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat >>>> [2020-07-02 12:30:23.889520] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on >>>> data-client-2 (hashed subvol is data-client-11) >>>> [2020-07-02 12:30:23.896618] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECM2_DATA.dat >>>> [2020-07-02 12:30:24.093017] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed >>>> subvol is data-client-15) >>>> [2020-07-02 12:30:24.094117] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat >>>> [2020-07-02 12:30:24.345257] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on >>>> data-client-17 (hashed subvol is data-client-10) >>>> [2020-07-02 12:30:24.346234] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECM3_DATA.dat >>>> [2020-07-02 12:30:24.425835] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed >>>> subvol is data-client-15) >>>> [2020-07-02 12:30:24.426880] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat >>>> [2020-07-02 12:30:25.158718] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>> on data-client-9 (hashed subvol is data-client-19) >>>> [2020-07-02 12:30:25.159619] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>> [2020-07-02 12:30:25.531479] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed >>>> subvol is data-client-10) >>>> [2020-07-02 12:30:25.540569] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat >>>> [2020-07-02 12:30:25.771692] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>> on data-client-11 (hashed subvol is data-client-3) >>>> [2020-07-02 12:30:25.772610] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>> [2020-07-02 12:30:25.866118] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 >>>> (hashed subvol is data-client-8) >>>> [2020-07-02 12:30:25.866917] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat >>>> [2020-07-02 12:30:26.424386] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>> on data-client-9 (hashed subvol is data-client-18) >>>> [2020-07-02 12:30:26.425309] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>> [2020-07-02 12:30:26.818852] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 >>>> (hashed subvol is data-client-2) >>>> [2020-07-02 12:30:26.819890] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat >>>> [2020-07-02 12:30:27.352405] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>> on data-client-10 (hashed subvol is data-client-2) >>>> [2020-07-02 12:30:27.352914] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>> [2020-07-02 12:30:27.521286] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed >>>> subvol is data-client-18) >>>> [2020-07-02 12:30:27.522325] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat >>>> [2020-07-02 12:30:28.566634] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>> on data-client-2 (hashed subvol is data-client-11) >>>> [2020-07-02 12:30:28.579295] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>> [2020-07-02 12:30:28.958028] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>> on data-client-7 (hashed subvol is data-client-16) >>>> [2020-07-02 12:30:28.959102] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>> [2020-07-02 12:30:29.012429] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed >>>> subvol is data-client-15) >>>> [2020-07-02 12:30:29.013416] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat >>>> [2020-07-02 12:30:29.396716] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on >>>> data-client-17 (hashed subvol is data-client-10) >>>> [2020-07-02 12:30:29.397740] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/NSEFO_BSE_TSDATA.dat >>>> [2020-07-02 12:30:29.556312] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed >>>> subvol is data-client-18) >>>> [2020-07-02 12:30:29.557197] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >>>> [2020-07-02 12:30:30.605354] I [MSGID: 109045] >>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>> deletion of stale linkfile >>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 >>>> (hashed subvol is data-client-19) >>>> [2020-07-02 12:30:30.606117] I [MSGID: 109069] >>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>> returned with op_ret -> 0 and op-errno -> 0 for >>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >>>> [2020-07-02 12:30:31.559206] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - >>>> 613576736 - 920330171 - 4159036738 >>>> [2020-07-02 12:30:31.559255] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>> [2020-07-02 12:30:31.569025] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - >>>> 920330172 - 1227083607 - 4159036738 >>>> [2020-07-02 12:30:31.569067] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>> [2020-07-02 12:30:31.701849] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - >>>> 3374637116 - 3681390551 - 4159036738 >>>> [2020-07-02 12:30:31.701895] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX, gfid = >>>> fff324f2-f855-4881-b77c-81e856522373 >>>> [2020-07-02 12:30:31.738464] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - >>>> 3681390552 - 3988143987 - 4159036738 >>>> [2020-07-02 12:30:31.738507] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX, gfid = >>>> fff324f2-f855-4881-b77c-81e856522373 >>>> [2020-07-02 12:30:31.857102] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk >>>> layout - 3067883680 - 3374637115 - 4159036738 >>>> [2020-07-02 12:30:31.857147] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>> f8447150-4801-4188-add9-ea295bb88729 >>>> [2020-07-02 12:30:31.857180] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>> layout - 3374637116 - 3681390551 - 4159036738 >>>> [2020-07-02 12:30:31.857197] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>> f8447150-4801-4188-add9-ea295bb88729 >>>> [2020-07-02 12:30:31.917705] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 >>>> - 306753435 - 4159036738 >>>> [2020-07-02 12:30:31.917781] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>> f8447150-4801-4188-add9-ea295bb88729 >>>> [2020-07-02 12:30:31.917855] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk >>>> layout - 3988213852 - 4294967295 - 4159036738 >>>> [2020-07-02 12:30:31.917874] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>> f8447150-4801-4188-add9-ea295bb88729 >>>> [2020-07-02 12:30:32.390945] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - >>>> 3681460416 - 3988213851 - 4159036738 >>>> [2020-07-02 12:30:32.390998] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>> [2020-07-02 12:30:32.391056] I [MSGID: 109064] >>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>> data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - >>>> 3988213852 - 4294967295 - 4159036738 >>>> [2020-07-02 12:30:32.391075] I [MSGID: 109018] >>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>> [2020-07-02 12:33:50.915279] I [MSGID: 109066] >>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>> /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >>>> (2cb54500-814d-4e85-83e7-e33d9440b18d) >>>> (hash=data-client-6/cache=data-client-18) => >>>> /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >>>> (hash=data-client-6/cache=) >>>> [2020-07-02 12:34:09.799586] I [MSGID: 109066] >>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>> /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >>>> (99938ee6-6986-4123-9d72-ec09e2310b4f) >>>> (hash=data-client-17/cache=data-client-18) => >>>> /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >>>> (hash=data-client-17/cache=) >>>> .... >>>> >>>> >>>> Please look into this at top-priority if possible. >>>> Let me know if anything else is required. >>>> >>>> >>>> -- >>>> Regards, >>>> Shreyansh Shah >>>> >>> >>> >>> -- >>> Regards, >>> Shreyansh Shah >>> ________ >>> >>> >>> >>> Community Meeting Calendar: >>> >>> Schedule - >>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>> Bridge: https://bluejeans.com/441850968 >>> >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >>> >> >> >> -- >> *Barak Sason Rofman* >> >> Gluster Storage Development >> >> Red Hat Israel >> >> 34 Jerusalem rd. Ra'anana, 43501 >> >> bsasonro at redhat.com T: *+972-9-7692304* >> M: *+972-52-4326355* >> @RedHat Red Hat >> Red Hat >> >> >> > > > -- > Regards, > Shreyansh Shah > -- *Barak Sason Rofman* Gluster Storage Development Red Hat Israel 34 Jerusalem rd. Ra'anana, 43501 bsasonro at redhat.com T: *+972-9-7692304* M: *+972-52-4326355* @RedHat Red Hat Red Hat -------------- next part -------------- An HTML attachment was scrubbed... URL: From shanondink at gmail.com Mon Jul 6 20:32:28 2020 From: shanondink at gmail.com (Shanon Swafford) Date: Mon, 6 Jul 2020 15:32:28 -0500 Subject: [Gluster-users] Restore a replica after failed hardware Message-ID: <008b01d653d4$91285500$b378ff00$@gmail.com> Hi guys, I lost a brick in a 2x replicated system. The volume is 17TB with 9TB used ( small files ). 3 drives failed in 2 hours in a raid-5 array. Gluster version: 3.8.15 So "reset-brick" isn't available on this version. I've googled all weekend and I'm overwhelmed so I'd like to verify before I muck everything up. Is this the correct procedure to restore the failed brick? # Replace drive # Use parted to create /dev/sdb1 # Make xfs filesystem on /dev/sdb1 # Mount /var/glusterfs/sdb1 # gluster volume replace-brick myvol d-es2-nfs-a:/var/glusterfs/sdb1/myvol d-es2-nfs-a:/var/glusterfs/sdb1/myvol commit force I read about using different brick names but again, I'm overwhelmed with all the info on google. I also saw something as simple as remove failed and re-add as new but.. Now I just read about xfsdump | xfsrestore to preload, but how would that work with healing? Thanks a ton in advance. Shanon [root at es2-nfs-a ~]# parted /dev/sdb print Error: /dev/sdb: unrecognised disk label Model: DELL PERC H700 (scsi) Disk /dev/sdb: 18.2TB Sector size (logical/physical): 512B/512B Partition Table: unknown Disk Flags: [root at es2-nfs-a ~]# grep sdb /etc/fstab /dev/sdb1 /var/glusterfs/sdb1 xfs inode64 0 0 [root at es2-nfs-a ~]# gluster volume info Volume Name: myvol Type: Replicate Volume ID: 49fd2a63-f887-4478-9242-69030a7a565d Status: Started Snapshot Count: 0 Number of Bricks: 1 x 2 = 2 Transport-type: tcp Bricks: Brick1: d-es2-nfs-a:/var/glusterfs/sdb1/myvol Brick2: d-es2-nfs-b:/var/glusterfs/sdb1/myvol Options Reconfigured: nfs.disable: on performance.readdir-ahead: on transport.address-family: inet performance.cache-size: 1GB [root at es2-nfs-a ~]# gluster volume status Status of volume: myvol Gluster process TCP Port RDMA Port Online Pid ---------------------------------------------------------------------------- -- Brick d-es2-nfs-a:/var/glusterfs/sdb1/myvol Brick d-es2-nfs-b:/var/glusterfs/sdb1/myvol Self-heal Daemon on localhost N/A N/A Y 2475 Self-heal Daemon on d-es2-nfs-b N/A N/A Y 8663 Task Status of Volume myvol ---------------------------------------------------------------------------- -- There are no active volume tasks -------------- next part -------------- An HTML attachment was scrubbed... URL: From hunter86_bg at yahoo.com Tue Jul 7 04:02:29 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Tue, 07 Jul 2020 07:02:29 +0300 Subject: [Gluster-users] Restore a replica after failed hardware In-Reply-To: <008b01d653d4$91285500$b378ff00$@gmail.com> References: <008b01d653d4$91285500$b378ff00$@gmail.com> Message-ID: It looks OK. Usually in the doc you got 'replace-brick source destination start' which stops the brick process and I do it this way. Also, you should consider: - adding 'noatime' to the mount options - Check what is your stripe width (stride multiplied by data disks) and then create xfs with the necessary options to align it properly. I have never used xfsdump to recover a brick. Just ensure gluster brick process is not running on the node during the restore. Best Regards, Strahil Nikolov ?? 6 ??? 2020 ?. 23:32:28 GMT+03:00, Shanon Swafford ??????: >Hi guys, > > > >I lost a brick in a 2x replicated system. The volume is 17TB with 9TB >used >( small files ). 3 drives failed in 2 hours in a raid-5 array. > > > >Gluster version: 3.8.15 > > > >So "reset-brick" isn't available on this version. > > > >I've googled all weekend and I'm overwhelmed so I'd like to verify >before I >muck everything up. > > > >Is this the correct procedure to restore the failed brick? > > > ># Replace drive > ># Use parted to create /dev/sdb1 > ># Make xfs filesystem on /dev/sdb1 > ># Mount /var/glusterfs/sdb1 > ># gluster volume replace-brick myvol >d-es2-nfs-a:/var/glusterfs/sdb1/myvol >d-es2-nfs-a:/var/glusterfs/sdb1/myvol commit force > > > >I read about using different brick names but again, I'm overwhelmed >with all >the info on google. > > > >I also saw something as simple as remove failed and re-add as new but.. > > > >Now I just read about xfsdump | xfsrestore to preload, but how would >that >work with healing? > > > >Thanks a ton in advance. > > > >Shanon > > > > > > > >[root at es2-nfs-a ~]# parted /dev/sdb print > >Error: /dev/sdb: unrecognised disk label > >Model: DELL PERC H700 (scsi) > >Disk /dev/sdb: 18.2TB > >Sector size (logical/physical): 512B/512B > >Partition Table: unknown > >Disk Flags: > > > > > >[root at es2-nfs-a ~]# grep sdb /etc/fstab > >/dev/sdb1 /var/glusterfs/sdb1 xfs inode64 >0 0 > > > > > >[root at es2-nfs-a ~]# gluster volume info > > > >Volume Name: myvol > >Type: Replicate > >Volume ID: 49fd2a63-f887-4478-9242-69030a7a565d > >Status: Started > >Snapshot Count: 0 > >Number of Bricks: 1 x 2 = 2 > >Transport-type: tcp > >Bricks: > >Brick1: d-es2-nfs-a:/var/glusterfs/sdb1/myvol > >Brick2: d-es2-nfs-b:/var/glusterfs/sdb1/myvol > >Options Reconfigured: > >nfs.disable: on > >performance.readdir-ahead: on > >transport.address-family: inet > >performance.cache-size: 1GB > > > > > >[root at es2-nfs-a ~]# gluster volume status > >Status of volume: myvol > >Gluster process TCP Port RDMA Port Online > Pid > >---------------------------------------------------------------------------- >-- > >Brick d-es2-nfs-a:/var/glusterfs/sdb1/myvol > >Brick d-es2-nfs-b:/var/glusterfs/sdb1/myvol > >Self-heal Daemon on localhost N/A N/A Y >2475 > >Self-heal Daemon on d-es2-nfs-b N/A N/A Y >8663 > > > >Task Status of Volume myvol > >---------------------------------------------------------------------------- >-- > >There are no active volume tasks > > > > From bsasonro at redhat.com Tue Jul 7 08:42:36 2020 From: bsasonro at redhat.com (Barak Sason Rofman) Date: Tue, 7 Jul 2020 11:42:36 +0300 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References:

Message-ID: Thanks Shreyansh, I'll look into it, however I'll likely need some help from more senior team members to perform RCA. I'll update once I have new insights. My regards, On Tue, Jul 7, 2020 at 11:40 AM Shreyansh Shah < shreyansh.shah at alpha-grep.com> wrote: > Hi Barak, > Thanks for looking into this and helping me out, > The fix-layout was successful, and I ran a rebalance after completion of > fix-layout. > The rebalance status though did show failure for 3 nodes. > > On Tue, Jul 7, 2020 at 2:07 PM Barak Sason Rofman > wrote: > >> Greetings again Shreyansh, >> >> I'm indeed seeing a lot of errors in the log file - still unsure about >> the RC. >> You mentioned that prior to running rebalance you ran fix-layout, was the >> fix-layout successful? >> Another question - did you wait until fix-layout was completed before >> running rebalance? >> >> My thanks, >> >> On Mon, Jul 6, 2020 at 9:33 PM Shreyansh Shah < >> shreyansh.shah at alpha-grep.com> wrote: >> >>> Hi, >>> Attaching rebalance logs >>> FYI, we ran "gluster rebalance fix-layout" followed by "gluster >>> rebalance" on 20200701 and today we again ran "gluster rebalance fix-layout" >>> >>> >>> PFA >>> >>> On Mon, Jul 6, 2020 at 11:08 PM Barak Sason Rofman >>> wrote: >>> >>>> I think it would be best. >>>> As I can't say at this point where the problem is originating from, >>>> brick logs might also be necessary (I assume I would have a better picture >>>> once I have the rebalance logs). >>>> >>>> Cheers, >>>> >>>> On Mon, Jul 6, 2020 at 8:16 PM Shreyansh Shah < >>>> shreyansh.shah at alpha-grep.com> wrote: >>>> >>>>> Hi Barak, >>>>> Can provide the rebalance logs. Do you require all the brick logs (14 >>>>> in total)? >>>>> >>>>> On Mon, Jul 6, 2020 at 10:43 PM Barak Sason Rofman < >>>>> bsasonro at redhat.com> wrote: >>>>> >>>>>> Greetings Shreyansh, >>>>>> >>>>>> Off-hand I can't come up with a reason for these failures. >>>>>> In order to start looking into this, access to the full rebalance >>>>>> logs is required (possibly brick logs as well). >>>>>> Can you provide those? >>>>>> >>>>>> My regards, >>>>>> >>>>>> >>>>>> On Mon, Jul 6, 2020 at 11:41 AM Shreyansh Shah < >>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>> >>>>>>> Hi, >>>>>>> Did anyone get a chance to look into this? >>>>>>> >>>>>>> On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah < >>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>> >>>>>>>> Hi All, >>>>>>>> >>>>>>>> *We are facing "Mismatching layouts for ,gfid = " >>>>>>>> errors.* >>>>>>>> >>>>>>>> We have a distributed glusterfs 5.10, no replication, 2 bricks (4TB >>>>>>>> each) on each node, 7 nodes in total. We added new bricks yesterday to the >>>>>>>> existing setup. >>>>>>>> Post that we did a rebalance fix-layout and then a rebalance (which >>>>>>>> is currently still in progress). The status shows "failed" on certain >>>>>>>> bricks but "in progress" for others. Adding output for gluster rebalance >>>>>>>> status below. >>>>>>>> >>>>>>>> The glusterfs client logs are flooded with "Mismatching layouts for >>>>>>>> ,gfid = " >>>>>>>> The performance too seems to have degraded due to this, even basic >>>>>>>> commands like `cd` and `ls` are taking more than a minute compared to >>>>>>>> sub-second number before brick addition. >>>>>>>> Apart from that we also experienced many binaries and files giving >>>>>>>> error stale file handle error even though the files were present. >>>>>>>> >>>>>>>> >>>>>>>> *gluster rebalance status :* >>>>>>>> >>>>>>>> Node Rebalanced-files size scanned failures >>>>>>>> skipped status run time in h:m:s >>>>>>>> --------- ----------- ----------- ----------- >>>>>>>> ----------- ----------- ------------ -------------- >>>>>>>> localhost 176 3.5GB 12790 >>>>>>>> 0 8552 in progress 21:36:01 >>>>>>>> 10.132.0.72 8232 394.8GB 19995 >>>>>>>> 21 26 failed 14:50:30 >>>>>>>> 10.132.0.44 12625 1.0TB 50023 >>>>>>>> 1 10202 in progress 21:36:00 >>>>>>>> 10.132.0.3 21982 956.8GB 79145 >>>>>>>> 1 34571 in progress 21:36:00 >>>>>>>> 10.132.0.9 7975 355.8GB 20157 >>>>>>>> 6 1522 failed 14:51:45 >>>>>>>> 10.132.0.73 6293 394.5GB 26414 >>>>>>>> 151 8085 failed 14:51:45 >>>>>>>> 10.132.0.70 6480 477.1GB 21058 >>>>>>>> 27 1787 failed 14:50:32 >>>>>>>> Estimated time left for rebalance to complete : 130:56:28 >>>>>>>> >>>>>>>> >>>>>>>> *Logs from one of the clients below:* >>>>>>>> >>>>>>>> [2020-07-02 12:30:14.971916] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk >>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>> [2020-07-02 12:30:14.971935] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>> [2020-07-02 12:30:15.032013] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk >>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.032059] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>> [2020-07-02 12:30:15.032107] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.032153] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>> [2020-07-02 12:30:15.093329] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk >>>>>>>> layout - 2454306944 - 2761060379 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.093373] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>> [2020-07-02 12:30:15.093460] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk >>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.093515] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>> [2020-07-02 12:30:15.151063] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk >>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.151108] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>> [2020-07-02 12:30:15.151149] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk >>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.151162] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>> [2020-07-02 12:30:15.424321] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk >>>>>>>> layout - 920400036 - 1227153471 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.424380] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>> [2020-07-02 12:30:15.424456] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk >>>>>>>> layout - 1840730208 - 2147483643 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.424484] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>> [2020-07-02 12:30:15.424525] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk >>>>>>>> layout - 1533976772 - 1840730207 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.424542] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>> [2020-07-02 12:30:15.424596] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk >>>>>>>> layout - 613646600 - 920400035 - 4159036738 >>>>>>>> [2020-07-02 12:30:15.424607] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>> [2020-07-02 12:30:16.004482] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on >>>>>>>> data-client-7 (hashed subvol is data-client-17) >>>>>>>> [2020-07-02 12:30:16.005523] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_CDS_1_DATA.dat >>>>>>>> [2020-07-02 12:30:16.531047] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>> [2020-07-02 12:30:16.532086] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>> [2020-07-02 12:30:18.733229] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on >>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>> [2020-07-02 12:30:18.734421] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_CDS_2_DATA.dat >>>>>>>> [2020-07-02 12:30:19.171930] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>> [2020-07-02 12:30:19.172901] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>> [2020-07-02 12:30:21.028495] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on >>>>>>>> data-client-6 (hashed subvol is data-client-15) >>>>>>>> [2020-07-02 12:30:21.029836] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_2_DATA.dat >>>>>>>> [2020-07-02 12:30:21.127648] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>> [2020-07-02 12:30:21.128713] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>> [2020-07-02 12:30:21.201126] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on >>>>>>>> data-client-15 (hashed subvol is data-client-7) >>>>>>>> [2020-07-02 12:30:21.201928] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_3_DATA.dat >>>>>>>> [2020-07-02 12:30:21.566158] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>> [2020-07-02 12:30:21.567123] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>> [2020-07-02 12:30:21.649357] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on >>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>> [2020-07-02 12:30:21.661381] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_4_DATA.dat >>>>>>>> [2020-07-02 12:30:21.748937] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>> on data-client-15 (hashed subvol is data-client-7) >>>>>>>> [2020-07-02 12:30:21.749481] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>> [2020-07-02 12:30:21.898593] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on >>>>>>>> data-client-14 (hashed subvol is data-client-7) >>>>>>>> [2020-07-02 12:30:21.899442] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_6_DATA.dat >>>>>>>> [2020-07-02 12:30:22.039337] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>> [2020-07-02 12:30:22.040086] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>> [2020-07-02 12:30:22.501877] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>> on data-client-15 (hashed subvol is data-client-8) >>>>>>>> [2020-07-02 12:30:22.502712] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>> [2020-07-02 12:30:22.782577] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 >>>>>>>> (hashed subvol is data-client-6) >>>>>>>> [2020-07-02 12:30:22.783777] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat >>>>>>>> [2020-07-02 12:30:23.146847] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on >>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>> [2020-07-02 12:30:23.148009] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECM1_DATA.dat >>>>>>>> [2020-07-02 12:30:23.229290] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed >>>>>>>> subvol is data-client-6) >>>>>>>> [2020-07-02 12:30:23.230151] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat >>>>>>>> [2020-07-02 12:30:23.889520] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on >>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>> [2020-07-02 12:30:23.896618] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECM2_DATA.dat >>>>>>>> [2020-07-02 12:30:24.093017] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed >>>>>>>> subvol is data-client-15) >>>>>>>> [2020-07-02 12:30:24.094117] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat >>>>>>>> [2020-07-02 12:30:24.345257] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on >>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>> [2020-07-02 12:30:24.346234] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECM3_DATA.dat >>>>>>>> [2020-07-02 12:30:24.425835] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed >>>>>>>> subvol is data-client-15) >>>>>>>> [2020-07-02 12:30:24.426880] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat >>>>>>>> [2020-07-02 12:30:25.158718] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>> [2020-07-02 12:30:25.159619] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>> [2020-07-02 12:30:25.531479] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed >>>>>>>> subvol is data-client-10) >>>>>>>> [2020-07-02 12:30:25.540569] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat >>>>>>>> [2020-07-02 12:30:25.771692] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>> [2020-07-02 12:30:25.772610] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>> [2020-07-02 12:30:25.866118] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 >>>>>>>> (hashed subvol is data-client-8) >>>>>>>> [2020-07-02 12:30:25.866917] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat >>>>>>>> [2020-07-02 12:30:26.424386] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>> [2020-07-02 12:30:26.425309] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>> [2020-07-02 12:30:26.818852] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 >>>>>>>> (hashed subvol is data-client-2) >>>>>>>> [2020-07-02 12:30:26.819890] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat >>>>>>>> [2020-07-02 12:30:27.352405] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>> [2020-07-02 12:30:27.352914] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>> [2020-07-02 12:30:27.521286] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed >>>>>>>> subvol is data-client-18) >>>>>>>> [2020-07-02 12:30:27.522325] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat >>>>>>>> [2020-07-02 12:30:28.566634] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>> on data-client-2 (hashed subvol is data-client-11) >>>>>>>> [2020-07-02 12:30:28.579295] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>> [2020-07-02 12:30:28.958028] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>> [2020-07-02 12:30:28.959102] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>> [2020-07-02 12:30:29.012429] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed >>>>>>>> subvol is data-client-15) >>>>>>>> [2020-07-02 12:30:29.013416] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat >>>>>>>> [2020-07-02 12:30:29.396716] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on >>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>> [2020-07-02 12:30:29.397740] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/NSEFO_BSE_TSDATA.dat >>>>>>>> [2020-07-02 12:30:29.556312] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed >>>>>>>> subvol is data-client-18) >>>>>>>> [2020-07-02 12:30:29.557197] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >>>>>>>> [2020-07-02 12:30:30.605354] I [MSGID: 109045] >>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>> deletion of stale linkfile >>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 >>>>>>>> (hashed subvol is data-client-19) >>>>>>>> [2020-07-02 12:30:30.606117] I [MSGID: 109069] >>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >>>>>>>> [2020-07-02 12:30:31.559206] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - >>>>>>>> 613576736 - 920330171 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.559255] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>> [2020-07-02 12:30:31.569025] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - >>>>>>>> 920330172 - 1227083607 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.569067] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>> [2020-07-02 12:30:31.701849] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - >>>>>>>> 3374637116 - 3681390551 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.701895] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>> [2020-07-02 12:30:31.738464] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - >>>>>>>> 3681390552 - 3988143987 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.738507] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>> [2020-07-02 12:30:31.857102] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk >>>>>>>> layout - 3067883680 - 3374637115 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.857147] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>> [2020-07-02 12:30:31.857180] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.857197] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>> [2020-07-02 12:30:31.917705] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 >>>>>>>> - 306753435 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.917781] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>> [2020-07-02 12:30:31.917855] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk >>>>>>>> layout - 3988213852 - 4294967295 - 4159036738 >>>>>>>> [2020-07-02 12:30:31.917874] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>> [2020-07-02 12:30:32.390945] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - >>>>>>>> 3681460416 - 3988213851 - 4159036738 >>>>>>>> [2020-07-02 12:30:32.390998] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>> [2020-07-02 12:30:32.391056] I [MSGID: 109064] >>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>> data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - >>>>>>>> 3988213852 - 4294967295 - 4159036738 >>>>>>>> [2020-07-02 12:30:32.391075] I [MSGID: 109018] >>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>> [2020-07-02 12:33:50.915279] I [MSGID: 109066] >>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>> /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >>>>>>>> (2cb54500-814d-4e85-83e7-e33d9440b18d) >>>>>>>> (hash=data-client-6/cache=data-client-18) => >>>>>>>> /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >>>>>>>> (hash=data-client-6/cache=) >>>>>>>> [2020-07-02 12:34:09.799586] I [MSGID: 109066] >>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>> /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >>>>>>>> (99938ee6-6986-4123-9d72-ec09e2310b4f) >>>>>>>> (hash=data-client-17/cache=data-client-18) => >>>>>>>> /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >>>>>>>> (hash=data-client-17/cache=) >>>>>>>> .... >>>>>>>> >>>>>>>> >>>>>>>> Please look into this at top-priority if possible. >>>>>>>> Let me know if anything else is required. >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Regards, >>>>>>>> Shreyansh Shah >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Regards, >>>>>>> Shreyansh Shah >>>>>>> ________ >>>>>>> >>>>>>> >>>>>>> >>>>>>> Community Meeting Calendar: >>>>>>> >>>>>>> Schedule - >>>>>>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>>>>>> Bridge: https://bluejeans.com/441850968 >>>>>>> >>>>>>> Gluster-users mailing list >>>>>>> Gluster-users at gluster.org >>>>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> *Barak Sason Rofman* >>>>>> >>>>>> Gluster Storage Development >>>>>> >>>>>> Red Hat Israel >>>>>> >>>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>>> >>>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>>> M: *+972-52-4326355* >>>>>> @RedHat Red Hat >>>>>> Red Hat >>>>>> >>>>>> >>>>>> >>>>> >>>>> >>>>> -- >>>>> Regards, >>>>> Shreyansh Shah >>>>> >>>> >>>> >>>> -- >>>> *Barak Sason Rofman* >>>> >>>> Gluster Storage Development >>>> >>>> Red Hat Israel >>>> >>>> 34 Jerusalem rd. Ra'anana, 43501 >>>> >>>> bsasonro at redhat.com T: *+972-9-7692304* >>>> M: *+972-52-4326355* >>>> @RedHat Red Hat >>>> Red Hat >>>> >>>> >>>> >>> >>> >>> -- >>> Regards, >>> Shreyansh Shah >>> >> >> >> -- >> *Barak Sason Rofman* >> >> Gluster Storage Development >> >> Red Hat Israel >> >> 34 Jerusalem rd. Ra'anana, 43501 >> >> bsasonro at redhat.com T: *+972-9-7692304* >> M: *+972-52-4326355* >> @RedHat Red Hat >> Red Hat >> >> >> > > > -- > Regards, > Shreyansh Shah > -- *Barak Sason Rofman* Gluster Storage Development Red Hat Israel 34 Jerusalem rd. Ra'anana, 43501 bsasonro at redhat.com T: *+972-9-7692304* M: *+972-52-4326355* @RedHat Red Hat Red Hat -------------- next part -------------- An HTML attachment was scrubbed... URL: From shreyansh.shah at alpha-grep.com Tue Jul 7 08:46:52 2020 From: shreyansh.shah at alpha-grep.com (Shreyansh Shah) Date: Tue, 7 Jul 2020 14:16:52 +0530 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References:

Message-ID: Sounds good, thank you. On Tue, Jul 7, 2020 at 2:12 PM Barak Sason Rofman wrote: > Thanks Shreyansh, > > I'll look into it, however I'll likely need some help from more senior > team members to perform RCA. > I'll update once I have new insights. > > My regards, > > On Tue, Jul 7, 2020 at 11:40 AM Shreyansh Shah < > shreyansh.shah at alpha-grep.com> wrote: > >> Hi Barak, >> Thanks for looking into this and helping me out, >> The fix-layout was successful, and I ran a rebalance after completion of >> fix-layout. >> The rebalance status though did show failure for 3 nodes. >> >> On Tue, Jul 7, 2020 at 2:07 PM Barak Sason Rofman >> wrote: >> >>> Greetings again Shreyansh, >>> >>> I'm indeed seeing a lot of errors in the log file - still unsure about >>> the RC. >>> You mentioned that prior to running rebalance you ran fix-layout, was >>> the fix-layout successful? >>> Another question - did you wait until fix-layout was completed before >>> running rebalance? >>> >>> My thanks, >>> >>> On Mon, Jul 6, 2020 at 9:33 PM Shreyansh Shah < >>> shreyansh.shah at alpha-grep.com> wrote: >>> >>>> Hi, >>>> Attaching rebalance logs >>>> FYI, we ran "gluster rebalance fix-layout" followed by "gluster >>>> rebalance" on 20200701 and today we again ran "gluster rebalance fix-layout" >>>> >>>> >>>> PFA >>>> >>>> On Mon, Jul 6, 2020 at 11:08 PM Barak Sason Rofman >>>> wrote: >>>> >>>>> I think it would be best. >>>>> As I can't say at this point where the problem is originating from, >>>>> brick logs might also be necessary (I assume I would have a better picture >>>>> once I have the rebalance logs). >>>>> >>>>> Cheers, >>>>> >>>>> On Mon, Jul 6, 2020 at 8:16 PM Shreyansh Shah < >>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>> >>>>>> Hi Barak, >>>>>> Can provide the rebalance logs. Do you require all the brick logs (14 >>>>>> in total)? >>>>>> >>>>>> On Mon, Jul 6, 2020 at 10:43 PM Barak Sason Rofman < >>>>>> bsasonro at redhat.com> wrote: >>>>>> >>>>>>> Greetings Shreyansh, >>>>>>> >>>>>>> Off-hand I can't come up with a reason for these failures. >>>>>>> In order to start looking into this, access to the full rebalance >>>>>>> logs is required (possibly brick logs as well). >>>>>>> Can you provide those? >>>>>>> >>>>>>> My regards, >>>>>>> >>>>>>> >>>>>>> On Mon, Jul 6, 2020 at 11:41 AM Shreyansh Shah < >>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>> >>>>>>>> Hi, >>>>>>>> Did anyone get a chance to look into this? >>>>>>>> >>>>>>>> On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah < >>>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>>> >>>>>>>>> Hi All, >>>>>>>>> >>>>>>>>> *We are facing "Mismatching layouts for ,gfid = " >>>>>>>>> errors.* >>>>>>>>> >>>>>>>>> We have a distributed glusterfs 5.10, no replication, 2 bricks >>>>>>>>> (4TB each) on each node, 7 nodes in total. We added new bricks yesterday to >>>>>>>>> the existing setup. >>>>>>>>> Post that we did a rebalance fix-layout and then a rebalance >>>>>>>>> (which is currently still in progress). The status shows "failed" on >>>>>>>>> certain bricks but "in progress" for others. Adding output for gluster >>>>>>>>> rebalance status below. >>>>>>>>> >>>>>>>>> The glusterfs client logs are flooded with "Mismatching layouts >>>>>>>>> for ,gfid = " >>>>>>>>> The performance too seems to have degraded due to this, even basic >>>>>>>>> commands like `cd` and `ls` are taking more than a minute compared to >>>>>>>>> sub-second number before brick addition. >>>>>>>>> Apart from that we also experienced many binaries and files giving >>>>>>>>> error stale file handle error even though the files were present. >>>>>>>>> >>>>>>>>> >>>>>>>>> *gluster rebalance status :* >>>>>>>>> >>>>>>>>> Node Rebalanced-files size scanned failures >>>>>>>>> skipped status run time in h:m:s >>>>>>>>> --------- ----------- ----------- ----------- >>>>>>>>> ----------- ----------- ------------ -------------- >>>>>>>>> localhost 176 3.5GB 12790 >>>>>>>>> 0 8552 in progress 21:36:01 >>>>>>>>> 10.132.0.72 8232 394.8GB 19995 >>>>>>>>> 21 26 failed 14:50:30 >>>>>>>>> 10.132.0.44 12625 1.0TB 50023 >>>>>>>>> 1 10202 in progress 21:36:00 >>>>>>>>> 10.132.0.3 21982 956.8GB 79145 >>>>>>>>> 1 34571 in progress 21:36:00 >>>>>>>>> 10.132.0.9 7975 355.8GB 20157 >>>>>>>>> 6 1522 failed 14:51:45 >>>>>>>>> 10.132.0.73 6293 394.5GB 26414 >>>>>>>>> 151 8085 failed 14:51:45 >>>>>>>>> 10.132.0.70 6480 477.1GB 21058 >>>>>>>>> 27 1787 failed 14:50:32 >>>>>>>>> Estimated time left for rebalance to complete : 130:56:28 >>>>>>>>> >>>>>>>>> >>>>>>>>> *Logs from one of the clients below:* >>>>>>>>> >>>>>>>>> [2020-07-02 12:30:14.971916] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk >>>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>>> [2020-07-02 12:30:14.971935] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>> [2020-07-02 12:30:15.032013] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk >>>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.032059] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>> [2020-07-02 12:30:15.032107] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.032153] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>> [2020-07-02 12:30:15.093329] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk >>>>>>>>> layout - 2454306944 - 2761060379 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.093373] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>> [2020-07-02 12:30:15.093460] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk >>>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.093515] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>> [2020-07-02 12:30:15.151063] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk >>>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.151108] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>> [2020-07-02 12:30:15.151149] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk >>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.151162] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>> [2020-07-02 12:30:15.424321] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk >>>>>>>>> layout - 920400036 - 1227153471 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.424380] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>> [2020-07-02 12:30:15.424456] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk >>>>>>>>> layout - 1840730208 - 2147483643 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.424484] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>> [2020-07-02 12:30:15.424525] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk >>>>>>>>> layout - 1533976772 - 1840730207 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.424542] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>> [2020-07-02 12:30:15.424596] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk >>>>>>>>> layout - 613646600 - 920400035 - 4159036738 >>>>>>>>> [2020-07-02 12:30:15.424607] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>> [2020-07-02 12:30:16.004482] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on >>>>>>>>> data-client-7 (hashed subvol is data-client-17) >>>>>>>>> [2020-07-02 12:30:16.005523] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_CDS_1_DATA.dat >>>>>>>>> [2020-07-02 12:30:16.531047] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>>> [2020-07-02 12:30:16.532086] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>>> [2020-07-02 12:30:18.733229] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on >>>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>>> [2020-07-02 12:30:18.734421] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_CDS_2_DATA.dat >>>>>>>>> [2020-07-02 12:30:19.171930] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>>> [2020-07-02 12:30:19.172901] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>>> [2020-07-02 12:30:21.028495] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on >>>>>>>>> data-client-6 (hashed subvol is data-client-15) >>>>>>>>> [2020-07-02 12:30:21.029836] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_2_DATA.dat >>>>>>>>> [2020-07-02 12:30:21.127648] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>>> [2020-07-02 12:30:21.128713] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>>> [2020-07-02 12:30:21.201126] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on >>>>>>>>> data-client-15 (hashed subvol is data-client-7) >>>>>>>>> [2020-07-02 12:30:21.201928] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_3_DATA.dat >>>>>>>>> [2020-07-02 12:30:21.566158] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>>> [2020-07-02 12:30:21.567123] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>>> [2020-07-02 12:30:21.649357] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on >>>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>>> [2020-07-02 12:30:21.661381] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_4_DATA.dat >>>>>>>>> [2020-07-02 12:30:21.748937] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>>> on data-client-15 (hashed subvol is data-client-7) >>>>>>>>> [2020-07-02 12:30:21.749481] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>>> [2020-07-02 12:30:21.898593] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on >>>>>>>>> data-client-14 (hashed subvol is data-client-7) >>>>>>>>> [2020-07-02 12:30:21.899442] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_6_DATA.dat >>>>>>>>> [2020-07-02 12:30:22.039337] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>>> [2020-07-02 12:30:22.040086] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>>> [2020-07-02 12:30:22.501877] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>>> on data-client-15 (hashed subvol is data-client-8) >>>>>>>>> [2020-07-02 12:30:22.502712] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>>> [2020-07-02 12:30:22.782577] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 >>>>>>>>> (hashed subvol is data-client-6) >>>>>>>>> [2020-07-02 12:30:22.783777] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat >>>>>>>>> [2020-07-02 12:30:23.146847] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on >>>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>>> [2020-07-02 12:30:23.148009] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECM1_DATA.dat >>>>>>>>> [2020-07-02 12:30:23.229290] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed >>>>>>>>> subvol is data-client-6) >>>>>>>>> [2020-07-02 12:30:23.230151] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat >>>>>>>>> [2020-07-02 12:30:23.889520] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on >>>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>>> [2020-07-02 12:30:23.896618] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECM2_DATA.dat >>>>>>>>> [2020-07-02 12:30:24.093017] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed >>>>>>>>> subvol is data-client-15) >>>>>>>>> [2020-07-02 12:30:24.094117] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat >>>>>>>>> [2020-07-02 12:30:24.345257] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on >>>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>>> [2020-07-02 12:30:24.346234] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECM3_DATA.dat >>>>>>>>> [2020-07-02 12:30:24.425835] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed >>>>>>>>> subvol is data-client-15) >>>>>>>>> [2020-07-02 12:30:24.426880] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat >>>>>>>>> [2020-07-02 12:30:25.158718] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>>> [2020-07-02 12:30:25.159619] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>>> [2020-07-02 12:30:25.531479] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed >>>>>>>>> subvol is data-client-10) >>>>>>>>> [2020-07-02 12:30:25.540569] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat >>>>>>>>> [2020-07-02 12:30:25.771692] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>>> [2020-07-02 12:30:25.772610] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>>> [2020-07-02 12:30:25.866118] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 >>>>>>>>> (hashed subvol is data-client-8) >>>>>>>>> [2020-07-02 12:30:25.866917] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat >>>>>>>>> [2020-07-02 12:30:26.424386] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>>> [2020-07-02 12:30:26.425309] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>>> [2020-07-02 12:30:26.818852] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 >>>>>>>>> (hashed subvol is data-client-2) >>>>>>>>> [2020-07-02 12:30:26.819890] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat >>>>>>>>> [2020-07-02 12:30:27.352405] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>>> [2020-07-02 12:30:27.352914] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>>> [2020-07-02 12:30:27.521286] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed >>>>>>>>> subvol is data-client-18) >>>>>>>>> [2020-07-02 12:30:27.522325] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat >>>>>>>>> [2020-07-02 12:30:28.566634] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>>> on data-client-2 (hashed subvol is data-client-11) >>>>>>>>> [2020-07-02 12:30:28.579295] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>>> [2020-07-02 12:30:28.958028] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>>> [2020-07-02 12:30:28.959102] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>>> [2020-07-02 12:30:29.012429] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed >>>>>>>>> subvol is data-client-15) >>>>>>>>> [2020-07-02 12:30:29.013416] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat >>>>>>>>> [2020-07-02 12:30:29.396716] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on >>>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>>> [2020-07-02 12:30:29.397740] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSDATA.dat >>>>>>>>> [2020-07-02 12:30:29.556312] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed >>>>>>>>> subvol is data-client-18) >>>>>>>>> [2020-07-02 12:30:29.557197] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >>>>>>>>> [2020-07-02 12:30:30.605354] I [MSGID: 109045] >>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>> deletion of stale linkfile >>>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 >>>>>>>>> (hashed subvol is data-client-19) >>>>>>>>> [2020-07-02 12:30:30.606117] I [MSGID: 109069] >>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >>>>>>>>> [2020-07-02 12:30:31.559206] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - >>>>>>>>> 613576736 - 920330171 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.559255] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>>> [2020-07-02 12:30:31.569025] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - >>>>>>>>> 920330172 - 1227083607 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.569067] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>>> [2020-07-02 12:30:31.701849] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - >>>>>>>>> 3374637116 - 3681390551 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.701895] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>>> [2020-07-02 12:30:31.738464] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - >>>>>>>>> 3681390552 - 3988143987 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.738507] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>>> [2020-07-02 12:30:31.857102] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk >>>>>>>>> layout - 3067883680 - 3374637115 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.857147] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>> [2020-07-02 12:30:31.857180] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.857197] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>> [2020-07-02 12:30:31.917705] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 >>>>>>>>> - 306753435 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.917781] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>> [2020-07-02 12:30:31.917855] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk >>>>>>>>> layout - 3988213852 - 4294967295 - 4159036738 >>>>>>>>> [2020-07-02 12:30:31.917874] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>> [2020-07-02 12:30:32.390945] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - >>>>>>>>> 3681460416 - 3988213851 - 4159036738 >>>>>>>>> [2020-07-02 12:30:32.390998] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>>> [2020-07-02 12:30:32.391056] I [MSGID: 109064] >>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>> data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - >>>>>>>>> 3988213852 - 4294967295 - 4159036738 >>>>>>>>> [2020-07-02 12:30:32.391075] I [MSGID: 109018] >>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>>> [2020-07-02 12:33:50.915279] I [MSGID: 109066] >>>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>>> /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >>>>>>>>> (2cb54500-814d-4e85-83e7-e33d9440b18d) >>>>>>>>> (hash=data-client-6/cache=data-client-18) => >>>>>>>>> /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >>>>>>>>> (hash=data-client-6/cache=) >>>>>>>>> [2020-07-02 12:34:09.799586] I [MSGID: 109066] >>>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>>> /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >>>>>>>>> (99938ee6-6986-4123-9d72-ec09e2310b4f) >>>>>>>>> (hash=data-client-17/cache=data-client-18) => >>>>>>>>> /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >>>>>>>>> (hash=data-client-17/cache=) >>>>>>>>> .... >>>>>>>>> >>>>>>>>> >>>>>>>>> Please look into this at top-priority if possible. >>>>>>>>> Let me know if anything else is required. >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> Regards, >>>>>>>>> Shreyansh Shah >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Regards, >>>>>>>> Shreyansh Shah >>>>>>>> ________ >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> Community Meeting Calendar: >>>>>>>> >>>>>>>> Schedule - >>>>>>>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>>>>>>> Bridge: https://bluejeans.com/441850968 >>>>>>>> >>>>>>>> Gluster-users mailing list >>>>>>>> Gluster-users at gluster.org >>>>>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> *Barak Sason Rofman* >>>>>>> >>>>>>> Gluster Storage Development >>>>>>> >>>>>>> Red Hat Israel >>>>>>> >>>>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>>>> >>>>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>>>> M: *+972-52-4326355* >>>>>>> @RedHat Red Hat >>>>>>> Red Hat >>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Regards, >>>>>> Shreyansh Shah >>>>>> >>>>> >>>>> >>>>> -- >>>>> *Barak Sason Rofman* >>>>> >>>>> Gluster Storage Development >>>>> >>>>> Red Hat Israel >>>>> >>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>> >>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>> M: *+972-52-4326355* >>>>> @RedHat Red Hat >>>>> Red Hat >>>>> >>>>> >>>>> >>>> >>>> >>>> -- >>>> Regards, >>>> Shreyansh Shah >>>> >>> >>> >>> -- >>> *Barak Sason Rofman* >>> >>> Gluster Storage Development >>> >>> Red Hat Israel >>> >>> 34 Jerusalem rd. Ra'anana, 43501 >>> >>> bsasonro at redhat.com T: *+972-9-7692304* >>> M: *+972-52-4326355* >>> @RedHat Red Hat >>> Red Hat >>> >>> >>> >> >> >> -- >> Regards, >> Shreyansh Shah >> > > > -- > *Barak Sason Rofman* > > Gluster Storage Development > > Red Hat Israel > > 34 Jerusalem rd. Ra'anana, 43501 > > bsasonro at redhat.com T: *+972-9-7692304* > M: *+972-52-4326355* > @RedHat Red Hat > Red Hat > > > -- Regards, Shreyansh Shah -------------- next part -------------- An HTML attachment was scrubbed... URL: From evilmf at gmail.com Tue Jul 7 21:46:21 2020 From: evilmf at gmail.com (Marco Fais) Date: Tue, 7 Jul 2020 22:46:21 +0100 Subject: [Gluster-users] Problems with qemu and disperse volumes (live merge) In-Reply-To: References:

<93D3EE3B-B5B3-4689-BF66-C1442A03971E@yahoo.com>

Message-ID: Hi Strahil first of all thanks a million for your help -- really appreciate it. Thanks also for the pointers on the debug. I have tried it, and while I can't interpret the results I think I might have found something. There is a lot of information so hopefully this is relevant. During the snapshot creation and deletion, I can see the following errors in the client log: [2020-07-07 21:23:06.837381] W [MSGID: 122019] [ec-helpers.c:401:ec_loc_gfid_check] 0-SSD_Storage-disperse-0: Mismatching GFID's in loc [2020-07-07 21:23:06.837387] D [MSGID: 0] [defaults.c:1328:default_mknod_cbk] 0-stack-trace: stack-address: 0x7f0dc0001a78, SSD_Storage-disperse-0 returned -1 error: Input/output error [Input/output error] [2020-07-07 21:23:06.837392] W [MSGID: 109002] [dht-rename.c:1019:dht_rename_links_create_cbk] 0-SSD_Storage-dht: link/file /8d49207e-f6b9-41d1-8d35-f6e0fb121980/images/4802e66e-a7e3-42df-a570-7155135566ad/b51133ee-54e0-4001-ab4b-9f0dc1e5c6fc.meta on SSD_Storage-disperse-0 failed [Input/output error] [2020-07-07 21:23:06.837850] D [MSGID: 0] [stack.h:502:copy_frame] 0-stack: groups is null (ngrps: 0) [Invalid argument] [2020-07-07 21:23:06.839252] D [dict.c:1168:data_to_uint32] (-->/lib64/libglusterfs.so.0(dict_foreach_match+0x77) [0x7f0ddb1855e7] -->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x384cf) [0x7f0dd23c54cf] -->/lib64/libglusterfs.so.0(data_to_uint32+0x8e) [0x7f0ddb184f2e] ) 0-dict: key null, unsigned integer type asked, has integer type [Invalid argument] [2020-07-07 21:23:06.839272] D [MSGID: 0] [dht-common.c:6674:dht_readdirp_cbk] 0-SSD_Storage-dht: Processing entries from SSD_Storage-disperse-0 [2020-07-07 21:23:06.839281] D [MSGID: 0] [dht-common.c:6681:dht_readdirp_cbk] 0-SSD_Storage-dht: SSD_Storage-disperse-0: entry = ., type = 4 [2020-07-07 21:23:06.839291] D [MSGID: 0] [dht-common.c:6813:dht_readdirp_cbk] 0-SSD_Storage-dht: SSD_Storage-disperse-0: Adding entry = . [2020-07-07 21:23:06.839297] D [MSGID: 0] [dht-common.c:6681:dht_readdirp_cbk] 0-SSD_Storage-dht: SSD_Storage-disperse-0: entry = .., type = 4 [2020-07-07 21:23:06.839324] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0034598, SSD_Storage-client-6 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839327] D [dict.c:1800:dict_get_int32] (-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x227d6) [0x7f0dd23af7d6] -->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x17661) [0x7f0dd23a4661] -->/lib64/libglusterfs.so.0(dict_get_int32+0x107) [0x7f0ddb186437] ) 0-dict: key glusterfs.inodelk-count, integer type asked, has unsigned integer type [Invalid argument] [2020-07-07 21:23:06.839361] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0034598, SSD_Storage-client-11 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839395] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc00395a8, SSD_Storage-client-15 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839419] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0034598, SSD_Storage-client-9 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839473] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc009c108, SSD_Storage-client-18 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839471] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0034598, SSD_Storage-client-10 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839491] D [dict.c:1800:dict_get_int32] (-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x256ad) [0x7f0dd23b26ad] -->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x17661) [0x7f0dd23a4661] -->/lib64/libglusterfs.so.0(dict_get_int32+0x107) [0x7f0ddb186437] ) 0-dict: key glusterfs.inodelk-count, integer type asked, has unsigned integer type [Invalid argument] [2020-07-07 21:23:06.839512] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0034598, SSD_Storage-client-7 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839526] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc009c108, SSD_Storage-client-23 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839543] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc009c108, SSD_Storage-client-22 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839543] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc00395a8, SSD_Storage-client-16 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839556] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc009c108, SSD_Storage-client-21 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839596] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc00395a8, SSD_Storage-client-12 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839617] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc00395a8, SSD_Storage-client-14 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839631] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc00395a8, SSD_Storage-client-13 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839636] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc00395a8, SSD_Storage-client-17 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839643] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0034598, SSD_Storage-client-8 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839656] D [MSGID: 0] [defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc007c428, SSD_Storage-disperse-2 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839665] D [MSGID: 0] [dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) on SSD_Storage-disperse-2 returned error [Stale file handle] [2020-07-07 21:23:06.839666] D [MSGID: 0] [defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc007c428, SSD_Storage-disperse-1 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839683] D [MSGID: 0] [dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) on SSD_Storage-disperse-1 returned error [Stale file handle] [2020-07-07 21:23:06.839686] D [dict.c:1168:data_to_uint32] (-->/lib64/libglusterfs.so.0(dict_foreach_match+0x77) [0x7f0ddb1855e7] -->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x384cf) [0x7f0dd23c54cf] -->/lib64/libglusterfs.so.0(data_to_uint32+0x8e) [0x7f0ddb184f2e] ) 0-dict: key null, unsigned integer type asked, has integer type [Invalid argument] [2020-07-07 21:23:06.839698] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc009c108, SSD_Storage-client-19 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839703] D [MSGID: 0] [dht-common.c:6674:dht_readdirp_cbk] 0-SSD_Storage-dht: Processing entries from SSD_Storage-disperse-0 [2020-07-07 21:23:06.839714] D [MSGID: 0] [dht-common.c:6681:dht_readdirp_cbk] 0-SSD_Storage-dht: SSD_Storage-disperse-0: entry = .., type = 4 [2020-07-07 21:23:06.839716] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0024b48, SSD_Storage-client-30 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839724] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0024b48, SSD_Storage-client-34 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839720] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0024b48, SSD_Storage-client-35 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839755] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0024b48, SSD_Storage-client-31 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839759] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc009c108, SSD_Storage-client-20 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839774] D [MSGID: 0] [defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc007c428, SSD_Storage-disperse-3 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839775] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0024b48, SSD_Storage-client-32 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839783] D [MSGID: 0] [dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) on SSD_Storage-disperse-3 returned error [Stale file handle] [2020-07-07 21:23:06.839798] D [MSGID: 0] [dht-common.c:601:dht_discover_complete] 0-SSD_Storage-dht: key = trusted.glusterfs.quota.read-only not present in dict [2020-07-07 21:23:06.839807] D [MSGID: 0] [client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc0024b48, SSD_Storage-client-33 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839807] D [MSGID: 0] [dht-layout.c:789:dht_layout_preset] 0-SSD_Storage-dht: file = 00000000-0000-0000-0000-000000000000, subvol = SSD_Storage-disperse-4 [2020-07-07 21:23:06.839825] D [MSGID: 0] [defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: 0x7f0dc007c428, SSD_Storage-disperse-5 returned -1 error: Stale file handle [Stale file handle] [2020-07-07 21:23:06.839835] D [MSGID: 0] [dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) on SSD_Storage-disperse-5 returned error [Stale file handle] The above is logged just shortly before the qemu-kvm process crashes with the usual error: Unexpected error in raw_check_lock_bytes() at block/file-posix.c:811: 2020-07-07T21:23:06.847336Z qemu-kvm: Failed to get shared "write" lock I have looked also on the bricks logs, but there is too much information there and will need to know what to look for. Not sure if there is any benefit in looking into this any further? Thanks, Marco On Thu, 2 Jul 2020 at 15:45, Strahil Nikolov wrote: > > > ?? 2 ??? 2020 ?. 16:33:51 GMT+03:00, Marco Fais ??????: > >Hi Strahil, > > > >WARNING: As you enabled sharding - NEVER DISABLE SHARDING, EVER ! > >> > > > >Thanks -- good to be reminded :) > > > > > >> >When you say they will not be optimal are you referring mainly to > >> >performance considerations? We did plenty of testing, and in terms > >of > >> >performance didn't have issues even with I/O intensive workloads > >(using > >> >SSDs, I had issues with spinning disks). > >> > >> Yes, the client side has to connect to 6 bricks (4+2) at a time and > >> calculate the data in order to obtain the necessary information.Same > >is > >> valid for writing. > >> If you need to conserve space, you can test VDO without compression > >(of > >> even with it). > >> > > > >Understood -- will explore VDO. Storage usage efficiency is less > >important > >than fault tolerance or performance for us -- disperse volumes seemed > >to > >tick all the boxes so we looked at them primarily. > >But clearly I had missed that they are not used as mainstream VM > >storage > >for oVirt (I did know they weren't supported, but as explained thought > >was > >more on the management side). > > > > > >> > >> Also with replica volumes, you can use 'choose-local' /in case > >you > >> have faster than the network storage (like NVMe)/ and increase the > >read > >> speed. Of course this feature is useful for Hyperconverged setup > >(gluster > >> + ovirt on the same node). > >> > > > >Will explore this option as well, thanks for the suggestion. > > > > > >> If you were using ovirt 4.3 , I would recommend you to focus on > >> gluster. Yet, you use oVirt 4.4 which is quite newer and it needs > > some > >> polishing. > >> > > > >Ovirt 4.3.9 (using the older Centos 7 qemu/libvirt) unfortunately had > >similar issues with the disperse volumes. Not sure if exactly the same, > >as > >never looked deeper into it, but the results were similar. > >Ovirt 4.4.0 has some issues with snapshot deletion that are independent > >from Gluster (I have raised the issue here, > >https://bugzilla.redhat.com/show_bug.cgi?id=1840414, should be sorted > >with > >4.4.2 I guess), so at the moment it only works with the "testing" AV > >repo. > > > > In such case I can recommend you to: > 1. Ensure you have enough space on all bricks for the logs > (/var/log/gluster). Several gigs should be OK > 2. Enable all logs to 'TRACE' . Red Hat's documentation on the topic is > quite good: > > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3/html/administration_guide/configuring_the_log_level > 3. Reproduce the issue on a fresh VM (never done snapshot deletion) > 4. Disable (switch to info) all logs as per the link in point 2 > > The logs will be spread among all nodes. If you have remote logging > available, you can also use it for analysis of the logs. > > Most probably the brick logs can provide useful information. > > > > > >> Check ovirt engine logs (on the HostedEngine VM or your standalone > >> engine) , vdsm logs on the host that was running the VM and next - > >check > >> the brick logs. > >> > > > >Will do. > > > >Thanks, > >Marco > > > About VDO - it might require some tuning and even afterwards it won't be > very performant, so it depends on your needs. > > Best Regards, > Strahil Nikolov > -------------- next part -------------- An HTML attachment was scrubbed... URL: From archon810 at gmail.com Wed Jul 8 06:02:10 2020 From: archon810 at gmail.com (Artem Russakovskii) Date: Tue, 7 Jul 2020 23:02:10 -0700 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References:

Message-ID: I think it'd be extremely helpful if gluster had a feature to grab all the necessary logs/debug info (maybe a few variations depending on the bug) so that all the user would have to do is enter a simple command and have gluster generate the whole bug report, ready to be sent to to the gluster team. Sincerely, Artem -- Founder, Android Police , APK Mirror , Illogical Robot LLC beerpla.net | @ArtemR On Tue, Jul 7, 2020 at 1:47 AM Shreyansh Shah wrote: > Sounds good, thank you. > > On Tue, Jul 7, 2020 at 2:12 PM Barak Sason Rofman > wrote: > >> Thanks Shreyansh, >> >> I'll look into it, however I'll likely need some help from more senior >> team members to perform RCA. >> I'll update once I have new insights. >> >> My regards, >> >> On Tue, Jul 7, 2020 at 11:40 AM Shreyansh Shah < >> shreyansh.shah at alpha-grep.com> wrote: >> >>> Hi Barak, >>> Thanks for looking into this and helping me out, >>> The fix-layout was successful, and I ran a rebalance after completion of >>> fix-layout. >>> The rebalance status though did show failure for 3 nodes. >>> >>> On Tue, Jul 7, 2020 at 2:07 PM Barak Sason Rofman >>> wrote: >>> >>>> Greetings again Shreyansh, >>>> >>>> I'm indeed seeing a lot of errors in the log file - still unsure about >>>> the RC. >>>> You mentioned that prior to running rebalance you ran fix-layout, was >>>> the fix-layout successful? >>>> Another question - did you wait until fix-layout was completed before >>>> running rebalance? >>>> >>>> My thanks, >>>> >>>> On Mon, Jul 6, 2020 at 9:33 PM Shreyansh Shah < >>>> shreyansh.shah at alpha-grep.com> wrote: >>>> >>>>> Hi, >>>>> Attaching rebalance logs >>>>> FYI, we ran "gluster rebalance fix-layout" followed by "gluster >>>>> rebalance" on 20200701 and today we again ran "gluster rebalance fix-layout" >>>>> >>>>> >>>>> PFA >>>>> >>>>> On Mon, Jul 6, 2020 at 11:08 PM Barak Sason Rofman < >>>>> bsasonro at redhat.com> wrote: >>>>> >>>>>> I think it would be best. >>>>>> As I can't say at this point where the problem is originating from, >>>>>> brick logs might also be necessary (I assume I would have a better picture >>>>>> once I have the rebalance logs). >>>>>> >>>>>> Cheers, >>>>>> >>>>>> On Mon, Jul 6, 2020 at 8:16 PM Shreyansh Shah < >>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>> >>>>>>> Hi Barak, >>>>>>> Can provide the rebalance logs. Do you require all the brick logs >>>>>>> (14 in total)? >>>>>>> >>>>>>> On Mon, Jul 6, 2020 at 10:43 PM Barak Sason Rofman < >>>>>>> bsasonro at redhat.com> wrote: >>>>>>> >>>>>>>> Greetings Shreyansh, >>>>>>>> >>>>>>>> Off-hand I can't come up with a reason for these failures. >>>>>>>> In order to start looking into this, access to the full rebalance >>>>>>>> logs is required (possibly brick logs as well). >>>>>>>> Can you provide those? >>>>>>>> >>>>>>>> My regards, >>>>>>>> >>>>>>>> >>>>>>>> On Mon, Jul 6, 2020 at 11:41 AM Shreyansh Shah < >>>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>>> >>>>>>>>> Hi, >>>>>>>>> Did anyone get a chance to look into this? >>>>>>>>> >>>>>>>>> On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah < >>>>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>>>> >>>>>>>>>> Hi All, >>>>>>>>>> >>>>>>>>>> *We are facing "Mismatching layouts for ,gfid = " >>>>>>>>>> errors.* >>>>>>>>>> >>>>>>>>>> We have a distributed glusterfs 5.10, no replication, 2 bricks >>>>>>>>>> (4TB each) on each node, 7 nodes in total. We added new bricks yesterday to >>>>>>>>>> the existing setup. >>>>>>>>>> Post that we did a rebalance fix-layout and then a rebalance >>>>>>>>>> (which is currently still in progress). The status shows "failed" on >>>>>>>>>> certain bricks but "in progress" for others. Adding output for gluster >>>>>>>>>> rebalance status below. >>>>>>>>>> >>>>>>>>>> The glusterfs client logs are flooded with "Mismatching layouts >>>>>>>>>> for ,gfid = " >>>>>>>>>> The performance too seems to have degraded due to this, even >>>>>>>>>> basic commands like `cd` and `ls` are taking more than a minute compared to >>>>>>>>>> sub-second number before brick addition. >>>>>>>>>> Apart from that we also experienced many binaries and files >>>>>>>>>> giving error stale file handle error even though the files were present. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> *gluster rebalance status :* >>>>>>>>>> >>>>>>>>>> Node Rebalanced-files size scanned failures >>>>>>>>>> skipped status run time in h:m:s >>>>>>>>>> --------- ----------- ----------- ----------- >>>>>>>>>> ----------- ----------- ------------ -------------- >>>>>>>>>> localhost 176 3.5GB 12790 >>>>>>>>>> 0 8552 in progress 21:36:01 >>>>>>>>>> 10.132.0.72 8232 394.8GB 19995 >>>>>>>>>> 21 26 failed 14:50:30 >>>>>>>>>> 10.132.0.44 12625 1.0TB 50023 >>>>>>>>>> 1 10202 in progress 21:36:00 >>>>>>>>>> 10.132.0.3 21982 956.8GB 79145 >>>>>>>>>> 1 34571 in progress 21:36:00 >>>>>>>>>> 10.132.0.9 7975 355.8GB 20157 >>>>>>>>>> 6 1522 failed 14:51:45 >>>>>>>>>> 10.132.0.73 6293 394.5GB 26414 >>>>>>>>>> 151 8085 failed 14:51:45 >>>>>>>>>> 10.132.0.70 6480 477.1GB 21058 >>>>>>>>>> 27 1787 failed 14:50:32 >>>>>>>>>> Estimated time left for rebalance to complete : 130:56:28 >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> *Logs from one of the clients below:* >>>>>>>>>> >>>>>>>>>> [2020-07-02 12:30:14.971916] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3995747641; disk >>>>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:14.971935] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>>> [2020-07-02 12:30:15.032013] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3995747641; disk >>>>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.032059] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>>> [2020-07-02 12:30:15.032107] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.032153] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI, gfid = b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>>> [2020-07-02 12:30:15.093329] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-15; inode layout - 2454306944 - 2761060379 - 3997647794; disk >>>>>>>>>> layout - 2454306944 - 2761060379 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.093373] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>> [2020-07-02 12:30:15.093460] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - 3997647794; disk >>>>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.093515] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>> [2020-07-02 12:30:15.151063] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 3997647794; disk >>>>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.151108] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>> [2020-07-02 12:30:15.151149] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 3997647794; disk >>>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.151162] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = 42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>> [2020-07-02 12:30:15.424321] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-11; inode layout - 920400036 - 1227153471 - 3997647794; disk >>>>>>>>>> layout - 920400036 - 1227153471 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.424380] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>> [2020-07-02 12:30:15.424456] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-16; inode layout - 1840730208 - 2147483643 - 3997647794; disk >>>>>>>>>> layout - 1840730208 - 2147483643 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.424484] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>> [2020-07-02 12:30:15.424525] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-15; inode layout - 1533976772 - 1840730207 - 3997647794; disk >>>>>>>>>> layout - 1533976772 - 1840730207 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.424542] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>> [2020-07-02 12:30:15.424596] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-10; inode layout - 613646600 - 920400035 - 3997647794; disk >>>>>>>>>> layout - 613646600 - 920400035 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:15.424607] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /raw_data/NSE/20200630, gfid = 1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>> [2020-07-02 12:30:16.004482] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_DATA.dat on >>>>>>>>>> data-client-7 (hashed subvol is data-client-17) >>>>>>>>>> [2020-07-02 12:30:16.005523] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_CDS_1_DATA.dat >>>>>>>>>> [2020-07-02 12:30:16.531047] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>>>> [2020-07-02 12:30:16.532086] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:18.733229] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_DATA.dat on >>>>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>>>> [2020-07-02 12:30:18.734421] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_CDS_2_DATA.dat >>>>>>>>>> [2020-07-02 12:30:19.171930] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>>>> [2020-07-02 12:30:19.172901] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:21.028495] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_DATA.dat on >>>>>>>>>> data-client-6 (hashed subvol is data-client-15) >>>>>>>>>> [2020-07-02 12:30:21.029836] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_2_DATA.dat >>>>>>>>>> [2020-07-02 12:30:21.127648] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>>>> [2020-07-02 12:30:21.128713] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:21.201126] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_DATA.dat on >>>>>>>>>> data-client-15 (hashed subvol is data-client-7) >>>>>>>>>> [2020-07-02 12:30:21.201928] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_3_DATA.dat >>>>>>>>>> [2020-07-02 12:30:21.566158] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>>>> [2020-07-02 12:30:21.567123] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:21.649357] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_DATA.dat on >>>>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>>>> [2020-07-02 12:30:21.661381] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_4_DATA.dat >>>>>>>>>> [2020-07-02 12:30:21.748937] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>>>> on data-client-15 (hashed subvol is data-client-7) >>>>>>>>>> [2020-07-02 12:30:21.749481] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:21.898593] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_DATA.dat on >>>>>>>>>> data-client-14 (hashed subvol is data-client-7) >>>>>>>>>> [2020-07-02 12:30:21.899442] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_6_DATA.dat >>>>>>>>>> [2020-07-02 12:30:22.039337] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>>>> [2020-07-02 12:30:22.040086] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:22.501877] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>>>> on data-client-15 (hashed subvol is data-client-8) >>>>>>>>>> [2020-07-02 12:30:22.502712] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>>>> [2020-07-02 12:30:22.782577] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat on data-client-11 >>>>>>>>>> (hashed subvol is data-client-6) >>>>>>>>>> [2020-07-02 12:30:22.783777] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:23.146847] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM1_DATA.dat on >>>>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>>>> [2020-07-02 12:30:23.148009] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECM1_DATA.dat >>>>>>>>>> [2020-07-02 12:30:23.229290] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat on data-client-14 (hashed >>>>>>>>>> subvol is data-client-6) >>>>>>>>>> [2020-07-02 12:30:23.230151] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:23.889520] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM2_DATA.dat on >>>>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>>>> [2020-07-02 12:30:23.896618] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECM2_DATA.dat >>>>>>>>>> [2020-07-02 12:30:24.093017] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat on data-client-6 (hashed >>>>>>>>>> subvol is data-client-15) >>>>>>>>>> [2020-07-02 12:30:24.094117] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:24.345257] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSECM3_DATA.dat on >>>>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>>>> [2020-07-02 12:30:24.346234] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECM3_DATA.dat >>>>>>>>>> [2020-07-02 12:30:24.425835] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat on data-client-6 (hashed >>>>>>>>>> subvol is data-client-15) >>>>>>>>>> [2020-07-02 12:30:24.426880] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:25.158718] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>>>> [2020-07-02 12:30:25.159619] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>>>> [2020-07-02 12:30:25.531479] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on data-client-2 (hashed >>>>>>>>>> subvol is data-client-10) >>>>>>>>>> [2020-07-02 12:30:25.540569] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:25.771692] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>>>> [2020-07-02 12:30:25.772610] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>>>> [2020-07-02 12:30:25.866118] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on data-client-15 >>>>>>>>>> (hashed subvol is data-client-8) >>>>>>>>>> [2020-07-02 12:30:25.866917] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:26.424386] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>>>> [2020-07-02 12:30:26.425309] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>>>> [2020-07-02 12:30:26.818852] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on data-client-10 >>>>>>>>>> (hashed subvol is data-client-2) >>>>>>>>>> [2020-07-02 12:30:26.819890] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:27.352405] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>>>> [2020-07-02 12:30:27.352914] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>>>> [2020-07-02 12:30:27.521286] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on data-client-8 (hashed >>>>>>>>>> subvol is data-client-18) >>>>>>>>>> [2020-07-02 12:30:27.522325] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:28.566634] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>>>> on data-client-2 (hashed subvol is data-client-11) >>>>>>>>>> [2020-07-02 12:30:28.579295] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>>>> [2020-07-02 12:30:28.958028] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>>>> [2020-07-02 12:30:28.959102] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>>>> [2020-07-02 12:30:29.012429] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on data-client-6 (hashed >>>>>>>>>> subvol is data-client-15) >>>>>>>>>> [2020-07-02 12:30:29.013416] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:29.396716] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile /processed_data/20200630/NSEFO_BSE_TSDATA.dat on >>>>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>>>> [2020-07-02 12:30:29.397740] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSDATA.dat >>>>>>>>>> [2020-07-02 12:30:29.556312] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on data-client-9 (hashed >>>>>>>>>> subvol is data-client-18) >>>>>>>>>> [2020-07-02 12:30:29.557197] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >>>>>>>>>> [2020-07-02 12:30:30.605354] I [MSGID: 109045] >>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: attempting >>>>>>>>>> deletion of stale linkfile >>>>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on data-client-9 >>>>>>>>>> (hashed subvol is data-client-19) >>>>>>>>>> [2020-07-02 12:30:30.606117] I [MSGID: 109069] >>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: lookup_unlink >>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >>>>>>>>>> [2020-07-02 12:30:31.559206] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-18; inode layout - 613576736 - 920330171 - 1; disk layout - >>>>>>>>>> 613576736 - 920330171 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.559255] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>>>> [2020-07-02 12:30:31.569025] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-19; inode layout - 920330172 - 1227083607 - 1; disk layout - >>>>>>>>>> 920330172 - 1227083607 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.569067] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes, gfid = 21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>>>> [2020-07-02 12:30:31.701849] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 1; disk layout - >>>>>>>>>> 3374637116 - 3681390551 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.701895] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>>>> [2020-07-02 12:30:31.738464] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 1; disk layout - >>>>>>>>>> 3681390552 - 3988143987 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.738507] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>>>> [2020-07-02 12:30:31.857102] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-15; inode layout - 3067883680 - 3374637115 - 3995747641; disk >>>>>>>>>> layout - 3067883680 - 3374637115 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.857147] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>> [2020-07-02 12:30:31.857180] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-16; inode layout - 3374637116 - 3681390551 - 3995747641; disk >>>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.857197] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>> [2020-07-02 12:30:31.917705] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-19; inode layout - 0 - 306753435 - 3995747641; disk layout - 0 >>>>>>>>>> - 306753435 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.917781] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>> [2020-07-02 12:30:31.917855] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-18; inode layout - 3988213852 - 4294967295 - 3995747641; disk >>>>>>>>>> layout - 3988213852 - 4294967295 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:31.917874] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>> [2020-07-02 12:30:32.390945] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-18; inode layout - 3681460416 - 3988213851 - 1; disk layout - >>>>>>>>>> 3681460416 - 3988213851 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:32.390998] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>>>> [2020-07-02 12:30:32.391056] I [MSGID: 109064] >>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: subvol: >>>>>>>>>> data-client-19; inode layout - 3988213852 - 4294967295 - 1; disk layout - >>>>>>>>>> 3988213852 - 4294967295 - 4159036738 >>>>>>>>>> [2020-07-02 12:30:32.391075] I [MSGID: 109018] >>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: Mismatching layouts for >>>>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>>>> [2020-07-02 12:33:50.915279] I [MSGID: 109066] >>>>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>>>> /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >>>>>>>>>> (2cb54500-814d-4e85-83e7-e33d9440b18d) >>>>>>>>>> (hash=data-client-6/cache=data-client-18) => >>>>>>>>>> /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >>>>>>>>>> (hash=data-client-6/cache=) >>>>>>>>>> [2020-07-02 12:34:09.799586] I [MSGID: 109066] >>>>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>>>> /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >>>>>>>>>> (99938ee6-6986-4123-9d72-ec09e2310b4f) >>>>>>>>>> (hash=data-client-17/cache=data-client-18) => >>>>>>>>>> /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >>>>>>>>>> (hash=data-client-17/cache=) >>>>>>>>>> .... >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Please look into this at top-priority if possible. >>>>>>>>>> Let me know if anything else is required. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> Regards, >>>>>>>>>> Shreyansh Shah >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> Regards, >>>>>>>>> Shreyansh Shah >>>>>>>>> ________ >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> Community Meeting Calendar: >>>>>>>>> >>>>>>>>> Schedule - >>>>>>>>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>>>>>>>> Bridge: https://bluejeans.com/441850968 >>>>>>>>> >>>>>>>>> Gluster-users mailing list >>>>>>>>> Gluster-users at gluster.org >>>>>>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> *Barak Sason Rofman* >>>>>>>> >>>>>>>> Gluster Storage Development >>>>>>>> >>>>>>>> Red Hat Israel >>>>>>>> >>>>>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>>>>> >>>>>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>>>>> M: *+972-52-4326355* >>>>>>>> @RedHat Red Hat >>>>>>>> Red Hat >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Regards, >>>>>>> Shreyansh Shah >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> *Barak Sason Rofman* >>>>>> >>>>>> Gluster Storage Development >>>>>> >>>>>> Red Hat Israel >>>>>> >>>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>>> >>>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>>> M: *+972-52-4326355* >>>>>> @RedHat Red Hat >>>>>> Red Hat >>>>>> >>>>>> >>>>>> >>>>> >>>>> >>>>> -- >>>>> Regards, >>>>> Shreyansh Shah >>>>> >>>> >>>> >>>> -- >>>> *Barak Sason Rofman* >>>> >>>> Gluster Storage Development >>>> >>>> Red Hat Israel >>>> >>>> 34 Jerusalem rd. Ra'anana, 43501 >>>> >>>> bsasonro at redhat.com T: *+972-9-7692304* >>>> M: *+972-52-4326355* >>>> @RedHat Red Hat >>>> Red Hat >>>> >>>> >>>> >>> >>> >>> -- >>> Regards, >>> Shreyansh Shah >>> >> >> >> -- >> *Barak Sason Rofman* >> >> Gluster Storage Development >> >> Red Hat Israel >> >> 34 Jerusalem rd. Ra'anana, 43501 >> >> bsasonro at redhat.com T: *+972-9-7692304* >> M: *+972-52-4326355* >> @RedHat Red Hat >> Red Hat >> >> >> > > > -- > Regards, > Shreyansh Shah > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > -------------- next part -------------- An HTML attachment was scrubbed... URL: From hunter86_bg at yahoo.com Wed Jul 8 11:27:49 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Wed, 08 Jul 2020 14:27:49 +0300 Subject: [Gluster-users] Problems with qemu and disperse volumes (live merge) In-Reply-To: References:

<93D3EE3B-B5B3-4689-BF66-C1442A03971E@yahoo.com>

Message-ID: <7C1FC636-CD69-46D1-8916-B583164B4AD5@yahoo.com> See my comments inline. ?? 8 ??? 2020 ?. 0:46:21 GMT+03:00, Marco Fais ??????: >Hi Strahil > >first of all thanks a million for your help -- really appreciate it. >Thanks also for the pointers on the debug. I have tried it, and while I >can't interpret the results I think I might have found something. > >There is a lot of information so hopefully this is relevant. During the >snapshot creation and deletion, I can see the following errors in the >client log: > >[2020-07-07 21:23:06.837381] W [MSGID: 122019] >[ec-helpers.c:401:ec_loc_gfid_check] 0-SSD_Storage-disperse-0: >Mismatching >GFID's in loc >[2020-07-07 21:23:06.837387] D [MSGID: 0] >[defaults.c:1328:default_mknod_cbk] 0-stack-trace: stack-address: >0x7f0dc0001a78, SSD_Storage-disperse-0 returned -1 error: Input/output >error [Input/output error] You have to check brick logs for the first brick in the volume list. >[2020-07-07 21:23:06.837392] W [MSGID: 109002] >[dht-rename.c:1019:dht_rename_links_create_cbk] 0-SSD_Storage-dht: >link/file >/8d49207e-f6b9-41d1-8d35-f6e0fb121980/images/4802e66e-a7e3-42df-a570-7155135566ad/b51133ee-54e0-4001-ab4b-9f0dc1e5c6fc.meta Check the meta file. There was a problem with Gluster where it healed it before the other replica has come up (in your case is a little bit different.Usually only the timestamp inside the file is changed, so you can force gluster to update it by changing the timestamp inside. >on SSD_Storage-disperse-0 failed [Input/output error] Already mentioned it. >[2020-07-07 21:23:06.837850] D [MSGID: 0] [stack.h:502:copy_frame] >0-stack: >groups is null (ngrps: 0) [Invalid argument] >[2020-07-07 21:23:06.839252] D [dict.c:1168:data_to_uint32] >(-->/lib64/libglusterfs.so.0(dict_foreach_match+0x77) [0x7f0ddb1855e7] >-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x384cf) >[0x7f0dd23c54cf] -->/lib64/libglusterfs.so.0(data_to_uint32+0x8e) >[0x7f0ddb184f2e] ) 0-dict: key null, unsigned integer type asked, has >integer type [Invalid argument] >[2020-07-07 21:23:06.839272] D [MSGID: 0] >[dht-common.c:6674:dht_readdirp_cbk] 0-SSD_Storage-dht: Processing >entries >from SSD_Storage-disperse-0 >[2020-07-07 21:23:06.839281] D [MSGID: 0] >[dht-common.c:6681:dht_readdirp_cbk] 0-SSD_Storage-dht: >SSD_Storage-disperse-0: entry = ., type = 4 >[2020-07-07 21:23:06.839291] D [MSGID: 0] >[dht-common.c:6813:dht_readdirp_cbk] 0-SSD_Storage-dht: >SSD_Storage-disperse-0: Adding entry = . >[2020-07-07 21:23:06.839297] D [MSGID: 0] >[dht-common.c:6681:dht_readdirp_cbk] 0-SSD_Storage-dht: >SSD_Storage-disperse-0: entry = .., type = 4 >[2020-07-07 21:23:06.839324] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0034598, SSD_Storage-client-6 returned -1 error: >Stale file handle [Stale file handle] I see multiple of these, but as the message is not 'W' or 'E' , I assume it could happen and it's normal. >[2020-07-07 21:23:06.839327] D [dict.c:1800:dict_get_int32] >(-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x227d6) >[0x7f0dd23af7d6] >-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x17661) >[0x7f0dd23a4661] -->/lib64/libglusterfs.so.0(dict_get_int32+0x107) >[0x7f0ddb186437] ) 0-dict: key glusterfs.inodelk-count, integer type >asked, >has unsigned integer type [Invalid argument] >[2020-07-07 21:23:06.839361] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0034598, SSD_Storage-client-11 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839395] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc00395a8, SSD_Storage-client-15 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839419] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0034598, SSD_Storage-client-9 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839473] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc009c108, SSD_Storage-client-18 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839471] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0034598, SSD_Storage-client-10 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839491] D [dict.c:1800:dict_get_int32] >(-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x256ad) >[0x7f0dd23b26ad] >-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x17661) >[0x7f0dd23a4661] -->/lib64/libglusterfs.so.0(dict_get_int32+0x107) >[0x7f0ddb186437] ) 0-dict: key glusterfs.inodelk-count, integer type >asked, >has unsigned integer type [Invalid argument] >[2020-07-07 21:23:06.839512] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0034598, SSD_Storage-client-7 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839526] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc009c108, SSD_Storage-client-23 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839543] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc009c108, SSD_Storage-client-22 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839543] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc00395a8, SSD_Storage-client-16 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839556] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc009c108, SSD_Storage-client-21 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839596] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc00395a8, SSD_Storage-client-12 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839617] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc00395a8, SSD_Storage-client-14 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839631] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc00395a8, SSD_Storage-client-13 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839636] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc00395a8, SSD_Storage-client-17 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839643] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0034598, SSD_Storage-client-8 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839656] D [MSGID: 0] >[defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: >0x7f0dc007c428, SSD_Storage-disperse-2 returned -1 error: Stale file >handle >[Stale file handle] >[2020-07-07 21:23:06.839665] D [MSGID: 0] >[dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) >on >SSD_Storage-disperse-2 returned error [Stale file handle] >[2020-07-07 21:23:06.839666] D [MSGID: 0] >[defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: >0x7f0dc007c428, SSD_Storage-disperse-1 returned -1 error: Stale file >handle >[Stale file handle] >[2020-07-07 21:23:06.839683] D [MSGID: 0] >[dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) >on >SSD_Storage-disperse-1 returned error [Stale file handle] >[2020-07-07 21:23:06.839686] D [dict.c:1168:data_to_uint32] >(-->/lib64/libglusterfs.so.0(dict_foreach_match+0x77) [0x7f0ddb1855e7] >-->/usr/lib64/glusterfs/7.5/xlator/cluster/disperse.so(+0x384cf) >[0x7f0dd23c54cf] -->/lib64/libglusterfs.so.0(data_to_uint32+0x8e) >[0x7f0ddb184f2e] ) 0-dict: key null, unsigned integer type asked, has >integer type [Invalid argument] >[2020-07-07 21:23:06.839698] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc009c108, SSD_Storage-client-19 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839703] D [MSGID: 0] >[dht-common.c:6674:dht_readdirp_cbk] 0-SSD_Storage-dht: Processing >entries >from SSD_Storage-disperse-0 >[2020-07-07 21:23:06.839714] D [MSGID: 0] >[dht-common.c:6681:dht_readdirp_cbk] 0-SSD_Storage-dht: >SSD_Storage-disperse-0: entry = .., type = 4 >[2020-07-07 21:23:06.839716] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0024b48, SSD_Storage-client-30 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839724] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0024b48, SSD_Storage-client-34 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839720] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0024b48, SSD_Storage-client-35 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839755] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0024b48, SSD_Storage-client-31 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839759] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc009c108, SSD_Storage-client-20 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839774] D [MSGID: 0] >[defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: >0x7f0dc007c428, SSD_Storage-disperse-3 returned -1 error: Stale file >handle >[Stale file handle] >[2020-07-07 21:23:06.839775] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0024b48, SSD_Storage-client-32 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839783] D [MSGID: 0] >[dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) >on >SSD_Storage-disperse-3 returned error [Stale file handle] >[2020-07-07 21:23:06.839798] D [MSGID: 0] >[dht-common.c:601:dht_discover_complete] 0-SSD_Storage-dht: key = >trusted.glusterfs.quota.read-only not present in dict >[2020-07-07 21:23:06.839807] D [MSGID: 0] >[client-rpc-fops_v2.c:2641:client4_0_lookup_cbk] 0-stack-trace: >stack-address: 0x7f0dc0024b48, SSD_Storage-client-33 returned -1 error: >Stale file handle [Stale file handle] >[2020-07-07 21:23:06.839807] D [MSGID: 0] >[dht-layout.c:789:dht_layout_preset] 0-SSD_Storage-dht: file = >00000000-0000-0000-0000-000000000000, subvol = SSD_Storage-disperse-4 >[2020-07-07 21:23:06.839825] D [MSGID: 0] >[defaults.c:1548:default_lookup_cbk] 0-stack-trace: stack-address: >0x7f0dc007c428, SSD_Storage-disperse-5 returned -1 error: Stale file >handle >[Stale file handle] >[2020-07-07 21:23:06.839835] D [MSGID: 0] >[dht-common.c:998:dht_discover_cbk] 0-SSD_Storage-dht: lookup of (null) >on >SSD_Storage-disperse-5 returned error [Stale file handle] > >The above is logged just shortly before the qemu-kvm process crashes >with >the usual error: > >Unexpected error in raw_check_lock_bytes() at block/file-posix.c:811: >2020-07-07T21:23:06.847336Z qemu-kvm: Failed to get shared "write" lock That's strange. Can you check the sanlock logs for anything reported there ? >I have looked also on the bricks logs, but there is too much >information >there and will need to know what to look for. > >Not sure if there is any benefit in looking into this any further? > >Thanks, >Marco > >On Thu, 2 Jul 2020 at 15:45, Strahil Nikolov >wrote: > >> >> >> ?? 2 ??? 2020 ?. 16:33:51 GMT+03:00, Marco Fais >??????: >> >Hi Strahil, >> > >> >WARNING: As you enabled sharding - NEVER DISABLE SHARDING, EVER ! >> >> >> > >> >Thanks -- good to be reminded :) >> > >> > >> >> >When you say they will not be optimal are you referring mainly to >> >> >performance considerations? We did plenty of testing, and in >terms >> >of >> >> >performance didn't have issues even with I/O intensive workloads >> >(using >> >> >SSDs, I had issues with spinning disks). >> >> >> >> Yes, the client side has to connect to 6 bricks (4+2) at a time >and >> >> calculate the data in order to obtain the necessary >information.Same >> >is >> >> valid for writing. >> >> If you need to conserve space, you can test VDO without >compression >> >(of >> >> even with it). >> >> >> > >> >Understood -- will explore VDO. Storage usage efficiency is less >> >important >> >than fault tolerance or performance for us -- disperse volumes >seemed >> >to >> >tick all the boxes so we looked at them primarily. >> >But clearly I had missed that they are not used as mainstream VM >> >storage >> >for oVirt (I did know they weren't supported, but as explained >thought >> >was >> >more on the management side). >> > >> > >> >> >> >> Also with replica volumes, you can use 'choose-local' /in case >> >you >> >> have faster than the network storage (like NVMe)/ and increase >the >> >read >> >> speed. Of course this feature is useful for Hyperconverged setup >> >(gluster >> >> + ovirt on the same node). >> >> >> > >> >Will explore this option as well, thanks for the suggestion. >> > >> > >> >> If you were using ovirt 4.3 , I would recommend you to focus >on >> >> gluster. Yet, you use oVirt 4.4 which is quite newer and it >needs >> > some >> >> polishing. >> >> >> > >> >Ovirt 4.3.9 (using the older Centos 7 qemu/libvirt) unfortunately >had >> >similar issues with the disperse volumes. Not sure if exactly the >same, >> >as >> >never looked deeper into it, but the results were similar. >> >Ovirt 4.4.0 has some issues with snapshot deletion that are >independent >> >from Gluster (I have raised the issue here, >> >https://bugzilla.redhat.com/show_bug.cgi?id=1840414, should be >sorted >> >with >> >4.4.2 I guess), so at the moment it only works with the "testing" AV >> >repo. >> >> >> >> In such case I can recommend you to: >> 1. Ensure you have enough space on all bricks for the logs >> (/var/log/gluster). Several gigs should be OK >> 2. Enable all logs to 'TRACE' . Red Hat's documentation on the topic >is >> quite good: >> >> >https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3/html/administration_guide/configuring_the_log_level >> 3. Reproduce the issue on a fresh VM (never done snapshot deletion) >> 4. Disable (switch to info) all logs as per the link in point 2 >> >> The logs will be spread among all nodes. If you have remote logging >> available, you can also use it for analysis of the logs. >> >> Most probably the brick logs can provide useful information. >> >> >> > >> >> Check ovirt engine logs (on the HostedEngine VM or your >standalone >> >> engine) , vdsm logs on the host that was running the VM and >next - >> >check >> >> the brick logs. >> >> >> > >> >Will do. >> > >> >Thanks, >> >Marco >> >> >> About VDO - it might require some tuning and even afterwards it won't >be >> very performant, so it depends on your needs. >> >> Best Regards, >> Strahil Nikolov >> From hunter86_bg at yahoo.com Wed Jul 8 11:35:35 2020 From: hunter86_bg at yahoo.com (Strahil Nikolov) Date: Wed, 08 Jul 2020 14:35:35 +0300 Subject: [Gluster-users] "Mismatching layouts" in glusterfs client logs after new brick addition and rebalance In-Reply-To: References:

Message-ID: <0EB17979-D921-492C-B3AE-852F2B4A2F8D@yahoo.com> At least for EL 7 ,there are 2 modules for sosreport: gluster & gluster_block Best Regards, Strahil Nikolov ?? 8 ??? 2020 ?. 9:02:10 GMT+03:00, Artem Russakovskii ??????: >I think it'd be extremely helpful if gluster had a feature to grab all >the >necessary logs/debug info (maybe a few variations depending on the bug) >so >that all the user would have to do is enter a simple command and have >gluster generate the whole bug report, ready to be sent to to the >gluster >team. > >Sincerely, >Artem > >-- >Founder, Android Police , APK Mirror >, Illogical Robot LLC >beerpla.net | @ArtemR > > >On Tue, Jul 7, 2020 at 1:47 AM Shreyansh Shah > >wrote: > >> Sounds good, thank you. >> >> On Tue, Jul 7, 2020 at 2:12 PM Barak Sason Rofman > >> wrote: >> >>> Thanks Shreyansh, >>> >>> I'll look into it, however I'll likely need some help from more >senior >>> team members to perform RCA. >>> I'll update once I have new insights. >>> >>> My regards, >>> >>> On Tue, Jul 7, 2020 at 11:40 AM Shreyansh Shah < >>> shreyansh.shah at alpha-grep.com> wrote: >>> >>>> Hi Barak, >>>> Thanks for looking into this and helping me out, >>>> The fix-layout was successful, and I ran a rebalance after >completion of >>>> fix-layout. >>>> The rebalance status though did show failure for 3 nodes. >>>> >>>> On Tue, Jul 7, 2020 at 2:07 PM Barak Sason Rofman > >>>> wrote: >>>> >>>>> Greetings again Shreyansh, >>>>> >>>>> I'm indeed seeing a lot of errors in the log file - still unsure >about >>>>> the RC. >>>>> You mentioned that prior to running rebalance you ran fix-layout, >was >>>>> the fix-layout successful? >>>>> Another question - did you wait until fix-layout was completed >before >>>>> running rebalance? >>>>> >>>>> My thanks, >>>>> >>>>> On Mon, Jul 6, 2020 at 9:33 PM Shreyansh Shah < >>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>> >>>>>> Hi, >>>>>> Attaching rebalance logs >>>>>> FYI, we ran "gluster rebalance fix-layout" followed by "gluster >>>>>> rebalance" on 20200701 and today we again ran "gluster rebalance >fix-layout" >>>>>> >>>>>> >>>>>> PFA >>>>>> >>>>>> On Mon, Jul 6, 2020 at 11:08 PM Barak Sason Rofman < >>>>>> bsasonro at redhat.com> wrote: >>>>>> >>>>>>> I think it would be best. >>>>>>> As I can't say at this point where the problem is originating >from, >>>>>>> brick logs might also be necessary (I assume I would have a >better picture >>>>>>> once I have the rebalance logs). >>>>>>> >>>>>>> Cheers, >>>>>>> >>>>>>> On Mon, Jul 6, 2020 at 8:16 PM Shreyansh Shah < >>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>> >>>>>>>> Hi Barak, >>>>>>>> Can provide the rebalance logs. Do you require all the brick >logs >>>>>>>> (14 in total)? >>>>>>>> >>>>>>>> On Mon, Jul 6, 2020 at 10:43 PM Barak Sason Rofman < >>>>>>>> bsasonro at redhat.com> wrote: >>>>>>>> >>>>>>>>> Greetings Shreyansh, >>>>>>>>> >>>>>>>>> Off-hand I can't come up with a reason for these failures. >>>>>>>>> In order to start looking into this, access to the full >rebalance >>>>>>>>> logs is required (possibly brick logs as well). >>>>>>>>> Can you provide those? >>>>>>>>> >>>>>>>>> My regards, >>>>>>>>> >>>>>>>>> >>>>>>>>> On Mon, Jul 6, 2020 at 11:41 AM Shreyansh Shah < >>>>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>>>> >>>>>>>>>> Hi, >>>>>>>>>> Did anyone get a chance to look into this? >>>>>>>>>> >>>>>>>>>> On Thu, Jul 2, 2020 at 8:09 PM Shreyansh Shah < >>>>>>>>>> shreyansh.shah at alpha-grep.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi All, >>>>>>>>>>> >>>>>>>>>>> *We are facing "Mismatching layouts for ,gfid = >" >>>>>>>>>>> errors.* >>>>>>>>>>> >>>>>>>>>>> We have a distributed glusterfs 5.10, no replication, 2 >bricks >>>>>>>>>>> (4TB each) on each node, 7 nodes in total. We added new >bricks yesterday to >>>>>>>>>>> the existing setup. >>>>>>>>>>> Post that we did a rebalance fix-layout and then a rebalance >>>>>>>>>>> (which is currently still in progress). The status shows >"failed" on >>>>>>>>>>> certain bricks but "in progress" for others. Adding output >for gluster >>>>>>>>>>> rebalance status below. >>>>>>>>>>> >>>>>>>>>>> The glusterfs client logs are flooded with "Mismatching >layouts >>>>>>>>>>> for ,gfid = " >>>>>>>>>>> The performance too seems to have degraded due to this, even >>>>>>>>>>> basic commands like `cd` and `ls` are taking more than a >minute compared to >>>>>>>>>>> sub-second number before brick addition. >>>>>>>>>>> Apart from that we also experienced many binaries and files >>>>>>>>>>> giving error stale file handle error even though the files >were present. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> *gluster rebalance status :* >>>>>>>>>>> >>>>>>>>>>> Node Rebalanced-files size scanned >failures >>>>>>>>>>> skipped status run time in h:m:s >>>>>>>>>>> --------- ----------- ----------- ----------- >>>>>>>>>>> ----------- ----------- ------------ >-------------- >>>>>>>>>>> localhost 176 3.5GB 12790 >>>>>>>>>>> 0 8552 in progress 21:36:01 >>>>>>>>>>> 10.132.0.72 8232 394.8GB 19995 >>>>>>>>>>> 21 26 failed 14:50:30 >>>>>>>>>>> 10.132.0.44 12625 1.0TB 50023 >>>>>>>>>>> 1 10202 in progress 21:36:00 >>>>>>>>>>> 10.132.0.3 21982 956.8GB 79145 >>>>>>>>>>> 1 34571 in progress 21:36:00 >>>>>>>>>>> 10.132.0.9 7975 355.8GB 20157 >>>>>>>>>>> 6 1522 failed 14:51:45 >>>>>>>>>>> 10.132.0.73 6293 394.5GB 26414 >>>>>>>>>>> 151 8085 failed 14:51:45 >>>>>>>>>>> 10.132.0.70 6480 477.1GB 21058 >>>>>>>>>>> 27 1787 failed 14:50:32 >>>>>>>>>>> Estimated time left for rebalance to complete : >130:56:28 >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> *Logs from one of the clients below:* >>>>>>>>>>> >>>>>>>>>>> [2020-07-02 12:30:14.971916] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - >3995747641; disk >>>>>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:14.971935] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI, gfid = >b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>>>> [2020-07-02 12:30:15.032013] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - >3995747641; disk >>>>>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.032059] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI, gfid = >b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>>>> [2020-07-02 12:30:15.032107] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - >3995747641; disk >>>>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.032153] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI, gfid = >b40e4c58-67b3-4d9e-b708-1ebd23f50dcc >>>>>>>>>>> [2020-07-02 12:30:15.093329] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-15; inode layout - 2454306944 - 2761060379 - >3997647794; disk >>>>>>>>>>> layout - 2454306944 - 2761060379 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.093373] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>>> [2020-07-02 12:30:15.093460] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-16; inode layout - 2761060380 - 3067813815 - >3997647794; disk >>>>>>>>>>> layout - 2761060380 - 3067813815 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.093515] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>>> [2020-07-02 12:30:15.151063] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - >3997647794; disk >>>>>>>>>>> layout - 3681390552 - 3988143987 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.151108] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>>> [2020-07-02 12:30:15.151149] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - >3997647794; disk >>>>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.151162] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/BSE_EOBI/20200630, gfid = >42a506b3-7aff-4935-8ef7-ecb8877c8222 >>>>>>>>>>> [2020-07-02 12:30:15.424321] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-11; inode layout - 920400036 - 1227153471 - >3997647794; disk >>>>>>>>>>> layout - 920400036 - 1227153471 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.424380] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/NSE/20200630, gfid = >1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>>> [2020-07-02 12:30:15.424456] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-16; inode layout - 1840730208 - 2147483643 - >3997647794; disk >>>>>>>>>>> layout - 1840730208 - 2147483643 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.424484] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/NSE/20200630, gfid = >1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>>> [2020-07-02 12:30:15.424525] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-15; inode layout - 1533976772 - 1840730207 - >3997647794; disk >>>>>>>>>>> layout - 1533976772 - 1840730207 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.424542] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/NSE/20200630, gfid = >1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>>> [2020-07-02 12:30:15.424596] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-10; inode layout - 613646600 - 920400035 - >3997647794; disk >>>>>>>>>>> layout - 613646600 - 920400035 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:15.424607] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /raw_data/NSE/20200630, gfid = >1a1c92db-503a-4126-911c-06d3a8ad9ea1 >>>>>>>>>>> [2020-07-02 12:30:16.004482] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_CDS_1_DATA.dat on >>>>>>>>>>> data-client-7 (hashed subvol is data-client-17) >>>>>>>>>>> [2020-07-02 12:30:16.005523] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_CDS_1_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:16.531047] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>>>>> [2020-07-02 12:30:16.532086] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_CDS_1_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:18.733229] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_CDS_2_DATA.dat on >>>>>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>>>>> [2020-07-02 12:30:18.734421] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_CDS_2_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:19.171930] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>>>>> [2020-07-02 12:30:19.172901] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_CDS_2_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.028495] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_2_DATA.dat on >>>>>>>>>>> data-client-6 (hashed subvol is data-client-15) >>>>>>>>>>> [2020-07-02 12:30:21.029836] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_2_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.127648] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>>>>> [2020-07-02 12:30:21.128713] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_2_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.201126] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_3_DATA.dat on >>>>>>>>>>> data-client-15 (hashed subvol is data-client-7) >>>>>>>>>>> [2020-07-02 12:30:21.201928] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_3_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.566158] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>>>>> [2020-07-02 12:30:21.567123] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_3_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.649357] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_4_DATA.dat on >>>>>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>>>>> [2020-07-02 12:30:21.661381] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_4_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.748937] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>>>>> on data-client-15 (hashed subvol is data-client-7) >>>>>>>>>>> [2020-07-02 12:30:21.749481] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_4_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:21.898593] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_6_DATA.dat on >>>>>>>>>>> data-client-14 (hashed subvol is data-client-7) >>>>>>>>>>> [2020-07-02 12:30:21.899442] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_6_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:22.039337] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>>>>> [2020-07-02 12:30:22.040086] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/BSE_EQ_6_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:22.501877] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>>>>> on data-client-15 (hashed subvol is data-client-8) >>>>>>>>>>> [2020-07-02 12:30:22.502712] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECDS1_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:22.782577] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat on >data-client-11 >>>>>>>>>>> (hashed subvol is data-client-6) >>>>>>>>>>> [2020-07-02 12:30:22.783777] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECDS1_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:23.146847] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSECM1_DATA.dat on >>>>>>>>>>> data-client-17 (hashed subvol is data-client-9) >>>>>>>>>>> [2020-07-02 12:30:23.148009] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECM1_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:23.229290] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat on >data-client-14 (hashed >>>>>>>>>>> subvol is data-client-6) >>>>>>>>>>> [2020-07-02 12:30:23.230151] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECM1_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:23.889520] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSECM2_DATA.dat on >>>>>>>>>>> data-client-2 (hashed subvol is data-client-11) >>>>>>>>>>> [2020-07-02 12:30:23.896618] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECM2_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:24.093017] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat on >data-client-6 (hashed >>>>>>>>>>> subvol is data-client-15) >>>>>>>>>>> [2020-07-02 12:30:24.094117] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECM2_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:24.345257] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSECM3_DATA.dat on >>>>>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>>>>> [2020-07-02 12:30:24.346234] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECM3_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:24.425835] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat on >data-client-6 (hashed >>>>>>>>>>> subvol is data-client-15) >>>>>>>>>>> [2020-07-02 12:30:24.426880] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSECM3_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:25.158718] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>>>>> on data-client-9 (hashed subvol is data-client-19) >>>>>>>>>>> [2020-07-02 12:30:25.159619] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:25.531479] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat on >data-client-2 (hashed >>>>>>>>>>> subvol is data-client-10) >>>>>>>>>>> [2020-07-02 12:30:25.540569] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO1_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:25.771692] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>>>>> on data-client-11 (hashed subvol is data-client-3) >>>>>>>>>>> [2020-07-02 12:30:25.772610] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:25.866118] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat on >data-client-15 >>>>>>>>>>> (hashed subvol is data-client-8) >>>>>>>>>>> [2020-07-02 12:30:25.866917] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO2_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:26.424386] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>>>>> on data-client-9 (hashed subvol is data-client-18) >>>>>>>>>>> [2020-07-02 12:30:26.425309] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:26.818852] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat on >data-client-10 >>>>>>>>>>> (hashed subvol is data-client-2) >>>>>>>>>>> [2020-07-02 12:30:26.819890] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO3_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:27.352405] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>>>>> on data-client-10 (hashed subvol is data-client-2) >>>>>>>>>>> [2020-07-02 12:30:27.352914] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:27.521286] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat on >data-client-8 (hashed >>>>>>>>>>> subvol is data-client-18) >>>>>>>>>>> [2020-07-02 12:30:27.522325] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO4_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:28.566634] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>>>>> on data-client-2 (hashed subvol is data-client-11) >>>>>>>>>>> [2020-07-02 12:30:28.579295] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO5_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:28.958028] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>>>>> on data-client-7 (hashed subvol is data-client-16) >>>>>>>>>>> [2020-07-02 12:30:28.959102] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_DATA.dat >>>>>>>>>>> [2020-07-02 12:30:29.012429] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat on >data-client-6 (hashed >>>>>>>>>>> subvol is data-client-15) >>>>>>>>>>> [2020-07-02 12:30:29.013416] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/MCASTNSEFNO6_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:29.396716] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >/processed_data/20200630/NSEFO_BSE_TSDATA.dat on >>>>>>>>>>> data-client-17 (hashed subvol is data-client-10) >>>>>>>>>>> [2020-07-02 12:30:29.397740] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSDATA.dat >>>>>>>>>>> [2020-07-02 12:30:29.556312] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat on >data-client-9 (hashed >>>>>>>>>>> subvol is data-client-18) >>>>>>>>>>> [2020-07-02 12:30:29.557197] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/NSEFO_BSE_TSMETADATA.dat >>>>>>>>>>> [2020-07-02 12:30:30.605354] I [MSGID: 109045] >>>>>>>>>>> [dht-common.c:2701:dht_lookup_everywhere_cbk] 4-data-dht: >attempting >>>>>>>>>>> deletion of stale linkfile >>>>>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat on >data-client-9 >>>>>>>>>>> (hashed subvol is data-client-19) >>>>>>>>>>> [2020-07-02 12:30:30.606117] I [MSGID: 109069] >>>>>>>>>>> [dht-common.c:1946:dht_lookup_unlink_cbk] 4-data-dht: >lookup_unlink >>>>>>>>>>> returned with op_ret -> 0 and op-errno -> 0 for >>>>>>>>>>> /processed_data/20200630/NSETOBSEPUBLISHER_METADATA.dat >>>>>>>>>>> [2020-07-02 12:30:31.559206] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-18; inode layout - 613576736 - 920330171 - 1; >disk layout - >>>>>>>>>>> 613576736 - 920330171 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.559255] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes, gfid = >21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>>>>> [2020-07-02 12:30:31.569025] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-19; inode layout - 920330172 - 1227083607 - 1; >disk layout - >>>>>>>>>>> 920330172 - 1227083607 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.569067] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes, gfid = >21f02cb8-f5d4-4a11-a5ce-a557f5e42e99 >>>>>>>>>>> [2020-07-02 12:30:31.701849] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-18; inode layout - 3374637116 - 3681390551 - 1; >disk layout - >>>>>>>>>>> 3374637116 - 3681390551 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.701895] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>>>>> [2020-07-02 12:30:31.738464] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-19; inode layout - 3681390552 - 3988143987 - 1; >disk layout - >>>>>>>>>>> 3681390552 - 3988143987 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.738507] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX, gfid = >>>>>>>>>>> fff324f2-f855-4881-b77c-81e856522373 >>>>>>>>>>> [2020-07-02 12:30:31.857102] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-15; inode layout - 3067883680 - 3374637115 - >3995747641; disk >>>>>>>>>>> layout - 3067883680 - 3374637115 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.857147] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>>> [2020-07-02 12:30:31.857180] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-16; inode layout - 3374637116 - 3681390551 - >3995747641; disk >>>>>>>>>>> layout - 3374637116 - 3681390551 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.857197] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>>> [2020-07-02 12:30:31.917705] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-19; inode layout - 0 - 306753435 - 3995747641; >disk layout - 0 >>>>>>>>>>> - 306753435 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.917781] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>>> [2020-07-02 12:30:31.917855] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-18; inode layout - 3988213852 - 4294967295 - >3995747641; disk >>>>>>>>>>> layout - 3988213852 - 4294967295 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:31.917874] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX/BANKNIFTY, gfid = >>>>>>>>>>> f8447150-4801-4188-add9-ea295bb88729 >>>>>>>>>>> [2020-07-02 12:30:32.390945] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-18; inode layout - 3681460416 - 3988213851 - 1; >disk layout - >>>>>>>>>>> 3681460416 - 3988213851 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:32.390998] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>>>>> [2020-07-02 12:30:32.391056] I [MSGID: 109064] >>>>>>>>>>> [dht-layout.c:771:dht_layout_dir_mismatch] 4-data-dht: >subvol: >>>>>>>>>>> data-client-19; inode layout - 3988213852 - 4294967295 - 1; >disk layout - >>>>>>>>>>> 3988213852 - 4294967295 - 4159036738 >>>>>>>>>>> [2020-07-02 12:30:32.391075] I [MSGID: 109018] >>>>>>>>>>> [dht-common.c:1686:dht_revalidate_cbk] 4-data-dht: >Mismatching layouts for >>>>>>>>>>> /processed_data/Indexes/NSEINDEX/NIFTY, gfid = >>>>>>>>>>> b2d4deb7-c58c-4046-b6f2-7c7f44d71311 >>>>>>>>>>> [2020-07-02 12:33:50.915279] I [MSGID: 109066] >>>>>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>>>>> /raw_data/Brazil/20200414/.260_INCREMENTAL.dat.gz.IwE7T2 >>>>>>>>>>> (2cb54500-814d-4e85-83e7-e33d9440b18d) >>>>>>>>>>> (hash=data-client-6/cache=data-client-18) => >>>>>>>>>>> /raw_data/Brazil/20200414/260_INCREMENTAL.dat.gz ((null)) >>>>>>>>>>> (hash=data-client-6/cache=) >>>>>>>>>>> [2020-07-02 12:34:09.799586] I [MSGID: 109066] >>>>>>>>>>> [dht-rename.c:1922:dht_rename] 4-data-dht: renaming >>>>>>>>>>> /raw_data/Brazil/20200414/.260_INSTRUMENTS.dat.gz.1jUL1k >>>>>>>>>>> (99938ee6-6986-4123-9d72-ec09e2310b4f) >>>>>>>>>>> (hash=data-client-17/cache=data-client-18) => >>>>>>>>>>> /raw_data/Brazil/20200414/260_INSTRUMENTS.dat.gz ((null)) >>>>>>>>>>> (hash=data-client-17/cache=) >>>>>>>>>>> .... >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> Please look into this at top-priority if possible. >>>>>>>>>>> Let me know if anything else is required. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> -- >>>>>>>>>>> Regards, >>>>>>>>>>> Shreyansh Shah >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> Regards, >>>>>>>>>> Shreyansh Shah >>>>>>>>>> ________ >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Community Meeting Calendar: >>>>>>>>>> >>>>>>>>>> Schedule - >>>>>>>>>> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >>>>>>>>>> Bridge: https://bluejeans.com/441850968 >>>>>>>>>> >>>>>>>>>> Gluster-users mailing list >>>>>>>>>> Gluster-users at gluster.org >>>>>>>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> *Barak Sason Rofman* >>>>>>>>> >>>>>>>>> Gluster Storage Development >>>>>>>>> >>>>>>>>> Red Hat Israel >>>>>>>>> >>>>>>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>>>>>> >>>>>>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>>>>>> M: *+972-52-4326355* >>>>>>>>> @RedHat Red Hat >>>>>>>>> Red Hat >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Regards, >>>>>>>> Shreyansh Shah >>>>>>>> >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> *Barak Sason Rofman* >>>>>>> >>>>>>> Gluster Storage Development >>>>>>> >>>>>>> Red Hat Israel >>>>>>> >>>>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>>>> >>>>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>>>> M: *+972-52-4326355* >>>>>>> @RedHat Red Hat >>>>>>> Red Hat >>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Regards, >>>>>> Shreyansh Shah >>>>>> >>>>> >>>>> >>>>> -- >>>>> *Barak Sason Rofman* >>>>> >>>>> Gluster Storage Development >>>>> >>>>> Red Hat Israel >>>>> >>>>> 34 Jerusalem rd. Ra'anana, 43501 >>>>> >>>>> bsasonro at redhat.com T: *+972-9-7692304* >>>>> M: *+972-52-4326355* >>>>> @RedHat Red Hat >>>>> Red Hat >>>>> >>>>> >>>>> >>>> >>>> >>>> -- >>>> Regards, >>>> Shreyansh Shah >>>> >>> >>> >>> -- >>> *Barak Sason Rofman* >>> >>> Gluster Storage Development >>> >>> Red Hat Israel >>> >>> 34 Jerusalem rd. Ra'anana, 43501 >>> >>> bsasonro at redhat.com T: *+972-9-7692304* >>> M: *+972-52-4326355* >>> @RedHat Red Hat >>> Red Hat >>> >>> >>> >> >> >> -- >> Regards, >> Shreyansh Shah >> ________ >> >> >> >> Community Meeting Calendar: >> >> Schedule - >> Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC >> Bridge: https://bluejeans.com/441850968 >> >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users >> From cjeppesen at datto.com Wed Jul 8 11:33:29 2020 From: cjeppesen at datto.com (Claus Jeppesen) Date: Wed, 8 Jul 2020 13:33:29 +0200 Subject: [Gluster-users] Sharding on 7.x - file sizes are wrong after a large copy. Message-ID: In April of this year I reported the problem using sharding on gluster 7.4: ==== We're using GlusterFS in a replicated brick setup with 2 bricks with sharding turned on (shardsize 128MB). There is something funny going on as we can see that if we copy large VM files to the volume we can end up with files that are a bit larger than the source files DEPENDING on the speed with which we copied the files - e.g.: dd if=SOURCE bs=1M | pv -L NNm | ssh gluster_server "dd of=/gluster/VOL_NAME/TARGET bs=1M" It seems that if NN is <= 25 (i.e. 25 MB/s) the size of SOURCE and TARGET will be the same. If we crank NN to, say, 50 we sometimes risk that a 25G file ends up having a slightly larger size, e.g. 26844413952 or 26844233728 - larger than the expected 26843545600. Unfortunately this is not an illusion ! If we dd the files out of Gluster we will receive the amount of data that 'ls' showed us. In the brick directory (incl .shard directory) we have the expected amount of shards for a 25G files (200) with size precisely equal to 128MB - but there is an additional 0 size shard file created. Has anyone else seen a phenomenon like this ? ==== After upgrade to 7.6 we're still seeing this problem - now, the extra bytes that are appearing can be removed using truncate in the mounted gluster volume, and md5sum can confirm that after truncate the content is identical to the source - however, it may point to an underlying issue. I hope someone can reproduce this behaviour, Thanx, Claus. -- *Claus Jeppesen* Manager, Network Services Datto, Inc. p +45 6170 5901 | Copenhagen Office www.datto.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From duc.to.ho at gmail.com Wed Jul 8 19:25:57 2020 From: duc.to.ho at gmail.com (Duc To Ho) Date: Thu, 9 Jul 2020 02:25:57 +0700 Subject: [Gluster-users] Setup GlusterFS geo-replication between Ubuntu server and CentOS server Message-ID: Hello all, Recently I need to setup GlusterFS geo-replication between Ubuntu server and CentOs server. I setup following the guideline and everything was fine, checking with "gluster volume geo-replication gvol0 dev-3-15:gvol0 status" command returns me the info with Active status. But then when I started copying data to the master node, I got Exception error in log and status has become Faulty. I tried to set up the same in both CentOs-CentOs and Ubuntu-Ubuntu and there is no such issue. I wonder if anyone has set up the same case CentOS-Ubuntu and feedback if it is OK or had a similar issue and hopefully, the solution for it? Thank you and I look forward to hearing from you, Duc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image_2020_07_06T03_10_34_652Z.png Type: image/png Size: 207811 bytes Desc: not available URL: From sacharya at redhat.com Thu Jul 9 11:29:58 2020 From: sacharya at redhat.com (Shwetha Acharya) Date: Thu, 9 Jul 2020 16:59:58 +0530 Subject: [Gluster-users] Reliable Geo-Replication In-Reply-To: References: <75c1490e-2a69-7bc2-9ddd-a26fa5a225f5@gmx.de> <93e7700c-c4df-dd3c-ba15-f7a815ae7e6a@gmx.de> <841137d6-768f-7560-24b3-c80fc8ceffa9@gmx.de>

<3b29ee38-991d-6253-f3da-504f4414c723@gmx.de> <93624ed1-4c0a-100f-344c-1cb99b30f94b@mpa-ifw.tu-darmstadt.de> Message-ID: Hi Felix, Find my reply inline. Regards, Shwetha On Thu, Jun 25, 2020 at 12:25 PM Felix K?lzow wrote: > Dear Gluster-users, > > I deleted a further the geo-replication session with [reset-sync-time] > option. Afterwards, > I recreated the session, and as expected, the session starts in the > hybrid crawl. > I can see some sync jobs are running in the gsyncd.log file and after a > couple of hours, > there are no such entries anymore. > > I switched into the log_level DEBUG mode to see what's going on: > gluster volume masterVOlume geoRepHost:slaveVol config log_level DEBUG > > It seems to me that the xsync mode is in loop since the same files > appear over and over again in the log-file. Can you elaborate this? Where are the same files appearing? Are they getting synced Now we have two volume in this "loop"-state and the third volume also > still has a broken geo-replication. > Is the worker status changing from initializing to faulty or initializing to active/passive? Is any worker active? > So any help is appreciated how to fix this or which information is > required to find the root cause? > > As mentioned before, all these gathered information could be used to > improve the geo-replication trouble-shooting documentation. > > Thanks in advance. > > Regards, > Felix > ________ > > > > Community Meeting Calendar: > > Schedule - > Every 2nd and 4th Tuesday at 14:30 IST / 09:00 UTC > Bridge: https://bluejeans.com/441850968 > > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rkothiya at redhat.com Thu Jul 9 12:46:32 2020 From: rkothiya at redhat.com (Rinku Kothiya) Date: Thu, 9 Jul 2020 18:16:32 +0530 Subject: [Gluster-users] Announcing Gluster release 8 Message-ID: Hi, The Gluster community is pleased to announce the release of 8.0, our latest release. This is a major release that includes a range of code improvements and stability fixes along with a few features as noted below. A selection of the key features and bugs addressed are documented in this [1] page. *Announcements:* 1. Releases that receive maintenance updates post release 8 are 7 and 8 [2] 2. Release 8 will receive maintenance updates around the 10th of every month for the first 3 months post release (i.e Aug'20, Sep'20, Oct'20). Post the initial 3 months, it will receive maintenance updates every 2 months till EOL. 3. For upgrading to release 8 refer to the release 8 upgrade guide [3]. Make sure you are not using any of the following deprecated features : - Block device (bd) xlator - Decompounder feature - Crypt xlator - Symlink-cache xlator - Stripe feature - Tiering support (tier xlator and changetimerecorder) - Glupy *Highlights of this release are:* *Highlights:* - Several stability fixes addressing * coverity, clang-scan, address sanitizer and valgrind reported issues * removal of unused and hence, deprecated code and features - Performance Improvements *Features:* - Implemented seek file operation for open-behind - Now storage.reserve option will take size of disk as input instead of percentage - Added Functionality to enable log rotation for user serviceable snapshot?s logs - Mandatory locks enhancements in replicate subvolumes - To validate other memory allocation implementations instead of libc?s malloc added an option to build with tcmalloc library - Integrated Thin-arbiter with GD1 - Client Handling of Elastic Clusters *Major issues:* - None Bugs addressed are provided towards the end, in the release notes [1] Thank you, Gluster community References: [1] Release notes: https://docs.gluster.org/en/latest/release-notes/8.0/ [2] Release schedule: https://www.gluster.org/release-schedule/ [3] Upgrade guide to release-8: https://docs.gluster.org/en/latest/Upgrade-Guide/upgrade_to_8/ [4] Packages: Packages that will be available : https://github.com/gluster/glusterdocs/blob/master/docs/Install-Guide/Community_Packages.md Packages at : https://download.gluster.org/pub/gluster/glusterfs/8/8.0/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From sacharya at redhat.com Thu Jul 9 13:00:23 2020 From: sacharya at redhat.com (Shwetha Acharya) Date: Thu, 9 Jul 2020 18:30:23 +0530 Subject: [Gluster-users] Setup GlusterFS geo-replication between Ubuntu server and CentOS server In-Reply-To: References: Message-ID: Hi Duc, Can we confirm if the gluster version and python version are same across the machines you are using? Regards, Shwetha On Thu, Jul 9, 2020 at 12:56 AM Duc To Ho