From jenkins at build.gluster.org Mon Apr 1 01:45:03 2019 From: jenkins at build.gluster.org (jenkins at build.gluster.org) Date: Mon, 1 Apr 2019 01:45:03 +0000 (UTC) Subject: [Gluster-devel] Weekly Untriaged Bugs Message-ID: <1758450900.40.1554083103600.JavaMail.jenkins@jenkins-el7.rht.gluster.org> [...truncated 6 lines...] https://bugzilla.redhat.com/1688226 / core: Brick Still Died After Restart Glusterd & Glusterfsd Services https://bugzilla.redhat.com/1691833 / core: Client sends 128KByte network packet for 0 length file copy https://bugzilla.redhat.com/1685023 / core: FD processes for larger files are not closed soon after FOP finished https://bugzilla.redhat.com/1686396 / core: ls and rm run on contents of same directory from a single mount point results in ENOENT errors https://bugzilla.redhat.com/1689981 / geo-replication: OSError: [Errno 1] Operation not permitted - failing with socket files? https://bugzilla.redhat.com/1694139 / glusterd: Error waiting for job 'heketi-storage-copy-job' to complete on one-node k3s deployment. https://bugzilla.redhat.com/1694010 / glusterd: peer gets disconnected during a rolling upgrade. https://bugzilla.redhat.com/1690254 / glusterd: Volume create fails with "Commit failed" message if volumes is created using 3 nodes with glusterd restarts on 4th node. https://bugzilla.redhat.com/1690753 / glusterd: Volume stop when quorum not met is successful https://bugzilla.redhat.com/1686353 / libgfapi: flooding of "dict is NULL" logging https://bugzilla.redhat.com/1687063 / locks: glusterd :symbol lookup error: undefined symbol :use_spinlocks https://bugzilla.redhat.com/1690454 / posix-acl: mount-shared-storage.sh does not implement mount options https://bugzilla.redhat.com/1691617 / project-infrastructure: clang-scan tests are failing nightly. https://bugzilla.redhat.com/1691357 / project-infrastructure: core archive link from regression jobs throw not found error https://bugzilla.redhat.com/1692349 / project-infrastructure: gluster-csi-containers job is failing https://bugzilla.redhat.com/1685813 / project-infrastructure: Not able to run centos-regression getting exception error https://bugzilla.redhat.com/1693385 / project-infrastructure: request to change the version of fedora in fedora-smoke-job https://bugzilla.redhat.com/1693295 / project-infrastructure: rpc.statd not started on builder204.aws.gluster.org https://bugzilla.redhat.com/1691789 / project-infrastructure: rpc-statd service stops on AWS builders https://bugzilla.redhat.com/1694291 / project-infrastructure: Smoke test build artifacts do not contain gluster logs https://bugzilla.redhat.com/1686461 / quota: Quotad.log filled with 0-dict is not sent on wire [Invalid argument] messages https://bugzilla.redhat.com/1693184 / replicate: A brick process(glusterfsd) died with 'memory violation' [...truncated 2 lines...] -------------- next part -------------- A non-text attachment was scrubbed... Name: build.log Type: application/octet-stream Size: 2887 bytes Desc: not available URL: From pkarampu at redhat.com Mon Apr 1 04:32:19 2019 From: pkarampu at redhat.com (Pranith Kumar Karampuri) Date: Mon, 1 Apr 2019 10:02:19 +0530 Subject: [Gluster-devel] Issue with posix locks In-Reply-To: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> References: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> Message-ID: On Sun, Mar 31, 2019 at 11:29 PM Soumya Koduri wrote: > > > On 3/29/19 11:55 PM, Xavi Hernandez wrote: > > Hi all, > > > > there is one potential problem with posix locks when used in a > > replicated or dispersed volume. > > > > Some background: > > > > Posix locks allow any process to lock a region of a file multiple times, > > but a single unlock on a given region will release all previous locks. > > Locked regions can be different for each lock request and they can > > overlap. The resulting lock will cover the union of all locked regions. > > A single unlock (the region doesn't necessarily need to match any of the > > ranges used for locking) will create a "hole" in the currently locked > > region, independently of how many times a lock request covered that > region. > > > > For this reason, the locks xlator simply combines the locked regions > > that are requested, but it doesn't track each individual lock range. > > > > Under normal circumstances this works fine. But there are some cases > > where this behavior is not sufficient. For example, suppose we have a > > replica 3 volume with quorum = 2. Given the special nature of posix > > locks, AFR sends the lock request sequentially to each one of the > > bricks, to avoid that conflicting lock requests from other clients could > > require to unlock an already locked region on the client that has not > > got enough successful locks (i.e. quorum). An unlock here not only would > > cancel the current lock request. It would also cancel any previously > > acquired lock. > > > > I may not have fully understood, please correct me. AFAIU, lk xlator > merges locks only if both the lk-owner and the client opaque matches. > > In the case which you have mentioned above, considering clientA acquired > locks on majority of quorum (say nodeA and nodeB) and clientB on nodeC > alone- clientB now has to unlock/cancel the lock it acquired on nodeC. > > You are saying the it could pose a problem if there were already > successful locks taken by clientB for the same region which would get > unlocked by this particular unlock request..right? > > Assuming the previous locks acquired by clientB are shared (otherwise > clientA wouldn't have got granted lock for the same region on nodeA & > nodeB), they would still hold true on nodeA & nodeB as the unlock > request was sent to only nodeC. Since the majority of quorum nodes still > hold the locks by clientB, this isn't serious issue IMO. > > I haven't looked into heal part but would like to understand if this is > really an issue in normal scenarios as well. > This is how I understood the code. Consider the following case: Nodes A, B, C have locks with start and end offsets: 5-15 from mount-1 and lock-range 2-3 from mount-2. If mount-1 requests nonblocking lock with lock-range 1-7 and in parallel lets say mount-2 issued unlock of 2-3 as well. nodeA got unlock from mount-2 with range 2-3 then lock from mount-1 with range 1-7, so the lock is granted and merged to give 1-15 nodeB got lock from mount-1 with range 1-7 before unlock of 2-3 which leads to EAGAIN which will trigger unlocks on granted lock in mount-1 which will end up doing unlock of 1-7 on nodeA leading to lock-range 8-15 instead of the original 5-15 on nodeA. Whereas nodeB and nodeC will have range 5-15. Let me know if my understanding is wrong. > Thanks, > Soumya > > > However, when something goes wrong (a brick dies during a lock request, > > or there's a network partition or some other weird situation), it could > > happen that even using sequential locking, only one brick succeeds the > > lock request. In this case, AFR should cancel the previous lock (and it > > does), but this also cancels any previously acquired lock on that > > region, which is not good. > > > > A similar thing can happen if we try to recover (heal) posix locks that > > were active after a brick has been disconnected (for any reason) and > > then reconnected. > > > > To fix all these situations we need to change the way posix locks are > > managed by locks xlator. One possibility would be to embed the lock > > request inside an inode transaction using inodelk. Since inodelks do not > > suffer this problem, the follwing posix lock could be sent safely. > > However this implies an additional network request, which could cause > > some performance impact. Eager-locking could minimize the impact in some > > cases. However this approach won't work for lock recovery after a > > disconnect. > > > > Another possibility is to send a special partial posix lock request > > which won't be immediately merged with already existing locks once > > granted. An additional confirmation request of the partial posix lock > > will be required to fully grant the current lock and merge it with the > > existing ones. This requires a new network request, which will add > > latency, and makes everything more complex since there would be more > > combinations of states in which something could fail. > > > > So I think one possible solution would be the following: > > > > 1. Keep each posix lock as an independent object in locks xlator. This > > will make it possible to "invalidate" any already granted lock without > > affecting already established locks. > > > > 2. Additionally, we'll keep a sorted list of non-overlapping segments of > > locked regions. And we'll count, for each region, how many locks are > > referencing it. One lock can reference multiple segments, and each > > segment can be referenced by multiple locks. > > > > 3. An additional lock request that overlaps with an existing segment, > > can cause this segment to be split to satisfy the non-overlapping > property. > > > > 4. When an unlock request is received, all segments intersecting with > > the region are eliminated (it may require some segment splits on the > > edges), and the unlocked region is subtracted from each lock associated > > to the segment. If a lock gets an empty region, it's removed. > > > > 5. We'll create a special "remove lock" request that doesn't unlock a > > region but removes an already granted lock. This will decrease the > > number of references to each of the segments this lock was covering. If > > some segment reaches 0, it's removed. Otherwise it remains there. This > > special request will only be used internally to cancel already acquired > > locks that cannot be fully granted due to quorum issues or any other > > problem. > > > > In some weird cases, the list of segments can be huge (many locks > > overlapping only on a single byte, so each segment represents only one > > byte). We can try to find some smarter structure that minimizes this > > problem or limit the number of segments (for example returning ENOLCK > > when there are too many). > > > > What do you think ? > > > > Xavi > > > > _______________________________________________ > > Gluster-devel mailing list > > Gluster-devel at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > -- Pranith -------------- next part -------------- An HTML attachment was scrubbed... URL: From xhernandez at redhat.com Mon Apr 1 07:54:22 2019 From: xhernandez at redhat.com (Xavi Hernandez) Date: Mon, 1 Apr 2019 09:54:22 +0200 Subject: [Gluster-devel] Issue with posix locks In-Reply-To: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> References: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> Message-ID: On Sun, Mar 31, 2019 at 7:59 PM Soumya Koduri wrote: > > > On 3/29/19 11:55 PM, Xavi Hernandez wrote: > > Hi all, > > > > there is one potential problem with posix locks when used in a > > replicated or dispersed volume. > > > > Some background: > > > > Posix locks allow any process to lock a region of a file multiple times, > > but a single unlock on a given region will release all previous locks. > > Locked regions can be different for each lock request and they can > > overlap. The resulting lock will cover the union of all locked regions. > > A single unlock (the region doesn't necessarily need to match any of the > > ranges used for locking) will create a "hole" in the currently locked > > region, independently of how many times a lock request covered that > region. > > > > For this reason, the locks xlator simply combines the locked regions > > that are requested, but it doesn't track each individual lock range. > > > > Under normal circumstances this works fine. But there are some cases > > where this behavior is not sufficient. For example, suppose we have a > > replica 3 volume with quorum = 2. Given the special nature of posix > > locks, AFR sends the lock request sequentially to each one of the > > bricks, to avoid that conflicting lock requests from other clients could > > require to unlock an already locked region on the client that has not > > got enough successful locks (i.e. quorum). An unlock here not only would > > cancel the current lock request. It would also cancel any previously > > acquired lock. > > > > I may not have fully understood, please correct me. AFAIU, lk xlator > merges locks only if both the lk-owner and the client opaque matches. > > In the case which you have mentioned above, considering clientA acquired > locks on majority of quorum (say nodeA and nodeB) and clientB on nodeC > alone- clientB now has to unlock/cancel the lock it acquired on nodeC. > > You are saying the it could pose a problem if there were already > successful locks taken by clientB for the same region which would get > unlocked by this particular unlock request..right? > Yes > > Assuming the previous locks acquired by clientB are shared (otherwise > clientA wouldn't have got granted lock for the same region on nodeA & > nodeB), they would still hold true on nodeA & nodeB as the unlock > request was sent to only nodeC. Since the majority of quorum nodes still > hold the locks by clientB, this isn't serious issue IMO. > Partially true. But if one of nodeA or nodeB dies or gets disconnected, there won't be any majority of bricks with correct locks, even though there are still 2 alive bricks. At this point, another client could successfully acquire a lock that, in theory, is already acquired by another client. > I haven't looked into heal part but would like to understand if this is > really an issue in normal scenarios as well. > If we consider that a brick disconnection is a normal scenario (which I think it should be on a large scale distributed file system), then this issue exists. But even without brick disconnections we can get incorrect results, as Pranith has just explained. Xavi > > Thanks, > Soumya > > > However, when something goes wrong (a brick dies during a lock request, > > or there's a network partition or some other weird situation), it could > > happen that even using sequential locking, only one brick succeeds the > > lock request. In this case, AFR should cancel the previous lock (and it > > does), but this also cancels any previously acquired lock on that > > region, which is not good. > > > > A similar thing can happen if we try to recover (heal) posix locks that > > were active after a brick has been disconnected (for any reason) and > > then reconnected. > > > > To fix all these situations we need to change the way posix locks are > > managed by locks xlator. One possibility would be to embed the lock > > request inside an inode transaction using inodelk. Since inodelks do not > > suffer this problem, the follwing posix lock could be sent safely. > > However this implies an additional network request, which could cause > > some performance impact. Eager-locking could minimize the impact in some > > cases. However this approach won't work for lock recovery after a > > disconnect. > > > > Another possibility is to send a special partial posix lock request > > which won't be immediately merged with already existing locks once > > granted. An additional confirmation request of the partial posix lock > > will be required to fully grant the current lock and merge it with the > > existing ones. This requires a new network request, which will add > > latency, and makes everything more complex since there would be more > > combinations of states in which something could fail. > > > > So I think one possible solution would be the following: > > > > 1. Keep each posix lock as an independent object in locks xlator. This > > will make it possible to "invalidate" any already granted lock without > > affecting already established locks. > > > > 2. Additionally, we'll keep a sorted list of non-overlapping segments of > > locked regions. And we'll count, for each region, how many locks are > > referencing it. One lock can reference multiple segments, and each > > segment can be referenced by multiple locks. > > > > 3. An additional lock request that overlaps with an existing segment, > > can cause this segment to be split to satisfy the non-overlapping > property. > > > > 4. When an unlock request is received, all segments intersecting with > > the region are eliminated (it may require some segment splits on the > > edges), and the unlocked region is subtracted from each lock associated > > to the segment. If a lock gets an empty region, it's removed. > > > > 5. We'll create a special "remove lock" request that doesn't unlock a > > region but removes an already granted lock. This will decrease the > > number of references to each of the segments this lock was covering. If > > some segment reaches 0, it's removed. Otherwise it remains there. This > > special request will only be used internally to cancel already acquired > > locks that cannot be fully granted due to quorum issues or any other > > problem. > > > > In some weird cases, the list of segments can be huge (many locks > > overlapping only on a single byte, so each segment represents only one > > byte). We can try to find some smarter structure that minimizes this > > problem or limit the number of segments (for example returning ENOLCK > > when there are too many). > > > > What do you think ? > > > > Xavi > > > > _______________________________________________ > > Gluster-devel mailing list > > Gluster-devel at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From skoduri at redhat.com Mon Apr 1 08:15:21 2019 From: skoduri at redhat.com (Soumya Koduri) Date: Mon, 1 Apr 2019 13:45:21 +0530 Subject: [Gluster-devel] Issue with posix locks In-Reply-To: References: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> Message-ID: <6c12597f-634d-8d7e-5c47-2158f6989299@redhat.com> On 4/1/19 10:02 AM, Pranith Kumar Karampuri wrote: > > > On Sun, Mar 31, 2019 at 11:29 PM Soumya Koduri > wrote: > > > > On 3/29/19 11:55 PM, Xavi Hernandez wrote: > > Hi all, > > > > there is one potential problem with posix locks when used in a > > replicated or dispersed volume. > > > > Some background: > > > > Posix locks allow any process to lock a region of a file multiple > times, > > but a single unlock on a given region will release all previous > locks. > > Locked regions can be different for each lock request and they can > > overlap. The resulting lock will cover the union of all locked > regions. > > A single unlock (the region doesn't necessarily need to match any > of the > > ranges used for locking) will create a "hole" in the currently > locked > > region, independently of how many times a lock request covered > that region. > > > > For this reason, the locks xlator simply combines the locked regions > > that are requested, but it doesn't track each individual lock range. > > > > Under normal circumstances this works fine. But there are some cases > > where this behavior is not sufficient. For example, suppose we > have a > > replica 3 volume with quorum = 2. Given the special nature of posix > > locks, AFR sends the lock request sequentially to each one of the > > bricks, to avoid that conflicting lock requests from other > clients could > > require to unlock an already locked region on the client that has > not > > got enough successful locks (i.e. quorum). An unlock here not > only would > > cancel the current lock request. It would also cancel any previously > > acquired lock. > > > > I may not have fully understood, please correct me. AFAIU, lk xlator > merges locks only if both the lk-owner and the client opaque matches. > > In the case which you have mentioned above, considering clientA > acquired > locks on majority of quorum (say nodeA and nodeB) and clientB on nodeC > alone- clientB now has to unlock/cancel the lock it acquired on nodeC. > > You are saying the it could pose a problem if there were already > successful locks taken by clientB for the same region which would get > unlocked by this particular unlock request..right? > > Assuming the previous locks acquired by clientB are shared (otherwise > clientA wouldn't have got granted lock for the same region on nodeA & > nodeB), they would still hold true on nodeA & nodeB? as the unlock > request was sent to only nodeC. Since the majority of quorum nodes > still > hold the locks by clientB, this isn't serious issue IMO. > > I haven't looked into heal part but would like to understand if this is > really an issue in normal scenarios as well. > > > This is how I understood the code. Consider the following case: > Nodes A, B, C have locks with start and end offsets: 5-15 from mount-1 > and lock-range 2-3 from mount-2. > If mount-1 requests nonblocking lock with lock-range 1-7 and in parallel > lets say mount-2 issued unlock of 2-3 as well. > > nodeA got unlock from mount-2 with range 2-3 then lock from mount-1 with > range 1-7, so the lock is granted and merged to give 1-15 > nodeB got lock from mount-1 with range 1-7 before unlock of 2-3 which > leads to EAGAIN which will trigger unlocks on granted lock in mount-1 > which will end up doing unlock of 1-7 on nodeA leading to lock-range > 8-15 instead of the original 5-15 on nodeA. Whereas nodeB and nodeC will > have range 5-15. > > Let me know if my understanding is wrong. Both of us mentioned the same points. So in the example you gave , mount-1 lost its previous lock on nodeA but majority of the quorum (nodeB and nodeC) still have the previous lock (range: 5-15) intact. So this shouldn't ideally lead to any issues as other conflicting locks are blocked or failed by majority of the nodes (provided there are no brick dis/re-connects). Wrt to brick disconnects/re-connects, if we can get in general lock healing (not getting into implementation details atm) support, that should take care of correcting lock range on nodeA as well right? That said I am not suggesting that we should stick to existing behavior, just trying to get clarification to check if we can avoid any overhead/side-effects with maintaining multiple locks. Thanks, Soumya > > > Thanks, > Soumya > > > However, when something goes wrong (a brick dies during a lock > request, > > or there's a network partition or some other weird situation), it > could > > happen that even using sequential locking, only one brick > succeeds the > > lock request. In this case, AFR should cancel the previous lock > (and it > > does), but this also cancels any previously acquired lock on that > > region, which is not good. > > > > A similar thing can happen if we try to recover (heal) posix > locks that > > were active after a brick has been disconnected (for any reason) and > > then reconnected. > > > > To fix all these situations we need to change the way posix locks > are > > managed by locks xlator. One possibility would be to embed the lock > > request inside an inode transaction using inodelk. Since inodelks > do not > > suffer this problem, the follwing posix lock could be sent safely. > > However this implies an additional network request, which could > cause > > some performance impact. Eager-locking could minimize the impact > in some > > cases. However this approach won't work for lock recovery after a > > disconnect. > > > > Another possibility is to send a special partial posix lock request > > which won't be immediately merged with already existing locks once > > granted. An additional confirmation request of the partial posix > lock > > will be required to fully grant the current lock and merge it > with the > > existing ones. This requires a new network request, which will add > > latency, and makes everything more complex since there would be more > > combinations of states in which something could fail. > > > > So I think one possible solution would be the following: > > > > 1. Keep each posix lock as an independent object in locks xlator. > This > > will make it possible to "invalidate" any already granted lock > without > > affecting already established locks. > > > > 2. Additionally, we'll keep a sorted list of non-overlapping > segments of > > locked regions. And we'll count, for each region, how many locks are > > referencing it. One lock can reference multiple segments, and each > > segment can be referenced by multiple locks. > > > > 3. An additional lock request that overlaps with an existing > segment, > > can cause this segment to be split to satisfy the non-overlapping > property. > > > > 4. When an unlock request is received, all segments intersecting > with > > the region are eliminated (it may require some segment splits on the > > edges), and the unlocked region is subtracted from each lock > associated > > to the segment. If a lock gets an empty region, it's removed. > > > > 5. We'll create a special "remove lock" request that doesn't > unlock a > > region but removes an already granted lock. This will decrease the > > number of references to each of the segments this lock was > covering. If > > some segment reaches 0, it's removed. Otherwise it remains there. > This > > special request will only be used internally to cancel already > acquired > > locks that cannot be fully granted due to quorum issues or any other > > problem. > > > > In some weird cases, the list of segments can be huge (many locks > > overlapping only on a single byte, so each segment represents > only one > > byte). We can try to find some smarter structure that minimizes this > > problem or limit the number of segments (for example returning > ENOLCK > > when there are too many). > > > > What do you think ? > > > > Xavi > > > > _______________________________________________ > > Gluster-devel mailing list > > Gluster-devel at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > > > > -- > Pranith From xhernandez at redhat.com Mon Apr 1 08:53:23 2019 From: xhernandez at redhat.com (Xavi Hernandez) Date: Mon, 1 Apr 2019 10:53:23 +0200 Subject: [Gluster-devel] Issue with posix locks In-Reply-To: <6c12597f-634d-8d7e-5c47-2158f6989299@redhat.com> References: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> <6c12597f-634d-8d7e-5c47-2158f6989299@redhat.com> Message-ID: On Mon, Apr 1, 2019 at 10:15 AM Soumya Koduri wrote: > > > On 4/1/19 10:02 AM, Pranith Kumar Karampuri wrote: > > > > > > On Sun, Mar 31, 2019 at 11:29 PM Soumya Koduri > > wrote: > > > > > > > > On 3/29/19 11:55 PM, Xavi Hernandez wrote: > > > Hi all, > > > > > > there is one potential problem with posix locks when used in a > > > replicated or dispersed volume. > > > > > > Some background: > > > > > > Posix locks allow any process to lock a region of a file multiple > > times, > > > but a single unlock on a given region will release all previous > > locks. > > > Locked regions can be different for each lock request and they can > > > overlap. The resulting lock will cover the union of all locked > > regions. > > > A single unlock (the region doesn't necessarily need to match any > > of the > > > ranges used for locking) will create a "hole" in the currently > > locked > > > region, independently of how many times a lock request covered > > that region. > > > > > > For this reason, the locks xlator simply combines the locked > regions > > > that are requested, but it doesn't track each individual lock > range. > > > > > > Under normal circumstances this works fine. But there are some > cases > > > where this behavior is not sufficient. For example, suppose we > > have a > > > replica 3 volume with quorum = 2. Given the special nature of > posix > > > locks, AFR sends the lock request sequentially to each one of the > > > bricks, to avoid that conflicting lock requests from other > > clients could > > > require to unlock an already locked region on the client that has > > not > > > got enough successful locks (i.e. quorum). An unlock here not > > only would > > > cancel the current lock request. It would also cancel any > previously > > > acquired lock. > > > > > > > I may not have fully understood, please correct me. AFAIU, lk xlator > > merges locks only if both the lk-owner and the client opaque matches. > > > > In the case which you have mentioned above, considering clientA > > acquired > > locks on majority of quorum (say nodeA and nodeB) and clientB on > nodeC > > alone- clientB now has to unlock/cancel the lock it acquired on > nodeC. > > > > You are saying the it could pose a problem if there were already > > successful locks taken by clientB for the same region which would get > > unlocked by this particular unlock request..right? > > > > Assuming the previous locks acquired by clientB are shared (otherwise > > clientA wouldn't have got granted lock for the same region on nodeA & > > nodeB), they would still hold true on nodeA & nodeB as the unlock > > request was sent to only nodeC. Since the majority of quorum nodes > > still > > hold the locks by clientB, this isn't serious issue IMO. > > > > I haven't looked into heal part but would like to understand if this > is > > really an issue in normal scenarios as well. > > > > > > This is how I understood the code. Consider the following case: > > Nodes A, B, C have locks with start and end offsets: 5-15 from mount-1 > > and lock-range 2-3 from mount-2. > > If mount-1 requests nonblocking lock with lock-range 1-7 and in parallel > > lets say mount-2 issued unlock of 2-3 as well. > > > > nodeA got unlock from mount-2 with range 2-3 then lock from mount-1 with > > range 1-7, so the lock is granted and merged to give 1-15 > > nodeB got lock from mount-1 with range 1-7 before unlock of 2-3 which > > leads to EAGAIN which will trigger unlocks on granted lock in mount-1 > > which will end up doing unlock of 1-7 on nodeA leading to lock-range > > 8-15 instead of the original 5-15 on nodeA. Whereas nodeB and nodeC will > > have range 5-15. > > > > Let me know if my understanding is wrong. > > Both of us mentioned the same points. So in the example you gave , > mount-1 lost its previous lock on nodeA but majority of the quorum > (nodeB and nodeC) still have the previous lock (range: 5-15) intact. So > this shouldn't ideally lead to any issues as other conflicting locks are > blocked or failed by majority of the nodes (provided there are no brick > dis/re-connects). > But brick disconnects will happen (upgrades, disk failures, server maintenance, ...). Anyway, even without brick disconnects, in the previous example we have nodeA with range 8-15, and nodes B and C with range 5-15. If another lock from mount-2 comes for range 5-7, it will succeed on nodeA, but it will block on nodeB. At this point, mount-1 could attempt a lock on same range. It will block on nodeA, so we have a deadlock. In general, having discrepancies between bricks is not good because sooner or later it will cause some bad inconsistency. > Wrt to brick disconnects/re-connects, if we can get in general lock > healing (not getting into implementation details atm) support, that > should take care of correcting lock range on nodeA as well right? > The problem we have seen is that to be able to correctly heal currently acquired locks on brick reconnect, there are cases where we need to release a lock that has already been granted (because the current owner doesn't have enough quorum and a just recovered connection tries to claim/heal it). In this case we need to deal with locks that have already been merged, but without interfering with other existing locks that already have quorum. > That said I am not suggesting that we should stick to existing behavior, > just trying to get clarification to check if we can avoid any > overhead/side-effects with maintaining multiple locks. > Right now is the only way we have found to provide a correct solution both for some cases of concurrent lock/unlock requests, and lock healing. Regards, Xavi > Thanks, > Soumya > > > > > > > > Thanks, > > Soumya > > > > > However, when something goes wrong (a brick dies during a lock > > request, > > > or there's a network partition or some other weird situation), it > > could > > > happen that even using sequential locking, only one brick > > succeeds the > > > lock request. In this case, AFR should cancel the previous lock > > (and it > > > does), but this also cancels any previously acquired lock on that > > > region, which is not good. > > > > > > A similar thing can happen if we try to recover (heal) posix > > locks that > > > were active after a brick has been disconnected (for any reason) > and > > > then reconnected. > > > > > > To fix all these situations we need to change the way posix locks > > are > > > managed by locks xlator. One possibility would be to embed the > lock > > > request inside an inode transaction using inodelk. Since inodelks > > do not > > > suffer this problem, the follwing posix lock could be sent safely. > > > However this implies an additional network request, which could > > cause > > > some performance impact. Eager-locking could minimize the impact > > in some > > > cases. However this approach won't work for lock recovery after a > > > disconnect. > > > > > > Another possibility is to send a special partial posix lock > request > > > which won't be immediately merged with already existing locks once > > > granted. An additional confirmation request of the partial posix > > lock > > > will be required to fully grant the current lock and merge it > > with the > > > existing ones. This requires a new network request, which will add > > > latency, and makes everything more complex since there would be > more > > > combinations of states in which something could fail. > > > > > > So I think one possible solution would be the following: > > > > > > 1. Keep each posix lock as an independent object in locks xlator. > > This > > > will make it possible to "invalidate" any already granted lock > > without > > > affecting already established locks. > > > > > > 2. Additionally, we'll keep a sorted list of non-overlapping > > segments of > > > locked regions. And we'll count, for each region, how many locks > are > > > referencing it. One lock can reference multiple segments, and each > > > segment can be referenced by multiple locks. > > > > > > 3. An additional lock request that overlaps with an existing > > segment, > > > can cause this segment to be split to satisfy the non-overlapping > > property. > > > > > > 4. When an unlock request is received, all segments intersecting > > with > > > the region are eliminated (it may require some segment splits on > the > > > edges), and the unlocked region is subtracted from each lock > > associated > > > to the segment. If a lock gets an empty region, it's removed. > > > > > > 5. We'll create a special "remove lock" request that doesn't > > unlock a > > > region but removes an already granted lock. This will decrease the > > > number of references to each of the segments this lock was > > covering. If > > > some segment reaches 0, it's removed. Otherwise it remains there. > > This > > > special request will only be used internally to cancel already > > acquired > > > locks that cannot be fully granted due to quorum issues or any > other > > > problem. > > > > > > In some weird cases, the list of segments can be huge (many locks > > > overlapping only on a single byte, so each segment represents > > only one > > > byte). We can try to find some smarter structure that minimizes > this > > > problem or limit the number of segments (for example returning > > ENOLCK > > > when there are too many). > > > > > > What do you think ? > > > > > > Xavi > > > > > > _______________________________________________ > > > Gluster-devel mailing list > > > Gluster-devel at gluster.org > > > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > > > > > > > > > -- > > Pranith > -------------- next part -------------- An HTML attachment was scrubbed... URL: From skoduri at redhat.com Mon Apr 1 10:10:21 2019 From: skoduri at redhat.com (Soumya Koduri) Date: Mon, 1 Apr 2019 15:40:21 +0530 Subject: [Gluster-devel] Issue with posix locks In-Reply-To: References: <7a1adf3a-bdc9-a1ef-6f3c-e69d17c37adc@redhat.com> <6c12597f-634d-8d7e-5c47-2158f6989299@redhat.com> Message-ID: <4d3d57ca-64a7-ed76-f98d-01d8dfb76c39@redhat.com> On 4/1/19 2:23 PM, Xavi Hernandez wrote: > On Mon, Apr 1, 2019 at 10:15 AM Soumya Koduri > wrote: > > > > On 4/1/19 10:02 AM, Pranith Kumar Karampuri wrote: > > > > > > On Sun, Mar 31, 2019 at 11:29 PM Soumya Koduri > > > >> wrote: > > > > > > > >? ? ?On 3/29/19 11:55 PM, Xavi Hernandez wrote: > >? ? ? > Hi all, > >? ? ? > > >? ? ? > there is one potential problem with posix locks when used in a > >? ? ? > replicated or dispersed volume. > >? ? ? > > >? ? ? > Some background: > >? ? ? > > >? ? ? > Posix locks allow any process to lock a region of a file > multiple > >? ? ?times, > >? ? ? > but a single unlock on a given region will release all > previous > >? ? ?locks. > >? ? ? > Locked regions can be different for each lock request and > they can > >? ? ? > overlap. The resulting lock will cover the union of all locked > >? ? ?regions. > >? ? ? > A single unlock (the region doesn't necessarily need to > match any > >? ? ?of the > >? ? ? > ranges used for locking) will create a "hole" in the currently > >? ? ?locked > >? ? ? > region, independently of how many times a lock request covered > >? ? ?that region. > >? ? ? > > >? ? ? > For this reason, the locks xlator simply combines the > locked regions > >? ? ? > that are requested, but it doesn't track each individual > lock range. > >? ? ? > > >? ? ? > Under normal circumstances this works fine. But there are > some cases > >? ? ? > where this behavior is not sufficient. For example, suppose we > >? ? ?have a > >? ? ? > replica 3 volume with quorum = 2. Given the special nature > of posix > >? ? ? > locks, AFR sends the lock request sequentially to each one > of the > >? ? ? > bricks, to avoid that conflicting lock requests from other > >? ? ?clients could > >? ? ? > require to unlock an already locked region on the client > that has > >? ? ?not > >? ? ? > got enough successful locks (i.e. quorum). An unlock here not > >? ? ?only would > >? ? ? > cancel the current lock request. It would also cancel any > previously > >? ? ? > acquired lock. > >? ? ? > > > > >? ? ?I may not have fully understood, please correct me. AFAIU, lk > xlator > >? ? ?merges locks only if both the lk-owner and the client opaque > matches. > > > >? ? ?In the case which you have mentioned above, considering clientA > >? ? ?acquired > >? ? ?locks on majority of quorum (say nodeA and nodeB) and clientB > on nodeC > >? ? ?alone- clientB now has to unlock/cancel the lock it acquired > on nodeC. > > > >? ? ?You are saying the it could pose a problem if there were already > >? ? ?successful locks taken by clientB for the same region which > would get > >? ? ?unlocked by this particular unlock request..right? > > > >? ? ?Assuming the previous locks acquired by clientB are shared > (otherwise > >? ? ?clientA wouldn't have got granted lock for the same region on > nodeA & > >? ? ?nodeB), they would still hold true on nodeA & nodeB? as the > unlock > >? ? ?request was sent to only nodeC. Since the majority of quorum > nodes > >? ? ?still > >? ? ?hold the locks by clientB, this isn't serious issue IMO. > > > >? ? ?I haven't looked into heal part but would like to understand > if this is > >? ? ?really an issue in normal scenarios as well. > > > > > > This is how I understood the code. Consider the following case: > > Nodes A, B, C have locks with start and end offsets: 5-15 from > mount-1 > > and lock-range 2-3 from mount-2. > > If mount-1 requests nonblocking lock with lock-range 1-7 and in > parallel > > lets say mount-2 issued unlock of 2-3 as well. > > > > nodeA got unlock from mount-2 with range 2-3 then lock from > mount-1 with > > range 1-7, so the lock is granted and merged to give 1-15 > > nodeB got lock from mount-1 with range 1-7 before unlock of 2-3 > which > > leads to EAGAIN which will trigger unlocks on granted lock in > mount-1 > > which will end up doing unlock of 1-7 on nodeA leading to lock-range > > 8-15 instead of the original 5-15 on nodeA. Whereas nodeB and > nodeC will > > have range 5-15. > > > > Let me know if my understanding is wrong. > > Both of us mentioned the same points. So in the example you gave , > mount-1 lost its previous lock on nodeA but majority of the quorum > (nodeB and nodeC) still have the previous lock? (range: 5-15) > intact. So > this shouldn't ideally lead to any issues as other conflicting locks > are > blocked or failed by majority of the nodes (provided there are no brick > dis/re-connects). > > > But brick disconnects will happen (upgrades, disk failures, server > maintenance, ...). Anyway, even without brick disconnects, in the > previous example we have nodeA with range 8-15, and nodes B and C with > range 5-15. If another lock from mount-2 comes for range 5-7, it will > succeed on nodeA, but it will block on nodeB. At this point, mount-1 > could attempt a lock on same range. It will block on nodeA, so we have a > deadlock. > > In general, having discrepancies between bricks is not good because > sooner or later it will cause some bad inconsistency. > > > Wrt to brick disconnects/re-connects, if we can get in general lock > healing (not getting into implementation details atm) support, that > should take care of correcting lock range on nodeA as well right? > > > The problem we have seen is that to be able to correctly heal currently > acquired locks on brick reconnect, there are cases where we need to > release a lock that has already been granted (because the current owner > doesn't have enough quorum and a just recovered connection tries to > claim/heal it). In this case we need to deal with locks that have > already been merged, but without interfering with other existing locks > that already have quorum. > Okay. Thanks for the detailed explanation. That clears my doubts. -Soumya From sabose at redhat.com Tue Apr 2 09:45:57 2019 From: sabose at redhat.com (Sahina Bose) Date: Tue, 2 Apr 2019 15:15:57 +0530 Subject: [Gluster-devel] [ovirt-users] oVirt Survey 2019 results In-Reply-To: References: Message-ID: On Tue, Apr 2, 2019 at 12:07 PM Sandro Bonazzola wrote: > Thanks to the 143 participants to oVirt Survey 2019! > The survey is now closed and results are publicly available at > https://bit.ly/2JYlI7U > We'll analyze collected data in order to improve oVirt thanks to your > feedback. > > As a first step after reading the results I'd like to invite the 30 > persons who replied they're willing to contribute code to send an email to > devel at ovirt.org introducing themselves: we'll be more than happy to > welcome them and helping them getting started. > > I would also like to invite the 17 people who replied they'd like to help > organizing oVirt events in their area to either get in touch with me or > introduce themselves to users at ovirt.org so we can discuss about events > organization. > > Last but not least I'd like to invite the 38 people willing to contribute > documentation and the one willing to contribute localization to introduce > themselves to devel at ovirt.org. > Thank you all for the feedback. I was looking at the feedback specific to Gluster. While it's disheartening to see "Gluster weakest link in oVirt", I can understand where the feedback and frustration is coming from. Over the past month and in this survey, the common themes that have come up - Ensure smoother upgrades for the hyperconverged deployments with GlusterFS. The oVirt 4.3 release with upgrade to gluster 5.3 caused disruption for many users and we want to ensure this does not happen again. To this end, we are working on adding upgrade tests to OST based CI . Contributions are welcome. - improve performance on gluster storage domain. While we have seen promising results with gluster 6 release this is an ongoing effort. Please help this effort with inputs on the specific workloads and usecases that you run, gathering data and running tests. - deployment issues. We have worked to improve the deployment flow in 4.3 by adding pre-checks and changing to gluster-ansible role based deployment. We would love to hear specific issues that you're facing around this - please raise bugs if you haven't already ( https://bugzilla.redhat.com/enter_bug.cgi?product=cockpit-ovirt) > Thanks! > -- > > SANDRO BONAZZOLA > > MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV > > Red Hat EMEA > > sbonazzo at redhat.com > > _______________________________________________ > Users mailing list -- users at ovirt.org > To unsubscribe send an email to users-leave at ovirt.org > Privacy Statement: https://www.ovirt.org/site/privacy-policy/ > oVirt Code of Conduct: > https://www.ovirt.org/community/about/community-guidelines/ > List Archives: > https://lists.ovirt.org/archives/list/users at ovirt.org/message/4N5DYCXY2S6ZAUI7BWD4DEKZ6JL6MSGN/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: From atin.mukherjee83 at gmail.com Tue Apr 2 09:51:35 2019 From: atin.mukherjee83 at gmail.com (Atin Mukherjee) Date: Tue, 2 Apr 2019 15:21:35 +0530 Subject: [Gluster-devel] [ovirt-users] oVirt Survey 2019 results In-Reply-To: References: Message-ID: Thanks Sahina for including Gluster community mailing lists. As Sahina already mentioned we had a strong focus on upgrade testing path before releasing glusterfs-6. We conducted test day and along with functional pieces, tested upgrade paths like from 3.12, 4 & 5 to release-6, we encountered problems but we fixed them before releasing glusterfs-6. So overall this experience should definitely improve with glusterfs-6. On Tue, 2 Apr 2019 at 15:16, Sahina Bose wrote: > > > On Tue, Apr 2, 2019 at 12:07 PM Sandro Bonazzola > wrote: > >> Thanks to the 143 participants to oVirt Survey 2019! >> The survey is now closed and results are publicly available at >> https://bit.ly/2JYlI7U >> We'll analyze collected data in order to improve oVirt thanks to your >> feedback. >> >> As a first step after reading the results I'd like to invite the 30 >> persons who replied they're willing to contribute code to send an email to >> devel at ovirt.org introducing themselves: we'll be more than happy to >> welcome them and helping them getting started. >> >> I would also like to invite the 17 people who replied they'd like to help >> organizing oVirt events in their area to either get in touch with me or >> introduce themselves to users at ovirt.org so we can discuss about events >> organization. >> >> Last but not least I'd like to invite the 38 people willing to contribute >> documentation and the one willing to contribute localization to introduce >> themselves to devel at ovirt.org. >> > > Thank you all for the feedback. > I was looking at the feedback specific to Gluster. While it's > disheartening to see "Gluster weakest link in oVirt", I can understand > where the feedback and frustration is coming from. > > Over the past month and in this survey, the common themes that have come up > - Ensure smoother upgrades for the hyperconverged deployments with > GlusterFS. The oVirt 4.3 release with upgrade to gluster 5.3 caused > disruption for many users and we want to ensure this does not happen again. > To this end, we are working on adding upgrade tests to OST based CI . > Contributions are welcome. > > - improve performance on gluster storage domain. While we have seen > promising results with gluster 6 release this is an ongoing effort. Please > help this effort with inputs on the specific workloads and usecases that > you run, gathering data and running tests. > > - deployment issues. We have worked to improve the deployment flow in 4.3 > by adding pre-checks and changing to gluster-ansible role based deployment. > We would love to hear specific issues that you're facing around this - > please raise bugs if you haven't already ( > https://bugzilla.redhat.com/enter_bug.cgi?product=cockpit-ovirt) > > > >> Thanks! >> -- >> >> SANDRO BONAZZOLA >> >> MANAGER, SOFTWARE ENGINEERING, EMEA R&D RHV >> >> Red Hat EMEA >> >> sbonazzo at redhat.com >> >> _______________________________________________ >> Users mailing list -- users at ovirt.org >> To unsubscribe send an email to users-leave at ovirt.org >> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ >> oVirt Code of Conduct: >> https://www.ovirt.org/community/about/community-guidelines/ >> List Archives: >> https://lists.ovirt.org/archives/list/users at ovirt.org/message/4N5DYCXY2S6ZAUI7BWD4DEKZ6JL6MSGN/ >> > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -- --Atin -------------- next part -------------- An HTML attachment was scrubbed... URL: From nux at li.nux.ro Tue Apr 2 15:37:17 2019 From: nux at li.nux.ro (Nux!) Date: Tue, 2 Apr 2019 16:37:17 +0100 (BST) Subject: [Gluster-devel] [Gluster-users] Prioritise local bricks for IO? In-Reply-To: References: <29221907.583.1553599314586.JavaMail.zimbra@li.nux.ro>

Message-ID: <383369409.4472.1554219437440.JavaMail.zimbra@li.nux.ro> Ok, cool, thanks. So.. no go. Any other ideas on how to accomplish task then? -- Sent from the Delta quadrant using Borg technology! Nux! www.nux.ro ----- Original Message ----- > From: "Nithya Balachandran" > To: "Poornima Gurusiddaiah" > Cc: "Nux!" , "gluster-users" , "Gluster Devel" > Sent: Thursday, 28 March, 2019 09:38:16 > Subject: Re: [Gluster-users] Prioritise local bricks for IO? > On Wed, 27 Mar 2019 at 20:27, Poornima Gurusiddaiah > wrote: > >> This feature is not under active development as it was not used widely. >> AFAIK its not supported feature. >> +Nithya +Raghavendra for further clarifications. >> > > This is not actively supported - there has been no work done on this > feature for a long time. > > Regards, > Nithya > >> >> Regards, >> Poornima >> >> On Wed, Mar 27, 2019 at 12:33 PM Lucian wrote: >> >>> Oh, that's just what the doctor ordered! >>> Hope it works, thanks >>> >>> On 27 March 2019 03:15:57 GMT, Vlad Kopylov wrote: >>>> >>>> I don't remember if it still in works >>>> NUFA >>>> >>>> https://github.com/gluster/glusterfs-specs/blob/master/done/Features/nufa.md >>>> >>>> v >>>> >>>> On Tue, Mar 26, 2019 at 7:27 AM Nux! wrote: >>>> >>>>> Hello, >>>>> >>>>> I'm trying to set up a distributed backup storage (no replicas), but >>>>> I'd like to prioritise the local bricks for any IO done on the volume. >>>>> This will be a backup stor, so in other words, I'd like the files to be >>>>> written locally if there is space, so as to save the NICs for other traffic. >>>>> >>>>> Anyone knows how this might be achievable, if at all? >>>>> >>>>> -- >>>>> Sent from the Delta quadrant using Borg technology! >>>>> >>>>> Nux! >>>>> www.nux.ro >>>>> _______________________________________________ >>>>> Gluster-users mailing list >>>>> Gluster-users at gluster.org >>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>>> >>>> >>> -- >>> Sent from my Android device with K-9 Mail. Please excuse my brevity. >>> _______________________________________________ >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> _______________________________________________ >> Gluster-users mailing list >> Gluster-users at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-users From ykaul at redhat.com Tue Apr 2 18:16:02 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Tue, 2 Apr 2019 21:16:02 +0300 Subject: [Gluster-devel] [Gluster-users] Prioritise local bricks for IO? In-Reply-To: <383369409.4472.1554219437440.JavaMail.zimbra@li.nux.ro> References: <29221907.583.1553599314586.JavaMail.zimbra@li.nux.ro>

<383369409.4472.1554219437440.JavaMail.zimbra@li.nux.ro> Message-ID: On Tue, Apr 2, 2019 at 6:37 PM Nux! wrote: > Ok, cool, thanks. So.. no go. > > Any other ideas on how to accomplish task then? > While not a solution, I believe https://review.gluster.org/#/c/glusterfs/+/21333/ - read selection based on latency, is an interesting path towards this. (Of course, you'd need later also add write...) Y. > -- > Sent from the Delta quadrant using Borg technology! > > Nux! > www.nux.ro > > ----- Original Message ----- > > From: "Nithya Balachandran" > > To: "Poornima Gurusiddaiah" > > Cc: "Nux!" , "gluster-users" , > "Gluster Devel" > > Sent: Thursday, 28 March, 2019 09:38:16 > > Subject: Re: [Gluster-users] Prioritise local bricks for IO? > > > On Wed, 27 Mar 2019 at 20:27, Poornima Gurusiddaiah > > > wrote: > > > >> This feature is not under active development as it was not used widely. > >> AFAIK its not supported feature. > >> +Nithya +Raghavendra for further clarifications. > >> > > > > This is not actively supported - there has been no work done on this > > feature for a long time. > > > > Regards, > > Nithya > > > >> > >> Regards, > >> Poornima > >> > >> On Wed, Mar 27, 2019 at 12:33 PM Lucian wrote: > >> > >>> Oh, that's just what the doctor ordered! > >>> Hope it works, thanks > >>> > >>> On 27 March 2019 03:15:57 GMT, Vlad Kopylov > wrote: > >>>> > >>>> I don't remember if it still in works > >>>> NUFA > >>>> > >>>> > https://github.com/gluster/glusterfs-specs/blob/master/done/Features/nufa.md > >>>> > >>>> v > >>>> > >>>> On Tue, Mar 26, 2019 at 7:27 AM Nux! wrote: > >>>> > >>>>> Hello, > >>>>> > >>>>> I'm trying to set up a distributed backup storage (no replicas), but > >>>>> I'd like to prioritise the local bricks for any IO done on the > volume. > >>>>> This will be a backup stor, so in other words, I'd like the files to > be > >>>>> written locally if there is space, so as to save the NICs for other > traffic. > >>>>> > >>>>> Anyone knows how this might be achievable, if at all? > >>>>> > >>>>> -- > >>>>> Sent from the Delta quadrant using Borg technology! > >>>>> > >>>>> Nux! > >>>>> www.nux.ro > >>>>> _______________________________________________ > >>>>> Gluster-users mailing list > >>>>> Gluster-users at gluster.org > >>>>> https://lists.gluster.org/mailman/listinfo/gluster-users > >>>>> > >>>> > >>> -- > >>> Sent from my Android device with K-9 Mail. Please excuse my brevity. > >>> _______________________________________________ > >>> Gluster-users mailing list > >>> Gluster-users at gluster.org > >>> https://lists.gluster.org/mailman/listinfo/gluster-users > >> > >> _______________________________________________ > >> Gluster-users mailing list > >> Gluster-users at gluster.org > > > https://lists.gluster.org/mailman/listinfo/gluster-users > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From amukherj at redhat.com Wed Apr 3 03:22:06 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Wed, 3 Apr 2019 08:52:06 +0530 Subject: [Gluster-devel] Backporting important fixes in release branches Message-ID: Off late my observation has been that we're missing to backport critical/important fixes into the release branches and we do a course of correction when users discover the problems which isn't a great experience. I request all developers and maintainers to pay some attention on (a) deciding on which patches from mainline should be backported to what release branches & (b) do the same right away once the patches are merged in mainline branch instead of waiting to do them later. -------------- next part -------------- An HTML attachment was scrubbed... URL: From amukherj at redhat.com Wed Apr 3 05:16:51 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Wed, 3 Apr 2019 10:46:51 +0530 Subject: [Gluster-devel] is_nfs_export_available from nfs.rc failing too often? Message-ID: I'm observing the above test function failing too often because of which arbiter-mount.t test fails in many regression jobs. Such frequency of failures wasn't there earlier. Does anyone know what has changed recently to cause these failures in regression? I also hear when such failure happens a reboot is required, is that true and if so why? One of the reference : https://build.gluster.org/job/centos7-regression/5340/consoleFull -------------- next part -------------- An HTML attachment was scrubbed... URL: From jthottan at redhat.com Wed Apr 3 06:26:20 2019 From: jthottan at redhat.com (Jiffin Thottan) Date: Wed, 3 Apr 2019 02:26:20 -0400 (EDT) Subject: [Gluster-devel] is_nfs_export_available from nfs.rc failing too often? In-Reply-To: References: Message-ID: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> Hi, is_nfs_export_available is just a wrapper around "showmount" command AFAIR. I saw following messages in console output. mount.nfs: rpc.statd is not running but is required for remote locking. 05:06:55 mount.nfs: Either use '-o nolock' to keep locks local, or start statd. 05:06:55 mount.nfs: an incorrect mount option was specified For me it looks rpcbind may not be running on the machine. Usually rpcbind starts automatically on machines, don't know whether it can happen or not. Regards, Jiffin ----- Original Message ----- From: "Atin Mukherjee" To: "gluster-infra" , "Gluster Devel" Sent: Wednesday, April 3, 2019 10:46:51 AM Subject: [Gluster-devel] is_nfs_export_available from nfs.rc failing too often? I'm observing the above test function failing too often because of which arbiter-mount.t test fails in many regression jobs. Such frequency of failures wasn't there earlier. Does anyone know what has changed recently to cause these failures in regression? I also hear when such failure happens a reboot is required, is that true and if so why? One of the reference : https://build.gluster.org/job/centos7-regression/5340/consoleFull _______________________________________________ Gluster-devel mailing list Gluster-devel at gluster.org https://lists.gluster.org/mailman/listinfo/gluster-devel From amukherj at redhat.com Wed Apr 3 11:00:42 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Wed, 3 Apr 2019 16:30:42 +0530 Subject: [Gluster-devel] is_nfs_export_available from nfs.rc failing too often? In-Reply-To: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> References: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> Message-ID: On Wed, Apr 3, 2019 at 11:56 AM Jiffin Thottan wrote: > Hi, > > is_nfs_export_available is just a wrapper around "showmount" command AFAIR. > I saw following messages in console output. > mount.nfs: rpc.statd is not running but is required for remote locking. > 05:06:55 mount.nfs: Either use '-o nolock' to keep locks local, or start > statd. > 05:06:55 mount.nfs: an incorrect mount option was specified > > For me it looks rpcbind may not be running on the machine. > Usually rpcbind starts automatically on machines, don't know whether it > can happen or not. > That's precisely what the question is. Why suddenly we're seeing this happening too frequently. Today I saw atleast 4 to 5 such failures already. Deepshika - Can you please help in inspecting this? > Regards, > Jiffin > > > ----- Original Message ----- > From: "Atin Mukherjee" > To: "gluster-infra" , "Gluster Devel" < > gluster-devel at gluster.org> > Sent: Wednesday, April 3, 2019 10:46:51 AM > Subject: [Gluster-devel] is_nfs_export_available from nfs.rc failing too > often? > > I'm observing the above test function failing too often because of which > arbiter-mount.t test fails in many regression jobs. Such frequency of > failures wasn't there earlier. Does anyone know what has changed recently > to cause these failures in regression? I also hear when such failure > happens a reboot is required, is that true and if so why? > > One of the reference : > https://build.gluster.org/job/centos7-regression/5340/consoleFull > > > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mscherer at redhat.com Wed Apr 3 11:52:50 2019 From: mscherer at redhat.com (Michael Scherer) Date: Wed, 03 Apr 2019 13:52:50 +0200 Subject: [Gluster-devel] [Gluster-infra] is_nfs_export_available from nfs.rc failing too often? In-Reply-To: References: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> Message-ID: <46932285269538f29a3bdd0ccb177bfce091bf85.camel@redhat.com> Le mercredi 03 avril 2019 ? 16:30 +0530, Atin Mukherjee a ?crit : > On Wed, Apr 3, 2019 at 11:56 AM Jiffin Thottan > wrote: > > > Hi, > > > > is_nfs_export_available is just a wrapper around "showmount" > > command AFAIR. > > I saw following messages in console output. > > mount.nfs: rpc.statd is not running but is required for remote > > locking. > > 05:06:55 mount.nfs: Either use '-o nolock' to keep locks local, or > > start > > statd. > > 05:06:55 mount.nfs: an incorrect mount option was specified > > > > For me it looks rpcbind may not be running on the machine. > > Usually rpcbind starts automatically on machines, don't know > > whether it > > can happen or not. > > > > That's precisely what the question is. Why suddenly we're seeing this > happening too frequently. Today I saw atleast 4 to 5 such failures > already. > > Deepshika - Can you please help in inspecting this? So in the past, this kind of stuff did happen with ipv6, so this could be a change on AWS and/or a upgrade. We are currently investigating a set of failure that happen after reboot (resulting in partial network bring up, causing all kind of weird issue), but it take some time to verify it, and since we lost 33% of the team with Nigel departure, stuff do not move as fast as before. -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From ykaul at redhat.com Wed Apr 3 12:12:16 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Wed, 3 Apr 2019 15:12:16 +0300 Subject: [Gluster-devel] [Gluster-infra] is_nfs_export_available from nfs.rc failing too often? In-Reply-To: <46932285269538f29a3bdd0ccb177bfce091bf85.camel@redhat.com> References: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> <46932285269538f29a3bdd0ccb177bfce091bf85.camel@redhat.com> Message-ID: On Wed, Apr 3, 2019 at 2:53 PM Michael Scherer wrote: > Le mercredi 03 avril 2019 ? 16:30 +0530, Atin Mukherjee a ?crit : > > On Wed, Apr 3, 2019 at 11:56 AM Jiffin Thottan > > wrote: > > > > > Hi, > > > > > > is_nfs_export_available is just a wrapper around "showmount" > > > command AFAIR. > > > I saw following messages in console output. > > > mount.nfs: rpc.statd is not running but is required for remote > > > locking. > > > 05:06:55 mount.nfs: Either use '-o nolock' to keep locks local, or > > > start > > > statd. > > > 05:06:55 mount.nfs: an incorrect mount option was specified > > > > > > For me it looks rpcbind may not be running on the machine. > > > Usually rpcbind starts automatically on machines, don't know > > > whether it > > > can happen or not. > > > > > > > That's precisely what the question is. Why suddenly we're seeing this > > happening too frequently. Today I saw atleast 4 to 5 such failures > > already. > > > > Deepshika - Can you please help in inspecting this? > > So in the past, this kind of stuff did happen with ipv6, so this could > be a change on AWS and/or a upgrade. > We need to enable IPv6, for two reasons: 1. IPv6 is common these days, even if we don't test with it, it should be there. 2. We should test with IPv6... I'm not sure, but I suspect we do disable IPv6 here and there. Example[1]. Y. [1] https://github.com/gluster/centosci/blob/master/jobs/scripts/glusto/setup-glusto.yml > > We are currently investigating a set of failure that happen after > reboot (resulting in partial network bring up, causing all kind of > weird issue), but it take some time to verify it, and since we lost 33% > of the team with Nigel departure, stuff do not move as fast as before. > > > -- > Michael Scherer > Sysadmin, Community Infrastructure and Platform, OSAS > > > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From mscherer at redhat.com Wed Apr 3 12:19:15 2019 From: mscherer at redhat.com (Michael Scherer) Date: Wed, 03 Apr 2019 14:19:15 +0200 Subject: [Gluster-devel] [Gluster-infra] is_nfs_export_available from nfs.rc failing too often? In-Reply-To: References: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> <46932285269538f29a3bdd0ccb177bfce091bf85.camel@redhat.com> Message-ID: <1658d7c7b3170ad7abe6afbcdf769775e9274da3.camel@redhat.com> Le mercredi 03 avril 2019 ? 15:12 +0300, Yaniv Kaul a ?crit : > On Wed, Apr 3, 2019 at 2:53 PM Michael Scherer > wrote: > > > Le mercredi 03 avril 2019 ? 16:30 +0530, Atin Mukherjee a ?crit : > > > On Wed, Apr 3, 2019 at 11:56 AM Jiffin Thottan < > > > jthottan at redhat.com> > > > wrote: > > > > > > > Hi, > > > > > > > > is_nfs_export_available is just a wrapper around "showmount" > > > > command AFAIR. > > > > I saw following messages in console output. > > > > mount.nfs: rpc.statd is not running but is required for remote > > > > locking. > > > > 05:06:55 mount.nfs: Either use '-o nolock' to keep locks local, > > > > or > > > > start > > > > statd. > > > > 05:06:55 mount.nfs: an incorrect mount option was specified > > > > > > > > For me it looks rpcbind may not be running on the machine. > > > > Usually rpcbind starts automatically on machines, don't know > > > > whether it > > > > can happen or not. > > > > > > > > > > That's precisely what the question is. Why suddenly we're seeing > > > this > > > happening too frequently. Today I saw atleast 4 to 5 such > > > failures > > > already. > > > > > > Deepshika - Can you please help in inspecting this? > > > > So in the past, this kind of stuff did happen with ipv6, so this > > could > > be a change on AWS and/or a upgrade. > > > > We need to enable IPv6, for two reasons: > 1. IPv6 is common these days, even if we don't test with it, it > should be > there. > 2. We should test with IPv6... > > I'm not sure, but I suspect we do disable IPv6 here and there. > Example[1]. > Y. > > [1] > https://github.com/gluster/centosci/blob/master/jobs/scripts/glusto/setup-glusto.yml We do disable ipv6 for sure, Nigel spent 3 days just on that for the AWS migration, and we do have a dedicated playbook applied on all builders that try to disable everything in every possible way: https://github.com/gluster/gluster.org_ansible_configuration/blob/master/roles/jenkins_builder/tasks/disable_ipv6_linux.yml According to the comment, that's from 2016, and I am sure this go further in the past since it wasn't just documented before. -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From mscherer at redhat.com Wed Apr 3 13:56:36 2019 From: mscherer at redhat.com (Michael Scherer) Date: Wed, 03 Apr 2019 15:56:36 +0200 Subject: [Gluster-devel] [Gluster-infra] is_nfs_export_available from nfs.rc failing too often? In-Reply-To: References: <2056284426.17636953.1554272780313.JavaMail.zimbra@redhat.com> Message-ID: <797512f6ff7f1b9fedbf8b7968dd86a6968d9105.camel@redhat.com> Le mercredi 03 avril 2019 ? 16:30 +0530, Atin Mukherjee a ?crit : > On Wed, Apr 3, 2019 at 11:56 AM Jiffin Thottan > wrote: > > > Hi, > > > > is_nfs_export_available is just a wrapper around "showmount" > > command AFAIR. > > I saw following messages in console output. > > mount.nfs: rpc.statd is not running but is required for remote > > locking. > > 05:06:55 mount.nfs: Either use '-o nolock' to keep locks local, or > > start > > statd. > > 05:06:55 mount.nfs: an incorrect mount option was specified > > > > For me it looks rpcbind may not be running on the machine. > > Usually rpcbind starts automatically on machines, don't know > > whether it > > can happen or not. > > > > That's precisely what the question is. Why suddenly we're seeing this > happening too frequently. Today I saw atleast 4 to 5 such failures > already. > > Deepshika - Can you please help in inspecting this? So we think (we are not sure) that the issue is a bit complex. What we were investigating was nightly run fail on aws. When the build crash, the builder is restarted, since that's the easiest way to clean everything (since even with a perfect test suite that would clean itself, we could always end in a corrupt state on the system, WRT mount, fs, etc). In turn, this seems to cause trouble on aws, since cloud-init or something rename eth0 interface to ens5, without cleaning to the network configuration. So the network init script fail (because the image say "start eth0" and that's not present), but fail in a weird way. Network is initialised and working (we can connect), but the dhclient process is not in the right cgroup, and network.service is in failed state. Restarting network didn't work. In turn, this mean that rpc-statd refuse to start (due to systemd dependencies), which seems to impact various NFS tests. We have also seen that on some builders, rpcbind pick some IP v6 autoconfiguration, but we can't reproduce that, and there is no ip v6 set up anywhere. I suspect the network.service failure is somehow involved, but fail to see how. In turn, rpcbind.socket not starting could cause NFS test troubles. Our current stop gap fix was to fix all the builders one by one. Remove the config, kill the rogue dhclient, restart network service. However, we can't be sure this is going to fix the problem long term since this only manifest after a crash of the test suite, and it doesn't happen so often. (plus, it was working before some day in the past, when something did make this fail, and I do not know if that's a system upgrade, or a test change, or both). So we are still looking at it to have a complete understanding of the issue, but so far, we hacked our way to make it work (or so do I think). Deepshika is working to fix it long term, by fixing the issue regarding eth0/ens5 with a new base image. -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From amukherj at redhat.com Thu Apr 4 06:17:59 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Thu, 4 Apr 2019 11:47:59 +0530 Subject: [Gluster-devel] shd multiplexing patch has introduced coverity defects Message-ID: Based on yesterday's coverity scan report, 6 defects are introduced because of the shd multiplexing patch. Could you address them, Rafi? -------------- next part -------------- An HTML attachment was scrubbed... URL: From amukherj at redhat.com Thu Apr 4 10:43:43 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Thu, 4 Apr 2019 16:13:43 +0530 Subject: [Gluster-devel] rebal-all-nodes-migrate.t always fails now Message-ID: Based on what I have seen that any multi node test case will fail and the above one is picked first from that group and If I am correct none of the code fixes will go through the regression until this is fixed. I suspect it to be an infra issue again. If we look at https://review.gluster.org/#/c/glusterfs/+/22501/ & https://build.gluster.org/job/centos7-regression/5382/ peer handshaking is stuck as 127.1.1.1 is unable to receive a response back, did we end up having firewall and other n/w settings screwed up? The test never fails locally. *15:51:21* Number of Peers: 2*15:51:21* *15:51:21* Hostname: 127.1.1.2*15:51:21* Uuid: 0e689ca8-d522-4b2f-b437-9dcde3579401*15:51:21* State: Accepted peer request (Connected)*15:51:21* *15:51:21* Hostname: 127.1.1.3*15:51:21* Uuid: a83a3bfa-729f-4a1c-8f9a-ae7d04ee4544*15:51:21* State: Accepted peer request (Connected) -------------- next part -------------- An HTML attachment was scrubbed... URL: From mscherer at redhat.com Thu Apr 4 11:53:21 2019 From: mscherer at redhat.com (Michael Scherer) Date: Thu, 04 Apr 2019 13:53:21 +0200 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: Message-ID: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com> Le jeudi 04 avril 2019 ? 16:13 +0530, Atin Mukherjee a ?crit : > Based on what I have seen that any multi node test case will fail and > the > above one is picked first from that group and If I am correct none of > the > code fixes will go through the regression until this is fixed. I > suspect it > to be an infra issue again. If we look at > https://review.gluster.org/#/c/glusterfs/+/22501/ & > https://build.gluster.org/job/centos7-regression/5382/ peer > handshaking is > stuck as 127.1.1.1 is unable to receive a response back, did we end > up > having firewall and other n/w settings screwed up? The test never > fails > locally. The firewall didn't change, and since the start has a line: "-A INPUT -i lo -j ACCEPT", so all traffic on the localhost interface work. (I am not even sure that netfilter do anything meaningful on the loopback interface, but maybe I am wrong, and not keen on looking kernel code for that). Ping seems to work fine as well, so we can exclude a routing issue. Maybe we should look at the socket, does it listen to a specific address or not ? > *15:51:21* Number of Peers: 2*15:51:21* *15:51:21* Hostname: > 127.1.1.2*15:51:21* Uuid: > 0e689ca8-d522-4b2f-b437-9dcde3579401*15:51:21* State: Accepted peer > request (Connected)*15:51:21* *15:51:21* Hostname: > 127.1.1.3*15:51:21* > Uuid: a83a3bfa-729f-4a1c-8f9a-ae7d04ee4544*15:51:21* State: Accepted > peer request (Connected) > _______________________________________________ > Gluster-infra mailing list > Gluster-infra at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-infra -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From mscherer at redhat.com Thu Apr 4 13:19:25 2019 From: mscherer at redhat.com (Michael Scherer) Date: Thu, 04 Apr 2019 15:19:25 +0200 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com> References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com> Message-ID: Le jeudi 04 avril 2019 ? 13:53 +0200, Michael Scherer a ?crit : > Le jeudi 04 avril 2019 ? 16:13 +0530, Atin Mukherjee a ?crit : > > Based on what I have seen that any multi node test case will fail > > and > > the > > above one is picked first from that group and If I am correct none > > of > > the > > code fixes will go through the regression until this is fixed. I > > suspect it > > to be an infra issue again. If we look at > > https://review.gluster.org/#/c/glusterfs/+/22501/ & > > https://build.gluster.org/job/centos7-regression/5382/ peer > > handshaking is > > stuck as 127.1.1.1 is unable to receive a response back, did we end > > up > > having firewall and other n/w settings screwed up? The test never > > fails > > locally. > > The firewall didn't change, and since the start has a line: > "-A INPUT -i lo -j ACCEPT", so all traffic on the localhost interface > work. (I am not even sure that netfilter do anything meaningful on > the > loopback interface, but maybe I am wrong, and not keen on looking > kernel code for that). > > > Ping seems to work fine as well, so we can exclude a routing issue. > > Maybe we should look at the socket, does it listen to a specific > address or not ? So, I did look at the 20 first ailure, removed all not related to rebal-all-nodes-migrate.t and seen all were run on builder203, who was freshly reinstalled. As Deepshika noticed today, this one had a issue with ipv6, the 2nd issue we were tracking. Summary, rpcbind.socket systemd unit listen on ipv6 despites ipv6 being disabled, and the fix is to reload systemd. We have so far no idea on why it happen, but suspect this might be related to the network issue we did identify, as that happen only after a reboot, that happen only if a build is cancelled/crashed/aborted. I apply the workaround on builder203, so if the culprit is that specific issue, guess that's fixed. I started a test to see how it go: https://build.gluster.org/job/centos7-regression/5383/ -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From mscherer at redhat.com Thu Apr 4 13:54:22 2019 From: mscherer at redhat.com (Michael Scherer) Date: Thu, 04 Apr 2019 15:54:22 +0200 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com> Message-ID: Le jeudi 04 avril 2019 ? 15:19 +0200, Michael Scherer a ?crit : > Le jeudi 04 avril 2019 ? 13:53 +0200, Michael Scherer a ?crit : > > Le jeudi 04 avril 2019 ? 16:13 +0530, Atin Mukherjee a ?crit : > > > Based on what I have seen that any multi node test case will fail > > > and > > > the > > > above one is picked first from that group and If I am correct > > > none > > > of > > > the > > > code fixes will go through the regression until this is fixed. I > > > suspect it > > > to be an infra issue again. If we look at > > > https://review.gluster.org/#/c/glusterfs/+/22501/ & > > > https://build.gluster.org/job/centos7-regression/5382/ peer > > > handshaking is > > > stuck as 127.1.1.1 is unable to receive a response back, did we > > > end > > > up > > > having firewall and other n/w settings screwed up? The test never > > > fails > > > locally. > > > > The firewall didn't change, and since the start has a line: > > "-A INPUT -i lo -j ACCEPT", so all traffic on the localhost > > interface > > work. (I am not even sure that netfilter do anything meaningful on > > the > > loopback interface, but maybe I am wrong, and not keen on looking > > kernel code for that). > > > > > > Ping seems to work fine as well, so we can exclude a routing issue. > > > > Maybe we should look at the socket, does it listen to a specific > > address or not ? > > So, I did look at the 20 first ailure, removed all not related to > rebal-all-nodes-migrate.t and seen all were run on builder203, who > was > freshly reinstalled. As Deepshika noticed today, this one had a issue > with ipv6, the 2nd issue we were tracking. > > Summary, rpcbind.socket systemd unit listen on ipv6 despites ipv6 > being > disabled, and the fix is to reload systemd. We have so far no idea on > why it happen, but suspect this might be related to the network issue > we did identify, as that happen only after a reboot, that happen only > if a build is cancelled/crashed/aborted. > > I apply the workaround on builder203, so if the culprit is that > specific issue, guess that's fixed. > > I started a test to see how it go: > https://build.gluster.org/job/centos7-regression/5383/ The test did just pass, so I would assume the problem was local to builder203. Not sure why it was always selected, except because this was the only one that failed, so was always up for getting new jobs. Maybe we should increase the number of builder so this doesn't happen, as I guess the others builders were busy at that time ? -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From amukherj at redhat.com Thu Apr 4 15:55:39 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Thu, 4 Apr 2019 21:25:39 +0530 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com> Message-ID: Thanks misc. I have always seen a pattern that on a reattempt (recheck centos) the same builder is picked up many time even though it's promised to pick up the builders in a round robin manner. On Thu, Apr 4, 2019 at 7:24 PM Michael Scherer wrote: > Le jeudi 04 avril 2019 ? 15:19 +0200, Michael Scherer a ?crit : > > Le jeudi 04 avril 2019 ? 13:53 +0200, Michael Scherer a ?crit : > > > Le jeudi 04 avril 2019 ? 16:13 +0530, Atin Mukherjee a ?crit : > > > > Based on what I have seen that any multi node test case will fail > > > > and > > > > the > > > > above one is picked first from that group and If I am correct > > > > none > > > > of > > > > the > > > > code fixes will go through the regression until this is fixed. I > > > > suspect it > > > > to be an infra issue again. If we look at > > > > https://review.gluster.org/#/c/glusterfs/+/22501/ & > > > > https://build.gluster.org/job/centos7-regression/5382/ peer > > > > handshaking is > > > > stuck as 127.1.1.1 is unable to receive a response back, did we > > > > end > > > > up > > > > having firewall and other n/w settings screwed up? The test never > > > > fails > > > > locally. > > > > > > The firewall didn't change, and since the start has a line: > > > "-A INPUT -i lo -j ACCEPT", so all traffic on the localhost > > > interface > > > work. (I am not even sure that netfilter do anything meaningful on > > > the > > > loopback interface, but maybe I am wrong, and not keen on looking > > > kernel code for that). > > > > > > > > > Ping seems to work fine as well, so we can exclude a routing issue. > > > > > > Maybe we should look at the socket, does it listen to a specific > > > address or not ? > > > > So, I did look at the 20 first ailure, removed all not related to > > rebal-all-nodes-migrate.t and seen all were run on builder203, who > > was > > freshly reinstalled. As Deepshika noticed today, this one had a issue > > with ipv6, the 2nd issue we were tracking. > > > > Summary, rpcbind.socket systemd unit listen on ipv6 despites ipv6 > > being > > disabled, and the fix is to reload systemd. We have so far no idea on > > why it happen, but suspect this might be related to the network issue > > we did identify, as that happen only after a reboot, that happen only > > if a build is cancelled/crashed/aborted. > > > > I apply the workaround on builder203, so if the culprit is that > > specific issue, guess that's fixed. > > > > I started a test to see how it go: > > https://build.gluster.org/job/centos7-regression/5383/ > > The test did just pass, so I would assume the problem was local to > builder203. Not sure why it was always selected, except because this > was the only one that failed, so was always up for getting new jobs. > > Maybe we should increase the number of builder so this doesn't happen, > as I guess the others builders were busy at that time ? > > -- > Michael Scherer > Sysadmin, Community Infrastructure and Platform, OSAS > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ykaul at redhat.com Thu Apr 4 16:10:34 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Thu, 4 Apr 2019 19:10:34 +0300 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com> Message-ID: I'm not convinced this is solved. Just had what I believe is a similar failure: *00:12:02.532* A dependency job for rpc-statd.service failed. See 'journalctl -xe' for details.*00:12:02.532* mount.nfs: rpc.statd is not running but is required for remote locking.*00:12:02.532* mount.nfs: Either use '-o nolock' to keep locks local, or start statd.*00:12:02.532* mount.nfs: an incorrect mount option was specified (of course, it can always be my patch!) https://build.gluster.org/job/centos7-regression/5384/console On Thu, Apr 4, 2019 at 6:56 PM Atin Mukherjee wrote: > Thanks misc. I have always seen a pattern that on a reattempt (recheck > centos) the same builder is picked up many time even though it's promised > to pick up the builders in a round robin manner. > > On Thu, Apr 4, 2019 at 7:24 PM Michael Scherer > wrote: > >> Le jeudi 04 avril 2019 ? 15:19 +0200, Michael Scherer a ?crit : >> > Le jeudi 04 avril 2019 ? 13:53 +0200, Michael Scherer a ?crit : >> > > Le jeudi 04 avril 2019 ? 16:13 +0530, Atin Mukherjee a ?crit : >> > > > Based on what I have seen that any multi node test case will fail >> > > > and >> > > > the >> > > > above one is picked first from that group and If I am correct >> > > > none >> > > > of >> > > > the >> > > > code fixes will go through the regression until this is fixed. I >> > > > suspect it >> > > > to be an infra issue again. If we look at >> > > > https://review.gluster.org/#/c/glusterfs/+/22501/ & >> > > > https://build.gluster.org/job/centos7-regression/5382/ peer >> > > > handshaking is >> > > > stuck as 127.1.1.1 is unable to receive a response back, did we >> > > > end >> > > > up >> > > > having firewall and other n/w settings screwed up? The test never >> > > > fails >> > > > locally. >> > > >> > > The firewall didn't change, and since the start has a line: >> > > "-A INPUT -i lo -j ACCEPT", so all traffic on the localhost >> > > interface >> > > work. (I am not even sure that netfilter do anything meaningful on >> > > the >> > > loopback interface, but maybe I am wrong, and not keen on looking >> > > kernel code for that). >> > > >> > > >> > > Ping seems to work fine as well, so we can exclude a routing issue. >> > > >> > > Maybe we should look at the socket, does it listen to a specific >> > > address or not ? >> > >> > So, I did look at the 20 first ailure, removed all not related to >> > rebal-all-nodes-migrate.t and seen all were run on builder203, who >> > was >> > freshly reinstalled. As Deepshika noticed today, this one had a issue >> > with ipv6, the 2nd issue we were tracking. >> > >> > Summary, rpcbind.socket systemd unit listen on ipv6 despites ipv6 >> > being >> > disabled, and the fix is to reload systemd. We have so far no idea on >> > why it happen, but suspect this might be related to the network issue >> > we did identify, as that happen only after a reboot, that happen only >> > if a build is cancelled/crashed/aborted. >> > >> > I apply the workaround on builder203, so if the culprit is that >> > specific issue, guess that's fixed. >> > >> > I started a test to see how it go: >> > https://build.gluster.org/job/centos7-regression/5383/ >> >> The test did just pass, so I would assume the problem was local to >> builder203. Not sure why it was always selected, except because this >> was the only one that failed, so was always up for getting new jobs. >> >> Maybe we should increase the number of builder so this doesn't happen, >> as I guess the others builders were busy at that time ? >> >> -- >> Michael Scherer >> Sysadmin, Community Infrastructure and Platform, OSAS >> >> >> _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From mscherer at redhat.com Thu Apr 4 16:24:56 2019 From: mscherer at redhat.com (Michael Scherer) Date: Thu, 04 Apr 2019 18:24:56 +0200 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

Message-ID: Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : > I'm not convinced this is solved. Just had what I believe is a > similar > failure: > > *00:12:02.532* A dependency job for rpc-statd.service failed. See > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: rpc.statd is > not running but is required for remote locking.*00:12:02.532* > mount.nfs: Either use '-o nolock' to keep locks local, or start > statd.*00:12:02.532* mount.nfs: an incorrect mount option was > specified > > (of course, it can always be my patch!) > > https://build.gluster.org/job/centos7-regression/5384/console same issue, different builder (206). I will check them all, as the issue is more widespread than I expected (or it did popup since last time I checked). > > On Thu, Apr 4, 2019 at 6:56 PM Atin Mukherjee > wrote: > > > Thanks misc. I have always seen a pattern that on a reattempt > > (recheck > > centos) the same builder is picked up many time even though it's > > promised > > to pick up the builders in a round robin manner. > > > > On Thu, Apr 4, 2019 at 7:24 PM Michael Scherer > > > > wrote: > > > > > Le jeudi 04 avril 2019 ? 15:19 +0200, Michael Scherer a ?crit : > > > > Le jeudi 04 avril 2019 ? 13:53 +0200, Michael Scherer a ?crit : > > > > > Le jeudi 04 avril 2019 ? 16:13 +0530, Atin Mukherjee a ?crit > > > > > : > > > > > > Based on what I have seen that any multi node test case > > > > > > will fail > > > > > > and > > > > > > the > > > > > > above one is picked first from that group and If I am > > > > > > correct > > > > > > none > > > > > > of > > > > > > the > > > > > > code fixes will go through the regression until this is > > > > > > fixed. I > > > > > > suspect it > > > > > > to be an infra issue again. If we look at > > > > > > https://review.gluster.org/#/c/glusterfs/+/22501/ & > > > > > > https://build.gluster.org/job/centos7-regression/5382/ peer > > > > > > handshaking is > > > > > > stuck as 127.1.1.1 is unable to receive a response back, > > > > > > did we > > > > > > end > > > > > > up > > > > > > having firewall and other n/w settings screwed up? The test > > > > > > never > > > > > > fails > > > > > > locally. > > > > > > > > > > The firewall didn't change, and since the start has a line: > > > > > "-A INPUT -i lo -j ACCEPT", so all traffic on the localhost > > > > > interface > > > > > work. (I am not even sure that netfilter do anything > > > > > meaningful on > > > > > the > > > > > loopback interface, but maybe I am wrong, and not keen on > > > > > looking > > > > > kernel code for that). > > > > > > > > > > > > > > > Ping seems to work fine as well, so we can exclude a routing > > > > > issue. > > > > > > > > > > Maybe we should look at the socket, does it listen to a > > > > > specific > > > > > address or not ? > > > > > > > > So, I did look at the 20 first ailure, removed all not related > > > > to > > > > rebal-all-nodes-migrate.t and seen all were run on builder203, > > > > who > > > > was > > > > freshly reinstalled. As Deepshika noticed today, this one had a > > > > issue > > > > with ipv6, the 2nd issue we were tracking. > > > > > > > > Summary, rpcbind.socket systemd unit listen on ipv6 despites > > > > ipv6 > > > > being > > > > disabled, and the fix is to reload systemd. We have so far no > > > > idea on > > > > why it happen, but suspect this might be related to the network > > > > issue > > > > we did identify, as that happen only after a reboot, that > > > > happen only > > > > if a build is cancelled/crashed/aborted. > > > > > > > > I apply the workaround on builder203, so if the culprit is that > > > > specific issue, guess that's fixed. > > > > > > > > I started a test to see how it go: > > > > https://build.gluster.org/job/centos7-regression/5383/ > > > > > > The test did just pass, so I would assume the problem was local > > > to > > > builder203. Not sure why it was always selected, except because > > > this > > > was the only one that failed, so was always up for getting new > > > jobs. > > > > > > Maybe we should increase the number of builder so this doesn't > > > happen, > > > as I guess the others builders were busy at that time ? > > > > > > -- > > > Michael Scherer > > > Sysadmin, Community Infrastructure and Platform, OSAS > > > > > > > > > _______________________________________________ > > > > Gluster-devel mailing list > > Gluster-devel at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-devel -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From rkavunga at redhat.com Fri Apr 5 05:05:44 2019 From: rkavunga at redhat.com (Rafi Kavungal Chundattu Parambil) Date: Fri, 5 Apr 2019 01:05:44 -0400 (EDT) Subject: [Gluster-devel] shd multiplexing patch has introduced coverity defects In-Reply-To: References: Message-ID: <334597073.13088844.1554440744816.JavaMail.zimbra@redhat.com> Yes. I will work on this. Rafi KC ----- Original Message ----- From: "Atin Mukherjee" To: "Rafi Kavungal Chundattu Parambil" Cc: "Gluster Devel" Sent: Thursday, April 4, 2019 11:47:59 AM Subject: shd multiplexing patch has introduced coverity defects Based on yesterday's coverity scan report, 6 defects are introduced because of the shd multiplexing patch. Could you address them, Rafi? From mscherer at redhat.com Fri Apr 5 06:45:19 2019 From: mscherer at redhat.com (Michael Scherer) Date: Fri, 05 Apr 2019 08:45:19 +0200 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

Message-ID: <0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : > > I'm not convinced this is solved. Just had what I believe is a > > similar > > failure: > > > > *00:12:02.532* A dependency job for rpc-statd.service failed. See > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: rpc.statd is > > not running but is required for remote locking.*00:12:02.532* > > mount.nfs: Either use '-o nolock' to keep locks local, or start > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was > > specified > > > > (of course, it can always be my patch!) > > > > https://build.gluster.org/job/centos7-regression/5384/console > > same issue, different builder (206). I will check them all, as the > issue is more widespread than I expected (or it did popup since last > time I checked). Deepshika did notice that the issue came back on one server (builder202) after a reboot, so the rpcbind issue is not related to the network initscript one, so the RCA continue. We are looking for another workaround involving fiddling with the socket (until we find why it do use ipv6 at boot, but not after, when ipv6 is disabled). Maybe we could run the test suite on a node without all the ipv6 disabling to see if that cause a issue ? -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From dkhandel at redhat.com Fri Apr 5 06:55:02 2019 From: dkhandel at redhat.com (Deepshikha Khandelwal) Date: Fri, 5 Apr 2019 12:25:02 +0530 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: <0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

<0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> Message-ID: On Fri, Apr 5, 2019 at 12:16 PM Michael Scherer wrote: > Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : > > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : > > > I'm not convinced this is solved. Just had what I believe is a > > > similar > > > failure: > > > > > > *00:12:02.532* A dependency job for rpc-statd.service failed. See > > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: rpc.statd is > > > not running but is required for remote locking.*00:12:02.532* > > > mount.nfs: Either use '-o nolock' to keep locks local, or start > > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was > > > specified > > > > > > (of course, it can always be my patch!) > > > > > > https://build.gluster.org/job/centos7-regression/5384/console > > > > same issue, different builder (206). I will check them all, as the > > issue is more widespread than I expected (or it did popup since last > > time I checked). > > Deepshika did notice that the issue came back on one server > (builder202) after a reboot, so the rpcbind issue is not related to the > network initscript one, so the RCA continue. > > We are looking for another workaround involving fiddling with the > socket (until we find why it do use ipv6 at boot, but not after, when > ipv6 is disabled). > > Maybe we could run the test suite on a node without all the ipv6 > disabling to see if that cause a issue ? > Do our test regression suit started supporting ipv6 now? Else this investigation would lead to further issues. > -- > Michael Scherer > Sysadmin, Community Infrastructure and Platform, OSAS > > > _______________________________________________ > Gluster-infra mailing list > Gluster-infra at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-infra > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ykaul at redhat.com Fri Apr 5 07:09:44 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Fri, 5 Apr 2019 10:09:44 +0300 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

<0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> Message-ID: On Fri, Apr 5, 2019 at 9:55 AM Deepshikha Khandelwal wrote: > > > On Fri, Apr 5, 2019 at 12:16 PM Michael Scherer > wrote: > >> Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : >> > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : >> > > I'm not convinced this is solved. Just had what I believe is a >> > > similar >> > > failure: >> > > >> > > *00:12:02.532* A dependency job for rpc-statd.service failed. See >> > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: rpc.statd is >> > > not running but is required for remote locking.*00:12:02.532* >> > > mount.nfs: Either use '-o nolock' to keep locks local, or start >> > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was >> > > specified >> > > >> > > (of course, it can always be my patch!) >> > > >> > > https://build.gluster.org/job/centos7-regression/5384/console >> > >> > same issue, different builder (206). I will check them all, as the >> > issue is more widespread than I expected (or it did popup since last >> > time I checked). >> >> Deepshika did notice that the issue came back on one server >> (builder202) after a reboot, so the rpcbind issue is not related to the >> network initscript one, so the RCA continue. >> >> We are looking for another workaround involving fiddling with the >> socket (until we find why it do use ipv6 at boot, but not after, when >> ipv6 is disabled). >> >> Maybe we could run the test suite on a node without all the ipv6 >> disabling to see if that cause a issue ? >> > Do our test regression suit started supporting ipv6 now? Else this > investigation would lead to further issues. > I suspect not yet. But we certainly would like to, at some point, to ensure we run with IPv6 as well! Y. > -- >> Michael Scherer >> Sysadmin, Community Infrastructure and Platform, OSAS >> >> >> _______________________________________________ >> Gluster-infra mailing list >> Gluster-infra at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-infra >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rkavunga at redhat.com Fri Apr 5 08:02:12 2019 From: rkavunga at redhat.com (RAFI KC) Date: Fri, 5 Apr 2019 13:32:12 +0530 Subject: [Gluster-devel] shd multiplexing patch has introduced coverity defects In-Reply-To: <334597073.13088844.1554440744816.JavaMail.zimbra@redhat.com> References: <334597073.13088844.1554440744816.JavaMail.zimbra@redhat.com> Message-ID: <9e5f6cbf-7b6d-62d8-f386-899ab047be44@redhat.com> This patch will try to address the issues reported https://review.gluster.org/#/c/glusterfs/+/22514/ Regards Rafi KC On 4/5/19 10:35 AM, Rafi Kavungal Chundattu Parambil wrote: > Yes. I will work on this. > > Rafi KC > > ----- Original Message ----- > From: "Atin Mukherjee" > To: "Rafi Kavungal Chundattu Parambil" > Cc: "Gluster Devel" > Sent: Thursday, April 4, 2019 11:47:59 AM > Subject: shd multiplexing patch has introduced coverity defects > > Based on yesterday's coverity scan report, 6 defects are introduced because > of the shd multiplexing patch. Could you address them, Rafi? > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel From nbalacha at redhat.com Fri Apr 5 11:25:58 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Fri, 5 Apr 2019 16:55:58 +0530 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: <0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

<0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> Message-ID: On Fri, 5 Apr 2019 at 12:16, Michael Scherer wrote: > Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : > > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : > > > I'm not convinced this is solved. Just had what I believe is a > > > similar > > > failure: > > > > > > *00:12:02.532* A dependency job for rpc-statd.service failed. See > > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: rpc.statd is > > > not running but is required for remote locking.*00:12:02.532* > > > mount.nfs: Either use '-o nolock' to keep locks local, or start > > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was > > > specified > > > > > > (of course, it can always be my patch!) > > > > > > https://build.gluster.org/job/centos7-regression/5384/console > > > > same issue, different builder (206). I will check them all, as the > > issue is more widespread than I expected (or it did popup since last > > time I checked). > > Deepshika did notice that the issue came back on one server > (builder202) after a reboot, so the rpcbind issue is not related to the > network initscript one, so the RCA continue. > > We are looking for another workaround involving fiddling with the > socket (until we find why it do use ipv6 at boot, but not after, when > ipv6 is disabled). > Could this be relevant? https://access.redhat.com/solutions/2798411 > > Maybe we could run the test suite on a node without all the ipv6 > disabling to see if that cause a issue ? > > -- > Michael Scherer > Sysadmin, Community Infrastructure and Platform, OSAS > > > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From srangana at redhat.com Fri Apr 5 13:36:49 2019 From: srangana at redhat.com (Shyam Ranganathan) Date: Fri, 5 Apr 2019 09:36:49 -0400 Subject: [Gluster-devel] Announcing Gluster release 4.1.8 Message-ID: The Gluster community is pleased to announce the release of Gluster 4.1.8 (packages available at [1]). Release notes for the release can be found at [2]. Major changes, features and limitations addressed in this release: None Thanks, Gluster community [1] Packages for 4.1.8: https://download.gluster.org/pub/gluster/glusterfs/4.1/4.1.8/ [2] Release notes for 4.1.8: https://docs.gluster.org/en/latest/release-notes/4.1.8/ From srangana at redhat.com Fri Apr 5 14:33:05 2019 From: srangana at redhat.com (Shyam Ranganathan) Date: Fri, 5 Apr 2019 10:33:05 -0400 Subject: [Gluster-devel] Release 6.1: Expected tagging on April 10th Message-ID: Hi, Expected tagging date for release-6.1 is on April, 10th, 2019. Please ensure required patches are backported and also are passing regressions and are appropriately reviewed for easy merging and tagging on the date. Thanks, Shyam From mscherer at redhat.com Fri Apr 5 14:40:08 2019 From: mscherer at redhat.com (Michael Scherer) Date: Fri, 05 Apr 2019 16:40:08 +0200 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

<0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> Message-ID: <090785225412c2b5b269454f8812d0a165aea62d.camel@redhat.com> Le vendredi 05 avril 2019 ? 16:55 +0530, Nithya Balachandran a ?crit : > On Fri, 5 Apr 2019 at 12:16, Michael Scherer > wrote: > > > Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : > > > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : > > > > I'm not convinced this is solved. Just had what I believe is a > > > > similar > > > > failure: > > > > > > > > *00:12:02.532* A dependency job for rpc-statd.service failed. > > > > See > > > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: > > > > rpc.statd is > > > > not running but is required for remote locking.*00:12:02.532* > > > > mount.nfs: Either use '-o nolock' to keep locks local, or start > > > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was > > > > specified > > > > > > > > (of course, it can always be my patch!) > > > > > > > > https://build.gluster.org/job/centos7-regression/5384/console > > > > > > same issue, different builder (206). I will check them all, as > > > the > > > issue is more widespread than I expected (or it did popup since > > > last > > > time I checked). > > > > Deepshika did notice that the issue came back on one server > > (builder202) after a reboot, so the rpcbind issue is not related to > > the > > network initscript one, so the RCA continue. > > > > We are looking for another workaround involving fiddling with the > > socket (until we find why it do use ipv6 at boot, but not after, > > when > > ipv6 is disabled). > > > > Could this be relevant? > https://access.redhat.com/solutions/2798411 Good catch. So, we already do that, Nigel took care of that (after 2 days of research). But I didn't knew the exact symptoms, and decided to double check just in case. And... there is no sysctl.conf in the initrd. Running dracut -v -f do not change anything. Running "dracut -v -f -H" take care of that (and this fix the problem), but: - our ansible script already run that - -H is hostonly, which is already the default on EL7 according to the doc. However, if dracut-config-generic is installed, it doesn't build a hostonly initrd, and so do not include the sysctl.conf file (who break rpcbnd, who break the test suite). And for some reason, it is installed the image in ec2 (likely default), but not by default on the builders. So what happen is that after a kernel upgrade, dracut rebuild a generic initrd instead of a hostonly one, who break things. And kernel was likely upgraded recently (and upgrade happen nightly (for some value of "night"), so we didn't see that earlier, nor with a fresh system. So now, we have several solution: - be explicit on using hostonly in dracut, so this doesn't happen again (or not for this reason) - disable ipv6 in rpcbind in a cleaner way (to be tested) - get the test suite work with ip v6 In the long term, I also want to monitor the processes, but for that, I need a VPN between the nagios server and ec2, and that project got blocked by several issues (like EC2 not support ecdsa keys, and we use that for ansible, so we have to come back to RSA for full automated deployment, and openvon requires to use certificates, so I need a newer python openssl for doing what I want, and RHEL 7 is too old, etc, etc). As the weekend approach for me, I just rebuilt the initrd for the time being. I guess forcing hostonly is the safest fix for now, but this will be for monday. -- Michael Scherer Sysadmin, Community Infrastructure and Platform, OSAS -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 836 bytes Desc: This is a digitally signed message part URL: From phlogistonjohn at asynchrono.us Fri Apr 5 19:30:29 2019 From: phlogistonjohn at asynchrono.us (John Mulligan) Date: Fri, 05 Apr 2019 15:30:29 -0400 Subject: [Gluster-devel] Heketi v9.0.0 available for download Message-ID: <2297373.DkY5oybNEL@abydos> Heketi v9.0.0 is now available [1]. This is the new stable version of Heketi. Major additions in this release: * Limit volumes per Gluster cluster * Prevent server from starting if db has unknown dbattributes * Support a default admin mode option * Add an option to enable strict zone checking on volume creation * Add automatic pending operation clean-up functionality * Configurable device formatting parameters * Add consistency check feature and state examiner debugging tools * The faulty and non-functional "db delete-pending-entries" command has been removed This release contains numerous stability and bug fixes. A more detailed changelog is available at the release page [1]. -- John M. on behalf of the Heketi team [1] https://github.com/heketi/heketi/releases/tag/v9.0.0 From ravishankar at redhat.com Sat Apr 6 06:29:25 2019 From: ravishankar at redhat.com (Ravishankar N) Date: Sat, 6 Apr 2019 11:59:25 +0530 Subject: [Gluster-devel] Release 6.1: Expected tagging on April 10th In-Reply-To: References: Message-ID: Tracker bug is https://bugzilla.redhat.com/show_bug.cgi?id=1692394, in case anyone wants to add blocker bugs. On 05/04/19 8:03 PM, Shyam Ranganathan wrote: > Hi, > > Expected tagging date for release-6.1 is on April, 10th, 2019. > > Please ensure required patches are backported and also are passing > regressions and are appropriately reviewed for easy merging and tagging > on the date. > > Thanks, > Shyam > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel From amukherj at redhat.com Sat Apr 6 08:38:44 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Sat, 6 Apr 2019 14:08:44 +0530 Subject: [Gluster-devel] Release 6.1: Expected tagging on April 10th In-Reply-To: References:

Message-ID: Hi Mohit, https://review.gluster.org/22495 should get into 6.1 as it?s a regression. Can you please attach the respective bug to the tracker Ravi pointed out? On Sat, 6 Apr 2019 at 12:00, Ravishankar N wrote: > Tracker bug is https://bugzilla.redhat.com/show_bug.cgi?id=1692394, in > case anyone wants to add blocker bugs. > > > On 05/04/19 8:03 PM, Shyam Ranganathan wrote: > > Hi, > > > > Expected tagging date for release-6.1 is on April, 10th, 2019. > > > > Please ensure required patches are backported and also are passing > > regressions and are appropriately reviewed for easy merging and tagging > > on the date. > > > > Thanks, > > Shyam > > _______________________________________________ > > Gluster-devel mailing list > > Gluster-devel at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-devel > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > -- - Atin (atinm) -------------- next part -------------- An HTML attachment was scrubbed... URL: From jenkins at build.gluster.org Mon Apr 8 01:45:03 2019 From: jenkins at build.gluster.org (jenkins at build.gluster.org) Date: Mon, 8 Apr 2019 01:45:03 +0000 (UTC) Subject: [Gluster-devel] Weekly Untriaged Bugs Message-ID: <825338949.63.1554687903451.JavaMail.jenkins@jenkins-el7.rht.gluster.org> [...truncated 6 lines...] https://bugzilla.redhat.com/1688226 / core: Brick Still Died After Restart Glusterd & Glusterfsd Services https://bugzilla.redhat.com/1695416 / core: client log flooding with intentional socket shutdown message when a brick is down https://bugzilla.redhat.com/1691833 / core: Client sends 128KByte network packet for 0 length file copy https://bugzilla.redhat.com/1695480 / core: Global Thread Pool https://bugzilla.redhat.com/1694943 / core: parallel-readdir slows down directory listing https://bugzilla.redhat.com/1696721 / geo-replication: geo-replication failing after upgrade from 5.5 to 6.0 https://bugzilla.redhat.com/1694637 / geo-replication: Geo-rep: Rename to an existing file name destroys its content on slave https://bugzilla.redhat.com/1689981 / geo-replication: OSError: [Errno 1] Operation not permitted - failing with socket files? https://bugzilla.redhat.com/1694139 / glusterd: Error waiting for job 'heketi-storage-copy-job' to complete on one-node k3s deployment. https://bugzilla.redhat.com/1695099 / glusterd: The number of glusterfs processes keeps increasing, using all available resources https://bugzilla.redhat.com/1690454 / posix-acl: mount-shared-storage.sh does not implement mount options https://bugzilla.redhat.com/1696518 / project-infrastructure: builder203 does not have a valid hostname set https://bugzilla.redhat.com/1691617 / project-infrastructure: clang-scan tests are failing nightly. https://bugzilla.redhat.com/1691357 / project-infrastructure: core archive link from regression jobs throw not found error https://bugzilla.redhat.com/1692349 / project-infrastructure: gluster-csi-containers job is failing https://bugzilla.redhat.com/1693385 / project-infrastructure: request to change the version of fedora in fedora-smoke-job https://bugzilla.redhat.com/1693295 / project-infrastructure: rpc.statd not started on builder204.aws.gluster.org https://bugzilla.redhat.com/1691789 / project-infrastructure: rpc-statd service stops on AWS builders https://bugzilla.redhat.com/1695484 / project-infrastructure: smoke fails with "Build root is locked by another process" https://bugzilla.redhat.com/1693184 / replicate: A brick process(glusterfsd) died with 'memory violation' https://bugzilla.redhat.com/1696075 / replicate: Client lookup is unable to heal missing directory GFID entry https://bugzilla.redhat.com/1696633 / tests: GlusterFs v4.1.5 Tests from /tests/bugs/ module failing on Intel https://bugzilla.redhat.com/1694976 / unclassified: On Fedora 29 GlusterFS 4.1 repo has bad/missing rpm signs [...truncated 2 lines...] -------------- next part -------------- A non-text attachment was scrubbed... Name: build.log Type: application/octet-stream Size: 2926 bytes Desc: not available URL: From cynthia.zhou at nokia-sbell.com Mon Apr 8 02:12:25 2019 From: cynthia.zhou at nokia-sbell.com (Zhou, Cynthia (NSB - CN/Hangzhou)) Date: Mon, 8 Apr 2019 02:12:25 +0000 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 Message-ID: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> Hi glusterfs experts, Good day! In my test env, sometimes glusterd stuck issue happened, and it is not responding to any gluster commands, when I checked this issue I find that glusterd thread 9 and thread 8 is dealing with the same socket, I thought following patch should be able to solve this issue, however after I merged this patch this issue still exist. When I looked into this code, it seems socket_event_poll_in called event_handled before rpc_transport_pollin_destroy, I think this gives the chance for another poll for the exactly the same socket. And caused this glusterd stuck issue, also, I find there is no LOCK_DESTROY(&iobref->lock) In iobref_destroy, I think it is better to add destroy lock. Following is the gdb info when this issue happened, I would like to know your opinion on this issue, thanks! SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 * socket: fix issue on concurrent handle of a socket GDB INFO: Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in iobref_unref, I think Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from /lib64/libpthread.so.0 #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, gen=4, idx=27) at socket.c:1201 #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at event-epoll.c:659 #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) thread 9 [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 (gdb) bt #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) frame 2 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 944 iobuf.c: No such file or directory. (gdb) print *iobref $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, __elision = 0, __list = {__prev = 0x4000, __next = 0x7f9ed00063b000}}, __size = "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@\000\000\000\000\000\000\000\260c\000?\177", __align = 953482739714}}, ref = -256, iobrefs = 0xffffffffffffffff, alloced = -1, used = -1} (gdb) quit A debugging session is active. -------------- next part -------------- An HTML attachment was scrubbed... URL: From 1173701037 at qq.com Mon Apr 8 06:41:24 2019 From: 1173701037 at qq.com (=?utf-8?B?UFND?=) Date: Mon, 8 Apr 2019 14:41:24 +0800 Subject: [Gluster-devel] Hello, I have a question about the erasure code translator, hope someone give me some advice, thank you! Message-ID: Hi, I am a storage software coder who is interested in Gluster. I am trying to improve the read/write performance of it. I noticed that gluster is using Vandermonde matrix in erasure code encoding and decoding process. However, it is quite complicate to generate inverse matrix of a Vandermonde matrix, which is necessary for decode. The cost is O(n?). Use a Cauchy matrix, can greatly cut down the cost of the process to find an inverse matrix. Which is O(n?). I use intel storage accelerate library to replace the original ec encode/decode part of gluster. And it reduce the encode and decode time to about 50% of the original one. However, when I test the whole system. The read/write performance is almost the same as the original gluster. I test it on three machines as servers. Each one had two bricks, both of them are SSD. So the total amount of bricks is 6. Use two of them as coding bricks. That is a 4+2 disperse volume configure. The capability of network card is 10000Mbps. Theoretically it can support read and write with the speed faster than 1000MB/s. The actually performance of read is about 492MB/s. The actually performance of write is about 336MB/s. While the original one read at 461MB/s, write at 322MB/s Is there someone who can give me some advice about how to improve its performance? Which part is the critical defect on its performance if it?s not the ec translator? I did a time count on translators. It show me EC translator just take 7% in the whole read\write process. Even though I knew that some translators are run asynchronous, so the real percentage can be some how lager than that. Sincerely thank you for your patient to read my question! -------------- next part -------------- An HTML attachment was scrubbed... URL: From jahernan at redhat.com Mon Apr 8 08:02:00 2019 From: jahernan at redhat.com (Xavi Hernandez) Date: Mon, 8 Apr 2019 10:02:00 +0200 Subject: [Gluster-devel] Hello, I have a question about the erasure code translator, hope someone give me some advice, thank you! In-Reply-To: References: Message-ID: Hi, On Mon, Apr 8, 2019 at 8:50 AM PSC <1173701037 at qq.com> wrote: > Hi, I am a storage software coder who is interested in Gluster. I am > trying to improve the read/write performance of it. > > > I noticed that gluster is using Vandermonde matrix in erasure code > encoding and decoding process. However, it is quite complicate to generate > inverse matrix of a Vandermonde matrix, which is necessary for decode. The > cost is O(n?). > That's not true, actually. A Vandermonde matrix can be inverted in O(n^2), as the code currently does (look at ec_method_matrix_inverse() in ec-method.c). Additionally, current code does caching of inverted matrices, so in normal circumstances there shouldn't be many inverse computations. Only when something changes (a brick dies or comes online), a new inverted matrix could be needed. > > Use a Cauchy matrix, can greatly cut down the cost of the process to find > an inverse matrix. Which is O(n?). > > > I use intel storage accelerate library to replace the original ec > encode/decode part of gluster. And it reduce the encode and decode time to > about 50% of the original one. > How do you test that ? I also did some tests long ago and I didn't observe that difference. Doing a raw test of encoding/decoding performance of the current code using Intel AVX2 extensions, it's able to process 7.6 GiB/s on a single core of an Intel Xeon Silver 4114 when L1 cache is used. Without relying on internal cache, it performs at 3.9 GiB/s. Does ISA-L provide better performance for a matrix of the same size (4+2 non-systematic matrix) ? > > However, when I test the whole system. The read/write performance is > almost the same as the original gluster. > Yes, there are many more things involved in the read and write operations in gluster. For the particular case of EC, having to deal with many bricks simultaneously (6 in this case) means that it's very sensitive to network latency and communications delays, and this is probably one of the biggest contributors. There some other small latencies added by other xlators. > > I test it on three machines as servers. Each one had two bricks, both of > them are SSD. So the total amount of bricks is 6. Use two of them as coding > bricks. That is a 4+2 disperse volume configure. > > > The capability of network card is 10000Mbps. Theoretically it can support > read and write with the speed faster than 1000MB/s. > > > The actually performance of read is about 492MB/s. > > The actually performance of write is about 336MB/s. > > > While the original one read at 461MB/s, write at 322MB/s > > > Is there someone who can give me some advice about how to improve its > performance? Which part is the critical defect on its performance if it?s > not the ec translator? > > > I did a time count on translators. It show me EC translator just take 7% > in the whole read\write process. Even though I knew that some translators > are run asynchronous, so the real percentage can be some how lager than > that. > > > Sincerely thank you for your patient to read my question! > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From srakonde at redhat.com Mon Apr 8 08:57:35 2019 From: srakonde at redhat.com (Sanju Rakonde) Date: Mon, 8 Apr 2019 14:27:35 +0530 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 In-Reply-To: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> References: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> Message-ID: Can you please capture output of "pstack $(pidof glusterd)" and send it to us? We need to capture this information when glusterd is struck. On Mon, Apr 8, 2019 at 8:05 AM Zhou, Cynthia (NSB - CN/Hangzhou) < cynthia.zhou at nokia-sbell.com> wrote: > Hi glusterfs experts, > > Good day! > > In my test env, sometimes glusterd stuck issue happened, and it is not > responding to any gluster commands, when I checked this issue I find that > glusterd thread 9 and thread 8 is dealing with the same socket, I thought > following patch should be able to solve this issue, however after I merged > this patch this issue still exist. When I looked into this code, it seems > socket_event_poll_in called event_handled before > rpc_transport_pollin_destroy, I think this gives the chance for another > poll for the exactly the same socket. And caused this glusterd stuck issue, > also, I find there is no LOCK_DESTROY(&iobref->lock) > > In iobref_destroy, I think it is better to add destroy lock. > > Following is the gdb info when this issue happened, I would like to know > your opinion on this issue, thanks! > > > > SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 > > > > ** socket: fix issue on concurrent handle of a socket* > > > > > > > > *GDB INFO:* > > Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in > iobref_unref, I think > > Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (*fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0*) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from > /lib64/libpthread.so.0 > > #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, > gen=4, idx=27) at socket.c:1201 > > #4 0x00007f9ee4fbf99c in socket_event_handler (*fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0*) at socket.c:2480 > > #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 > > #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at > event-epoll.c:659 > > #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > (gdb) thread 9 > > [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > (gdb) bt > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > (gdb) frame 2 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > 944 iobuf.c: No such file or directory. > > (gdb) print *iobref > > $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, > __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, > > __elision = 0, __list = {__prev = 0x4000, __next = > 0x7f9ed00063b000}}, > > __size = > "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@\000\000\000\000\000\000\000\260c\000?\177", > __align = 953482739714}}, ref = -256, iobrefs = 0xffffffffffffffff, alloced > = -1, used = -1} > > (gdb) quit > > A debugging session is active. > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -- Thanks, Sanju -------------- next part -------------- An HTML attachment was scrubbed... URL: From cynthia.zhou at nokia-sbell.com Mon Apr 8 09:01:19 2019 From: cynthia.zhou at nokia-sbell.com (Zhou, Cynthia (NSB - CN/Hangzhou)) Date: Mon, 8 Apr 2019 09:01:19 +0000 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 In-Reply-To: References: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> Message-ID: Hi, The env is not there anymore, but I have collected the thread stack trace of glusterd with command ?thread apply all bt? Using host libthread_db library "/lib64/libthread_db.so.1". 0x00007f9ee9fcfa3d in __pthread_timedjoin_ex () from /lib64/libpthread.so.0 Missing separate debuginfos, use: dnf debuginfo-install rcp-pack-glusterfs-1.12.0_1_gc999db1-RCP2.wf29.x86_64 (gdb) thread apply all bt Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from /lib64/libpthread.so.0 #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, gen=4, idx=27) at socket.c:1201 #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at event-epoll.c:659 #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 7 (Thread 0x7f9ee49b3700 (LWP 1931)): #0 0x00007f9ee9fd45bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x00007f9ee5e651b9 in hooks_worker (args=0x1813000) at glusterd-hooks.c:529 #2 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #3 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 6 (Thread 0x7f9ee692e700 (LWP 1762)): #0 0x00007f9ee9fd497a in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x00007f9eeb25d904 in syncenv_task (proc=0x1808e00) at syncop.c:603 #2 0x00007f9eeb25db9f in syncenv_processor (thdata=0x1808e00) at syncop.c:695 #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 5 (Thread 0x7f9ee712f700 (LWP 1761)): ---Type to continue, or q to quit--- #0 0x00007f9ee9fd497a in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x00007f9eeb25d904 in syncenv_task (proc=0x1808a40) at syncop.c:603 #2 0x00007f9eeb25db9f in syncenv_processor (thdata=0x1808a40) at syncop.c:695 #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 4 (Thread 0x7f9ee7930700 (LWP 1760)): #0 0x00007f9ee98725d0 in nanosleep () from /lib64/libc.so.6 #1 0x00007f9ee98724aa in sleep () from /lib64/libc.so.6 #2 0x00007f9eeb247fdf in pool_sweeper (arg=0x0) at mem-pool.c:481 #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 3 (Thread 0x7f9ee8131700 (LWP 1759)): #0 0x00007f9ee97e3d7c in sigtimedwait () from /lib64/libc.so.6 #1 0x00007f9ee9fd8bac in sigwait () from /lib64/libpthread.so.0 #2 0x0000000000409ed7 in glusterfs_sigwaiter () #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 2 (Thread 0x7f9ee8932700 (LWP 1758)): #0 0x00007f9ee9fd83b0 in nanosleep () from /lib64/libpthread.so.0 #1 0x00007f9eeb224545 in gf_timer_proc (data=0x1808580) at timer.c:164 #2 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #3 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 1 (Thread 0x7f9eeb707780 (LWP 1757)): #0 0x00007f9ee9fcfa3d in __pthread_timedjoin_ex () from /lib64/libpthread.so.0 #1 0x00007f9eeb282b09 in event_dispatch_epoll (event_pool=0x17feb00) at event-epoll.c:746 #2 0x00007f9eeb246786 in event_dispatch (event_pool=0x17feb00) at event.c:124 #3 0x000000000040ab95 in main () (gdb) (gdb) (gdb) q! A syntax error in expression, near `'. (gdb) quit From: Sanju Rakonde Sent: Monday, April 08, 2019 4:58 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) Cc: Raghavendra Gowdappa ; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 Can you please capture output of "pstack $(pidof glusterd)" and send it to us? We need to capture this information when glusterd is struck. On Mon, Apr 8, 2019 at 8:05 AM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Hi glusterfs experts, Good day! In my test env, sometimes glusterd stuck issue happened, and it is not responding to any gluster commands, when I checked this issue I find that glusterd thread 9 and thread 8 is dealing with the same socket, I thought following patch should be able to solve this issue, however after I merged this patch this issue still exist. When I looked into this code, it seems socket_event_poll_in called event_handled before rpc_transport_pollin_destroy, I think this gives the chance for another poll for the exactly the same socket. And caused this glusterd stuck issue, also, I find there is no LOCK_DESTROY(&iobref->lock) In iobref_destroy, I think it is better to add destroy lock. Following is the gdb info when this issue happened, I would like to know your opinion on this issue, thanks! SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 * socket: fix issue on concurrent handle of a socket GDB INFO: Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in iobref_unref, I think Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from /lib64/libpthread.so.0 #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, gen=4, idx=27) at socket.c:1201 #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at event-epoll.c:659 #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) thread 9 [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 (gdb) bt #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) frame 2 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 944 iobuf.c: No such file or directory. (gdb) print *iobref $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, __elision = 0, __list = {__prev = 0x4000, __next = 0x7f9ed00063b000}}, __size = "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@\000\000\000\000\000\000\000\260c\000?\177", __align = 953482739714}}, ref = -256, iobrefs = 0xffffffffffffffff, alloced = -1, used = -1} (gdb) quit A debugging session is active. _______________________________________________ Gluster-devel mailing list Gluster-devel at gluster.org https://lists.gluster.org/mailman/listinfo/gluster-devel -- Thanks, Sanju -------------- next part -------------- An HTML attachment was scrubbed... URL: From srakonde at redhat.com Tue Apr 9 07:08:16 2019 From: srakonde at redhat.com (Sanju Rakonde) Date: Tue, 9 Apr 2019 12:38:16 +0530 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 In-Reply-To: References: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com>

Message-ID: Hello, I'm unable to figure out the issue by just looking at the backtrace. You might be hitting https://bugzilla.redhat.com/show_bug.cgi?id=1650115. If you come across the same problem in future, please capture pstack output and share it with us. On Mon, Apr 8, 2019 at 2:31 PM Zhou, Cynthia (NSB - CN/Hangzhou) < cynthia.zhou at nokia-sbell.com> wrote: > Hi, > > The env is not there anymore, but I have collected the thread stack trace > of glusterd with command ?thread apply all bt? > > Using host libthread_db library "/lib64/libthread_db.so.1". > > 0x00007f9ee9fcfa3d in __pthread_timedjoin_ex () from /lib64/libpthread.so.0 > > Missing separate debuginfos, use: dnf debuginfo-install > rcp-pack-glusterfs-1.12.0_1_gc999db1-RCP2.wf29.x86_64 > > (gdb) thread apply all bt > > > > Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from > /lib64/libpthread.so.0 > > #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, > gen=4, idx=27) at socket.c:1201 > > #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 > > #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 > > #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at > event-epoll.c:659 > > #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 7 (Thread 0x7f9ee49b3700 (LWP 1931)): > > #0 0x00007f9ee9fd45bc in pthread_cond_wait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #1 0x00007f9ee5e651b9 in hooks_worker (args=0x1813000) at > glusterd-hooks.c:529 > > #2 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #3 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 6 (Thread 0x7f9ee692e700 (LWP 1762)): > > #0 0x00007f9ee9fd497a in pthread_cond_timedwait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #1 0x00007f9eeb25d904 in syncenv_task (proc=0x1808e00) at syncop.c:603 > > #2 0x00007f9eeb25db9f in syncenv_processor (thdata=0x1808e00) at > syncop.c:695 > > #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 5 (Thread 0x7f9ee712f700 (LWP 1761)): > > ---Type to continue, or q to quit--- > > #0 0x00007f9ee9fd497a in pthread_cond_timedwait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #1 0x00007f9eeb25d904 in syncenv_task (proc=0x1808a40) at syncop.c:603 > > #2 0x00007f9eeb25db9f in syncenv_processor (thdata=0x1808a40) at > syncop.c:695 > > #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 4 (Thread 0x7f9ee7930700 (LWP 1760)): > > #0 0x00007f9ee98725d0 in nanosleep () from /lib64/libc.so.6 > > #1 0x00007f9ee98724aa in sleep () from /lib64/libc.so.6 > > #2 0x00007f9eeb247fdf in pool_sweeper (arg=0x0) at mem-pool.c:481 > > #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 3 (Thread 0x7f9ee8131700 (LWP 1759)): > > #0 0x00007f9ee97e3d7c in sigtimedwait () from /lib64/libc.so.6 > > #1 0x00007f9ee9fd8bac in sigwait () from /lib64/libpthread.so.0 > > #2 0x0000000000409ed7 in glusterfs_sigwaiter () > > #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 2 (Thread 0x7f9ee8932700 (LWP 1758)): > > #0 0x00007f9ee9fd83b0 in nanosleep () from /lib64/libpthread.so.0 > > #1 0x00007f9eeb224545 in gf_timer_proc (data=0x1808580) at timer.c:164 > > #2 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #3 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 1 (Thread 0x7f9eeb707780 (LWP 1757)): > > #0 0x00007f9ee9fcfa3d in __pthread_timedjoin_ex () from > /lib64/libpthread.so.0 > > #1 0x00007f9eeb282b09 in event_dispatch_epoll (event_pool=0x17feb00) at > event-epoll.c:746 > > #2 0x00007f9eeb246786 in event_dispatch (event_pool=0x17feb00) at > event.c:124 > > #3 0x000000000040ab95 in main () > > (gdb) > > (gdb) > > (gdb) q! > > A syntax error in expression, near `'. > > (gdb) quit > > > > *From:* Sanju Rakonde > *Sent:* Monday, April 08, 2019 4:58 PM > *To:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Cc:* Raghavendra Gowdappa ; > gluster-devel at gluster.org > *Subject:* Re: [Gluster-devel] glusterd stuck for glusterfs with version > 3.12.15 > > > > Can you please capture output of "pstack $(pidof glusterd)" and send it to > us? We need to capture this information when glusterd is struck. > > > > On Mon, Apr 8, 2019 at 8:05 AM Zhou, Cynthia (NSB - CN/Hangzhou) < > cynthia.zhou at nokia-sbell.com> wrote: > > Hi glusterfs experts, > > Good day! > > In my test env, sometimes glusterd stuck issue happened, and it is not > responding to any gluster commands, when I checked this issue I find that > glusterd thread 9 and thread 8 is dealing with the same socket, I thought > following patch should be able to solve this issue, however after I merged > this patch this issue still exist. When I looked into this code, it seems > socket_event_poll_in called event_handled before > rpc_transport_pollin_destroy, I think this gives the chance for another > poll for the exactly the same socket. And caused this glusterd stuck issue, > also, I find there is no LOCK_DESTROY(&iobref->lock) > > In iobref_destroy, I think it is better to add destroy lock. > > Following is the gdb info when this issue happened, I would like to know > your opinion on this issue, thanks! > > > > SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 > > > > ** socket: fix issue on concurrent handle of a socket* > > > > > > > > *GDB INFO:* > > Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in > iobref_unref, I think > > Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (*fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0*) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from > /lib64/libpthread.so.0 > > #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, > gen=4, idx=27) at socket.c:1201 > > #4 0x00007f9ee4fbf99c in socket_event_handler (*fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0*) at socket.c:2480 > > #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 > > #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at > event-epoll.c:659 > > #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > (gdb) thread 9 > > [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > (gdb) bt > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > (gdb) frame 2 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > 944 iobuf.c: No such file or directory. > > (gdb) print *iobref > > $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, > __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, > > __elision = 0, __list = {__prev = 0x4000, __next = > 0x7f9ed00063b000}}, > > __size = > "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@ > \000\000\000\000\000\000\000\260c\000?\177", __align = 953482739714}}, > ref = -256, iobrefs = 0xffffffffffffffff, alloced = -1, used = -1} > > (gdb) quit > > A debugging session is active. > > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > > > -- > > Thanks, > > Sanju > -- Thanks, Sanju -------------- next part -------------- An HTML attachment was scrubbed... URL: From cynthia.zhou at nokia-sbell.com Tue Apr 9 07:18:30 2019 From: cynthia.zhou at nokia-sbell.com (Zhou, Cynthia (NSB - CN/Hangzhou)) Date: Tue, 9 Apr 2019 07:18:30 +0000 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 In-Reply-To: References: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com>

Message-ID: <04bcfe872c314b7dbf7860dab74f8977@nokia-sbell.com> Hi, Gluster brick multiplex is not enabled in my env. My point is that : socket notify another around of poll in too early actually this round of poll in is not finished, thread 9 has not finished poll in while thread 8 has begun another round of poll in, thread 8 has called iobref_unref and iobref_unref does not release iobref->lock , so thread 9 may block. cynthia From: Sanju Rakonde Sent: Tuesday, April 09, 2019 3:08 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) Cc: Raghavendra Gowdappa ; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 Hello, I'm unable to figure out the issue by just looking at the backtrace. You might be hitting https://bugzilla.redhat.com/show_bug.cgi?id=1650115. If you come across the same problem in future, please capture pstack output and share it with us. On Mon, Apr 8, 2019 at 2:31 PM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Hi, The env is not there anymore, but I have collected the thread stack trace of glusterd with command ?thread apply all bt? Using host libthread_db library "/lib64/libthread_db.so.1". 0x00007f9ee9fcfa3d in __pthread_timedjoin_ex () from /lib64/libpthread.so.0 Missing separate debuginfos, use: dnf debuginfo-install rcp-pack-glusterfs-1.12.0_1_gc999db1-RCP2.wf29.x86_64 (gdb) thread apply all bt Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from /lib64/libpthread.so.0 #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, gen=4, idx=27) at socket.c:1201 #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at event-epoll.c:659 #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 7 (Thread 0x7f9ee49b3700 (LWP 1931)): #0 0x00007f9ee9fd45bc in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x00007f9ee5e651b9 in hooks_worker (args=0x1813000) at glusterd-hooks.c:529 #2 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #3 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 6 (Thread 0x7f9ee692e700 (LWP 1762)): #0 0x00007f9ee9fd497a in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x00007f9eeb25d904 in syncenv_task (proc=0x1808e00) at syncop.c:603 #2 0x00007f9eeb25db9f in syncenv_processor (thdata=0x1808e00) at syncop.c:695 #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 5 (Thread 0x7f9ee712f700 (LWP 1761)): ---Type to continue, or q to quit--- #0 0x00007f9ee9fd497a in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x00007f9eeb25d904 in syncenv_task (proc=0x1808a40) at syncop.c:603 #2 0x00007f9eeb25db9f in syncenv_processor (thdata=0x1808a40) at syncop.c:695 #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 4 (Thread 0x7f9ee7930700 (LWP 1760)): #0 0x00007f9ee98725d0 in nanosleep () from /lib64/libc.so.6 #1 0x00007f9ee98724aa in sleep () from /lib64/libc.so.6 #2 0x00007f9eeb247fdf in pool_sweeper (arg=0x0) at mem-pool.c:481 #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 3 (Thread 0x7f9ee8131700 (LWP 1759)): #0 0x00007f9ee97e3d7c in sigtimedwait () from /lib64/libc.so.6 #1 0x00007f9ee9fd8bac in sigwait () from /lib64/libpthread.so.0 #2 0x0000000000409ed7 in glusterfs_sigwaiter () #3 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #4 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 2 (Thread 0x7f9ee8932700 (LWP 1758)): #0 0x00007f9ee9fd83b0 in nanosleep () from /lib64/libpthread.so.0 #1 0x00007f9eeb224545 in gf_timer_proc (data=0x1808580) at timer.c:164 #2 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #3 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 1 (Thread 0x7f9eeb707780 (LWP 1757)): #0 0x00007f9ee9fcfa3d in __pthread_timedjoin_ex () from /lib64/libpthread.so.0 #1 0x00007f9eeb282b09 in event_dispatch_epoll (event_pool=0x17feb00) at event-epoll.c:746 #2 0x00007f9eeb246786 in event_dispatch (event_pool=0x17feb00) at event.c:124 #3 0x000000000040ab95 in main () (gdb) (gdb) (gdb) q! A syntax error in expression, near `'. (gdb) quit From: Sanju Rakonde > Sent: Monday, April 08, 2019 4:58 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) > Cc: Raghavendra Gowdappa >; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 Can you please capture output of "pstack $(pidof glusterd)" and send it to us? We need to capture this information when glusterd is struck. On Mon, Apr 8, 2019 at 8:05 AM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Hi glusterfs experts, Good day! In my test env, sometimes glusterd stuck issue happened, and it is not responding to any gluster commands, when I checked this issue I find that glusterd thread 9 and thread 8 is dealing with the same socket, I thought following patch should be able to solve this issue, however after I merged this patch this issue still exist. When I looked into this code, it seems socket_event_poll_in called event_handled before rpc_transport_pollin_destroy, I think this gives the chance for another poll for the exactly the same socket. And caused this glusterd stuck issue, also, I find there is no LOCK_DESTROY(&iobref->lock) In iobref_destroy, I think it is better to add destroy lock. Following is the gdb info when this issue happened, I would like to know your opinion on this issue, thanks! SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 * socket: fix issue on concurrent handle of a socket GDB INFO: Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in iobref_unref, I think Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from /lib64/libpthread.so.0 #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, gen=4, idx=27) at socket.c:1201 #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at event-epoll.c:659 #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) thread 9 [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 (gdb) bt #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) frame 2 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 944 iobuf.c: No such file or directory. (gdb) print *iobref $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, __elision = 0, __list = {__prev = 0x4000, __next = 0x7f9ed00063b000}}, __size = "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@\000\000\000\000\000\000\000\260c\000?\177", __align = 953482739714}}, ref = -256, iobrefs = 0xffffffffffffffff, alloced = -1, used = -1} (gdb) quit A debugging session is active. _______________________________________________ Gluster-devel mailing list Gluster-devel at gluster.org https://lists.gluster.org/mailman/listinfo/gluster-devel -- Thanks, Sanju -- Thanks, Sanju -------------- next part -------------- An HTML attachment was scrubbed... URL: From rgowdapp at redhat.com Tue Apr 9 07:52:26 2019 From: rgowdapp at redhat.com (Raghavendra Gowdappa) Date: Tue, 9 Apr 2019 13:22:26 +0530 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 In-Reply-To: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> References: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> Message-ID: On Mon, Apr 8, 2019 at 7:42 AM Zhou, Cynthia (NSB - CN/Hangzhou) < cynthia.zhou at nokia-sbell.com> wrote: > Hi glusterfs experts, > > Good day! > > In my test env, sometimes glusterd stuck issue happened, and it is not > responding to any gluster commands, when I checked this issue I find that > glusterd thread 9 and thread 8 is dealing with the same socket, I thought > following patch should be able to solve this issue, however after I merged > this patch this issue still exist. When I looked into this code, it seems > socket_event_poll_in called event_handled before > rpc_transport_pollin_destroy, I think this gives the chance for another > poll for the exactly the same socket. And caused this glusterd stuck issue, > also, I find there is no LOCK_DESTROY(&iobref->lock) > > In iobref_destroy, I think it is better to add destroy lock. > > Following is the gdb info when this issue happened, I would like to know > your opinion on this issue, thanks! > > > > SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 > > > > ** socket: fix issue on concurrent handle of a socket* > > > > > > > > *GDB INFO:* > > Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in > iobref_unref, I think > > Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (*fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0*) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from > /lib64/libpthread.so.0 > > #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from > /lib64/libpthread.so.0 > > #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, > gen=4, idx=27) at socket.c:1201 > > #4 0x00007f9ee4fbf99c in socket_event_handler (*fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0*) at socket.c:2480 > > #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 > > #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at > event-epoll.c:659 > > #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > > > (gdb) thread 9 > > [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > (gdb) bt > > #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 > > #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy > (pollin=0x7f9ed00452d0) at rpc-transport.c:123 > > #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, > notify_handled=_gf_true) at socket.c:2322 > > #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, > data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 > > #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler > (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 > > #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at > event-epoll.c:659 > > #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 > > #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 > > (gdb) frame 2 > > #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at > iobuf.c:944 > > 944 iobuf.c: No such file or directory. > > (gdb) print *iobref > > $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, > __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, > > __elision = 0, __list = {__prev = 0x4000, __next = > 0x7f9ed00063b000}}, > > __size = > "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@\000\000\000\000\000\000\000\260c\000?\177", > __align = 953482739714}}, ref = -256, iobrefs = 0xffffffffffffffff, alloced > = -1, used = -1} > looks like the iobref is corrupted here. It seems to be a use-after-free issue. We need to dig into why a freed iobref is being accessed here. (gdb) quit > > A debugging session is active. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cynthia.zhou at nokia-sbell.com Tue Apr 9 07:57:15 2019 From: cynthia.zhou at nokia-sbell.com (Zhou, Cynthia (NSB - CN/Hangzhou)) Date: Tue, 9 Apr 2019 07:57:15 +0000 Subject: [Gluster-devel] glusterd stuck for glusterfs with version 3.12.15 In-Reply-To: References: <2c963e64775f4a35b43d651906ce30ef@nokia-sbell.com> Message-ID: Can you figure out some possible reason why iobref is corrupted, is it possible that thread 8 has called poll in and iobref has been relased, but the lock within it is not properly released (as I can not find any free lock operation in iobref_destroy), then thread 9 called rpc_transport_pollin_destroy again, and so stuck on this lock Also, there should not be two thread handling the same socket at the same time, although there has been a patch claimed to tackle this issue. cynthia From: Raghavendra Gowdappa Sent: Tuesday, April 09, 2019 3:52 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) Cc: gluster-devel at gluster.org Subject: Re: glusterd stuck for glusterfs with version 3.12.15 On Mon, Apr 8, 2019 at 7:42 AM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Hi glusterfs experts, Good day! In my test env, sometimes glusterd stuck issue happened, and it is not responding to any gluster commands, when I checked this issue I find that glusterd thread 9 and thread 8 is dealing with the same socket, I thought following patch should be able to solve this issue, however after I merged this patch this issue still exist. When I looked into this code, it seems socket_event_poll_in called event_handled before rpc_transport_pollin_destroy, I think this gives the chance for another poll for the exactly the same socket. And caused this glusterd stuck issue, also, I find there is no LOCK_DESTROY(&iobref->lock) In iobref_destroy, I think it is better to add destroy lock. Following is the gdb info when this issue happened, I would like to know your opinion on this issue, thanks! SHA-1: f747d55a7fd364e2b9a74fe40360ab3cb7b11537 * socket: fix issue on concurrent handle of a socket GDB INFO: Thread 8 is blocked on pthread_cond_wait, and thread 9 is blocked in iobref_unref, I think Thread 9 (Thread 0x7f9edf7fe700 (LWP 1933)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 Thread 8 (Thread 0x7f9edffff700 (LWP 1932)): #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fd2b42 in __pthread_mutex_cond_lock () from /lib64/libpthread.so.0 #2 0x00007f9ee9fd44a8 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #3 0x00007f9ee4fbadab in socket_event_poll_err (this=0x7f9ed0049cc0, gen=4, idx=27) at socket.c:1201 #4 0x00007f9ee4fbf99c in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2480 #5 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edfffee84) at event-epoll.c:583 #6 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180cf20) at event-epoll.c:659 #7 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #8 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) thread 9 [Switching to thread 9 (Thread 0x7f9edf7fe700 (LWP 1933))] #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 (gdb) bt #0 0x00007f9ee9fd785c in __lll_lock_wait () from /lib64/libpthread.so.0 #1 0x00007f9ee9fda657 in __lll_lock_elision () from /lib64/libpthread.so.0 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 #3 0x00007f9eeafd2f29 in rpc_transport_pollin_destroy (pollin=0x7f9ed00452d0) at rpc-transport.c:123 #4 0x00007f9ee4fbf319 in socket_event_poll_in (this=0x7f9ed0049cc0, notify_handled=_gf_true) at socket.c:2322 #5 0x00007f9ee4fbf932 in socket_event_handler (fd=36, idx=27, gen=4, data=0x7f9ed0049cc0, poll_in=1, poll_out=0, poll_err=0) at socket.c:2471 #6 0x00007f9eeb2825d4 in event_dispatch_epoll_handler (event_pool=0x17feb00, event=0x7f9edf7fde84) at event-epoll.c:583 #7 0x00007f9eeb2828ab in event_dispatch_epoll_worker (data=0x180d0c0) at event-epoll.c:659 #8 0x00007f9ee9fce5da in start_thread () from /lib64/libpthread.so.0 #9 0x00007f9ee98a4eaf in clone () from /lib64/libc.so.6 (gdb) frame 2 #2 0x00007f9eeb24cae6 in iobref_unref (iobref=0x7f9ed00063b0) at iobuf.c:944 944 iobuf.c: No such file or directory. (gdb) print *iobref $1 = {lock = {spinlock = 2, mutex = {__data = {__lock = 2, __count = 222, __owner = -2120437760, __nusers = 1, __kind = 8960, __spins = 512, __elision = 0, __list = {__prev = 0x4000, __next = 0x7f9ed00063b000}}, __size = "\002\000\000\000\336\000\000\000\000\260\234\201\001\000\000\000\000#\000\000\000\002\000\000\000@\000\000\000\000\000\000\000\260c\000?\177", __align = 953482739714}}, ref = -256, iobrefs = 0xffffffffffffffff, alloced = -1, used = -1} looks like the iobref is corrupted here. It seems to be a use-after-free issue. We need to dig into why a freed iobref is being accessed here. (gdb) quit A debugging session is active. -------------- next part -------------- An HTML attachment was scrubbed... URL: From spisla80 at gmail.com Wed Apr 10 10:09:23 2019 From: spisla80 at gmail.com (David Spisla) Date: Wed, 10 Apr 2019 12:09:23 +0200 Subject: [Gluster-devel] [Gluster-users] Replica 3 - how to replace failed node (peer) In-Reply-To: <0917AF4A-76EC-4A9E-820F-E0ADA2DA899A@gmail.com> References: <0917AF4A-76EC-4A9E-820F-E0ADA2DA899A@gmail.com> Message-ID: Hello Martin, look here: https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.4/pdf/administration_guide/Red_Hat_Gluster_Storage-3.4-Administration_Guide-en-US.pdf on page 324. There is a manual how to replace a brick in case of a hardware failure Regards David Spisla Am Mi., 10. Apr. 2019 um 11:42 Uhr schrieb Martin Toth : > Hi all, > > I am running replica 3 gluster with 3 bricks. One of my servers failed - > all disks are showing errors and raid is in fault state. > > Type: Replicate > Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a > Status: Started > Number of Bricks: 1 x 3 = 3 > Transport-type: tcp > Bricks: > Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 > Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 > > So one of my bricks is totally failed (node2). It went down and all data > are lost (failed raid on node2). Now I am running only two bricks on 2 > servers out from 3. > This is really critical problem for us, we can lost all data. I want to > add new disks to node2, create new raid array on them and try to replace > failed brick on this node. > > What is the procedure of replacing Brick2 on node2, can someone advice? I > can?t find anything relevant in documentation. > > Thanks in advance, > Martin > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From rkavunga at redhat.com Wed Apr 10 10:16:25 2019 From: rkavunga at redhat.com (RAFI KC) Date: Wed, 10 Apr 2019 15:46:25 +0530 Subject: [Gluster-devel] [Gluster-users] Replica 3 - how to replace failed node (peer) In-Reply-To: References: <0917AF4A-76EC-4A9E-820F-E0ADA2DA899A@gmail.com> Message-ID: <3292fe0e-f164-43c0-f922-fa2176158749@redhat.com> reset brick is another way of replacing a brick. this usually helpful, when you want to replace the brick with same name. You can find the documentation here https://docs.gluster.org/en/latest/release-notes/3.9.0/#introducing-reset-brick-command. In your case, I think you can use replace brick. So you can initiate a reset-brick start, then you have to replace your failed disk and create new brick with same name . Once you have healthy disk and brick, you can commit the reset-brick. Let's know if you have any question, Rafi KC On 4/10/19 3:39 PM, David Spisla wrote: > Hello Martin, > > look here: > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.4/pdf/administration_guide/Red_Hat_Gluster_Storage-3.4-Administration_Guide-en-US.pdf > on page 324. There is a manual how to replace a brick in case of a > hardware failure > > Regards > David Spisla > > Am Mi., 10. Apr. 2019 um 11:42?Uhr schrieb Martin Toth > >: > > Hi all, > > I am running replica 3 gluster with 3 bricks. One of my servers > failed - all disks are showing errors and raid is in fault state. > > Type: Replicate > Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a > Status: Started > Number of Bricks: 1 x 3 = 3 > Transport-type: tcp > Bricks: > Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 > Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 is down > Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 > > So one of my bricks is totally failed (node2). It went down and > all data are lost (failed raid on node2). Now I am running only > two bricks on 2 servers out from 3. > This is really critical problem for us, we can lost all data. I > want to add new disks to node2, create new raid array on them and > try to replace failed brick on this node. > > What is the procedure of replacing Brick2 on node2, can someone > advice? I can?t find anything relevant in documentation. > > Thanks in advance, > Martin > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > > > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Wed Apr 10 10:20:40 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Wed, 10 Apr 2019 15:50:40 +0530 Subject: [Gluster-devel] [Gluster-users] Replica 3 - how to replace failed node (peer) In-Reply-To: <0917AF4A-76EC-4A9E-820F-E0ADA2DA899A@gmail.com> References: <0917AF4A-76EC-4A9E-820F-E0ADA2DA899A@gmail.com> Message-ID: Hi Martin, After you add the new disks and creating raid array, you can run the following command to replace the old brick with new one: - If you are going to use a different name to the new brick you can run gluster volume replace-brick

commit force - If you are planning to use the same name for the new brick as well then you can use gluster volume reset-brick

commit force Here old-brick & new-brick's hostname & path should be same. After replacing the brick, make sure the brick comes online using volume status. Heal should automatically start, you can check the heal status to see all the files gets replicated to the newly added brick. If it does not start automatically, you can manually start that by running gluster volume heal . HTH, Karthik On Wed, Apr 10, 2019 at 3:13 PM Martin Toth wrote: > Hi all, > > I am running replica 3 gluster with 3 bricks. One of my servers failed - > all disks are showing errors and raid is in fault state. > > Type: Replicate > Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a > Status: Started > Number of Bricks: 1 x 3 = 3 > Transport-type: tcp > Bricks: > Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 > Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 > > So one of my bricks is totally failed (node2). It went down and all data > are lost (failed raid on node2). Now I am running only two bricks on 2 > servers out from 3. > This is really critical problem for us, we can lost all data. I want to > add new disks to node2, create new raid array on them and try to replace > failed brick on this node. > > What is the procedure of replacing Brick2 on node2, can someone advice? I > can?t find anything relevant in documentation. > > Thanks in advance, > Martin > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From nbalacha at redhat.com Wed Apr 10 11:37:52 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Wed, 10 Apr 2019 17:07:52 +0530 Subject: [Gluster-devel] SHD crash in https://build.gluster.org/job/centos7-regression/5510/consoleFull Message-ID: Hi, My patch is unlikely to have caused this as the changes are only in dht. Can someone take a look? Thanks, Nithya -------------- next part -------------- An HTML attachment was scrubbed... URL: From atin.mukherjee83 at gmail.com Wed Apr 10 12:17:16 2019 From: atin.mukherjee83 at gmail.com (Atin Mukherjee) Date: Wed, 10 Apr 2019 17:47:16 +0530 Subject: [Gluster-devel] SHD crash in https://build.gluster.org/job/centos7-regression/5510/consoleFull In-Reply-To: References: Message-ID: Rafi mentioned to me earlier that this will be fixed through https://review.gluster.org/22468 . This crash is more often seen in the nightly regression these days. Patch needs review and I'd request the respective maintainers to take a look at it. On Wed, Apr 10, 2019 at 5:08 PM Nithya Balachandran wrote: > Hi, > > My patch is unlikely to have caused this as the changes are only in dht. > Can someone take a look? > > Thanks, > Nithya > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Wed Apr 10 12:26:36 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Wed, 10 Apr 2019 17:56:36 +0530 Subject: [Gluster-devel] [Gluster-users] [External] Replica 3 - how to replace failed node (peer) In-Reply-To: <804C3826-0173-431C-A286-085E7E582212@gmail.com> References: <0917AF4A-76EC-4A9E-820F-E0ADA2DA899A@gmail.com> <804C3826-0173-431C-A286-085E7E582212@gmail.com> Message-ID: Hi Martin, The reset-brick command is introduced in 3.9.0 and not present in 3.7.6. You can try using the same replace-brick command with the force option even if you want to use the same name for the brick being replaced. 3.7.6 is EOLed long back and glusterfs-6 is the latest version with lots of improvements, bug fixes and new features. The release schedule can be found at [1]. Upgrading to one of the maintained branch is highly recommended. On Wed, Apr 10, 2019 at 4:14 PM Martin Toth wrote: > I?ve read this documentation but step 4 is really unclear to me. I don?t > understand related mkdir/rmdir/setfattr and so on. > > Step 4: > > *Using the gluster volume fuse mount (In this example: /mnt/r2) set up > metadata so that data will be synced to new brick (In this case it is > from Server1:/home/gfs/r2_1 to Server1:/home/gfs/r2_5)* > > Why should I change trusted.non-existent-key on this volume? > It is even more confusing because other mentioned howtos does not contain > this step at all. > Those steps were needed in the older releases to set some metadata on the good bricks so that heal should not happen from the replaced brick to good bricks, which can lead to data loss. Since you are on 3.7.6, we have automated all these steps for you in that branch. You just need to run the replace-brick command, which will take care of all those things. [1] https://www.gluster.org/release-schedule/ Regards, Karthik > > BR, > Martin > > On 10 Apr 2019, at 11:54, Davide Obbi wrote: > > > https://docs.gluster.org/en/v3/Administrator%20Guide/Managing%20Volumes/#replace-faulty-brick > > On Wed, Apr 10, 2019 at 11:42 AM Martin Toth wrote: > >> Hi all, >> >> I am running replica 3 gluster with 3 bricks. One of my servers failed - >> all disks are showing errors and raid is in fault state. >> >> Type: Replicate >> Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a >> Status: Started >> Number of Bricks: 1 x 3 = 3 >> Transport-type: tcp >> Bricks: >> Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 >> Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 > Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 >> >> So one of my bricks is totally failed (node2). It went down and all data >> are lost (failed raid on node2). Now I am running only two bricks on 2 >> servers out from 3. >> This is really critical problem for us, we can lost all data. I want to >> add new disks to node2, create new raid array on them and try to replace >> failed brick on this node. >> >> What is the procedure of replacing Brick2 on node2, can someone advice? I >> can?t find anything relevant in documentation. >> >> Thanks in advance, >> Martin >> _______________________________________________ >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users > > > > -- > Davide Obbi > Senior System Administrator > > Booking.com B.V. > Vijzelstraat 66-80 Amsterdam 1017HL Netherlands > Direct +31207031558 > [image: Booking.com] > Empowering people to experience the world since 1996 > 43 languages, 214+ offices worldwide, 141,000+ global destinations, 29 > million reported listings > Subsidiary of Booking Holdings Inc. (NASDAQ: BKNG) > > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From amukherj at redhat.com Wed Apr 10 13:59:28 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Wed, 10 Apr 2019 19:29:28 +0530 Subject: [Gluster-devel] test failure reports for last 15 days In-Reply-To: References: Message-ID: And now for last 15 days: https://fstat.gluster.org/summary?start_date=2019-03-25&end_date=2019-04-10 ./tests/bitrot/bug-1373520.t 18 ==> Fixed through https://review.gluster.org/#/c/glusterfs/+/22481/, I don't see this failing in brick mux post 5th April ./tests/bugs/ec/bug-1236065.t 17 ==> happens only in brick mux, needs analysis. ./tests/basic/uss.t 15 ==> happens in both brick mux and non brick mux runs, test just simply times out. Needs urgent analysis. ./tests/basic/ec/ec-fix-openfd.t 13 ==> Fixed through https://review.gluster.org/#/c/22508/ , patch merged today. ./tests/basic/volfile-sanity.t 8 ==> Some race, though this succeeds in second attempt every time. There're plenty more with 5 instances of failure from many tests. We need all maintainers/owners to look through these failures and fix them, we certainly don't want to get into a stage where master is unstable and we have to lock down the merges till all these failures are resolved. So please help. (Please note fstat stats show up the retries as failures too which in a way is right) On Tue, Feb 26, 2019 at 5:27 PM Atin Mukherjee wrote: > [1] captures the test failures report since last 30 days and we'd need > volunteers/component owners to see why the number of failures are so high > against few tests. > > [1] > https://fstat.gluster.org/summary?start_date=2019-01-26&end_date=2019-02-25&job=all > -------------- next part -------------- An HTML attachment was scrubbed... URL: From atumball at redhat.com Wed Apr 10 14:57:04 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Wed, 10 Apr 2019 20:27:04 +0530 Subject: [Gluster-devel] test failure reports for last 15 days In-Reply-To: References: Message-ID: Thanks for the summary Atin. On Wed, Apr 10, 2019 at 7:30 PM Atin Mukherjee wrote: > And now for last 15 days: > > https://fstat.gluster.org/summary?start_date=2019-03-25&end_date=2019-04-10 > > ./tests/bitrot/bug-1373520.t 18 ==> Fixed through > https://review.gluster.org/#/c/glusterfs/+/22481/, I don't see this > failing in brick mux post 5th April > ./tests/bugs/ec/bug-1236065.t 17 ==> happens only in brick mux, needs > analysis. > ./tests/basic/uss.t 15 ==> happens in both brick mux and non > brick mux runs, test just simply times out. Needs urgent analysis. > ./tests/basic/ec/ec-fix-openfd.t 13 ==> Fixed through > https://review.gluster.org/#/c/22508/ , patch merged today. > ./tests/basic/volfile-sanity.t 8 ==> Some race, though this succeeds > in second attempt every time. > > Can volfile-sanity.t be failing because of the 'hang' in uss.t ? It is possible as volfile-sanity.t runs after uss.t in regressions. I checked volfile-sanity.t, but it has 'cleanup' at the beginning, but not sure if there are any lingering things which caused these failures. > There're plenty more with 5 instances of failure from many tests. We need > all maintainers/owners to look through these failures and fix them, we > certainly don't want to get into a stage where master is unstable and we > have to lock down the merges till all these failures are resolved. So > please help. > > (Please note fstat stats show up the retries as failures too which in a > way is right) > > > On Tue, Feb 26, 2019 at 5:27 PM Atin Mukherjee > wrote: > >> [1] captures the test failures report since last 30 days and we'd need >> volunteers/component owners to see why the number of failures are so high >> against few tests. >> >> [1] >> https://fstat.gluster.org/summary?start_date=2019-01-26&end_date=2019-02-25&job=all >> > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -- Amar Tumballi (amarts) -------------- next part -------------- An HTML attachment was scrubbed... URL: From jahernan at redhat.com Wed Apr 10 17:25:06 2019 From: jahernan at redhat.com (Xavi Hernandez) Date: Wed, 10 Apr 2019 19:25:06 +0200 Subject: [Gluster-devel] test failure reports for last 15 days In-Reply-To: References: Message-ID: On Wed, Apr 10, 2019 at 4:01 PM Atin Mukherjee wrote: > And now for last 15 days: > > https://fstat.gluster.org/summary?start_date=2019-03-25&end_date=2019-04-10 > > ./tests/bitrot/bug-1373520.t 18 ==> Fixed through > https://review.gluster.org/#/c/glusterfs/+/22481/, I don't see this > failing in brick mux post 5th April > ./tests/bugs/ec/bug-1236065.t 17 ==> happens only in brick mux, needs > analysis. > I've identified the problem here, but not the cause yet. There's a stale inodelk acquired by a process that is already dead, which causes inodelk requests from self-heal and other processes to block. The reason why it seemed to block in random places is that all commands are executed with the working directory pointing to a gluster directory which needs healing after the initial tests. Because of the stale inodelk, when any application tries to open a file in the working directory, it's blocked. I'll investigate what causes this. Xavi ./tests/basic/uss.t 15 ==> happens in both brick mux and non > brick mux runs, test just simply times out. Needs urgent analysis. > ./tests/basic/ec/ec-fix-openfd.t 13 ==> Fixed through > https://review.gluster.org/#/c/22508/ , patch merged today. > ./tests/basic/volfile-sanity.t 8 ==> Some race, though this succeeds > in second attempt every time. > > There're plenty more with 5 instances of failure from many tests. We need > all maintainers/owners to look through these failures and fix them, we > certainly don't want to get into a stage where master is unstable and we > have to lock down the merges till all these failures are resolved. So > please help. > > (Please note fstat stats show up the retries as failures too which in a > way is right) > > > On Tue, Feb 26, 2019 at 5:27 PM Atin Mukherjee > wrote: > >> [1] captures the test failures report since last 30 days and we'd need >> volunteers/component owners to see why the number of failures are so high >> against few tests. >> >> [1] >> https://fstat.gluster.org/summary?start_date=2019-01-26&end_date=2019-02-25&job=all >> > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From rabhat at redhat.com Wed Apr 10 20:07:50 2019 From: rabhat at redhat.com (FNU Raghavendra Manjunath) Date: Wed, 10 Apr 2019 16:07:50 -0400 Subject: [Gluster-devel] test failure reports for last 15 days In-Reply-To: References: Message-ID: On Wed, Apr 10, 2019 at 9:59 AM Atin Mukherjee wrote: > And now for last 15 days: > > https://fstat.gluster.org/summary?start_date=2019-03-25&end_date=2019-04-10 > > ./tests/bitrot/bug-1373520.t 18 ==> Fixed through > https://review.gluster.org/#/c/glusterfs/+/22481/, I don't see this > failing in brick mux post 5th April > The above patch has been sent to fix the failure with brick mux enabled. > ./tests/bugs/ec/bug-1236065.t 17 ==> happens only in brick mux, needs > analysis. > ./tests/basic/uss.t 15 ==> happens in both brick mux and non > brick mux runs, test just simply times out. Needs urgent analysis. > Nothing has changed in snapview-server and snapview-client recently. Looking into it. ./tests/basic/ec/ec-fix-openfd.t 13 ==> Fixed through > https://review.gluster.org/#/c/22508/ , patch merged today. > ./tests/basic/volfile-sanity.t 8 ==> Some race, though this succeeds > in second attempt every time. > > There're plenty more with 5 instances of failure from many tests. We need > all maintainers/owners to look through these failures and fix them, we > certainly don't want to get into a stage where master is unstable and we > have to lock down the merges till all these failures are resolved. So > please help. > > (Please note fstat stats show up the retries as failures too which in a > way is right) > > > On Tue, Feb 26, 2019 at 5:27 PM Atin Mukherjee > wrote: > >> [1] captures the test failures report since last 30 days and we'd need >> volunteers/component owners to see why the number of failures are so high >> against few tests. >> >> [1] >> https://fstat.gluster.org/summary?start_date=2019-01-26&end_date=2019-02-25&job=all >> > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Thu Apr 11 04:34:23 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Thu, 11 Apr 2019 10:04:23 +0530 Subject: [Gluster-devel] [Gluster-users] Replica 3 - how to replace failed node (peer) In-Reply-To: References: Message-ID: Hi Strahil, Thank you for sharing your experience with reset-brick option. Since he is using the gluster version 3.7.6, we do not have the reset-brick [1] option implemented there. It is introduced in 3.9.0. He has to go with replace-brick with the force option if he wants to use the same path & name for the new brick. Yes, it is recommended to have the new brick to be of the same size as that of the other bricks. [1] https://docs.gluster.org/en/latest/release-notes/3.9.0/#introducing-reset-brick-command Regards, Karthik On Wed, Apr 10, 2019 at 10:31 PM Strahil wrote: > I have used reset-brick - but I have just changed the brick layout. > You may give it a try, but I guess you need your new brick to have same > amount of space (or more). > > Maybe someone more experienced should share a more sound solution. > > Best Regards, > Strahil NikolovOn Apr 10, 2019 12:42, Martin Toth > wrote: > > > > Hi all, > > > > I am running replica 3 gluster with 3 bricks. One of my servers failed - > all disks are showing errors and raid is in fault state. > > > > Type: Replicate > > Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a > > Status: Started > > Number of Bricks: 1 x 3 = 3 > > Transport-type: tcp > > Bricks: > > Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 > > Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 down > > Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 > > > > So one of my bricks is totally failed (node2). It went down and all data > are lost (failed raid on node2). Now I am running only two bricks on 2 > servers out from 3. > > This is really critical problem for us, we can lost all data. I want to > add new disks to node2, create new raid array on them and try to replace > failed brick on this node. > > > > What is the procedure of replacing Brick2 on node2, can someone advice? > I can?t find anything relevant in documentation. > > > > Thanks in advance, > > Martin > > _______________________________________________ > > Gluster-users mailing list > > Gluster-users at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-users > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Thu Apr 11 04:53:37 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Thu, 11 Apr 2019 10:23:37 +0530 Subject: [Gluster-devel] [Gluster-users] Replica 3 - how to replace failed node (peer) In-Reply-To: References: Message-ID: Hi Strahil, Can you give us some more insights on - the volume configuration you were using? - why you wanted to replace your brick? - which brick(s) you tried replacing? - what problem(s) did you face? Regards, Karthik On Thu, Apr 11, 2019 at 10:14 AM Strahil wrote: > Hi Karthnik, > I used only once the brick replace function when I wanted to change my > Arbiter (v3.12.15 in oVirt 4.2.7) and it was a complete disaster. > Most probably I should have stopped the source arbiter before doing that, > but the docs didn't mention it. > > Thus I always use reset-brick, as it never let me down. > > Best Regards, > Strahil Nikolov > On Apr 11, 2019 07:34, Karthik Subrahmanya wrote: > > Hi Strahil, > > Thank you for sharing your experience with reset-brick option. > Since he is using the gluster version 3.7.6, we do not have the > reset-brick [1] option implemented there. It is introduced in 3.9.0. He has > to go with replace-brick with the force option if he wants to use the same > path & name for the new brick. > Yes, it is recommended to have the new brick to be of the same size as > that of the other bricks. > > [1] > https://docs.gluster.org/en/latest/release-notes/3.9.0/#introducing-reset-brick-command > > Regards, > Karthik > > On Wed, Apr 10, 2019 at 10:31 PM Strahil wrote: > > I have used reset-brick - but I have just changed the brick layout. > You may give it a try, but I guess you need your new brick to have same > amount of space (or more). > > Maybe someone more experienced should share a more sound solution. > > Best Regards, > Strahil NikolovOn Apr 10, 2019 12:42, Martin Toth > wrote: > > > > Hi all, > > > > I am running replica 3 gluster with 3 bricks. One of my servers failed - > all disks are showing errors and raid is in fault state. > > > > Type: Replicate > > Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a > > Status: Started > > Number of Bricks: 1 x 3 = 3 > > Transport-type: tcp > > Bricks: > > Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 > > Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 down > > Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 > > > > So one of my bricks is totally failed (node2). It went down and all data > are lost (failed raid on node2). Now I am running only two bricks on 2 > servers out from 3. > > This is really critical problem for us, we can lost all data. I want to > add new disks to node2, create new raid array on them and try to replace > failed brick on this node. > > > > What is the procedure of replacing Brick2 on node2, can someone advice? > I can?t find anything relevant in documentation. > > > > Thanks in advance, > > Martin > > _______________________________________________ > > Gluster-users mailing list > > Gluster-users at gluster.org > > https://lists.gluster.org/mailman/listinfo/gluster-users > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Thu Apr 11 04:55:37 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Thu, 11 Apr 2019 10:25:37 +0530 Subject: [Gluster-devel] [Gluster-users] Replica 3 - how to replace failed node (peer) In-Reply-To: References:

Message-ID: On Thu, Apr 11, 2019 at 10:23 AM Karthik Subrahmanya wrote: > Hi Strahil, > > Can you give us some more insights on > - the volume configuration you were using? > - why you wanted to replace your brick? > - which brick(s) you tried replacing? > - if you remember the commands/steps that you followed, please give that as well. > - what problem(s) did you face? > > Regards, > Karthik > > On Thu, Apr 11, 2019 at 10:14 AM Strahil wrote: > >> Hi Karthnik, >> I used only once the brick replace function when I wanted to change my >> Arbiter (v3.12.15 in oVirt 4.2.7) and it was a complete disaster. >> Most probably I should have stopped the source arbiter before doing that, >> but the docs didn't mention it. >> >> Thus I always use reset-brick, as it never let me down. >> >> Best Regards, >> Strahil Nikolov >> On Apr 11, 2019 07:34, Karthik Subrahmanya wrote: >> >> Hi Strahil, >> >> Thank you for sharing your experience with reset-brick option. >> Since he is using the gluster version 3.7.6, we do not have the >> reset-brick [1] option implemented there. It is introduced in 3.9.0. He has >> to go with replace-brick with the force option if he wants to use the same >> path & name for the new brick. >> Yes, it is recommended to have the new brick to be of the same size as >> that of the other bricks. >> >> [1] >> https://docs.gluster.org/en/latest/release-notes/3.9.0/#introducing-reset-brick-command >> >> Regards, >> Karthik >> >> On Wed, Apr 10, 2019 at 10:31 PM Strahil wrote: >> >> I have used reset-brick - but I have just changed the brick layout. >> You may give it a try, but I guess you need your new brick to have same >> amount of space (or more). >> >> Maybe someone more experienced should share a more sound solution. >> >> Best Regards, >> Strahil NikolovOn Apr 10, 2019 12:42, Martin Toth >> wrote: >> > >> > Hi all, >> > >> > I am running replica 3 gluster with 3 bricks. One of my servers failed >> - all disks are showing errors and raid is in fault state. >> > >> > Type: Replicate >> > Volume ID: 41d5c283-3a74-4af8-a55d-924447bfa59a >> > Status: Started >> > Number of Bricks: 1 x 3 = 3 >> > Transport-type: tcp >> > Bricks: >> > Brick1: node1.san:/tank/gluster/gv0imagestore/brick1 >> > Brick2: node2.san:/tank/gluster/gv0imagestore/brick1 > down >> > Brick3: node3.san:/tank/gluster/gv0imagestore/brick1 >> > >> > So one of my bricks is totally failed (node2). It went down and all >> data are lost (failed raid on node2). Now I am running only two bricks on 2 >> servers out from 3. >> > This is really critical problem for us, we can lost all data. I want to >> add new disks to node2, create new raid array on them and try to replace >> failed brick on this node. >> > >> > What is the procedure of replacing Brick2 on node2, can someone advice? >> I can?t find anything relevant in documentation. >> > >> > Thanks in advance, >> > Martin >> > _______________________________________________ >> > Gluster-users mailing list >> > Gluster-users at gluster.org >> > https://lists.gluster.org/mailman/listinfo/gluster-users >> _______________________________________________ >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From dkhandel at redhat.com Thu Apr 11 07:15:45 2019 From: dkhandel at redhat.com (Deepshikha Khandelwal) Date: Thu, 11 Apr 2019 12:45:45 +0530 Subject: [Gluster-devel] Unplanned Jenkins restart Message-ID: Hello, I had to do an unplanned Jenkins restart. Jenkins was not responding to any of the requests and was not giving back the regression votes. I did update the vote verified values of regression jobs which seemed to change to 0 all of a sudden and was not giving back the vote. I'm investigating more on the root cause. I'll update on the bug[1] about the root cause. Centos regression jobs may have ended up canceled. Please retry them. [1] https://bugzilla.redhat.com/show_bug.cgi?id=1698716 -------------- next part -------------- An HTML attachment was scrubbed... URL: From atumball at redhat.com Thu Apr 11 08:54:14 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Thu, 11 Apr 2019 14:24:14 +0530 Subject: [Gluster-devel] [Gluster-users] Proposal: Changes in Gluster Community meetings In-Reply-To: References: <62104B6F-99CF-4C22-80FC-9C177F73E897@onholyground.com> Message-ID: Hi All, Below is the final details of our community meeting, and I will be sending invites to mailing list following this email. You can add Gluster Community Calendar so you can get notifications on the meetings. We are starting the meetings from next week. For the first meeting, we need 1 volunteer from users to discuss the use case / what went well, and what went bad, etc. preferrably in APAC region. NA/EMEA region, next week. Draft Content: https://hackmd.io/OqZbh7gfQe6uvVUXUVKJ5g ---- Gluster Community Meeting Previous Meeting minutes: - http://github.com/gluster/community Date/Time: Check the community calendar Bridge - APAC friendly hours - Bridge: https://bluejeans.com/836554017 - NA/EMEA - Bridge: https://bluejeans.com/486278655 ------------------------------ Attendance - Name, Company Host - Who will host next meeting? - Host will need to send out the agenda 24hr - 12hrs in advance to mailing list, and also make sure to send the meeting minutes. - Host will need to reach out to one user at least who can talk about their usecase, their experience, and their needs. - Host needs to send meeting minutes as PR to http://github.com/gluster/community User stories - Discuss 1 usecase from a user. - How was the architecture derived, what volume type used, options, etc? - What were the major issues faced ? How to improve them? - What worked good? - How can we all collaborate well, so it is win-win for the community and the user? How can we Community - Any release updates? - Blocker issues across the project? - Metrics - Number of new bugs since previous meeting. How many are not triaged? - Number of emails, anything unanswered? Conferences / Meetups - Any conference in next 1 month where gluster-developers are going? gluster-users are going? So we can meet and discuss. Developer focus - Any design specs to discuss? - Metrics of the week? - Coverity - Clang-Scan - Number of patches from new developers. - Did we increase test coverage? - [Atin] Also talk about most frequent test failures in the CI and carve out an AI to get them fixed. RoundTable - ---- Regards, Amar On Mon, Mar 25, 2019 at 8:53 PM Amar Tumballi Suryanarayan < atumball at redhat.com> wrote: > Thanks for the feedback Darrell, > > The new proposal is to have one in North America 'morning' time. (10AM > PST), And another in ASIA day time, which is evening 7pm/6pm in Australia, > 9pm Newzealand, 5pm Tokyo, 4pm Beijing. > > For example, if we choose Every other Tuesday for meeting, and 1st of the > month is Tuesday, we would have North America time for 1st, and on 15th it > would be ASIA/Pacific time. > > Hopefully, this way, we can cover all the timezones, and meeting minutes > would be committed to github repo, so that way, it will be easier for > everyone to be aware of what is happening. > > Regards, > Amar > > On Mon, Mar 25, 2019 at 8:40 PM Darrell Budic > wrote: > >> As a user, I?d like to visit more of these, but the time slot is my 3AM. >> Any possibility for a rolling schedule (move meeting +6 hours each week >> with rolling attendance from maintainers?) or an occasional regional >> meeting 12 hours opposed to the one you?re proposing? >> >> -Darrell >> >> On Mar 25, 2019, at 4:25 AM, Amar Tumballi Suryanarayan < >> atumball at redhat.com> wrote: >> >> All, >> >> We currently have 3 meetings which are public: >> >> 1. Maintainer's Meeting >> >> - Runs once in 2 weeks (on Mondays), and current attendance is around 3-5 >> on an avg, and not much is discussed. >> - Without majority attendance, we can't take any decisions too. >> >> 2. Community meeting >> >> - Supposed to happen on #gluster-meeting, every 2 weeks, and is the only >> meeting which is for 'Community/Users'. Others are for developers as of >> now. >> Sadly attendance is getting closer to 0 in recent times. >> >> 3. GCS meeting >> >> - We started it as an effort inside Red Hat gluster team, and opened it >> up for community from Jan 2019, but the attendance was always from RHT >> members, and haven't seen any traction from wider group. >> >> So, I have a proposal to call out for cancelling all these meeting, and >> keeping just 1 weekly 'Community' meeting, where even topics related to >> maintainers and GCS and other projects can be discussed. >> >> I have a template of a draft template @ >> https://hackmd.io/OqZbh7gfQe6uvVUXUVKJ5g >> >> Please feel free to suggest improvements, both in agenda and in timings. >> So, we can have more participation from members of community, which allows >> more user - developer interactions, and hence quality of project. >> >> Waiting for feedbacks, >> >> Regards, >> Amar >> >> >> _______________________________________________ >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> >> > > -- > Amar Tumballi (amarts) > -- Amar Tumballi (amarts) -------------- next part -------------- An HTML attachment was scrubbed... URL: From amarts at gmail.com Thu Apr 11 08:56:48 2019 From: amarts at gmail.com (amarts at gmail.com) Date: Thu, 11 Apr 2019 08:56:48 +0000 Subject: [Gluster-devel] Invitation: Gluster Community Meeting (APAC friendly hours) @ Tue Apr 16, 2019 11:30am - 12:30pm (IST) (gluster-devel@gluster.org) Message-ID: <000000000000cc647605863d5d8c@google.com> You have been invited to the following event. Title: Gluster Community Meeting (APAC friendly hours) Bridge: https://bluejeans.com/836554017 Meeting minutes: https://hackmd.io/OqZbh7gfQe6uvVUXUVKJ5g?both Previous Meeting notes: http://github.com/gluster/community When: Tue Apr 16, 2019 11:30am ? 12:30pm India Standard Time - Kolkata Where: https://bluejeans.com/836554017 Calendar: gluster-devel at gluster.org Who: * amarts at gmail.com - creator * gluster-users at gluster.org * maintainers at gluster.org * gluster-devel at gluster.org Event details: https://www.google.com/calendar/event?action=VIEW&eid=MjU2dWllNDQyM2tqaGs0ZjhidGl2YmdtM2YgZ2x1c3Rlci1kZXZlbEBnbHVzdGVyLm9yZw&tok=NTIjdmViajVibDBrbnNiOWQwY205ZWg5cGJsaTRAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbTE4ODM2ZDY3Mzk4MjRjNDc2OWE3NmEyMTY0ODEwMDg0ODI5ODNlZmY&ctz=Asia%2FKolkata&hl=en&es=0 Invitation from Google Calendar: https://www.google.com/calendar/ You are receiving this courtesy email at the account gluster-devel at gluster.org because you are an attendee of this event. To stop receiving future updates for this event, decline this event. Alternatively you can sign up for a Google account at https://www.google.com/calendar/ and control your notification settings for your entire calendar. Forwarding this invitation could allow any recipient to send a response to the organizer and be added to the guest list, or invite others regardless of their own invitation status, or to modify your RSVP. Learn more at https://support.google.com/calendar/answer/37135#forwarding -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/calendar Size: 1821 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: invite.ics Type: application/ics Size: 1861 bytes Desc: not available URL: From amarts at gmail.com Thu Apr 11 08:57:51 2019 From: amarts at gmail.com (amarts at gmail.com) Date: Thu, 11 Apr 2019 08:57:51 +0000 Subject: [Gluster-devel] Invitation: Gluster Community Meeting (NA/EMEA friendly hours) @ Tue Apr 23, 2019 10:30pm - 11:30pm (IST) (gluster-devel@gluster.org) Message-ID: <000000000000913c6405863d6169@google.com> You have been invited to the following event. Title: Gluster Community Meeting (NA/EMEA friendly hours) Bridge: https://bluejeans.com/486278655 Meeting minutes: https://hackmd.io/OqZbh7gfQe6uvVUXUVKJ5g?both Previous Meeting notes: http://github.com/gluster/community When: Tue Apr 23, 2019 10:30pm ? 11:30pm India Standard Time - Kolkata Where: https://bluejeans.com/486278655 Calendar: gluster-devel at gluster.org Who: * amarts at gmail.com - creator * gluster-users at gluster.org * maintainers at gluster.org * gluster-devel at gluster.org Event details: https://www.google.com/calendar/event?action=VIEW&eid=N3Y1NWZkZTkxNWQzc3QxcHR2OHJnNm4zNzYgZ2x1c3Rlci1kZXZlbEBnbHVzdGVyLm9yZw&tok=NTIjdmViajVibDBrbnNiOWQwY205ZWg5cGJsaTRAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbWYwYzdiMTk0ODRhYWY1MTBmNjU4NmQ0MGM2M2M1MWU3ZDg0ZDQzYzI&ctz=Asia%2FKolkata&hl=en&es=0 Invitation from Google Calendar: https://www.google.com/calendar/ You are receiving this courtesy email at the account gluster-devel at gluster.org because you are an attendee of this event. To stop receiving future updates for this event, decline this event. Alternatively you can sign up for a Google account at https://www.google.com/calendar/ and control your notification settings for your entire calendar. Forwarding this invitation could allow any recipient to send a response to the organizer and be added to the guest list, or invite others regardless of their own invitation status, or to modify your RSVP. Learn more at https://support.google.com/calendar/answer/37135#forwarding -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/calendar Size: 1828 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: invite.ics Type: application/ics Size: 1868 bytes Desc: not available URL: From jahernan at redhat.com Thu Apr 11 09:28:48 2019 From: jahernan at redhat.com (Xavi Hernandez) Date: Thu, 11 Apr 2019 11:28:48 +0200 Subject: [Gluster-devel] test failure reports for last 15 days In-Reply-To: References:

Message-ID: On Wed, Apr 10, 2019 at 7:25 PM Xavi Hernandez wrote: > On Wed, Apr 10, 2019 at 4:01 PM Atin Mukherjee > wrote: > >> And now for last 15 days: >> >> >> https://fstat.gluster.org/summary?start_date=2019-03-25&end_date=2019-04-10 >> >> ./tests/bitrot/bug-1373520.t 18 ==> Fixed through >> https://review.gluster.org/#/c/glusterfs/+/22481/, I don't see this >> failing in brick mux post 5th April >> ./tests/bugs/ec/bug-1236065.t 17 ==> happens only in brick mux, >> needs analysis. >> > > I've identified the problem here, but not the cause yet. There's a stale > inodelk acquired by a process that is already dead, which causes inodelk > requests from self-heal and other processes to block. > > The reason why it seemed to block in random places is that all commands > are executed with the working directory pointing to a gluster directory > which needs healing after the initial tests. Because of the stale inodelk, > when any application tries to open a file in the working directory, it's > blocked. > > I'll investigate what causes this. > I think I've found the problem. This is a fragment of the brick log that includes script steps, connections and disconnections of brick 0, and lock requests to the problematic lock: [2019-04-11 08:22:20.381398]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 66 kill_brick patchy jahernan /d/backends/patchy2 ++++++++++ [2019-04-11 08:22:22.532646]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 67 kill_brick patchy jahernan /d/backends/patchy3 ++++++++++ [2019-04-11 08:22:23.709655] I [MSGID: 115029] [server-handshake.c:550:server_setvolume] 0-patchy-server: accepted client from CTX_ID:1c2952c2-e90f-4631-8712-170b8c05aa6e-GRAPH_ID:0-PID:28900-HOST:jahernan-PC_NAME:patchy-client-1-RECON_NO:-2 (version: 7dev) with subvol /d/backends/patchy1 [2019-04-11 08:22:23.792204] I [common.c:234:pl_trace_in] 8-patchy-locks: [REQUEST] Locker = {Pid=29710, lk-owner=68580998b47f0000, Client=CTX_ID:1c2952c2-e90f-4631-8712-170b8c05aa6e-GRAPH_ID:0-PID:28900-HOST:jahernan-PC_NAME:patchy-client-1-RECON_NO:-2, Frame=18676} Lockee = {gfid=35743386-b7c2-41c9-aafd-6b13de216704, fd=(nil), path=/test} Lock = {lock=INODELK, cmd=SETLK, type=WRITE, domain: patchy-disperse-0, start=0, len=0, pid=0} [2019-04-11 08:22:23.792299] I [common.c:285:pl_trace_out] 8-patchy-locks: [GRANTED] Locker = {Pid=29710, lk-owner=68580998b47f0000, Client=CTX_ID:1c2952c2-e90f-4631-8712-170b8c05aa6e-GRAPH_ID:0-PID:28900-HOST:jahernan-PC_NAME:patchy-client-1-RECON_NO:-2, Frame=18676} Lockee = {gfid=35743386-b7c2-41c9-aafd-6b13de216704, fd=(nil), path=/test} Lock = {lock=INODELK, cmd=SETLK, type=WRITE, domain: patchy-disperse-0, start=0, len=0, pid=0} [2019-04-11 08:22:24.628478]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 68 5 online_brick_count ++++++++++ [2019-04-11 08:22:26.097092]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 70 rm -f 0.o 10.o 11.o 12.o 13.o 14.o 15.o 16.o 17.o 18.o 19.o 1.o 2.o 3.o 4.o 5.o 6.o 7.o 8.o 9.o ++++++++++ [2019-04-11 08:22:26.333740]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 71 ec_test_make ++++++++++ [2019-04-11 08:22:27.718963] I [MSGID: 115029] [server-handshake.c:550:server_setvolume] 0-patchy-server: accepted client from CTX_ID:1c2952c2-e90f-4631-8712-170b8c05aa6e-GRAPH_ID:0-PID:28900-HOST:jahernan-PC_NAME:patchy-client-1-RECON_NO:-3 (version: 7dev) with subvol /d/backends/patchy1 [2019-04-11 08:22:27.801416] I [common.c:234:pl_trace_in] 8-patchy-locks: [REQUEST] Locker = {Pid=29885, lk-owner=68580998b47f0000, Client=CTX_ID:1c2952c2-e90f-4631-8712-170b8c05aa6e-GRAPH_ID:0-PID:28900-HOST:jahernan-PC_NAME:patchy-client-1-RECON_NO:-3, Frame=19233} Lockee = {gfid=35743386-b7c2-41c9-aafd-6b13de216704, fd=(nil), path=/test} Lock = {lock=INODELK, cmd=SETLK, type=UNLOCK, domain: patchy-disperse-0, start=0, len=0, pid=0} [2019-04-11 08:22:27.801434] E [inodelk.c:513:__inode_unlock_lock] 8-patchy-locks: Matching lock not found for unlock 0-9223372036854775807, by 68580998b47f0000 on 0x7f0ed0029190 [2019-04-11 08:22:27.801446] I [common.c:285:pl_trace_out] 8-patchy-locks: [Invalid argument] Locker = {Pid=29885, lk-owner=68580998b47f0000, Client=CTX_ID:1c2952c2-e90f-4631-8712-170b8c05aa6e-GRAPH_ID:0-PID:28900-HOST:jahernan-PC_NAME:patchy-client-1-RECON_NO:-3, Frame=19233} Lockee = {gfid=35743386-b7c2-41c9-aafd-6b13de216704, fd=(nil), path=/test} Lock = {lock=INODELK, cmd=SETLK, type=UNLOCK, domain: patchy-disperse-0, start=0, len=0, pid=0} This is a fragment of the client log: [2019-04-11 08:22:20.381398]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 66 kill_brick patchy jahernan /d/backends/patchy2 ++++++++++ [2019-04-11 08:22:20.675938] I [MSGID: 114018] [client.c:2333:client_rpc_notify] 0-patchy-client-1: disconnected from patchy-client-1. Client process will keep trying to connect to glusterd until brick's port is available [2019-04-11 08:22:21.674772] W [MSGID: 122035] [ec-common.c:654:ec_child_select] 0-patchy-disperse-0: Executing operation with some subvolumes unavailable. (6). FOP : 'INODELK' failed on '/test' with gfid 35743386-b7c2-41c9-aafd-6b13de216704 [2019-04-11 08:22:22.532646]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 67 kill_brick patchy jahernan /d/backends/patchy3 ++++++++++ [2019-04-11 08:22:23.691171] W [MSGID: 122035] [ec-common.c:654:ec_child_select] 0-patchy-disperse-0: Executing operation with some subvolumes unavailable. (8). FOP : 'INODELK' failed on '/test' with gfid 35743386-b7c2-41c9-aafd-6b13de216704 [2019-04-11 08:22:23.710420] I [MSGID: 114046] [client-handshake.c:1106:client_setvolume_cbk] 0-patchy-client-1: Connected to patchy-client-1, attached to remote volume '/d/backends/patchy1'. [2019-04-11 08:22:23.791635] W [MSGID: 122035] [ec-common.c:654:ec_child_select] 0-patchy-disperse-0: Executing operation with some subvolumes unavailable. (C). FOP : 'INODELK' failed on '/test' with gfid 35743386-b7c2-41c9-aafd-6b13de216704 [2019-04-11 08:22:24.460529] I [MSGID: 114018] [client.c:2333:client_rpc_notify] 0-patchy-client-1: disconnected from patchy-client-1. Client process will keep trying to connect to glusterd until brick's port is available [2019-04-11 08:22:24.628478]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 68 5 online_brick_count ++++++++++ [2019-04-11 08:22:26.097092]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 70 rm -f 0.o 10.o 11.o 12.o 13.o 14.o 15.o 16.o 17.o 18.o 19.o 1.o 2.o 3.o 4.o 5.o 6.o 7.o 8.o 9.o ++++++++++ [2019-04-11 08:22:26.333740]:++++++++++ G_LOG:tests/bugs/ec/bug-1236065.t: TEST: 71 ec_test_make ++++++++++ [2019-04-11 08:22:27.719299] I [MSGID: 114046] [client-handshake.c:1106:client_setvolume_cbk] 0-patchy-client-1: Connected to patchy-client-1, attached to remote volume '/d/backends/patchy1'. [2019-04-11 08:22:27.840342] W [MSGID: 122035] [ec-common.c:654:ec_child_select] 0-patchy-disperse-0: Executing operation with some subvolumes unavailable. (C). FOP : 'INODELK' failed on '/test' with gfid 35743386-b7c2-41c9-aafd-6b13de216704 The problem happens for two things: 1. Brick 0 gets disconnected randomly (apparently), but the server side is not aware of these disconnections. This causes that at 08:22:24.460529, the client has already sent a successful INODELK request to brick 0. At this point the connection is broken on the client side, but server side doesn't get any notification, so it doesn't clear the locks. 2. When client reconnects at 08:22:27.719299, a new connection is created, and the servers does see this new connection (it creates a new client_t structure). Then the client sends the unlock request, which fails on brick 0 because locks xlators checks if the client is the same by comparing the pointers, but they are different because of the reconnection. So the lock is not unlocked and remains there, blocking all future inodelk requests. The first problem is why the client gets disconnected and the server doesn't get any notification. The script is stopping bricks 2 and 3 when this happens. Brick 0 shouldn't fail here. It seems related to the The second problem is that when we receive a new connection from a client we already consider connected, we don't cleanup the old connection, which should take care of the stale locks. The third problem is that locks xlator is checking if the client is the same by comparing pointers of client_t structs instead of comparing client_uid field, which remains the same. Adding +Raghavendra Gowdappa , +Pranith Kumar Karampuri , +Krutika Dhananjay , +Shyam Ranganathan and +Amar Tumballi