[Bugs] [Bug 1443501] New: Don' t wind post-op on a brick where the fop phase failed.
bugzilla at redhat.com
bugzilla at redhat.com
Wed Apr 19 11:30:41 UTC 2017
https://bugzilla.redhat.com/show_bug.cgi?id=1443501
Bug ID: 1443501
Summary: Don't wind post-op on a brick where the fop phase
failed.
Product: GlusterFS
Version: 3.10
Component: replicate
Keywords: Triaged
Assignee: bugs at gluster.org
Reporter: ravishankar at redhat.com
CC: bugs at gluster.org
Depends On: 1438255
Blocks: 1394118, 1443319
+++ This bug was initially created as a clone of Bug #1438255 +++
Problem:
In afr-v2, self-blaming xattrs are not there by design. But if the FOP
failed on a brick due to an error other than ENOTCONN (or even due to
ENOTCONN, but we regained connection before postop was wound), we wind
the post-op also on the failed brick, leading to setting self-blaming
xattrs on that brick. This can lead to undesired results like healing of
files in split-brain etc.
Fix:
If a fop failed on a brick on which pre-op was successful, do not
perform post-op on it. This also produces the desired effect of not
resetting the dirty xattr on the brick, which is how it should be
because if the fop failed on a brick, there is no reason to clear the
dirty bit which actually serves as an indication of the failure.
--- Additional comment from Worker Ant on 2017-04-02 09:12:51 EDT ---
REVIEW: https://review.gluster.org/16976 (afr: don't do a post-op on a brick if
op failed) posted (#1) for review on master by Ravishankar N
(ravishankar at redhat.com)
--- Additional comment from Worker Ant on 2017-04-05 00:49:26 EDT ---
REVIEW: https://review.gluster.org/16976 (afr: don't do a post-op on a brick if
op failed) posted (#2) for review on master by Ravishankar N
(ravishankar at redhat.com)
--- Additional comment from Worker Ant on 2017-04-10 07:37:37 EDT ---
REVIEW: https://review.gluster.org/16976 (afr: don't do a post-op on a brick if
op failed) posted (#3) for review on master by Ravishankar N
(ravishankar at redhat.com)
--- Additional comment from Worker Ant on 2017-04-12 12:57:36 EDT ---
REVIEW: https://review.gluster.org/16976 (afr: don't do a post-op on a brick if
op failed) posted (#4) for review on master by Ravishankar N
(ravishankar at redhat.com)
--- Additional comment from Worker Ant on 2017-04-14 06:38:17 EDT ---
REVIEW: https://review.gluster.org/16976 (afr: don't do a post-op on a brick if
op failed) posted (#5) for review on master by Ravishankar N
(ravishankar at redhat.com)
--- Additional comment from Worker Ant on 2017-04-17 01:57:00 EDT ---
REVIEW: https://review.gluster.org/16976 (afr: don't do a post-op on a brick if
op failed) posted (#6) for review on master by Ravishankar N
(ravishankar at redhat.com)
--- Additional comment from Worker Ant on 2017-04-18 22:29:33 EDT ---
COMMIT: https://review.gluster.org/16976 committed in master by Pranith Kumar
Karampuri (pkarampu at redhat.com)
------
commit 10dad995c989e9d77c341135d7c48817baba966c
Author: Ravishankar N <ravishankar at redhat.com>
Date: Sun Apr 2 18:08:04 2017 +0530
afr: don't do a post-op on a brick if op failed
Problem:
In afr-v2, self-blaming xattrs are not there by design. But if the FOP
failed on a brick due to an error other than ENOTCONN (or even due to
ENOTCONN, but we regained connection before postop was wound), we wind
the post-op also on the failed brick, leading to setting self-blaming
xattrs on that brick. This can lead to undesired results like healing of
files in split-brain etc.
Fix:
If a fop failed on a brick on which pre-op was successful, do not
perform post-op on it. This also produces the desired effect of not
resetting the dirty xattr on the brick, which is how it should be
because if the fop failed on a brick, there is no reason to clear the
dirty bit which actually serves as an indication of the failure.
Change-Id: I5f1caf4d1b39f36cf8093ccef940118638caa9c4
BUG: 1438255
Signed-off-by: Ravishankar N <ravishankar at redhat.com>
Reviewed-on: https://review.gluster.org/16976
Smoke: Gluster Build System <jenkins at build.gluster.org>
NetBSD-regression: NetBSD Build System <jenkins at build.gluster.org>
CentOS-regression: Gluster Build System <jenkins at build.gluster.org>
Reviewed-by: Pranith Kumar Karampuri <pkarampu at redhat.com>
Referenced Bugs:
https://bugzilla.redhat.com/show_bug.cgi?id=1438255
[Bug 1438255] Don't wind post-op on a brick where the fop phase failed.
https://bugzilla.redhat.com/show_bug.cgi?id=1443319
[Bug 1443319] Don't wind post-op on a brick where the fop phase failed.
--
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.
More information about the Bugs
mailing list