[Bugs] [Bug 1229250] New: Data Tiering:Old copy of file still remaining on EC(disperse) layer, when edited after attaching tier(new copy is moved to hot tier)
bugzilla at redhat.com
bugzilla at redhat.com
Mon Jun 8 10:31:23 UTC 2015
https://bugzilla.redhat.com/show_bug.cgi?id=1229250
Bug ID: 1229250
Summary: Data Tiering:Old copy of file still remaining on
EC(disperse) layer, when edited after attaching
tier(new copy is moved to hot tier)
Product: Red Hat Gluster Storage
Version: 3.1
Component: glusterfs
Sub Component: tiering
Keywords: Triaged
Severity: urgent
Priority: urgent
Assignee: rhs-bugs at redhat.com
Reporter: nchilaka at redhat.com
QA Contact: nchilaka at redhat.com
CC: bugs at gluster.org, dlambrig at redhat.com,
josferna at redhat.com, vagarwal at redhat.com
Depends On: 1212037
Blocks: 1186580 (qe_tracker_everglades)
+++ This bug was initially created as a clone of Bug #1212037 +++
Description of problem:
======================
When we attach a tier to a EC volume which has files already, then on trying to
edit those files using vim, the file with new contents is getting saved to the
hot tier, but the old tier still has the old version of file but renamed to
<filename>~
Eg:
If a file f2 was existing on ec volume and after attaching a tier,
This means it will consume lot of disk space if they are huge files
Version-Release number of selected component (if applicable):
============================================================
[root at vertigo ~]# gluster --version
glusterfs 3.7dev built on Apr 13 2015 07:14:27
Repository revision: git://git.gluster.com/glusterfs.git
Copyright (c) 2006-2011 Gluster Inc. <http://www.gluster.com>
GlusterFS comes with ABSOLUTELY NO WARRANTY.
You may redistribute copies of GlusterFS under the terms of the GNU General
Public License.
[root at vertigo ~]# rpm -qa|grep gluster
glusterfs-server-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-rdma-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-api-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-regression-tests-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-devel-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-resource-agents-3.7dev-0.994.gitf522001.el6.noarch
glusterfs-libs-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-fuse-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-geo-replication-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-cli-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-api-devel-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-extra-xlators-3.7dev-0.994.gitf522001.el6.x86_64
glusterfs-debuginfo-3.7dev-0.994.gitf522001.el6.x86_64
How reproducible:
================
easily
Steps to Reproduce:
==================
1.create a EC volume
2.add files to it after mounting
3.Now attach a tier and try to edit one of those existing files, it can be seen
that while editing u get error, but on force edit, the file with edited
contents get saved in hot tier, while the old file copy still exists as
<filename> ~
Additional info:
=================
[root at ninja ~]# ls /rhs/brick*/rhat*
/rhs/brick1/rhatvol-10:
f1 f10 f11 f12 f13 f14 f15 f16 f17 f18 f19 f2~ f20 f3 f4 f5 f6
f7 f8 f9~
/rhs/brick1/rhatvol-2:
f1 f10 f11 f12 f13 f14 f15 f16 f17 f18 f19 f2~ f20 f3 f4 f5 f6
f7 f8 f9~
/rhs/brick2/rhatvol-12:
f1 f10 f11 f12 f13 f14 f15 f16 f17 f18 f19 f2~ f20 f3 f4 f5 f6
f7 f8 f9~
/rhs/brick2/rhatvol-4:
f1 f10 f11 f12 f13 f14 f15 f16 f17 f18 f19 f2~ f20 f3 f4 f5 f6
f7 f8 f9~
/rhs/brick3/rhatvol-6:
f1 f10 f11 f12 f13 f14 f15 f16 f17 f18 f19 f2~ f20 f3 f4 f5 f6
f7 f8 f9~
/rhs/brick3/rhatvol-tier:
f2 f9 newfile
/rhs/brick4/rhatvol-8:
f1 f10 f11 f12 f13 f14 f15 f16 f17 f18 f19 f2~ f20 f3 f4 f5 f6
f7 f8 f9~
--- Additional comment from Anand Avati on 2015-04-24 10:56:30 EDT ---
REVIEW: http://review.gluster.org/10370 (ctr/xlator: Named lookup heal of
pre-existing files, before ctr was ON.) posted (#2) for review on master by
Joseph Fernandes (josferna at redhat.com)
--- Additional comment from Anand Avati on 2015-04-29 05:36:07 EDT ---
REVIEW: http://review.gluster.org/10370 (ctr/xlator: Named lookup heal of
pre-existing files, before ctr was ON.) posted (#3) for review on master by
Joseph Fernandes (josferna at redhat.com)
--- Additional comment from Anand Avati on 2015-05-02 03:08:02 EDT ---
REVIEW: http://review.gluster.org/10370 (ctr/xlator: Named lookup heal of
pre-existing files, before ctr was ON.) posted (#4) for review on master by
Joseph Fernandes (josferna at redhat.com)
--- Additional comment from Anand Avati on 2015-05-03 03:01:37 EDT ---
REVIEW: http://review.gluster.org/10370 (ctr/xlator: Named lookup heal of
pre-existing files, before ctr was ON.) posted (#5) for review on master by
Joseph Fernandes (josferna at redhat.com)
--- Additional comment from Anand Avati on 2015-05-03 14:47:07 EDT ---
REVIEW: http://review.gluster.org/10370 (ctr/xlator: Named lookup heal of
pre-existing files, before ctr was ON.) posted (#6) for review on master by
Joseph Fernandes (josferna at redhat.com)
--- Additional comment from Anand Avati on 2015-05-05 14:52:04 EDT ---
REVIEW: http://review.gluster.org/10370 (ctr/xlator: Named lookup heal of
pre-existing files, before ctr was ON.) posted (#8) for review on master by Dan
Lambright (dlambrig at redhat.com)
--- Additional comment from Anand Avati on 2015-05-06 07:42:32 EDT ---
COMMIT: http://review.gluster.org/10370 committed in master by Vijay Bellur
(vbellur at redhat.com)
------
commit cb11dd91a6cc296e4a3808364077f4eacb810e48
Author: Joseph Fernandes <josferna at redhat.com>
Date: Fri Apr 24 19:22:44 2015 +0530
ctr/xlator: Named lookup heal of pre-existing files, before ctr was ON.
Problem: The CTR xlator records file meta (heat/hardlinks)
into the data. This works fine for files which are created
after ctr xlator is switched ON. But for files which were
created before CTR xlator is ON, CTR xlator is not able to
record either of the meta i.e heat or hardlinks. Thus making
those files immune to promotions/demotions.
Solution: The solution that is implemented in this patch is
do ctr-db heal of all those pre-existent files, using named lookup.
For this purpose we use the inode-xlator context variable option
in gluster.
The inode-xlator context variable for ctr xlator will have the
following,
a. A Lock for the context variable
b. A hardlink list: This list represents the successful looked
up hardlinks.
These are the scenarios when the hardlink list is updated:
1) Named-Lookup: Whenever a named lookup happens on a file, in the
wind path we copy all required hardlink and inode information to
ctr_db_record structure, which resides in the frame->local variable.
We dont update the database in wind. During the unwind, we read the
information from the ctr_db_record and ,
Check if the inode context variable is created, if not we create it.
Check if the hard link is there in the hardlink list.
If its not there we add it to the list and send a update to the
database using libgfdb.
Please note: The database transaction can fail(and we ignore) as
there
already might be a record in the db. This update to the db is to heal
if its not there.
If its there in the list we ignore it.
2) Inode Forget: Whenever an inode forget hits we clear the hardlink list
in
the inode context variable and delete the inode context variable.
Please note: An inode forget may happen for two reason,
a. when the inode is delete.
b. the in-memory inode is evicted from the inode table due to cache
limits.
3) create: whenever a create happens we create the inode context variable
and
add the hardlink. The database updation is done as usual by ctr.
4) link: whenever a hardlink is created for the inode, we create the inode
context
variable, if not present, and add the hardlink to the list.
5) unlink: whenever a unlink happens we delete the hardlink from the list.
6) mknod: same as create.
7) rename: whenever a rename happens we update the hardlink in list. if the
hardlink
was not present for updation, we add the hardlink to the list.
What is pending:
1) This solution will only work for named lookups.
2) We dont track afr-self-heal/dht-rebalancer traffic for healing.
Change-Id: Ia4bbaf84128ad6ce8c3ddd70bcfa82894c79585f
BUG: 1212037
Signed-off-by: Joseph Fernandes <josferna at redhat.com>
Signed-off-by: Dan Lambright <dlambrig at redhat.com>
Reviewed-on: http://review.gluster.org/10370
Reviewed-by: Jeff Darcy <jdarcy at redhat.com>
Tested-by: Gluster Build System <jenkins at build.gluster.com>
Tested-by: NetBSD Build System
Reviewed-by: Vijay Bellur <vbellur at redhat.com>
--- Additional comment from Niels de Vos on 2015-05-15 09:07:31 EDT ---
This change should not be in "ON_QA", the patch posted for this bug is only
available in the master branch and not in a release yet. Moving back to
MODIFIED until there is an beta release for the next GlusterFS version.
Referenced Bugs:
https://bugzilla.redhat.com/show_bug.cgi?id=1186580
[Bug 1186580] QE tracker bug for Everglades
https://bugzilla.redhat.com/show_bug.cgi?id=1212037
[Bug 1212037] Data Tiering:Old copy of file still remaining on EC(disperse)
layer, when edited after attaching tier(new copy is moved to hot tier)
--
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=njJkKh60ae&a=cc_unsubscribe
More information about the Bugs
mailing list