[Gluster-users] Hundreds of duplicate files
Joe Julian
joe at julianfamily.org
Wed Feb 18 19:51:25 UTC 2015
On 02/18/2015 11:43 AM, tbenzvi at 3vgeomatics.com wrote:
> Hi Olav,
>
> I have a hunch that our problem was caused by improper unmounting of
> the gluster volume, and have since found that the proper order should
> be: kill all jobs using volume -> unmount volume on clients -> gluster
> volume stop -> stop gluster service (if necessary)
> In my case, I wrote a Python script to find duplicate files on the
> mounted volume, then delete the corresponding link files on the bricks
> (making sure to also delete files in the .glusterfs directory)
> However, your find command was also suggested to me and I think it's a
> simpler solution. I believe removing all link files (even ones that
> are not causing duplicates) is fine since the next file access gluster
> will do a lookup on all bricks and recreate any link files if
> necessary. Hopefully a gluster expert can chime in on this point as
> I'm not completely sure.
You are correct.
> Keep in mind your setup is somewhat different than mine as I have only
> 5 bricks with no replication.
> Regards,
> Tom
>
> --------- Original Message ---------
> Subject: Re: [Gluster-users] Hundreds of duplicate files
> From: "Olav Peeters" <opeeters at gmail.com>
> Date: 2/18/15 10:52 am
> To: gluster-users at gluster.org, tbenzvi at 3vgeomatics.com
>
> Hi all,
> I'm have this problem after upgrading from 3.5.3 to 3.6.2.
> At the moment I am still waiting for a heal to finish (on a 31TB
> volume with 42 bricks, replicated over three nodes).
>
> Tom,
> how did you remove the duplicates?
> with 42 bricks I will not be able to do this manually..
> Did a:
> find $brick_root -type f -size 0 -perm 1000 -exec /bin/rm {} \;
> work for you?
>
> Should this type of thing ideally not be checked and mended by a heal?
>
> Does anyone have an idea yet how this happens in the first place?
> Can it be connected to upgrading?
>
> Cheers,
> Olav
>
>
>
> On 01/01/15 03:07, tbenzvi at 3vgeomatics.com wrote:
>
> No, the files can be read on a newly mounted client! I went
> ahead and deleted all of the link files associated with these
> duplicates, and then remounted the volume. The problem is fixed!
> Thanks again for the help, Joe and Vijay.
> Tom
>
> --------- Original Message ---------
> Subject: Re: [Gluster-users] Hundreds of duplicate files
> From: "Vijay Bellur" <vbellur at redhat.com>
> Date: 12/28/14 3:23 am
> To: tbenzvi at 3vgeomatics.com, gluster-users at gluster.org
>
> On 12/28/2014 01:20 PM, tbenzvi at 3vgeomatics.com wrote:
> > Hi Vijay,
> > Yes the files are still readable from the .glusterfs path.
> > There is no explicit error. However, trying to read a
> text file in
> > python simply gives me null characters:
> >
> > >>> open('ott_mf_itab').readlines()
> >
> ['\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00']
> >
> > And reading binary files does the same
> >
>
> Is this behavior seen with a freshly mounted client too?
>
> -Vijay
>
> > --------- Original Message ---------
> > Subject: Re: [Gluster-users] Hundreds of duplicate files
> > From: "Vijay Bellur" <vbellur at redhat.com>
> > Date: 12/27/14 9:57 pm
> > To: tbenzvi at 3vgeomatics.com, gluster-users at gluster.org
> >
> > On 12/28/2014 10:13 AM, tbenzvi at 3vgeomatics.com wrote:
> > > Thanks Joe, I've read your blog post as well as your post
> > regarding the
> > > .glusterfs directory.
> > > I found some unneeded duplicate files which were not
> being read
> > > properly. I then deleted the link file from the brick.
> This always
> > > removes the duplicate file from the listing, but the
> file does not
> > > always become readable. If I also delete the
> associated file in the
> > > .glusterfs directory on that brick, then some more
> files become
> > > readable. However this solution still doesn't work for
> all files.
> > > I know the file on the brick is not corrupt as it can
> be read
> > directly
> > > from the brick directory.
> >
> > For files that are not readable from the client, can you
> check if the
> > file is readable from the .glusterfs/ path?
> >
> > What is the specific error that is seen while trying to
> read one such
> > file from the client?
> >
> > Thanks,
> > Vijay
> >
> >
> >
> > _______________________________________________
> > Gluster-users mailing list
> > Gluster-users at gluster.org
> > http://www.gluster.org/mailman/listinfo/gluster-users
> >
>
>
>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-users
>
>
>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.gluster.org/pipermail/gluster-users/attachments/20150218/eccac1d5/attachment.html>
More information about the Gluster-users
mailing list