[Gluster-users] cleaning up duplicate files

Todd Pfaff pfaff at rhpcs.mcmaster.ca
Sun Feb 26 16:17:53 UTC 2012


I'm using gluster 3.2.5.  I have a situation where I've somehow gotten
multiple copies of some files on back-end bricks that are members of the
same distribute volume set.  Accessing these files from the front-end
volume results in an Input/Output error.  I don't know how I got into
this situation and I don't really care about that at the moment.  I'd
just like to fix the problem now without having to go to the extreme
of removing everything from the bricks.

I'd do the fixing manually if it were a small number of files but there
are thousands.

Is there any gluster operation that can automatically fix such cases?

Alternatively, short of removing everything from back-end bricks and
starting from a clean slate, has anyone written code to find and fix such
duplicate files?

Fortunately these files are backups so if I do have to remove them
completely the primary copy still exists elsewhere.

Regards,
Todd



More information about the Gluster-users mailing list