[Gluster-users] Client and server file "view", different results?! Client can't see the right file.

Martin Schenker martin.schenker at profitbricks.com
Mon May 16 09:19:29 UTC 2011


Hi all!

Here we have another mismatch between the client "view" and the server
mounts:

>From the server site everything seems well, the 20G file is visible and the
attributes seem to match:

	0 root at pserver5:~ # getfattr -R -d -e hex -m "trusted.afr."
/mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864da6ef
/hdd-images/

	# file:
mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864da6ef/
hdd-images//20964
	trusted.afr.storage0-client-2=0x000000000000000000000000
	trusted.afr.storage0-client-3=0x000000000000000000000000

	0 root at pserver5:~ # find /mnt/gluster/ -name 20964 | xargs -i ls -al
{}
	-rwxrwx--- 1 libvirt-qemu vcb 21474836480 May 13 11:21
/mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864da6ef
/hdd-images/20964

But the client view shows 2!! files with 0 byte size!! And these aren't any
link files created by Gluster
 ( with the T on the end)

	0 root at pserver5:~ # find /opt/profitbricks/storage/ -name 20964 |
xargs -i ls -al {}
	-rwxrwx--- 1 libvirt-qemu kvm 0 May 13 11:24
/opt/profitbricks/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864da6ef/h
dd-images/20964
	-rwxrwx--- 1 libvirt-qemu kvm 0 May 13 11:24
/opt/profitbricks/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864da6ef/h
dd-images/20964

I'm a bit stumped that we seem to have so many weird errors cropping up. Any
ideas? I've checked the ext4 filesystem on all boxes, no real problems. We
run a distributed cluster with 4 servers offering 2 bricks each.

Best, Martin




> -----Original Message-----
> From: Mohit Anchlia [mailto:mohitanchlia at gmail.com] 
> Sent: Monday, May 16, 2011 2:24 AM
> To: Martin Schenker
> Cc: gluster-users at gluster.org
> Subject: Re: [Gluster-users] Brick pair file mismatch, 
> self-heal problems?
> 
> 
> Try this to trigger self heal:
> 
> find <gluster-mount> -noleaf -print0 -name <file name>| xargs 
> --null stat >/dev/null
> 
> 
> 
> On Sun, May 15, 2011 at 11:20 AM, Martin Schenker 
> <martin.schenker at profitbricks.com> wrote:
> > Can someone enlighten me what's going on here? We have a two peers, 
> > the file 21313 is shown through the client mountpoint as 
> "1Jan1970", 
> > attribs on server pserver3 don't match but NO self-heal or 
> repair can 
> > be triggered through "ls -alR"?!?
> >
> > Checking the files through the server mounts show that two versions 
> > are on the system. But the wrong one (as with the 
> "1Jan1970") seems to 
> > be the preferred one by the client?!?
> >
> > Do I need to use setattr or what in order to get the client 
> to see the 
> > RIGHT version?!? This is not the ONLY file displaying this 
> problematic 
> > behaviour!
> >
> > Thanks for any feedback.
> >
> > Martin
> >
> > pserver5:
> >
> > 0 root at pserver5:~ # ls -al 
> > 
> /mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-854286
> > 4da6ef
> > /hdd-images
> >
> > -rwxrwx--- 1 libvirt-qemu vcb  483183820800 May 13 13:41 21313
> >
> > 0 root at pserver5:~ # getfattr -R -d -e hex -m "trusted.afr." 
> > 
> /mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-854286
> > 4da6ef
> > /hdd-images/21313
> > getfattr: Removing leading '/' from absolute path names
> > # file:
> > 
> mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f
> -8542864da6ef/
> > hdd-images/21313
> > trusted.afr.storage0-client-2=0x000000000000000000000000
> > trusted.afr.storage0-client-3=0x000000000000000000000000
> >
> > 0 root at pserver5:~ # ls -alR 
> > 
> /opt/profitbricks/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864d
> > a6ef/h
> > dd-images/21313
> > -rwxrwx--- 1 libvirt-qemu kvm 483183820800 Jan  1  1970
> > 
> /opt/profitbricks/storage/images/2078/ebb83b05-3a83-9d18-ad8f-
> 8542864da6ef/h
> > dd-images/21313
> >
> > pserver3:
> >
> > 0 root at pserver3:~ # ls -al 
> > 
> /mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-854286
> > 4da6ef
> > /hdd-images
> >
> > -rwxrwx--- 1 libvirt-qemu kvm  483183820800 Jan  1  1970 21313
> >
> > 0 root at pserver3:~ # ls -alR 
> > 
> /opt/profitbricks/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864d
> > a6ef/h
> > dd-images/21313
> > -rwxrwx--- 1 libvirt-qemu kvm 483183820800 Jan  1  1970
> > 
> /opt/profitbricks/storage/images/2078/ebb83b05-3a83-9d18-ad8f-
> 8542864da6ef/h
> > dd-images/21313
> >
> > 0 root at pserver3:~ # getfattr -R -d -e hex -m "trusted.afr."
> > /mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-
> > ad8f-8542864da6ef/hdd-images/21313
> > getfattr: Removing leading '/' from absolute path names
> > # file: 
> > 
> mnt/gluster/brick1/storage/images/2078/ebb83b05-3a83-9d18-ad8f-8542864
> > da6ef/
> > hdd-images/21313
> > trusted.afr.storage0-client-2=0x000000000000000000000000
> > trusted.afr.storage0-client-3=0x0b0000090900000000000000  
> <- mismatch,
> > should be targeted for self-heal/repair? Why is there a 
> difference in the
> > views?
> >
> >
> > From the volfile:
> >
> > volume storage0-client-2
> >    type protocol/client
> >    option remote-host de-dc1-c1-pserver3
> >    option remote-subvolume /mnt/gluster/brick1/storage
> >    option transport-type rdma
> >    option ping-timeout 5
> > end-volume
> >
> > volume storage0-client-3
> >    type protocol/client
> >    option remote-host de-dc1-c1-pserver5
> >    option remote-subvolume /mnt/gluster/brick1/storage
> >    option transport-type rdma
> >    option ping-timeout 5
> > end-volume
> >
> >
> >
> > _______________________________________________
> > Gluster-users mailing list
> > Gluster-users at gluster.org 
> > http://gluster.org/cgi-bin/mailman/listinfo/gluster-users
> >
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20110516/b6a5d298/attachment.html>


More information about the Gluster-users mailing list