[Gluster-devel] split brain with all-zero pending matrix

Pranith Kumar Karampuri pkarampu at redhat.com
Sat Jun 22 14:52:30 UTC 2013


Emmanuel,
   What do the extended attributes look like in the backend for this file on the two bricks.

Pranith.

----- Original Message -----
> From: "Emmanuel Dreyfus" <manu at netbsd.org>
> To: "Gluster Devel" <gluster-devel at nongnu.org>
> Sent: Friday, June 21, 2013 6:23:31 AM
> Subject: [Gluster-devel] split brain with all-zero pending matrix
> 
> On 3.4.0beta3, after using the volume for a while, I get split-brain errors
> with an unhelpful pending matrix.
> 
> [2013-06-20 08:44:00.665731] E
> [afr-self-heal-common.c:197:afr_sh_print_split_brain_log]
> 0-gfs34-replicate-1: Unable to self-heal contents of
> '/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o' (possible
> split-brain). Please delete the file from all but the preferred subvolume.-
> Pending matrix:  [ [ 0 0 ] [ 0 0 ] ]
> [2013-06-20 08:44:00.666431] E
> [afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
> 0-gfs34-replicate-1: background  data self-heal failed on
> /manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o
> [2013-06-20 08:44:00.667201] W [afr-open.c:213:afr_open] 0-gfs34-replicate-1:
> failed to open as split brain seen, returning EIO
> [2013-06-20 08:44:00.668193] W [fuse-bridge.c:875:fuse_fd_cbk]
> 0-glusterfs-fuse: 8711927: OPEN()
> /manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o => -1 (Input/output
> error)
> 
> On the bricks (ls -l, md5 and first 32 bytes):
> brick0
> -rw-r--r--  2 manu  manu  7216 Jun 20 09:56
> /export/wd3a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o
> MD5 (/export/wd3a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o) =
> 24ee57aa8e2aeb6102ba170fb81bbf22
> 00000000  7f 45 4c 46 01 01 01 00  00 00 00 00 00 00 00 00
> |.ELF............|
> 00000010  01 00 03 00 01 00 00 00  00 00 00 00 00 00 00 00
> |................|
> brick1
> -rw-r--r--  2 manu  manu  7216 Jun 20 09:56
> /export/wd1a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o
> MD5 (/export/wd1a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o) =
> 24ee57aa8e2aeb6102ba170fb81bbf22
> 00000000  7f 45 4c 46 01 01 01 00  00 00 00 00 00 00 00 00
> |.ELF............|
> 00000010  01 00 03 00 01 00 00 00  00 00 00 00 00 00 00 00
> |................|
> brick2
> -rw-r--r--  2 manu  manu  6256 Jun 20 09:56
> /export/wd3a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o
> MD5 (/export/wd3a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o) =
> 58a4b8c5929cac2f799c60b3dd2acc2f
> brick3
> 00000000  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00
> |................|
> 00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00
> |................|
> -rw-r--r--  2 manu  manu  7216 Jun 20 09:56
> /export/wd1a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o
> MD5 (/export/wd1a/manu/netbsd/usr/src/tools/gcc/obj/build/gcc/cfgbuild.o) =
> 2bc383747432c49793a2cf1a2c0e8cfa
> 00000000  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00
> |................|
> 00000010  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00
> |................|
> 
> That shows the offending copies without any doubt, but I still wonder how
> we come to that situation. There are a lot of such files, making the
> glusterfs
> volume just unusable at this point.
> 
> --
> Emmanuel Dreyfus
> http://hcpnet.free.fr/pubz
> manu at netbsd.org
> 
> _______________________________________________
> Gluster-devel mailing list
> Gluster-devel at nongnu.org
> https://lists.nongnu.org/mailman/listinfo/gluster-devel
> 




More information about the Gluster-devel mailing list