[Gluster-devel] Cannot run VMware Virtual Machines on GlusterFS

Tomoaki Sato tsato at valinux.co.jp
Tue Jun 26 06:19:52 UTC 2012


Avati,

I truly appreciate your efforts in resolving the problem.
I've confirmed followings with a distributed volume and a distributed+replicated volume in my environment.

At the ESXi host (NFS client):
- mkdir, chdir and getcwd of some sub directories.
- create an empty VM in a sub directory.
- power-on and install CentOS on the VM.
- boot the VM with CentOS.
- login and shutdown the VM.

Fernando,

Could you try the patch with some striped volumes in your environment?

Regards,

Tomo

(2012/06/26 11:59), Anand Avati wrote:
> Please let me know if this patch fixes your problem:
>
> http://review.gluster.com/3617
>
> Thanks for your help and patience so far!
>
> Avati
>
> On Mon, Jun 25, 2012 at 7:50 PM, Anand Avati <anand.avati at gmail.com <mailto:anand.avati at gmail.com>> wrote:
>
>     Tomaoki, excellent debugging! Please add yourself to CC - https://bugzilla.redhat.com/show_bug.cgi?id=835336
>
>     Avati
>
>
>     On Sun, Jun 24, 2012 at 10:55 PM, Tomoaki Sato <tsato at valinux.co.jp <mailto:tsato at valinux.co.jp>> wrote:
>
>         Avati,
>
>         Are these intended ?:
>         - hashcount value of 'bar'(0) is not same as 'foo/..'(2) and,
>         - hashcount value of 'foo'(1) is not same as 'foo/../foo'(3).
>
>
>         # tshark -i 1 -R nfs
>         Running as user "root" and group "root". This could be dangerous.
>         Capturing on eth0
>           2.386732 192.168.1.23 -> 192.168.1.132 NFS V3 GETATTR Call, FH:0x43976ad5
>           2.387772 192.168.1.132 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 7)  Directory mode:0755 uid:0 gid:0
>           3.666252 192.168.1.23 -> 192.168.1.132 NFS V3 GETATTR Call, FH:0x43976ad5
>           3.667112 192.168.1.132 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 17)  Directory mode:0755 uid:0 gid:0
>           3.667260 192.168.1.23 -> 192.168.1.132 NFS V3 LOOKUP Call, DH:0x43976ad5/foo                  /* bar/foo */
>           3.668321 192.168.1.132 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 19), FH:0x3f9fd887
>           11.386638 192.168.1.23 -> 192.168.1.132 NFS V3 GETATTR Call, FH:0x43976ad5
>           11.387664 192.168.1.132 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 52)  Directory mode:0755 uid:0 gid:0
>           20.386438 192.168.1.23 -> 192.168.1.132 NFS V3 GETATTR Call, FH:0x43976ad5
>           20.387436 192.168.1.132 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 95)  Directory mode:0755 uid:0 gid:0
>           29.382531 192.168.1.23 -> 192.168.1.132 NFS V3 GETATTR Call, FH:0x43976ad5
>           29.383796 192.168.1.132 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 126)  Directory mode:0755 uid:0 gid:0
>           33.666658 192.168.1.23 -> 192.168.1.132 NFS V3 LOOKUP Call, DH:0x3f9fd887/..                   /* foo/.. */
>           33.668097 192.168.1.132 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 144), FH:0x42966b36
>           33.668310 192.168.1.23 -> 192.168.1.132 NFS V3 READDIRPLUS Call, FH:0x42966b36
>           33.669996 192.168.1.132 -> 192.168.1.23 NFS V3 READDIRPLUS Reply (Call In 146) .. foo .
>           33.670188 192.168.1.23 -> 192.168.1.132 NFS V3 LOOKUP Call, DH:0x42966b36/..                   /* bar/.. */
>           33.671279 192.168.1.132 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 148), FH:0xbc1b2900
>           33.671425 192.168.1.23 -> 192.168.1.132 NFS V3 LOOKUP Call, DH:0x42966b36/foo                  /* bar/foo */
>           33.672421 192.168.1.132 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 150), FH:0x3e9ed964
>         20 packets captured
>
>         # egrep "nfs3_log_fh_entry_call|nfs3___log_newfh_res" /var/log/glusterfs/nfs.log | tail -8
>         [2012-06-25 14:28:40.090333] D [nfs3-helpers.c:1645:nfs3_log___fh_entry_call] 0-nfs-nfsv3: XID: 3d78d872, LOOKUP: args: FH: hashcount 0, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 00000000-0000-0000-0000-__000000000001, name: foo                      /* bar/foo */
>         [2012-06-25 14:28:40.091108] D [nfs3-helpers.c:3462:nfs3_log___newfh_res] 0-nfs-nfsv3: XID: 3d78d872, LOOKUP: NFS: 0(Call completed successfully.), POSIX: 0(Success), FH: hashcount 1, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 7c4b5a51-0108-4ac9-8fd2-__4b843dcb2715
>         [2012-06-25 14:29:10.089791] D [nfs3-helpers.c:1645:nfs3_log___fh_entry_call] 0-nfs-nfsv3: XID: 3d78d879, LOOKUP: args: FH: hashcount 1, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 7c4b5a51-0108-4ac9-8fd2-__4b843dcb2715, name: ..                       /* foo/.. */
>         [2012-06-25 14:29:10.090872] D [nfs3-helpers.c:3462:nfs3_log___newfh_res] 0-nfs-nfsv3: XID: 3d78d879, LOOKUP: NFS: 0(Call completed successfully.), POSIX: 0(Success), FH: hashcount 2, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 00000000-0000-0000-0000-__000000000001
>         [2012-06-25 14:29:10.093266] D [nfs3-helpers.c:1645:nfs3_log___fh_entry_call] 0-nfs-nfsv3: XID: 3d78d87b, LOOKUP: args: FH: hashcount 2, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 00000000-0000-0000-0000-__000000000001, name: ..                       /* bar/.. */
>         [2012-06-25 14:29:10.094056] D [nfs3-helpers.c:3462:nfs3_log___newfh_res] 0-nfs-nfsv3: XID: 3d78d87b, LOOKUP: NFS: 0(Call completed successfully.), POSIX: 0(Success), FH: hashcount 3, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 6edd430d-bc57-470e-8e98-__eacfe1a91040
>         [2012-06-25 14:29:10.094498] D [nfs3-helpers.c:1645:nfs3_log___fh_entry_call] 0-nfs-nfsv3: XID: 3d78d87c, LOOKUP: args: FH: hashcount 2, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 00000000-0000-0000-0000-__000000000001, name: foo                      /* bar/foo */
>         [2012-06-25 14:29:10.095198] D [nfs3-helpers.c:3462:nfs3_log___newfh_res] 0-nfs-nfsv3: XID: 3d78d87c, LOOKUP: NFS: 0(Call completed successfully.), POSIX: 0(Success), FH: hashcount 3, exportid b2d75589-8370-4528-ab4e-__b543b3abdc3b, gfid 7c4b5a51-0108-4ac9-8fd2-__4b843dcb2715
>
>         Regards,
>
>         Tomo
>
>         Anand Avati wrote:
>
>             Tomoaki, this is very useful. I will look deeper soon.
>             Thanks!
>
>             Avati
>
>             On Thu, Jun 21, 2012 at 9:21 PM, Tomoaki Sato <tsato at valinux.co.jp <mailto:tsato at valinux.co.jp> <mailto:tsato at valinux.co.jp <mailto:tsato at valinux.co.jp>>> wrote:
>
>                 Avati,
>
>                 tshark says ...
>                 FH values that the linux kernel NFS server returns stays constant for every LOOKUP 'foo' but,
>                 FH values that the GlusterFS(NFS) returns are non-constant.
>
>                 operaions at the ESXi host:
>
>                 ~ # ./getcwd /vmfs/volumes/94925201-__78f190e0/foo
>                 ========= sleep 30 ================
>                 /vmfs/volumes/94925201-__78f190e0/foo
>                 ~ #
>
>                 tshark's output at the linux kernel NFS server:
>
>                 # tshark -i 2 -R nfs
>                 Running as user "root" and group "root". This could be dangerous.
>                 Capturing on br0
>                 /* chdir */
>                   2.056680 192.168.1.23 -> 192.168.1.254 NFS V3 GETATTR Call, FH:0x1ffd38ff
>                   2.056990 192.168.1.254 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 13)  Directory mode:0755 uid:0 gid:0
>                   9.848666 192.168.1.23 -> 192.168.1.254 NFS V3 GETATTR Call, FH:0x1ffd38ff
>                   9.848767 192.168.1.254 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 60)  Directory mode:0755 uid:0 gid:0
>                   9.848966 192.168.1.23 -> 192.168.1.254 NFS V3 LOOKUP Call, DH:0x1ffd38ff/foo
>                   9.849049 192.168.1.254 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 62), FH:0xdb05b90a         <=====
>                   20.055508 192.168.1.23 -> 192.168.1.254 NFS V3 GETATTR Call, FH:0x1ffd38ff
>                   20.055702 192.168.1.254 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 103)  Directory mode:0755 uid:0 gid:0
>                   29.054939 192.168.1.23 -> 192.168.1.254 NFS V3 GETATTR Call, FH:0x1ffd38ff
>                   29.055180 192.168.1.254 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 132)  Directory mode:0755 uid:0 gid:0
>                   38.054338 192.168.1.23 -> 192.168.1.254 NFS V3 GETATTR Call, FH:0x1ffd38ff
>                   38.054583 192.168.1.254 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 151)  Directory mode:0755 uid:0 gid:0
>                 /* getcwd */
>                   39.849107 192.168.1.23 -> 192.168.1.254 NFS V3 LOOKUP Call, DH:0xdb05b90a/..
>                   39.849449 192.168.1.254 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 170), FH:0x1ffd38ff
>                   39.849676 192.168.1.23 -> 192.168.1.254 NFS V3 READDIRPLUS Call, FH:0x1ffd38ff
>                   39.849833 192.168.1.254 -> 192.168.1.23 NFS V3 READDIRPLUS Reply (Call In 172) . .. foo
>                   39.850071 192.168.1.23 -> 192.168.1.254 NFS V3 LOOKUP Call, DH:0x1ffd38ff/foo
>                   39.850149 192.168.1.254 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 174), FH:0xdb05b90a
>                   39.850746 192.168.1.23 -> 192.168.1.254 NFS V3 LOOKUP Call, DH:0xdb05b90a/..
>                   39.850814 192.168.1.254 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 176), FH:0x1ffd38ff
>                   39.851014 192.168.1.23 -> 192.168.1.254 NFS V3 READDIRPLUS Call, FH:0x1ffd38ff
>                   39.851095 192.168.1.254 -> 192.168.1.23 NFS V3 READDIRPLUS Reply (Call In 178) . .. foo
>                   39.851329 192.168.1.23 -> 192.168.1.254 NFS V3 LOOKUP Call, DH:0x1ffd38ff/foo
>                   39.851438 192.168.1.254 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 180), FH:0xdb05b90a            <=====
>
>                 operations at the ESXi host:
>
>                 ~ # ./getcwd /vmfs/volumes/ef172a87-__e5ae817f/foo
>                 ========= sleep 30 ================
>                 getcwd: No such file or directory
>                 ~ #
>
>                 tshark's output at the GlusterFS(NFS) server:
>
>                 # tshark -i 1 -R nfs
>                 Running as user "root" and group "root". This could be dangerous.
>                 Capturing on eth0
>                 /* chdir */
>                   1.228396 192.168.1.23 -> 192.168.1.136 NFS V3 GETATTR Call, FH:0x43976ad5
>                   1.229406 192.168.1.136 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 6)  Directory mode:0755 uid:0 gid:0
>                   4.445894 192.168.1.23 -> 192.168.1.136 NFS V3 GETATTR Call, FH:0x43976ad5
>                   4.446916 192.168.1.136 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 16)  Directory mode:0755 uid:0 gid:0
>                   4.447099 192.168.1.23 -> 192.168.1.136 NFS V3 LOOKUP Call, DH:0x43976ad5/foo
>                   4.448147 192.168.1.136 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 18), FH:0x3f9fd887     <=====
>                   10.228438 192.168.1.23 -> 192.168.1.136 NFS V3 GETATTR Call, FH:0x43976ad5
>                   10.229432 192.168.1.136 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 31)  Directory mode:0755 uid:0 gid:0
>                   19.228321 192.168.1.23 -> 192.168.1.136 NFS V3 GETATTR Call, FH:0x43976ad5
>                   19.229309 192.168.1.136 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 47)  Directory mode:0755 uid:0 gid:0
>                   28.228139 192.168.1.23 -> 192.168.1.136 NFS V3 GETATTR Call, FH:0x43976ad5
>                   28.229112 192.168.1.136 -> 192.168.1.23 NFS V3 GETATTR Reply (Call In 70)  Directory mode:0755 uid:0 gid:0
>                 /* getcwd */
>                   34.448796 192.168.1.23 -> 192.168.1.136 NFS V3 LOOKUP Call, DH:0x3f9fd887/..
>                   34.450119 192.168.1.136 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 81), FH:0x42966b36
>                   34.450343 192.168.1.23 -> 192.168.1.136 NFS V3 READDIRPLUS Call, FH:0x42966b36
>                   34.452105 192.168.1.136 -> 192.168.1.23 NFS V3 READDIRPLUS Reply (Call In 83) .. foo .
>                   34.452311 192.168.1.23 -> 192.168.1.136 NFS V3 LOOKUP Call, DH:0x42966b36/..
>                   34.453464 192.168.1.136 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 85), FH:0xbc1b2900
>                   34.453648 192.168.1.23 -> 192.168.1.136 NFS V3 LOOKUP Call, DH:0x42966b36/foo
>                   34.454677 192.168.1.136 -> 192.168.1.23 NFS V3 LOOKUP Reply (Call In 87), FH:0x3e9ed964     <======
>
>                 Regards,
>
>                 Tomo
>
>                 (2012年06月20日 16:28), Tomoaki Sato wrote:
>                  > Avati,
>                  >
>                  > I've tried following:
>                  > 1) 'esxcfg-nas -d gluster_nfs' at the ESXi host.
>                  > 2) 'volume set bar nfs.enable-ino32 on' at the 192.168.1.136 host.
>                  > 3) 'volume stop bar' and 'volume start bar' at the 192.168.1.136 host.
>                  > 4) 'esxcfg-nas -a -o 192.168.1.136 -s /bar gluster_nfs' at the ESXi host.
>                  >
>                  > on the ESXi host:
>                  >
>                  > ~ # uname -m
>                  > x86_64
>                  > ~ # mkdir /vmfs/volumes/ef172a87-__e5ae817f/after-enable-ino32-on
>                  > ~ # ls -liR /vmfs/volumes/ef172a87-__e5ae817f
>                  > /vmfs/volumes/ef172a87-__e5ae817f:
>                  > -2118204814 drwxr-xr-x 1 root root 4096 Jun 20 07:13 after-enable-ino32-on
>                  > 1205893126 drwxr-xr-x 1 root root 4096 Jun 20 07:08 baz
>                  > -1291907235 drwx------ 1 root root 16384 Jun 6 23:41 lost+found
>                  >
>                  > /vmfs/volumes/ef172a87-__e5ae817f/after-enable-ino32-__on:
>                  >
>                  > /vmfs/volumes/ef172a87-__e5ae817f/baz:
>                  > -1374929331 drwxr-xr-x 1 root root 4096 Jun 19 06:41 foo
>                  >
>                  > /vmfs/volumes/ef172a87-__e5ae817f/baz/foo:
>                  >
>                  > /vmfs/volumes/ef172a87-__e5ae817f/lost+found:
>                  > ~ # ./getcwd /vmfs/volumes/ef172a87-__e5ae817f/after-enable-ino32-on
>                  > getcwd: No such file or directory
>                  > ~ #
>                  >
>                  > on the 192.168.1.136 host:
>                  >
>                  > # gluster volume info bar
>                  >
>                  > Volume Name: bar
>                  > Type: Distribute
>                  > Volume ID: b2d75589-8370-4528-ab4e-__b543b3abdc3b
>                  > Status: Started
>                  > Number of Bricks: 1
>                  > Transport-type: tcp
>                  > Bricks:
>                  > Brick1: bar-1-private:/mnt/brick
>                  > Options Reconfigured:
>                  > diagnostics.brick-log-level: TRACE
>                  > diagnostics.client-log-level: TRACE
>                  > nfs.enable-ino32: on
>                  >
>                  > please fine attached nfs.log5.
>                  >
>                  > Regards,
>                  >
>                  > Tomo
>                  >
>                  > (2012/06/20 16:11), Anand Avati wrote:
>                  >> -1374929331 drwxr-xr-x 1 root root 4096 Jun 19 06:41 foo
>                  >>
>                  >> ...
>                  >>
>                  >> 2920037965 drwxr-xr-x 2 root root 4096 Jun 19 15:41 foo
>                  >>
>                  >>
>                  >> Ouch!
>                  >>
>                  >> -1374929331 == (int32_t) 2920037965
>                  >>
>                  >> 'uname -m' from the ESXi host please! Is it a 32bit OS? Can you try 'gluster volume set bar nfs.enable-ino32 on' and retry?
>                  >>
>                  >> Avati
>                  >
>
>
>
>
>
>






More information about the Gluster-devel mailing list