[Gluster-users] gluster 1.4pre5 crashing

Keith Freedman freedman at FreeFormIT.com
Tue Sep 23 14:57:07 UTC 2008


ok, but what do I until then?  and it's so odd it just suddenly broke 
after all this time.


At 07:50 AM 9/23/2008, Raghavendra G wrote:
>Hi Keith,
>
>Work is being done on AFR to make it stable. Please wait for a stable release.
>
>On Tue, Sep 23, 2008 at 6:41 PM, Keith Freedman 
><<mailto:freedman at freeformit.com>freedman at freeformit.com> wrote:
>as a followup..  I have shutdown the "broken" one in the pair since
>it kept crashing.
>the working one is running on it's own but gluster dies every 10 mins or so.
>seems 1.4pre5 doesn't like being an AFR client all on it's own?
>
>I'm going to see if it works with only itself as the AFR subvolumes list
>
>
>
>
>2008-09-23 07:24:00 E [afr.c:3434:afr_statfs_cbk] home: (child=home2)
>op_ret=-1 op_errno=107(Transport endpoint is not connected)
>2008-09-23 07:24:03 E [afr.c:3434:afr_statfs_cbk] home: (child=home2)
>op_ret=-1 op_errno=107(Transport endpoint is not connected)
>2008-09-23 07:24:28 E [afr.c:4759:afr_create_cbk] home:
>(path=/glusterfile/tmp/1222179868.H882395P21565.HOSTNAME child=home2)
>op_ret=-1 op_errno=107(Transport endpoint is not connected)
>pending frames:
>
>Signal received: 11
>configuration details:argp 1
>backtrace 1
>dlfcn 1
>fdatasync 1
>libpthread 1
>llistxattr 1
>setfsid 1
>spinlock 1
>epoll.h 1
>xattr.h 1
>tv_nsec 1
>package-string: glusterfs 1.4.0pre5
>/lib64/libc.so.6[0x300d0322a0]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_incver_internal_incver_cbk+0x38)[0xe5250c]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/protocol/client.so(client_incver+0xb9)[0xa29072]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_incver_internal_lock_cbk+0x6d8)[0xe52d75]
>/usr/local/lib/libglusterfs.so.0[0x125c5b]
>/usr/local/lib/libglusterfs.so.0(mop_lock_impl+0x103)[0x12a6aa]
>/usr/local/lib/libglusterfs.so.0(default_lock+0x126)[0x125d88]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_incver_internal_fd+0x33a)[0xe530d1]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so(afr_close+0x26d)[0xe5c64d]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/mount/fuse.so[0x7299fa8]
>/lib64/libfuse.so.2[0x10824b2]
>/usr/local/lib/glusterfs/1.4.0pre5/xlator/mount/fuse.so[0x729cc35]
>/lib64/libpthread.so.0[0x300dc0729a]
>/lib64/libc.so.6(clone+0x6d)[0x300d0e439d]
>---------
>
>
>At 07:09 AM 9/23/2008, Keith Freedman wrote:
> >I had a pair of servers running 1.4pre5 in AFR.
> >they've been running fine for over a week, and suddenly today one of
> >them had decided it just will crash anytime it tries to AFR a file.
> >
> >strange is, it seems to get updates form the other server.
> >it's not up long enough to do any thorough testing, but when I do
> >this from the "good" server:
> >echo `hostname` `date` > /gluster/shared/file
> >I can read the correct hostname and date from the "bad" server, but
> >when I do the same thing on the "bad" server, it crashes instantly.
> >
> >running FC9 with default fuse:
> >fuse-2.7.4-8_10.fc9.i386
> >
> >I'm going to re-install fuse thinking that perhaps something got
> >corrupted, but it's odd it happened while the servers been goign just
> >fine for days.
> >
> >I turned on debugging and here's what it's producing
> >where the log ends is where the server crashed while I was tailing
> >the logfile:
> >2008-09-23 06:56:31 D [inode.c:311:__inode_retire] fuse/inode:
> >retiring inode(0) lru=21/0 active=21 purge=29
> >2008-09-23 06:56:31 D [fuse-bridge.c:437:fuse_lookup] glusterfs-fuse:
> >223: LOOKUP /uservideo/public_html/Guests/Images/Misc/.htaccess
> >2008-09-23 06:56:31 D [inode.c:443:__inode_create] fuse/inode: 
> create inode(0)
> >2008-09-23 06:56:31 D [inode.c:268:__inode_activate] fuse/inode:
> >activating inode(0), lru=21/0 active=22 purge=29
> >2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]
> >glusterfs-fuse: 222: FLUSH() ERR => 0
> >2008-09-23 06:56:31 D [fuse-bridge.c:1599:fuse_release]
> >glusterfs-fuse: 224: CLOSE 0x8fecf58
> >2008-09-23 06:56:31 D [fuse-bridge.c:562:fuse_getattr]
> >glusterfs-fuse: 225: FGETATTR 20971566
> >(/user2/public_html/shopping/var/run/classes/kernel/Profiler.php/0x8fece28)
> >2008-09-23 06:56:31 D [fuse-bridge.c:496:fuse_attr_cbk]
> >glusterfs-fuse: 225: FSTAT()
> >/user2/public_html/shopping/var/run/classes/kernel/Profiler.php => 20971566
> >2008-09-23 06:56:31 D [fuse-bridge.c:398:fuse_entry_cbk]
> >glusterfs-fuse: 223: LOOKUP()
> >/uservideo/public_html/Guests/Images/Misc/.htaccess => -1 (No such
> >file or directory)
> >2008-09-23 06:56:31 D [inode.c:311:__inode_retire] fuse/inode:
> >retiring inode(0) lru=21/0 active=21 purge=30
> >2008-09-23 06:56:31 D [inode.c:268:__inode_activate] fuse/inode:
> >activating inode(11010602), lru=20/0 active=22 purge=30
> >2008-09-23 06:56:31 D [fuse-bridge.c:1429:fuse_open] glusterfs-fuse:
> >226: OPEN /uservideo/public_html/Guests/Images/Misc/userLogo.jpg
> >2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]
> >glusterfs-fuse: 224: CLOSE() ERR => 0
> >2008-09-23 06:56:31 D [fuse-bridge.c:1572:fuse_flush] glusterfs-fuse:
> >227: FLUSH 0x8fece28
> >2008-09-23 06:56:31 D [fuse-bridge.c:603:fuse_fd_cbk] glusterfs-fuse:
> >226: OPEN() /uservideo/public_html/Guests/Images/Misc/userLogo.jpg
> >=> 0x8fecd50
> >2008-09-23 06:56:31 D [fuse-bridge.c:1487:fuse_readv] glusterfs-fuse:
> >228: READ (0x8fecd50, size=4096, offset=0)
> >2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]
> >glusterfs-fuse: 227: FLUSH() ERR => 0
> >2008-09-23 06:56:31 D [fuse-bridge.c:1455:fuse_readv_cbk]
> >glusterfs-fuse: 228: READ => 3513/4096,0/3513
> >2008-09-23 06:56:31 D [fuse-bridge.c:1599:fuse_release]
> >glusterfs-fuse: 229: CLOSE 0x8fece28
> >2008-09-23 06:56:31 D [fuse-bridge.c:437:fuse_lookup] glusterfs-fuse:
> >230: LOOKUP /user2/public_html/shopping/var/run/classes/kernel/Database.php
> >2008-09-23 06:56:31 D [inode.c:443:__inode_create] fuse/inode: 
> create inode(0)
> >2008-09-23 06:56:31 D [inode.c:268:__inode_activate] fuse/inode:
> >activating inode(0), lru=20/0 active=23 purge=30
> >2008-09-23 06:56:31 D [fuse-bridge.c:1572:fuse_flush] glusterfs-fuse:
> >231: FLUSH 0x8fecd50
> >2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]
> >glusterfs-fuse: 229: CLOSE() ERR => 0
> >2008-09-23 06:56:31 D [inode.c:287:__inode_passivate] fuse/inode:
> >passivating inode(20971566) lru=21/0 active=22 purge=30
> >2008-09-23 06:56:31 D [fuse-bridge.c:370:fuse_entry_cbk]
> >glusterfs-fuse: 230: LOOKUP()
> >/user2/public_html/shopping/var/run/classes/kernel/Database.php => 20971567
> >2008-09-23 06:56:31 D [inode.c:287:__inode_passivate] fuse/inode:
> >passivating inode(20971567) lru=22/0 active=21 purge=30
> >2008-09-23 06:56:31 D [fuse-bridge.c:857:fuse_err_cbk]
> >glusterfs-fuse: 231: FLUSH() ERR => 0
> >2008-09-23 06:56:31 D [fuse-bridge.c:1599:fuse_release]
> >glusterfs-fuse: 232: CLOSE 0x8fecd50
> >2008-09-23 06:56:31 D [fuse-b
> >
> >here's some more from when the server rebooted
> >+-----
> >2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for 'home1'
> >2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt
> >to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/storage/posix.so
> >2008-09-23 07:04:39 D [spec.y:219:section_type] parser:
> >Type:home1:storage/posix
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:home1:directory:/gluster/home
> >2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:home1
> >2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for
> >'posix-locks-home1'
> >2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt
> >to load file 
> /usr/local/lib/glusterfs/1.4.0pre5/xlator/features/posix-locks.so
> >2008-09-23 07:04:39 D [xlator.c:318:xlator_set_type]
> >posix-locks-home1: dlsym(notify) on
> >/usr/local/lib/glusterfs/1.4.0pre5/xlator/features/posix-locks.so:
> >undefined symbol: notify -- neglecting
> >2008-09-23 07:04:39 D [spec.y:219:section_type] parser:
> >Type:posix-locks-home1:features/posix-locks
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:posix-locks-home1:mandatory:on
> >2008-09-23 07:04:39 D [spec.y:352:section_sub] parser:
> >child:posix-locks-home1->home1
> >2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:posix-locks-home1
> >2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for 'home2'
> >2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt
> >to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/protocol/client.so
> >2008-09-23 07:04:39 D [spec.y:219:section_type] parser:
> >Type:home2:protocol/client
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:home2:transport-type:tcp/client
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:home2:remote-host:<http://72.36.173.218>72.36.173.218
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:home2:remote-subvolume:posix-locks-home1
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:home2:transport-timeout:10
> >2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:home2
> >2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for 'server'
> >2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt
> >to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/protocol/server.so
> >2008-09-23 07:04:39 D [spec.y:219:section_type] parser:
> >Type:server:protocol/server
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:server:transport-type:tcp/server
> >2008-09-23 07:04:39 D [spec.y:352:section_sub] parser:
> >child:server->posix-locks-home1
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:server:auth.addr.posix-locks-home1.allow:<http://72.36.173.2 
> 18>72.36.173.218,<http://127.0.0.1>127.0.0.1
> >2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:server
> >2008-09-23 07:04:39 D [spec.y:194:new_section] parser: New node for 'home'
> >2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt
> >to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/cluster/afr.so
> >2008-09-23 07:04:39 D [xlator.c:324:xlator_set_type] home: strict
> >option validation is not enforced -- neglecting
> >2008-09-23 07:04:39 D [spec.y:219:section_type] parser: 
> Type:home:cluster/afr
> >2008-09-23 07:04:39 D [spec.y:285:section_option] parser:
> >Option:home:read-subvolume:posix-locks-home1
> >2008-09-23 07:04:39 D [spec.y:352:section_sub] parser:
> >child:home->posix-locks-home1
> >2008-09-23 07:04:39 D [spec.y:352:section_sub] parser: child:home->home2
> >2008-09-23 07:04:39 D [spec.y:367:section_end] parser: end:home
> >2008-09-23 07:04:39 D [xlator.c:289:xlator_set_type] xlator: attempt
> >to load file /usr/local/lib/glusterfs/1.4.0pre5/xlator/mount/fuse.so
> >2008-09-23 07:04:39 D [xlator.c:324:xlator_set_type] fuse: strict
> >option validation is not enforced -- neglecting
> >2008-09-23 07:04:39 D [glusterfs.c:771:main] glusterfs: running in pid 1145
> >2008-09-23 07:04:39 D [fuse-options.c:140:fuse_options_validate]
> >fuse-options: using mount-point = /home
> >2008-09-23 07:04:39 D [fuse-options.c:147:fuse_options_validate]
> >fuse-options: using attr-timeout = 1
> >2008-09-23 07:04:39 D [fuse-options.c:159:fuse_options_validate]
> >fuse-options: using entry-timeout = 1
> >2008-09-23 07:04:39 D [fuse-options.c:171:fuse_options_validate]
> >fuse-options: using direct-io-mode = 1
> >2008-09-23 07:04:39 D [client-protocol.c:4383:init] home2: setting
> >transport-timeout to 10
> >2008-09-23 07:04:39 D [transport.c:104:transport_load] transport:
> >attempt to load file /usr/local/lib/glusterfs/1.4.0pre5/transport/socket.so
> >2008-09-23 07:04:39 D [client-protocol.c:4427:init] home2: defaulting
> >limits.transaction-size to 268435456
> >2008-09-23 07:04:39 D [afr.c:6397:init] home: self-heal is enabled (default)
> >2008-09-23 07:04:39 D [afr.c:6421:init] home: config: reads will be
> >done on posix-locks-home1
> >2008-09-23 07:04:39 D [afr.c:6309:notify] home: GF_EVENT_CHILD_UP
> >from posix-locks-home1
> >2008-09-23 07:04:39 D [afr.c:6241:afr_check_xattr_cbk] home:
> >'posix-locks-home1' supports Extended attribute
> >2008-09-23 07:04:39 D [inode.c:928:inode_table_new] fuse: creating
> >new inode table with lru_limit=0
> >2008-09-23 07:04:39 D [inode.c:443:__inode_create] fuse/inode: 
> create inode(0)
> >2008-09-23 07:04:39 D [client-protocol.c:4653:notify] home2: got
> >GF_EVENT_PARENT_UP, attempting connect on transport
> >2008-09-23 07:04:39 D [transport.c:104:transport_load] transport:
> >attempt to load file /usr/local/lib/glusterfs/1.4.0pre5/transport/socket.so
> >2008-09-23 07:04:39 E [name.c:344:af_inet_server_get_local_sockaddr]
> >server: getaddrinfo failed (Name or service not known)
> >2008-09-23 07:04:39 W [common-utils.c:158:gf_print_bytes] glusterfs:
> >Total data (in bytes): transfered (0), received (0)
> >pending frames:
> >
> >Signal received: 11
> >configuration details:argp 1
> >backtrace 1
> >dlfcn 1
> >fdatasync 1
> >libpthread 1
> >llistxattr 1
> >setfsid 1
> >spinlock 1
> >epoll.h 1
> >xattr.h 1
> >tv_nsec 1
> >package-string: glusterfs 1.4.0pre5
> >
> >
> >
> >_______________________________________________
> >Gluster-users mailing list
> ><mailto:Gluster-users at gluster.org>Gluster-users at gluster.org
> >http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users
>
>
>_______________________________________________
>Gluster-users mailing list
><mailto:Gluster-users at gluster.org>Gluster-users at gluster.org
>http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users
>
>
>
>
>--
>Raghavendra G
>
>A centipede was happy quite, until a toad in fun,
>Said, "Prey, which leg comes after which?",
>This raised his doubts to such a pitch,
>He fell flat into the ditch,
>Not knowing how to run.
>-Anonymous





More information about the Gluster-users mailing list