[Gluster-users] OS crash of client using version 3.1.1

Burnash, James jburnash at knight.com
Wed Apr 27 16:49:10 UTC 2011


Hello.

I have had two 3.1.1 client machines running  CentOS 5.2 crash with no indications in /var/log/messages, but with this stanza in /var/log/messages/etc-glusterfs-glusterd.vol.log:

[2011-04-27 11:12:00.350935] I [glusterd.c:275:init] management: Using /etc/glusterd as working directory
[2011-04-27 11:12:00.379320] E [rpc-transport.c:905:rpc_transport_load] rpc-transport: /usr/lib64/glusterfs/3.1.1/rpc-transport/rdma.so: ca
nnot open shared object file: No such file or directory
[2011-04-27 11:12:00.379340] E [rpc-transport.c:909:rpc_transport_load] rpc-transport: volume 'rdma.management': transport-type 'rdma' is n
ot valid or not found on this machine
[2011-04-27 11:12:00.389775] I [glusterd.c:87:glusterd_uuid_init] glusterd: retrieved UUID: b03f0420-14cb-403b-86c5-bde8ef2d4a28
Given volfile:
+------------------------------------------------------------------------------+
  1: volume management
  2:     type mgmt/glusterd
  3:     option working-directory /etc/glusterd
  4:     option transport-type socket,rdma
  5:     option transport.socket.keepalive-time 10
  6:     option transport.socket.keepalive-interval 2
  7: end-volume
  8:



I'm not running RDMA, I'm running TCP over 1Gb ethernet.

Volume created with:

gluster volume create pfs-ro1 replica 2 transport tcp <bricks ...>

Server info:
root at jc1letgfs18:/export/read-only# /usr/sbin/glusterfs -V
glusterfs 3.1.3 built on Mar 16 2011 01:01:54
Repository revision: v3.1.3

Client info:
rpm -qa "gluster*"
glusterfs-core-3.1.1-1.x86_64
glusterfs-fuse-3.1.1-1.x86_64
glusterfs-debuginfo-3.1.1-1.x86_64

Finally, this is (hopefully) the relevant section from the crashdump:

taps_linux64.os[10253]: segfault at 0000000000000000 rip 000000000042b057 rsp 0000000040c60f20 error 4
nfs: server pid3780 at jc1lodin2:/net not responding, still trying
nfs: server pid3780 at jc1lodin2:/net OK
Unable to handle kernel NULL pointer dereference at 00000000000000e8 RIP:
 [<ffffffff800095c6>] __link_path_walk+0x54/0xf42
PGD 730763067 PUD 7db2e3067 PMD 0
Oops: 0000 [1] SMP
last sysfs file: /devices/pci0000:00/0000:00:00.0/irq
CPU 0
Modules linked in: fuse(U) mptctl mptbase sg ipmi_si(U) ipmi_devintf(U) ipmi_msghandler(U) hpilo(U) nfs fscache nfsd exportfs lockd nfs_acl auth_rpcgss aut
ofs4 sunrpc bonding dm_multipath video sbs backlight i2c_ec i2c_core button battery asus_acpi acpi_memhotplug ac parport_pc lp parport ata_piix libata ide_
cd cdrom i5000_edac serio_raw edac_mc pcspkr shpchp bnx2 dm_snapshot dm_zero dm_mirror dm_mod cciss(U) sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 4004, comm: csh Tainted: G      2.6.18-92.el5 #1

Thanks,

James Burnash, Unix Engineering


DISCLAIMER:
This e-mail, and any attachments thereto, is intended only for use by the addressee(s) named herein and may contain legally privileged and/or confidential information. If you are not the intended recipient of this e-mail, you are hereby notified that any dissemination, distribution or copying of this e-mail, and any attachments thereto, is strictly prohibited. If you have received this in error, please immediately notify me and permanently delete the original and any copy of any e-mail and any printout thereof. E-mail transmission cannot be guaranteed to be secure or error-free. The sender therefore does not accept liability for any errors or omissions in the contents of this message which arise as a result of e-mail transmission.
NOTICE REGARDING PRIVACY AND CONFIDENTIALITY Knight Capital Group may, at its discretion, monitor and review the content of all e-mail communications. http://www.knight.com


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20110427/11aa3a2e/attachment.html>


More information about the Gluster-users mailing list