[Gluster-users] Glusterfs 13.12 client crashes; how to proceed?
Dan Farrell
dan at spore.ath.cx
Thu Oct 2 03:39:05 UTC 2008
Friends,
Hi! I am so besieged with gluster-setup problems, I am not sure where
to begin. I've compiled and glusterfs-1.3.12
(glusterfs--mainline--2.5--patch-797) on 3 seperate computers with the
intention of building an HA Cluster as in Daniel Maher's howto:
http://gluster.org/docs/index.php/High-availability_storage_using_server-side_AFR
glusterfsd -f /etc/glusterfs/glusterfs-server.vol -L DEBUG
-l /var/log/gluster Two of the servers have essentially the same server
config, as shown in the howto. They only differ in that they refer to
each other.
The client config is also pulled directly from the howto. I have
reason to believe the configs are working (fixed the typos already) but
cannot be sure. The servers are both started with the line:
glusterfsd -f /etc/glusterfs/glusterfs-server.vol -L DEBUG
-l /var/log/glusterfsd
The logs mention a rather worrying:
2008-10-01 22:15:56 D [xlator.c:145:xlator_set_type] xlator:
dlsym(notify)
on /usr/local/lib/glusterfs/1.3.12/xlator/features/posix-locks.so:
undefined symbol: notify -- neglecting
other than that, things seem to go well until I get the other server
fired up, they talk fora a while, and then one serve2008-10-01 22:16:19
D [ip.c:120:gf_auth] wwwfs-ds-locks: allowed = "172.17.1.*", received
ip addr = "172.17.1.101" 2008-10-01 22:16:19 D
[server-protocol.c:5674:mop_setvolume] server: accepted client from
172.17.1.101:1023 2008-10-01 22:16:19 D
[server-protocol.c:5717:mop_setvolume] server: creating inode table
with lru_limit=1024, xlator=wwwfs-ds-locks 2008-10-01 22:16:19 D
[inode.c:1163:inode_table_new] wwwfs-ds-locks: creating new inode table
with lru_limit=1024, sizeof(inode_t)=96 2008-10-01 22:16:19 D
[inode.c:577:__create_inode] wwwfs-ds-locks/inode: create inode(1)
2008-10-01 22:16:19 D [inode.c:367:__active_inode]
wwwfs-ds-locks/inode: activating inode(1), lru=0/1024 r ends with:
===
The other says:
2008-10-01 22:16:54 D [tcp-client.c:161:tcp_connect] wwwfs-cedar-ds:
connect on 7 in progress (non-blocking)
2008-10-01 22:16:54 D [tcp-client.c:205:tcp_connect] wwwfs-cedar-ds:
connection on 7 success
2008-10-01 22:16:54 D [client-protocol.c:5362:notify] wwwfs-cedar-ds:
got GF_EVENT_CHILD_UP
2008-10-01 22:16:54 D
[client-protocol.c:5103:client_protocol_handshake_reply]
wwwfs-cedar-ds: reply frame has callid: 424242
2008-10-01 22:16:54 D
[client-protocol.c:5137:client_protocol_handshake_reply]
wwwfs-cedar-ds: SETVOLUME on remote-host succeeded
2008-10-01 22:16:54 D [afr.c:6001:notify] wwwfs-ds-afr:
GF_EVENT_CHILD_UP from wwwfs-cedar-ds
2008-10-01 22:16:54 D [afr.c:5933:afr_check_xattr_cbk] wwwfs-ds-afr:
'wwwfs-cedar-ds' supports Extended attribute
2008-10-01 22:17:49 D
[client-protocol.c:4763:client_protocol_reconnect] wwwfs-cedar-ds:
breaking reconnect chain
/var/log/glusterfsd lines 138-149/
The client really bombs:
glusterfs -f /etc/glusterfs/glusterfs-client.vol -L DEBUG
-l /var/log/glusterfs /var/www/
It too says:
2008-10-01 21:29:46 D [xlator.c:145:xlator_set_type] xlator:
dlsym(notify)
on /usr/local/lib/glusterfs/1.3.12/xlator/performance/write-behind.so:
undefined symbol: notify -- neglecting
As well as
2008-10-01 22:34:08 E
[client-protocol.c:5133:client_protocol_handshake_reply] www-fs:
SETVOLUME on remote-host failed: ret=-1 error=remote-subvolume "www-fs"
is not found
Finally, it gives up:
2008-10-01 22:37:59 D [tcp.c:87:tcp_disconnect] www-fs: connection
disconnected
I am at something of a loss as to how to proceed. Is there a better
version to be running? Did I miss something building/installing? Is
the configuration botched? I would very much appreciate it if someone
could help.
SERVER 1, cedar
# dataspace local, on oak.
volume wwwfs-ds
type storage/posix
option directory /var/wwwfs-ds
end-volume
# posix locks
volume wwwfs-ds-locks
type features/posix-locks
subvolumes wwwfs-ds
end-volume
# dataspace on storage2
volume wwwfs-oak-ds
type protocol/client
option transport-type tcp/client
option remote-host oak.storage.farrellit.net
option remote-subvolume wwwfs-ds-locks
option transport-timeout 10
end-volume
# automatic file replication translator for dataspace
volume wwwfs-ds-afr
type cluster/afr
subvolumes wwwfs-ds-locks wwwfs-oak-ds
# the actual exported volume
volume wwwfs
type performance/io-threads
option thread-count 8
option cache-size 64MB
subvolumes wwwfs-ds-afr
end-volume
# finally, the server declaration
volume server
type protocol/server
option transport-type tcp/server
subvolumes wwwfs
# storage network access only
option auth.ip.wwwfs-ds-locks.allow 172.17.1.*,127.0.0.1
option auth.ip.wwwfs.allow 172.17.1.*
end-volume
#####################
server 2, oak
# dataspace local, on oak.
volume wwwfs-ds
type storage/posix
option directory /var/wwwfs-ds
end-volume
# posix locks
volume wwwfs-ds-locks
type features/posix-locks
subvolumes wwwfs-ds
end-volume
# dataspace on storage2
volume wwwfs-cedar-ds
type protocol/client
option transport-type tcp/client
option remote-host cedar.storage.farrellit.net
option remote-subvolume wwwfs-ds-locks
option transport-timeout 10
be set rel atively low
end-volume
# automatic file replication translator for dataspace
volume wwwfs-ds-afr
type cluster/afr
subvolumes wwwfs-ds-locks wwwfs-cedar-ds # local and remote
dataspaces end-volume
# the actual exported volume
volume wwwfs
type performance/io-threads
option thread-count 8
option cache-size 64MB
subvolumes wwwfs-ds-afr
end-volume
# finally, the server declaration
volume server
type protocol/server
option transport-type tcp/server
subvolumes wwwfs
# storage network access only
option auth.ip.wwwfs-ds-locks.allow 172.17.1.*,127.0.0.1
option auth.ip.wwwfs.allow 172.17.1.*
end-volume
Logs available upon request. Please, guide me!
Thank you!
Dan Farrell
More information about the Gluster-users
mailing list