[Gluster-devel] GlusterFS AFR not failing over

Gordan Bobic gordan at bobich.net
Fri Jun 6 19:34:24 UTC 2008


I have /home mounted from GlusterFS with AFR, and if one of the servers 
(secondary) goes away, I cannot log in. sshd tries to read ~/.ssh and 
bash tries to read ~/.bashrc and this seems to fail - or at least take a 
very long time to time out and try the remaining server (which 
verifiably works).

I get this sort of thing in the logs:

E [tcp-client.c:190:tcp_connect] home2: non-blocking connect() returned: 
110 (Connection timed out)
E [client-protocol.c:4423:client_lookup_cbk] home2: no proper reply from 
server, returning ENOTCONN
C [client-protocol.c:212:call_bail] home2: bailing transport

where home2 is the name of the GlusterFS export on the secondary.

Is this a known issue or have I managed to trip another error case?


