[Gluster-users] The continuing story ...
Jeff Evans
jeffe at tricab.com
Tue Sep 8 00:13:17 UTC 2009
> - server was ping'able
> - glusterfsd was disconnected by the client because of missing
> ping-pong - no login possible
> - no fs action (no lights on the hd-stack)
> - no screen (was blank, stayed blank)
This is very similar to what I have seen many times (even back on
1.3), and have also commented on the list.
It seems that we have quite a few ACK's on this, or similar problems.
The only thing different in my scenario, is that the console doesn't
stay blank. When attempting to login I get the last login message, and
nothing more, no prompt ever. Also, I can see that other processes are
still listening on sockets etc.. so it seems like the kernel just
can't grab new FD's.
I too found the hang happens more easily if a downed node from a
replicate pair re-joins after some time.
Following suggestions that this is all kernel related, I have just
moved up to RHEL 5.4 in the hope that the new kernel will
help.
This fix stood out as potentially related for me:
https://bugzilla.redhat.com/show_bug.cgi?id=445433
We also have a broadcom network card, which had reports of hangs under
load, the kernel has a patch for that too.
If I still run into the hangs, I'll try xfs.
Thanks, Jeff.
More information about the Gluster-users
mailing list