[Gluster-users] The continuing story ...

Jeff Evans jeffe at tricab.com
Tue Sep 8 00:13:17 UTC 2009


> - server was ping'able
> - glusterfsd was disconnected by the client because of missing
> ping-pong - no login possible
> - no fs action (no lights on the hd-stack)
> - no screen (was blank, stayed blank)

This is very similar to what I have seen many times (even back on
1.3), and have also commented on the list.

It seems that we have quite a few ACK's on this, or similar problems.

The only thing different in my scenario, is that the console doesn't
stay blank. When attempting to login I get the last login message, and
nothing more, no prompt ever. Also, I can see that other processes are
still listening on sockets etc.. so it seems like the kernel just
can't grab new FD's.

I too found the hang happens more easily if a downed node from a
replicate pair re-joins after some time.

Following suggestions that this is all kernel related, I have just
moved up to RHEL 5.4 in the hope that the new kernel will
help.

This fix stood out as potentially related for me:
https://bugzilla.redhat.com/show_bug.cgi?id=445433

We also have a broadcom network card, which had reports of hangs under
load, the kernel has a patch for that too.

If I still run into the hangs, I'll try xfs.

Thanks, Jeff.







More information about the Gluster-users mailing list