[Bugs] [Bug 1205579] New: gluster nfs server process was crashed multiple time while mounting volume and starting volume using force option

bugzilla at redhat.com bugzilla at redhat.com
Wed Mar 25 08:59:19 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1205579

            Bug ID: 1205579
           Summary: gluster nfs server process was crashed multiple time
                    while mounting volume and starting volume using force
                    option
           Product: GlusterFS
           Version: mainline
         Component: nfs
          Severity: high
          Priority: high
          Assignee: ndevos at redhat.com
          Reporter: ndevos at redhat.com
                CC: bugs at gluster.org, gluster-bugs at redhat.com
            Blocks: 1196520



Description of problem:
=======================
while mounting volume using nfs it was hung and found nfs server process was
crashed and core was generated. After that killed some brick process and during
start force of that volume found crash again

nfs log snippet:-
[2014-08-09 03:25:04.297623] E [nfs.c:341:nfs_init_versions] 0-nfs: Program 
NLM4 registration failed
[2014-08-09 03:25:04.297642] E [nfs.c:1327:init] 0-nfs: Failed to initialize
protocols
[2014-08-09 03:25:04.297654] E [xlator.c:423:xlator_init] 0-nfs-server:
Initialization of volume 'nfs-server' failed, review your volfile again
[2014-08-09 03:25:04.297684] E [graph.c:292:glusterfs_graph_init] 0-nfs-server:
initializing translator failed
[2014-08-09 03:25:04.297698] E [graph.c:479:glusterfs_graph_activate] 0-graph:
init failed
pending frames:
frame : type(0) op(0)

patchset: git://git.gluster.com/glusterfs.git
signal received: 11

NLM (of NFS) failed to register with portmapper, so init() routine of NFS
xlator returns (-1) to xlator_init(). Call flow:

glusterfsd (NFS process)
  glusterfs_process_volfp() =>
     glusterfs_graph_activate() =>
         glusterfs_graph_init() =>
             xlator_init () => NFS init() routine

But somehow cleanup_and_exit() goes through some race which causes the SEGV.
But it clearly says the crash is in the exit code path i.e. NLM failed to
register with portmapper and NFS process was about to exit but segfaulted. 


The root cause:
The NLM was not able to register with portmapper which prohibited NFS to start.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=Yzzp1e7YHM&a=cc_unsubscribe


More information about the Bugs mailing list