[Gluster-devel] SSL enabled glusterd crash

Emmanuel Dreyfus manu at netbsd.org
Fri Aug 7 08:40:57 UTC 2015


On Thu, Aug 06, 2015 at 11:34:12PM -0400, Raghavendra Gowdappa wrote:
> What errors do you get? Is it a problem related to reconnect timer?

The behavior is not consistent accross retries; However I got another crash:

Program terminated with signal 11, Segmentation fault.
#0  0xbbbda222 in rpc_clnt_reconnect (conn_ptr=0xb99a2150) at rpc-clnt.c:413
413                     if ((conn->connected == 0) && !clnt->disabled) {
(gdb) bt
#0  0xbbbda222 in rpc_clnt_reconnect (conn_ptr=0xb99a2150) at rpc-clnt.c:413
#1  0xbbb33c2e in gf_timer_proc (ctx=Cannot access memory at address 
    0xba9fffd8) at timer.c:184
(gdb) list  
408                     if (conn->reconnect)
409                             gf_timer_call_cancel (clnt->ctx,
410                                                   conn->reconnect);
411                     conn->reconnect = 0;
412     
413                     if ((conn->connected == 0) && !clnt->disabled) {
414                             ts.tv_sec = 3;
415                             ts.tv_nsec = 0;
416     
417                             gf_log (conn->name, GF_LOG_TRACE,
(gdb) print conn
$1 = (rpc_clnt_connection_t *) 0xb99a2150
(gdb) print conn->connected
$2 = 0 '\000'
(gdb) print clnt
$3 = (struct rpc_clnt *) 0x0
(gdb) print conn->rpc_clnt
$4 = (struct rpc_clnt *) 0x0
(gdb) print *conn
$5 = {lock = {ptm_magic = 3369885922, ptm_errorcheck = 241 '\361', 
    ptm_pad1 = "de\361", ptm_interlock = 177 '\261', 
    ptm_pad2 = "\351\372\303", ptm_owner = 0xff14a3e2, 
    ptm_waiters = 0x3cf0070a, ptm_recursed = 3719490962, ptm_spare2 = 0x4}, 
  trans = 0x1, config = {rpc_timeout = 0, remote_port = 0, remote_host = 0x0}, 
  reconnect = 0x0, timer = 0x0, ping_timer = 0x0, rpc_clnt = 0x0, 
  connected = 0 '\000', saved_frames = 0x0, frame_timeout = 0, last_sent = {
    tv_sec = -5160433714497904384, tv_usec = 0}, last_received = {
    tv_sec = 30923764531201, tv_usec = 1438935992}, ping_started = 0, 
  name = 0xba1e5df4 "\001", ping_timeout = 0, pingcnt = 0, msgcnt = 0}

conn->lock.ptm_magic is invalid again.


-- 
Emmanuel Dreyfus
manu at netbsd.org


More information about the Gluster-devel mailing list