[Bugs] [Bug 1247833] New: sharding - OS installation on vm image hangs on a sharded volume
bugzilla at redhat.com
bugzilla at redhat.com
Wed Jul 29 03:49:42 UTC 2015
https://bugzilla.redhat.com/show_bug.cgi?id=1247833
Bug ID: 1247833
Summary: sharding - OS installation on vm image hangs on a
sharded volume
Product: GlusterFS
Version: 3.7.3
Component: sharding
Keywords: Triaged
Assignee: kdhananj at redhat.com
Reporter: kdhananj at redhat.com
QA Contact: bugs at gluster.org
CC: bugs at gluster.org
Depends On: 1247108
+++ This bug was initially created as a clone of Bug #1247108 +++
Description of problem:
OS installation on a vm image in a sharded volume hangs at some point.
Statedump on the fuse client taken at several points reveals that readv() fop
is hung:
<statedump>
...
...
[global.callpool.stack.1.frame.10]
frame=0x7f0b0bcfd150
ref_count=0
translator=dis-rep-shard
complete=0 <==== complete is 0.
parent=dis-rep-trace
wind_from=trace_readv
wind_to=FIRST_CHILD(this)->fops->readv
unwind_to=trace_readv_cbk
...
...
[global.callpool.stack.1.frame.14]
frame=0x7f0b0bcd6f40
ref_count=1
translator=dis-rep
complete=0 <======== complete is 0
parent=fuse
wind_from=fuse_readv_resume
wind_to=FIRST_CHILD(this)->fops->readv
unwind_to=fuse_readv_cbk
...
...
</statedump>
This was found to be due to call_count being reduced to -1 at the end of
shard_common_lookup_shards() because of which this particular stack never gets
unwound till FUSE:
(gdb) p (call_frame_t *)0x7f0b0bcfd150
$1 = (call_frame_t *) 0x7f0b0bcfd150
(gdb) p (shard_local_t *)$1->local
$2 = (shard_local_t *) 0x7f0b0086310c
(gdb) p $2->call_count
$3 = -1
(gdb) p $2->eexist_count
$4 = 1
Version-Release number of selected component (if applicable):
How reproducible:
Steps to Reproduce:
1.
2.
3.
Actual results:
Expected results:
Additional info:
--- Additional comment from Krutika Dhananjay on 2015-07-28 03:37:32 EDT ---
http://review.gluster.org/#/c/11770/
--- Additional comment from Anand Avati on 2015-07-28 09:44:12 EDT ---
REVIEW: http://review.gluster.org/11778 (features/shard: Fix block size get
from xdata) posted (#1) for review on master by Pranith Kumar Karampuri
(pkarampu at redhat.com)
--- Additional comment from Anand Avati on 2015-07-28 21:53:52 EDT ---
COMMIT: http://review.gluster.org/11770 committed in master by Pranith Kumar
Karampuri (pkarampu at redhat.com)
------
commit d051bd14223d12ca8eaea85f6988ff41e5eef2c3
Author: Krutika Dhananjay <kdhananj at redhat.com>
Date: Tue Jul 28 11:25:55 2015 +0530
features/shard: (Re)initialize local->call_count before winding lookup
Change-Id: I616409c38b86c0acf1817b3472a1fed73db293f8
BUG: 1247108
Signed-off-by: Krutika Dhananjay <kdhananj at redhat.com>
Reviewed-on: http://review.gluster.org/11770
Tested-by: NetBSD Build System <jenkins at build.gluster.org>
Reviewed-by: Pranith Kumar Karampuri <pkarampu at redhat.com>
Tested-by: Gluster Build System <jenkins at build.gluster.com>
Referenced Bugs:
https://bugzilla.redhat.com/show_bug.cgi?id=1247108
[Bug 1247108] sharding - OS installation on vm image hangs on a sharded
volume
--
You are receiving this mail because:
You are the QA Contact for the bug.
You are on the CC list for the bug.
More information about the Bugs
mailing list