[Gluster-users] hanging httpd processes.
Yong Zhang
hiscal at outlook.com
Sat Apr 1 00:56:24 UTC 2017
Sorry, I replied based on the wrong title, forget about this.
From: Yong Zhang<mailto:hiscal at outlook.com>
Sent: Saturday, April 1, 2017 1:49 AM
To: Amar Tumballi<mailto:atumball at redhat.com>; Alvin Starr<mailto:alvin at netvel.net>
Cc: gluster-users at gluster.org List<mailto:gluster-users at gluster.org>
Subject: Re: [Gluster-users] hanging httpd processes.
Thanks Amar, I’ll consider your recommendations. But why performance is totally different on two nodes? Will data be written to both nodes at the same time?
From: Amar Tumballi<mailto:atumball at redhat.com>
Sent: Friday, March 31, 2017 3:14 PM
To: Alvin Starr<mailto:alvin at netvel.net>
Cc: gluster-users at gluster.org List<mailto:gluster-users at gluster.org>
Subject: Re: [Gluster-users] hanging httpd processes.
On Fri, Mar 31, 2017 at 12:29 PM, Amar Tumballi <atumball at redhat.com<mailto:atumball at redhat.com>> wrote:
Hi Alvin,
Thanks for the dump output. It helped a bit.
For now, recommend turning off open-behind and read-ahead performance translators for you to get rid of this situation, As I noticed hung FLUSH operations from these translators.
Looks like I gave wrong advise by looking at below snippet:
[global.callpool.stack.61]
stack=0x7f6c6f628f04
uid=48
gid=48
pid=11077
unique=10048797
lk-owner=a73ae5bdb5fcd0d2
op=FLUSH
type=1
cnt=5
[global.callpool.stack.61.frame.1]
frame=0x7f6c6f793d88
ref_count=0
translator=edocs-production-write-behind
complete=0
parent=edocs-production-read-ahead
wind_from=ra_flush
wind_to=FIRST_CHILD (this)->fops->flush
unwind_to=ra_flush_cbk
[global.callpool.stack.61.frame.2]
frame=0x7f6c6f796c90
ref_count=1
translator=edocs-production-read-ahead
complete=0
parent=edocs-production-open-behind
wind_from=default_flush_resume
wind_to=FIRST_CHILD(this)->fops->flush
unwind_to=default_flush_cbk
[global.callpool.stack.61.frame.3]
frame=0x7f6c6f79b724
ref_count=1
translator=edocs-production-open-behind
complete=0
parent=edocs-production
wind_from=io_stats_flush
wind_to=FIRST_CHILD(this)->fops->flush
unwind_to=io_stats_flush_cbk
[global.callpool.stack.61.frame.4]
frame=0x7f6c6f79b474
ref_count=1
translator=edocs-production
complete=0
parent=fuse
wind_from=fuse_flush_resume
wind_to=FIRST_CHILD(this)->fops->flush
unwind_to=fuse_err_cbk
[global.callpool.stack.61.frame.5]
frame=0x7f6c6f796684
ref_count=1
translator=fuse
complete=0
Mos probably, issue is with write-behind's flush. So please turn off write-behind and test. If you don't have any hung httpd processes, please let us know.
-Amar
-Amar
On Wed, Mar 29, 2017 at 6:56 AM, Alvin Starr <alvin at netvel.net<mailto:alvin at netvel.net>> wrote:
We are running gluster 3.8.9-1 on Centos 7.3.1611 for the servers and on the clients 3.7.11-2 on Centos 6.8
We are seeing httpd processes hang in fuse_request_send or sync_page.
These calls are from PHP 5.3.3-48 scripts
I am attaching a tgz file that contains the process dump from glusterfsd and the hung pids along with the offending pid's stacks from /proc/{pid}/stack.
This has been a low level annoyance for a while but it has become a much bigger issue because the number of hung processes went from a few a week to a few hundred a day.
--
Alvin Starr || voice: (905)513-7688
Netvel Inc. || Cell: (416)806-0133
alvin at netvel.net<mailto:alvin at netvel.net> ||
_______________________________________________
Gluster-users mailing list
Gluster-users at gluster.org<mailto:Gluster-users at gluster.org>
http://lists.gluster.org/mailman/listinfo/gluster-users
--
Amar Tumballi (amarts)
--
Amar Tumballi (amarts)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.gluster.org/pipermail/gluster-users/attachments/20170401/da3ac1aa/attachment.html>
More information about the Gluster-users
mailing list