[Gluster-devel] regression failures on afr/split-brain-resolution

Ravishankar N ravishankar at redhat.com
Tue Jul 24 13:00:43 UTC 2018



On 07/24/2018 02:56 PM, Raghavendra Gowdappa wrote:
> All,
>
> I was trying to debug regression failures on [1] and observed that 
> split-brain-resolution.t was failing consistently.
>
> =========================
> TEST 45 (line 88): 0 get_pending_heal_count patchy
> ./tests/basic/afr/split-brain-resolution.t .. 45/45 RESULT 45: 1
> ./tests/basic/afr/split-brain-resolution.t .. Failed 17/45 subtests
>
> Test Summary Report
> -------------------
> ./tests/basic/afr/split-brain-resolution.t (Wstat: 0 Tests: 45 Failed: 17)
>   Failed tests:  24-26, 28-36, 41-45
>
>
> On probing deeper, I observed a curious fact - on most of the failures 
> stat was not served from md-cache, but instead was wound down to afr 
> which failed stat with EIO as the file was in split brain. So, I did 
> another test:
> * disabled md-cache
> * mount glusterfs with attribute-timeout 0 and entry-timeout 0
>
> Now the test fails always. So, I think the test relied on stat 
> requests being absorbed either by kernel attribute cache or md-cache. 
> When its not happening stats are reaching afr and resulting in 
> failures of cmds like getfattr etc.

This indeed seems to be the case.  Is there any way we can avoid the 
stat? When a getfattr is performed on the mount, aren't lookup + 
getfattr are the only fops that need to be hit in gluster?
-Ravi

> Thoughts?
>
> [1] https://review.gluster.org/#/c/20549/
>
>
> _______________________________________________
> Gluster-devel mailing list
> Gluster-devel at gluster.org
> https://lists.gluster.org/mailman/listinfo/gluster-devel

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.gluster.org/pipermail/gluster-devel/attachments/20180724/db204da6/attachment.html>


More information about the Gluster-devel mailing list