[Gluster-devel] Need help diagnosing regression-test crashes
Atin Mukherjee
atin.mukherjee83 at gmail.com
Sat Apr 9 08:20:19 UTC 2016
-Atin
Sent from one plus one
On 09-Apr-2016 9:57 am, "Vijay Bellur" <vbellur at redhat.com> wrote:
>
> On 04/09/2016 12:17 AM, Atin Mukherjee wrote:
>>
>> -Atin
>> Sent from one plus one
>> On 09-Apr-2016 9:32 am, "Rajesh Joseph" <rjoseph at redhat.com
>> <mailto:rjoseph at redhat.com>> wrote:
>> >
>> >
>> >
>> > On Sat, Apr 9, 2016 at 2:05 AM, Jeff Darcy <jdarcy at redhat.com
>> <mailto:jdarcy at redhat.com>> wrote:
>> >>
>> >> Upon further investigation, I've been able to determine that the
problem
>> >> lies in this line of our generic cleanup routine.
>> >>
>> >> type cleanup_lvm &>/dev/null && cleanup_lvm || true;
>> >>
>> >> This works great if snapshot.rc we're at the end of a test that
included
>> >> snapshot.rc (which defines cleanup_lvm), but we've generally been
moving
>> >> away from that in favor of calling it only at the beginning. Thus,
when
>> >> we go from a snapshot test to a non-snapshot test, the cleanup at the
>> >> beginning of the latter does *not* clean up any LVM stuff that's left
>> >> over. What might have been a simple and correctly attributed
failure in
>> >> the snapshot test can instead show up later. In this case, the
sequence
>> >> of events is as follows:
>> >>
>> >> 1) bug-1322772 (snapshot) test starts glusterd
>> >>
>> >> 2) bug-1322772 exits while the new glusterd is still initializing
>> >>
>> >> 3) run-tests.sh looks for new core files and finds none
>> >>
>> >> 4) run-tests.sh starts bug-1002207 (stripe) test
>> >>
>> >> 5) glusterd from bug-1322772 dumps core
>> >>
>> >> 6) bug-1002207 test completes
>> >>
>> >> 7) run-tests.sh sees new core and misattributes it to bug-1002207
>> >>
>> >> The question is what to do about this. Unconditionally calling
>> >> lvm_cleanup from generic cleanup is simple, but might make regression
>> >> tests noticeably slower. Another possibility would be to change all
>> >> snapshot tests to call cleanup (or at least cleanup_lvm) at the end,
or
>> >> use bash's "trap" mechanism to ensure the same. I'm not wild about
any
>> >> of those, but lean toward the "trap" approach. Anyone else have any
>> >> opinions?
>> >
>> >
>> > I think each snapshot test script should call cleanup_lvm and trap is
a
>> > great suggestion.
>> >
>> > atinm: Can you please look into the crash in the following test case?
>> > bugs/snapshot/bug-1322772-real-path-fix-for-snapshot.t
>>
>> Do we have the link to the crash?
>
>
> OT - Possibly unrelated glusterd crash in mainline [1]. This needs some
attention too.
Sure, will take a look.
>
> -Vijay
>
> [1] http://www.gluster.org/pipermail/maintainers/2016-April/000619.html
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.gluster.org/pipermail/gluster-devel/attachments/20160409/169a9604/attachment.html>
More information about the Gluster-devel
mailing list