<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Tue, May 2, 2017 at 2:36 AM, Jeff Darcy <span dir="ltr"><<a href="mailto:jeff@pl.atyp.us" target="_blank">jeff@pl.atyp.us</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Since the vast majority of our tests run without multiplexing, I'm going<br>
to start running regular runs of all tests with multiplexing turned on.<br>
You can see the patch here:<br>
<br>
<a href="https://review.gluster.org/#/c/17145/" rel="noreferrer" target="_blank">https://review.gluster.org/#/<wbr>c/17145/</a><br>
<br>
There are currently two tests that fail with multiplexing. Note that<br>
these are all tests that passed as of when multiplexing was introduced.<br>
I don't know about these specific tests, but most tests had passed with<br>
multiplexing turned *many times* - sometimes literally over a hundred<br>
because I did more runs that that during development. These are tests<br>
that have been broken since then, because without regular tests the<br>
people making changes could not have known how their changes interact<br>
with multiplexing.<br>
<br>
19:14:41<br>
./tests/bugs/glusterd/bug-<wbr>1367478-volume-start-<wbr>validation-after-glusterd-<wbr>restart.t<br>
..<br>
19:14:41 not ok 17 Got "0" instead of "1", LINENUM:37<br>
19:14:41 FAILED COMMAND: 1 brick_up_status_1 patchy1 127.1.1.2<br>
/d/backends/2/patchy12<br></blockquote><div><br></div><div>This is one of the problem we are trying to address through <a href="https://review.gluster.org/#/c/17101">https://review.gluster.org/#/c/17101</a> and this test was broken by <a href="https://review.gluster.org/16866">https://review.gluster.org/16866</a> .<br><br><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">20:52:10 ./tests/features/trash.t ..<br>
20:52:10 not ok 53 Got "2" instead of "1", LINENUM:221<br>
20:52:10 FAILED COMMAND: 1 online_brick_count<br>
20:52:10 ok 54, LINENUM:223<br>
20:52:10 ok 55, LINENUM:226<br>
20:52:10 not ok 56 Got "3" instead of "2", LINENUM:227<br>
20:52:10 FAILED COMMAND: 2 online_brick_count<br>
20:52:10 ok 57, LINENUM:228<br>
20:52:10 ok 58, LINENUM:233<br>
20:52:10 ok 59, LINENUM:236<br>
20:52:10 ok 60, LINENUM:237<br>
20:52:10 not ok 61 , LINENUM:238<br>
20:52:10 FAILED COMMAND: [ -e /mnt/glusterfs/0/abc -a ! -e<br>
/mnt/glusterfs/0/.trashcan ]<br></blockquote><div><br></div><div>IMO, nothing specific to brick-mux. online_brick_count function has a flaw. It basically looks for pids for all the processes instead of looking for only the bricks. In this test one of the volume was replicate and hence shd was up and you'd see one additional pidfile placed. This was actually caught by Mohit while we were (and still are) working on patch 17101. The last failure needs to be looked at.<br><br> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
Do we have any volunteers to look into these? I looked at the first one<br>
a bit and didn't find any obvious clues; I haven't looked at the second.<br>
______________________________<wbr>_________________<br>
Gluster-devel mailing list<br>
<a href="mailto:Gluster-devel@gluster.org">Gluster-devel@gluster.org</a><br>
<a href="http://lists.gluster.org/mailman/listinfo/gluster-devel" rel="noreferrer" target="_blank">http://lists.gluster.org/<wbr>mailman/listinfo/gluster-devel</a><br>
</blockquote></div><br></div></div>