[Bugs] [Bug 1312817] New: Bonnie test failing on cifs mount with can't sync file error and logs showing split-brain errors.

bugzilla at redhat.com bugzilla at redhat.com
Mon Feb 29 10:26:13 UTC 2016


https://bugzilla.redhat.com/show_bug.cgi?id=1312817

            Bug ID: 1312817
           Summary: Bonnie test failing on cifs mount with can't sync file
                    error and logs showing split-brain errors.
           Product: GlusterFS
           Version: 3.7.8
         Component: replicate
          Assignee: bugs at gluster.org
          Reporter: sbhaloth at redhat.com
                CC: bugs at gluster.org



Description of problem:
**********************************

While running bonnie on cifs mount the write tests fails with "can't sync file
error"  and in the logs it shows split-brain errors.

There is no operation been done in the backend.All bricks are up and running.

********************************************************

Version-Release number of selected component (if applicable):
glusterfs-libs-3.7.8-1.el7.x86_64
glusterfs-cli-3.7.8-1.el7.x86_64
glusterfs-3.7.8-1.el7.x86_64
glusterfs-fuse-3.7.8-1.el7.x86_64
glusterfs-server-3.7.8-1.el7.x86_64
glusterfs-geo-replication-3.7.8-1.el7.x86_64
glusterfs-api-3.7.8-1.el7.x86_64
samba-vfs-glusterfs-4.2.4-12.el7rhgs.x86_64
glusterfs-client-xlators-3.7.8-1.el7.x86_64
glusterfs-rdma-3.7.8-1.el7.x86_64


How reproducible:
Always

Steps to Reproduce:
1.Take upstream 3.7.8 gluster version
2. Do a samba setup and mount the volume on cifs client
3. Start bonnie test tool 

Actual results:
Bonnie fails with can't sync file error.

Expected results:

Bonnie should pass.


Additional info:
***********************************************************
Volume Name: testvol
Type: Distributed-Replicate
Volume ID: b946f59a-2397-4aa6-b3f8-f2ea994f7e82
Status: Started
Number of Bricks: 2 x 2 = 4
Transport-type: tcp
Bricks:
Brick1: 10.70.47.140:/bricks/brick0/testvol_brick0
Brick2: 10.70.47.19:/bricks/brick0/testvol_brick1
Brick3: 10.70.47.140:/bricks/brick1/testvol_brick2
Brick4: 10.70.47.19:/bricks/brick1/testvol_brick3
Options Reconfigured:
storage.batch-fsync-delay-usec: 0
performance.stat-prefetch: off
server.allow-insecure: on
performance.readdir-ahead: on


[2016-02-29 05:18:38.302291] E [MSGID: 108008]
[afr-read-txn.c:89:afr_read_txn_refresh_done] 0-testvol-replicate-0: Failing
GETXATTR on gfid 22885084-
a015-429f-9847-83bf971d77aa: split-brain observed. [Input/output error]
[2016-02-29 05:18:38.320807] E [MSGID: 108008]
[afr-read-txn.c:89:afr_read_txn_refresh_done] 0-testvol-replicate-0: Failing
FSTAT on gfid 22885084-a01
5-429f-9847-83bf971d77aa: split-brain observed. [Input/output error]
[2016-02-29 05:18:38.327422] E [MSGID: 108008]
[afr-read-txn.c:89:afr_read_txn_refresh_done] 0-testvol-replicate-0: Failing
GETXATTR on gfid 22885084-
a015-429f-9847-83bf971d77aa: split-brain observed. [Input/output error]
[2016-02-29 05:18:38.335338] E [MSGID: 108008]
[afr-read-txn.c:89:afr_read_txn_refresh_done] 0-testvol-replicate-0: Failing
FSTAT on gfid 22885084-a01
5-429f-9847-83bf971d77aa: split-brain observed. [Input/output error]
[2016-02-29 05:18:39.345415] E [MSGID: 108008]
[afr-read-txn.c:89:afr_read_txn_refresh_done] 0-testvol-replicate-0: Failing
GETXATTR on gfid 22885084-
a015-429f-9847-83bf971d77aa: split-brain observed. [Input/output error]


gluster vol status
Status of volume: testvol
Gluster process                             TCP Port  RDMA Port  Online  Pid
------------------------------------------------------------------------------
Brick 10.70.47.140:/bricks/brick0/testvol_b
rick0                                       49176     0          Y       27357
Brick 10.70.47.19:/bricks/brick0/testvol_br
ick1                                        49176     0          Y       15047
Brick 10.70.47.140:/bricks/brick1/testvol_b
rick2                                       49177     0          Y       27376
Brick 10.70.47.19:/bricks/brick1/testvol_br
ick3                                        49177     0          Y       15066
NFS Server on localhost                     2049      0          Y       27399
Self-heal Daemon on localhost               N/A       N/A        Y       27404
NFS Server on dhcp47-19.lab.eng.blr.redhat.
com                                         2049      0          Y       15088
Self-heal Daemon on dhcp47-19.lab.eng.blr.r
edhat.com                                   N/A       N/A        Y       15094

Task Status of Volume testvol


Changing to the specified mountpoint
/mnt/cifs/run28894
executing bonnie
Using uid:0, gid:0.
Writing a byte at a time...done
Writing intelligently...Can't sync file.
done
Can't open file ./Bonnie.28924

real    9m34.342s
user    0m0.715s
sys    0m20.983s
bonnie failed

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list