[Gluster-users] fix-layout stalls with xattr errors
Dan Bretherton
d.a.bretherton at reading.ac.uk
Thu Dec 29 00:35:37 UTC 2011
Hello All-
I am having problems with rebalance ... fix-layout in version 3.2.5. I
extended a volume with add-brick but the fix-layout stalls after a small
number of layout fixes and does not make any more progress. I have
tried the operation twice on different servers with the same result.
The following errors are found in the fuse mount log file on the
server carrying out the operation.
[2011-12-28 21:38:14.840013] I
[afr-common.c:1038:afr_launch_self_heal] 0-nemo2-replicate-4:
background data self-heal triggered. path:
/users/hzu/DATA/ERAINT/ORCA025/2010/snow_ERAINT_2010.nc
[2011-12-28 21:38:15.93079] E
[client3_1-fops.c:1498:client3_1_fxattrop_cbk] 0-nemo2-client-8:
remote operation failed: Operation not supported
[2011-12-28 21:38:15.93141] E
[client3_1-fops.c:1498:client3_1_fxattrop_cbk] 0-nemo2-client-9:
remote operation failed: Operation not supported
[2011-12-28 21:38:15.93385] I
[client3_1-fops.c:1187:client3_1_fstat_cbk] 0-nemo2-client-8: remote
operation failed: Operation not supported
[2011-12-28 21:38:15.93521] I
[client3_1-fops.c:1187:client3_1_fstat_cbk] 0-nemo2-client-9: remote
operation failed: Operation not supported
The file in the error message is a link, and it is not broken as seen
from the volume mount point or the bricks.
There are some worrying error messages in the brick log files for
nemo2-client-8 and nemo2-client-9. Here are some exerpts from the
nemo2-client-8 log, which is similar to the 0-nemo2-client-9 log.
[2011-12-28 21:23:05.827877] W [posix.c:3928:do_xattrop]
0-nemo2-posix: Extended attributes not supported by filesystem
[2011-12-28 21:23:05.827932] I
[server3_1-fops.c:1705:server_fxattrop_cbk] 0-nemo2-server: 8438:
FXATTROP 0 (-2111276040) ==> -1 (Operation not support
ed)
[2011-12-28 21:23:05.828848] E [posix.c:4200:posix_fstat]
0-nemo2-posix: fstat failed on fd=0x2aaaac703804: Operation not
supported
[2011-12-28 21:23:05.828879] I
[server3_1-fops.c:1113:server_fstat_cbk] 0-nemo2-server: 8439: FSTAT
0 (-2111276040) ==> -1 (Operation not supported)
[2011-12-28 21:29:29.871213] W
[socket.c:1494:__socket_proto_state_machine] 0-tcp.nemo2-server:
reading from socket failed. Error (Transport endpoint i
s not connected), peer (192.171.166.81:1003)
[2011-12-28 21:29:29.871305] I
[server-helpers.c:360:do_lock_table_cleanup] 0-nemo2-server: inodelk
released on /users/hzu/DATA/ERAINT/ORCA025/2010/sno
w_ERAINT_2010.nc
[2011-12-28 21:29:29.871345] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on
/users/hzu/DATA/ERAINT/ORCA025/2010/snow_ERAINT_2010.
nc
[2011-12-28 21:34:36.190023] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /
[2011-12-28 21:34:36.190055] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /users
[2011-12-28 21:34:36.190086] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /users/hzu
[2011-12-28 21:34:36.190102] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /users/hzu/DATA
[2011-12-28 21:34:36.190135] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /users/hzu/DATA/ERAINT
[2011-12-28 21:34:36.190154] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /users/hzu/DATA/ERAINT/ORCA025
[2011-12-28 21:34:36.190171] I [server-helpers.c:485:do_fd_cleanup]
0-nemo2-server: fd cleanup on /users/hzu/DATA/ERAINT/ORCA025/2009
[2011-12-28 21:38:15.92433] I
[server3_1-fops.c:1705:server_fxattrop_cbk] 0-nemo2-server: 12228:
FXATTROP 7 (-2111276040) ==> -1 (Operation not supported)
[2011-12-28 21:38:15.92743] E [posix.c:4200:posix_fstat]
0-nemo2-posix: fstat failed on fd=0x2aaaac703804: Operation not
supported
[2011-12-28 21:38:15.92775] I
[server3_1-fops.c:1113:server_fstat_cbk] 0-nemo2-server: 12229:
FSTAT 7 (-2111276040) ==> -1 (Operation not supported)
The backend filesystems are ext4 and the are mounted with options
"acl,user_xattr". I tested extended attribute support (as suggested
here:
http://gluster.org/pipermail/gluster-users/2010-December/006257.html)
and could not find any problems, so I don't understand the "Extended
attributes not supported by filesystem" error. The only unusual thing
about the filesystems is the reduced number of filesystem features
enabled compared to other bricks. These are the ext4 features enabled.
has_journal ext_attr resize_inode dir_index filetype needs_recovery
sparse_super large_file
All the other bricks in the volume have these features plus extent,
flex_bg, huge_file, uninit_bg, dir_nlink and extra_isize. I don't know
if any of these missing ext4 features are part of the problem. Does
anybody know what's going on here?
Regards
Dan.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20111229/a5cecfb0/attachment.html>
More information about the Gluster-users
mailing list