[Gluster-users] Input/output error

siga hiro hirokisiga at gmail.com
Wed Aug 24 08:41:32 UTC 2011


Thank you for the quick answer.

> 1) http://bugs.gluster.com/show_bug.cgi?id=2921 (most likely this)
Isn't this solved in GlusterFS 3.2.3?

I have installed GlusterFS 3.2.3 in 172.23.0.2.
(get from http://download.gluster.com/pub/gluster/glusterfs/LATEST/CentOS/)
And It confirmed that md5sum corresponded with 172.23.0.1 and 172.23.0.2.
# md5sum *
8012eaf68e8ee8153d1b4f317dea385d  error_log.txt
88f70311135f82578a69866bce0564ba  error.log

mount 172.23.0.2
  -> mount -t glusterfs -o tcp,soft,timeo=3 172.23.0.2:/syncdata /syncdata

But...
[root at 172.23.0.2 /]# ls -al /syncdata/testdata/
ls: reading directory /syncdata/testdata/: Input/output error

/var/log/glusterfs/nfs.log
[2011-08-24 17:06:14.447688] I [rpc-clnt.c:1531:rpc_clnt_reconfig]
0-syncdata-client-0: changing port to 24009 (from 0)
[2011-08-24 17:06:17.453688] I
[client-handshake.c:1082:select_server_supported_programs]0-syncdata-client-1:
Using Program GlusterFS-3.1.0, Num (1298437), Version (310)
[2011-08-24 17:06:17.456448] I
[client-handshake.c:913:client_setvolume_cbk] 0-syncdata-client-1:
Connected to 172.23.11.121:24009, attached to remote volume
'/home/syncdata'.
[2011-08-24 17:06:17.456517] I [afr-common.c:2611:afr_notify]
0-syncdata-replicate-0: Subvolume 'syncdata-client-1' came back up;
going online.
[2011-08-24 17:06:17.456957] I
[client-handshake.c:1082:select_server_supported_programs]0-syncdata-client-0:
Using Program GlusterFS-3.1.0, Num (1298437), Version (310)
[2011-08-24 17:06:17.457937] I
[client-handshake.c:913:client_setvolume_cbk] 0-syncdata-client-0:
Connected to 172.23.3.4:24009, attached to remote volume
'/home/syncdata'.
[2011-08-24 17:06:17.458478] I [afr-common.c:912:afr_fresh_lookup_cbk]
0-syncdata-replicate-0: added root inode
[2011-08-24 17:06:52.479588] W
[afr-common.c:656:afr_lookup_self_heal_check] 0-syncdata-replicate-0:
/fastask: gfid different on subvolume
[2011-08-24 17:06:52.480560] I
[client3_1-fops.c:411:client3_1_stat_cbk] 0-syncdata-client-0: remote
operation failed: No such file or directory
[2011-08-24 17:06:52.481555] I
[client3_1-fops.c:1099:client3_1_access_cbk] 0-syncdata-client-0:
remote operation failed: No such file or directory
[2011-08-24 17:06:52.482554] I
[client3_1-fops.c:2132:client3_1_opendir_cbk] 0-syncdata-client-0:
remote operation failed: No such file or directory
[2011-08-24 17:06:52.482577] W
[client3_1-fops.c:5136:client3_1_readdir] 0-syncdata-client-0:
(689897478): failed to get fd ctx. EBADFD
[2011-08-24 17:06:52.482592] W
[client3_1-fops.c:5201:client3_1_readdir] 0-syncdata-client-0: failed
to send the fop: File descriptor in bad state
[2011-08-24 17:06:52.482608] I
[afr-dir-read.c:120:afr_examine_dir_readdir_cbk]
0-syncdata-replicate-0: /fastask: failed to do opendir on
syncdata-client-0
[2011-08-24 17:06:52.482811] I
[afr-dir-read.c:174:afr_examine_dir_readdir_cbk]
0-syncdata-replicate-0:  entry self-heal triggered. path: /fastask,
reason: checksums of directory differ, forced merge option set
[2011-08-24 17:06:52.483553] I
[client3_1-fops.c:1303:client3_1_entrylk_cbk] 0-syncdata-client-0:
remote operation failed: No such file or directory
[2011-08-24 17:06:52.483642] E
[afr-self-heal-entry.c:2292:afr_sh_post_nonblocking_entry_cbk]
0-syncdata-replicate-0: Non Blocking entrylks failed for /fastask.
[2011-08-24 17:06:52.483839] W [afr-common.c:122:afr_set_split_brain]
(-->/opt/glusterfs/3.2.3/lib64/glusterfs/3.2.3/xlator/cluster/replicate.so(afr_sh_post_nonblocking_entry_cbk+0xf5)
[0x2aaaaad137f5]
(-->/opt/glusterfs/3.2.3/lib64/glusterfs/3.2.3/xlator/cluster/replicate.so(afr_sh_entry_done+0x46)
[0x2aaaaad13646]
(-->/opt/glusterfs/3.2.3/lib64/glusterfs/3.2.3/xlator/cluster/replicate.so(afr_self_heal_completion_cbk+0x246)
[0x2aaaaad0cac6]))) 0-syncdata-replicate-0: invalid argument: inode
[2011-08-24 17:06:52.483864] E
[afr-self-heal-common.c:1554:afr_self_heal_completion_cbk]
0-syncdata-replicate-0: background  entry entry self-heal failed on
/fastask
[2011-08-24 17:06:52.483898] W
[client3_1-fops.c:5253:client3_1_readdirp] 0-syncdata-client-0:
(689897478): failed to get fd ctx. EBADFD
[2011-08-24 17:06:52.483913] W
[client3_1-fops.c:5317:client3_1_readdirp] 0-syncdata-client-0: failed
to send the fop: File descriptor in bad state

thanks.

> hi siga hiro,
>    I see the following warning:
> [2011-08-24 11:36:04.695145] W [afr-common.c:656:afr_lookup_self_heal_check]
> 0-syncdata-replicate-0: /testdata: gfid different on subvolume
>
> I also see that you have more than one mount on the volume. Most probably
> you are running into one of the following bugs:
> 1) http://bugs.gluster.com/show_bug.cgi?id=2921 (most likely this)
> 2) http://bugs.gluster.com/show_bug.cgi?id=2745
>
> If it is not the bug 2745, you can confirm it is the bug 2921 if the md5sums
> on the files match on both the machines 172.23.0.1, 172.23.0.2
>
> pranith.
>
> On 08/24/2011 11:48 AM, siga hiro wrote:
>
> Hi, everyone.
> Its nice meeting you.
> I am poor at English....
>
> I am writing this because I'd like to update GlusterFS to 3.2.2-1,and I want
> to change from gluster mount to nfs mount.
>
> I have installed GlusterFS 3.2.1 one week ago,and replication 2 server.
>
> OS:CentOS5.5 64bit
> RPM:glusterfs-core-3.2.1-1
>     glusterfs-fuse-3.2.1-1
>
> command
>  gluster volume create syncdata replica 2  transport tcp
> 172.23.0.1:/home/syncdata 172.23.0.2:/home/syncdata
>
> mount command
>  172.23.0.1 -> mount -t glusterfs -o tcp,soft,timeo=3 172.23.0.1:/syncdata
> /syncdata
>  172.23.0.2 -> mount -t glusterfs -o tcp,soft,timeo=3 172.23.0.2:/syncdata
> /syncdata
>
> So,Yesterday I update GlusterFS to 3.2.2-1 and use nfs mount.
>  172.23.0.2 -> mount -t nfs  -o nolock,nfsvers=3,tcp,hard,intr
> 172.23.0.2:/syncdata /syncdata
>
> [root at 172.23.0.2 /]# ls -al /syncdata/testdata/
> ls: reading directory /syncdata/testdata/: Input/output error
>
> /var/log/glusterfs/nfs.log
> [2011-08-24 11:35:16.319379] I
> [client-handshake.c:1082:select_server_supported_programs]
> 0-syncdata-client-1: Using Program GlusterFS-3.1.0, Num (1298437), Version
> (310)
> [2011-08-24 11:35:16.322126] I [client-handshake.c:913:client_setvolume_cbk]
> 0-syncdata-client-1: Connected to 172.23.0.2:24009, attached to remote
> volume '/home/syncdata'.
> [2011-08-24 11:35:16.322191] I [afr-common.c:2611:afr_notify]
> 0-syncdata-replicate-0: Subvolume 'syncdata-client-1' came back up; going
> online.
> [2011-08-24 11:35:16.323281] I
> [client-handshake.c:1082:select_server_supported_programs]
> 0-syncdata-client-0: Using Program GlusterFS-3.1.0, Num (1298437), Version
> (310)
> [2011-08-24 11:35:16.324274] I [client-handshake.c:913:client_setvolume_cbk]
> 0-syncdata-client-0: Connected to 172.23.0.1:24009, attached to remote
> volume '/home/syncdata'.
> [2011-08-24 11:35:16.324801] I [afr-common.c:912:afr_fresh_lookup_cbk]
> 0-syncdata-replicate-0: added root inode
> [2011-08-24 11:36:04.695145] W [afr-common.c:656:afr_lookup_self_heal_check]
> 0-syncdata-replicate-0: /testdata: gfid different on subvolume
> [2011-08-24 11:36:04.696121] I [client3_1-fops.c:411:client3_1_stat_cbk]
> 0-syncdata-client-0: remote operation failed: No such file or directory
> [2011-08-24 11:36:04.697121] I [client3_1-fops.c:1099:client3_1_access_cbk]
> 0-syncdata-client-0: remote operation failed: No such file or directory
> [2011-08-24 11:36:04.698118] I [client3_1-fops.c:2132:client3_1_opendir_cbk]
> 0-syncdata-client-0: remote operation failed: No such file or directory
> [2011-08-24 11:36:04.698140] W [client3_1-fops.c:5136:client3_1_readdir]
> 0-syncdata-client-0: (689897478): failed to get fd ctx. EBADFD
> [2011-08-24 11:36:04.698155] W [client3_1-fops.c:5201:client3_1_readdir]
> 0-syncdata-client-0: failed to send the fop: File descriptor in bad state
> [2011-08-24 11:36:04.698168] I
> [afr-dir-read.c:120:afr_examine_dir_readdir_cbk] 0-syncdata-replicate-0:
> /fastask: failed to do opendir on syncdata-client-0
>
> # gluster volume info all
>
> Volume Name: syncdata
> Type: Replicate
> Status: Started
> Number of Bricks: 2
> Transport-type: tcp
> Bricks:
> Brick1: 172.23.0.1:/home/syncdata
> Brick2: 172.23.0.2:/home/syncdata
>
>
> After an 172.23.0.2 server is made to work as usual, I want to do the work
> of the 172.23.0.1 server.
>
> Any ideas?
>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://gluster.org/cgi-bin/mailman/listinfo/gluster-users
>
>



More information about the Gluster-users mailing list