[Gluster-users] Input/output error

Pranith Kumar K pranithk at gluster.com
Wed Aug 24 09:44:34 UTC 2011


hi siga hiro,
      Can you provide the output of:
getfattr -d -m . /home/syncdata
getfattr -d -m . /home/syncdata/testdata

On both the machines.
Pranith

On 08/24/2011 02:11 PM, siga hiro wrote:
> Thank you for the quick answer.
>
>> 1) http://bugs.gluster.com/show_bug.cgi?id=2921 (most likely this)
> Isn't this solved in GlusterFS 3.2.3?
>
> I have installed GlusterFS 3.2.3 in 172.23.0.2.
> (get from http://download.gluster.com/pub/gluster/glusterfs/LATEST/CentOS/)
> And It confirmed that md5sum corresponded with 172.23.0.1 and 172.23.0.2.
> # md5sum *
> 8012eaf68e8ee8153d1b4f317dea385d  error_log.txt
> 88f70311135f82578a69866bce0564ba  error.log
>
> mount 172.23.0.2
>    ->  mount -t glusterfs -o tcp,soft,timeo=3 172.23.0.2:/syncdata /syncdata
>
> But...
> [root at 172.23.0.2 /]# ls -al /syncdata/testdata/
> ls: reading directory /syncdata/testdata/: Input/output error
>
> /var/log/glusterfs/nfs.log
> [2011-08-24 17:06:14.447688] I [rpc-clnt.c:1531:rpc_clnt_reconfig]
> 0-syncdata-client-0: changing port to 24009 (from 0)
> [2011-08-24 17:06:17.453688] I
> [client-handshake.c:1082:select_server_supported_programs]0-syncdata-client-1:
> Using Program GlusterFS-3.1.0, Num (1298437), Version (310)
> [2011-08-24 17:06:17.456448] I
> [client-handshake.c:913:client_setvolume_cbk] 0-syncdata-client-1:
> Connected to 172.23.11.121:24009, attached to remote volume
> '/home/syncdata'.
> [2011-08-24 17:06:17.456517] I [afr-common.c:2611:afr_notify]
> 0-syncdata-replicate-0: Subvolume 'syncdata-client-1' came back up;
> going online.
> [2011-08-24 17:06:17.456957] I
> [client-handshake.c:1082:select_server_supported_programs]0-syncdata-client-0:
> Using Program GlusterFS-3.1.0, Num (1298437), Version (310)
> [2011-08-24 17:06:17.457937] I
> [client-handshake.c:913:client_setvolume_cbk] 0-syncdata-client-0:
> Connected to 172.23.3.4:24009, attached to remote volume
> '/home/syncdata'.
> [2011-08-24 17:06:17.458478] I [afr-common.c:912:afr_fresh_lookup_cbk]
> 0-syncdata-replicate-0: added root inode
> [2011-08-24 17:06:52.479588] W
> [afr-common.c:656:afr_lookup_self_heal_check] 0-syncdata-replicate-0:
> /fastask: gfid different on subvolume
> [2011-08-24 17:06:52.480560] I
> [client3_1-fops.c:411:client3_1_stat_cbk] 0-syncdata-client-0: remote
> operation failed: No such file or directory
> [2011-08-24 17:06:52.481555] I
> [client3_1-fops.c:1099:client3_1_access_cbk] 0-syncdata-client-0:
> remote operation failed: No such file or directory
> [2011-08-24 17:06:52.482554] I
> [client3_1-fops.c:2132:client3_1_opendir_cbk] 0-syncdata-client-0:
> remote operation failed: No such file or directory
> [2011-08-24 17:06:52.482577] W
> [client3_1-fops.c:5136:client3_1_readdir] 0-syncdata-client-0:
> (689897478): failed to get fd ctx. EBADFD
> [2011-08-24 17:06:52.482592] W
> [client3_1-fops.c:5201:client3_1_readdir] 0-syncdata-client-0: failed
> to send the fop: File descriptor in bad state
> [2011-08-24 17:06:52.482608] I
> [afr-dir-read.c:120:afr_examine_dir_readdir_cbk]
> 0-syncdata-replicate-0: /fastask: failed to do opendir on
> syncdata-client-0
> [2011-08-24 17:06:52.482811] I
> [afr-dir-read.c:174:afr_examine_dir_readdir_cbk]
> 0-syncdata-replicate-0:  entry self-heal triggered. path: /fastask,
> reason: checksums of directory differ, forced merge option set
> [2011-08-24 17:06:52.483553] I
> [client3_1-fops.c:1303:client3_1_entrylk_cbk] 0-syncdata-client-0:
> remote operation failed: No such file or directory
> [2011-08-24 17:06:52.483642] E
> [afr-self-heal-entry.c:2292:afr_sh_post_nonblocking_entry_cbk]
> 0-syncdata-replicate-0: Non Blocking entrylks failed for /fastask.
> [2011-08-24 17:06:52.483839] W [afr-common.c:122:afr_set_split_brain]
> (-->/opt/glusterfs/3.2.3/lib64/glusterfs/3.2.3/xlator/cluster/replicate.so(afr_sh_post_nonblocking_entry_cbk+0xf5)
> [0x2aaaaad137f5]
> (-->/opt/glusterfs/3.2.3/lib64/glusterfs/3.2.3/xlator/cluster/replicate.so(afr_sh_entry_done+0x46)
> [0x2aaaaad13646]
> (-->/opt/glusterfs/3.2.3/lib64/glusterfs/3.2.3/xlator/cluster/replicate.so(afr_self_heal_completion_cbk+0x246)
> [0x2aaaaad0cac6]))) 0-syncdata-replicate-0: invalid argument: inode
> [2011-08-24 17:06:52.483864] E
> [afr-self-heal-common.c:1554:afr_self_heal_completion_cbk]
> 0-syncdata-replicate-0: background  entry entry self-heal failed on
> /fastask
> [2011-08-24 17:06:52.483898] W
> [client3_1-fops.c:5253:client3_1_readdirp] 0-syncdata-client-0:
> (689897478): failed to get fd ctx. EBADFD
> [2011-08-24 17:06:52.483913] W
> [client3_1-fops.c:5317:client3_1_readdirp] 0-syncdata-client-0: failed
> to send the fop: File descriptor in bad state
>
> thanks.
>
>> hi siga hiro,
>>     I see the following warning:
>> [2011-08-24 11:36:04.695145] W [afr-common.c:656:afr_lookup_self_heal_check]
>> 0-syncdata-replicate-0: /testdata: gfid different on subvolume
>>
>> I also see that you have more than one mount on the volume. Most probably
>> you are running into one of the following bugs:
>> 1) http://bugs.gluster.com/show_bug.cgi?id=2921 (most likely this)
>> 2) http://bugs.gluster.com/show_bug.cgi?id=2745
>>
>> If it is not the bug 2745, you can confirm it is the bug 2921 if the md5sums
>> on the files match on both the machines 172.23.0.1, 172.23.0.2
>>
>> pranith.
>>
>> On 08/24/2011 11:48 AM, siga hiro wrote:
>>
>> Hi, everyone.
>> Its nice meeting you.
>> I am poor at English....
>>
>> I am writing this because I'd like to update GlusterFS to 3.2.2-1,and I want
>> to change from gluster mount to nfs mount.
>>
>> I have installed GlusterFS 3.2.1 one week ago,and replication 2 server.
>>
>> OS:CentOS5.5 64bit
>> RPM:glusterfs-core-3.2.1-1
>>      glusterfs-fuse-3.2.1-1
>>
>> command
>>   gluster volume create syncdata replica 2  transport tcp
>> 172.23.0.1:/home/syncdata 172.23.0.2:/home/syncdata
>>
>> mount command
>>   172.23.0.1 ->  mount -t glusterfs -o tcp,soft,timeo=3 172.23.0.1:/syncdata
>> /syncdata
>>   172.23.0.2 ->  mount -t glusterfs -o tcp,soft,timeo=3 172.23.0.2:/syncdata
>> /syncdata
>>
>> So,Yesterday I update GlusterFS to 3.2.2-1 and use nfs mount.
>>   172.23.0.2 ->  mount -t nfs  -o nolock,nfsvers=3,tcp,hard,intr
>> 172.23.0.2:/syncdata /syncdata
>>
>> [root at 172.23.0.2 /]# ls -al /syncdata/testdata/
>> ls: reading directory /syncdata/testdata/: Input/output error
>>
>> /var/log/glusterfs/nfs.log
>> [2011-08-24 11:35:16.319379] I
>> [client-handshake.c:1082:select_server_supported_programs]
>> 0-syncdata-client-1: Using Program GlusterFS-3.1.0, Num (1298437), Version
>> (310)
>> [2011-08-24 11:35:16.322126] I [client-handshake.c:913:client_setvolume_cbk]
>> 0-syncdata-client-1: Connected to 172.23.0.2:24009, attached to remote
>> volume '/home/syncdata'.
>> [2011-08-24 11:35:16.322191] I [afr-common.c:2611:afr_notify]
>> 0-syncdata-replicate-0: Subvolume 'syncdata-client-1' came back up; going
>> online.
>> [2011-08-24 11:35:16.323281] I
>> [client-handshake.c:1082:select_server_supported_programs]
>> 0-syncdata-client-0: Using Program GlusterFS-3.1.0, Num (1298437), Version
>> (310)
>> [2011-08-24 11:35:16.324274] I [client-handshake.c:913:client_setvolume_cbk]
>> 0-syncdata-client-0: Connected to 172.23.0.1:24009, attached to remote
>> volume '/home/syncdata'.
>> [2011-08-24 11:35:16.324801] I [afr-common.c:912:afr_fresh_lookup_cbk]
>> 0-syncdata-replicate-0: added root inode
>> [2011-08-24 11:36:04.695145] W [afr-common.c:656:afr_lookup_self_heal_check]
>> 0-syncdata-replicate-0: /testdata: gfid different on subvolume
>> [2011-08-24 11:36:04.696121] I [client3_1-fops.c:411:client3_1_stat_cbk]
>> 0-syncdata-client-0: remote operation failed: No such file or directory
>> [2011-08-24 11:36:04.697121] I [client3_1-fops.c:1099:client3_1_access_cbk]
>> 0-syncdata-client-0: remote operation failed: No such file or directory
>> [2011-08-24 11:36:04.698118] I [client3_1-fops.c:2132:client3_1_opendir_cbk]
>> 0-syncdata-client-0: remote operation failed: No such file or directory
>> [2011-08-24 11:36:04.698140] W [client3_1-fops.c:5136:client3_1_readdir]
>> 0-syncdata-client-0: (689897478): failed to get fd ctx. EBADFD
>> [2011-08-24 11:36:04.698155] W [client3_1-fops.c:5201:client3_1_readdir]
>> 0-syncdata-client-0: failed to send the fop: File descriptor in bad state
>> [2011-08-24 11:36:04.698168] I
>> [afr-dir-read.c:120:afr_examine_dir_readdir_cbk] 0-syncdata-replicate-0:
>> /fastask: failed to do opendir on syncdata-client-0
>>
>> # gluster volume info all
>>
>> Volume Name: syncdata
>> Type: Replicate
>> Status: Started
>> Number of Bricks: 2
>> Transport-type: tcp
>> Bricks:
>> Brick1: 172.23.0.1:/home/syncdata
>> Brick2: 172.23.0.2:/home/syncdata
>>
>>
>> After an 172.23.0.2 server is made to work as usual, I want to do the work
>> of the 172.23.0.1 server.
>>
>> Any ideas?
>>
>> _______________________________________________
>> Gluster-users mailing list
>> Gluster-users at gluster.org
>> http://gluster.org/cgi-bin/mailman/listinfo/gluster-users
>>
>>
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://gluster.org/cgi-bin/mailman/listinfo/gluster-users




More information about the Gluster-users mailing list