[Gluster-users] GlusterFS

Chris Jin chris at pikicentral.com
Fri Mar 26 06:14:24 UTC 2010


Hello,

Our project has been through the pain of file system failures. So we
decided to implement a new FS with GlusterFS V3.03.

We has 3 servers on both east coast and west coast, and want to keep the
data synchrolized between the two location. The important files should
be replicated twice and normal files replicated once.

We designed the structure as following:

Replicate(west/east)
	Distribute West
		Replicate 1
			Server 1 Brick 1
			Server 2 Brick 1
		Replicate 2
			Server 2 Brick 2
			Server 3 Brick 2
		Replicate 3
			Server 3 Brick 3
			Server 1 Brick 3
	Distribute East
		Replicate 1
			Server 1 Brick 1
			Server 2 Brick 1
		Replicate 2
			Server 2 Brick 2
			Server 3 Brick 2
		Replicate 3
			Server 3 Brick 3
			Server 1 Brick 3

We have tested the structure on our local machines. 
It worked fine sometimes, but sometimes it had no response.

My questions are
1. Can this structure be implemented?
2. Is there a better solution for our case?
3. Listed below are the sample vol files and debug info. Are these
problems fixable?
4. Can we mount a GlusterFS, and use it as a destination folder in
Gluster server?




The vol file for servers

# Sample vol for server 1
#####################################################
volume posix1
	type storage/posix
	option directory /data1/vi1
end-volume

volume locks1
	type features/locks
	subvolumes posix1
end-volume

volume brick1
  type performance/io-threads
  option thread-count 8
  subvolumes locks1
end-volume

volume posix2
	type storage/posix
	option directory /data2/vi1
end-volume

volume locks2
	type features/locks
	subvolumes posix2
end-volume

volume brick2
  type performance/io-threads
  option thread-count 8
  subvolumes locks2
end-volume

volume posix3
	type storage/posix
	option directory /data3/vi1
end-volume

volume locks3
	type features/locks
	subvolumes posix3
end-volume

volume brick3
  type performance/io-threads
  option thread-count 8
  subvolumes locks3
end-volume

volume server
        type protocol/server
        option transport-type tcp/server
        option transport.socket.listen-port 6996
        option transport.socket.bind-address fs1
        subvolumes brick1 brick2 brick3
        option auth.addr.brick1.allow 192.168.0.*
	option auth.addr.brick2.allow 192.168.0.*
	option auth.addr.brick3.allow 192.168.0.*
end-volume


# Sample vol file for client in east coast
######################################################
  1: volume remote1-a
  2:   type protocol/client
  3:   option transport-type tcp
  4:   option remote-host 192.168.0.195
  5:   option remote-port 6996
  6:   option remote-subvolume brick1
  7: end-volume
  8: 
  9: volume remote1-b
 10:   type protocol/client
 11:   option transport-type tcp
 12:   option remote-host 192.168.0.195
 13:   option remote-port 6997
 14:   option remote-subvolume brick1
 15: end-volume
 16: 
 17: volume remote1-c
 18:   type protocol/client
 19:   option transport-type tcp
 20:   option remote-host 192.168.0.195
 21:   option remote-port 6998
 22:   option remote-subvolume brick1
 23: end-volume
 24: 
 25: volume remote2-a
 26:   type protocol/client
 27:   option transport-type tcp
 28:   option remote-host 192.168.0.195
 29:   option remote-port 6996
 30:   option remote-subvolume brick2
 31: end-volume
 32: 
 33: volume remote2-b
 34:   type protocol/client
 35:   option transport-type tcp
 36:   option remote-host 192.168.0.195
 37:   option remote-port 6997
 38:   option remote-subvolume brick2
 39: end-volume
 40: 
 41: volume remote2-c
 42:   type protocol/client
 43:   option transport-type tcp
 44:   option remote-host 192.168.0.195
 45:   option remote-port 6998
 46:   option remote-subvolume brick2
 47: end-volume
 48: 
 49: volume remote3-a
 50:   type protocol/client
 51:   option transport-type tcp
 52:   option remote-host 192.168.0.195
 53:   option remote-port 6996
 54:   option remote-subvolume brick3
 55: end-volume
 56: 
 57: volume remote3-b
 58:   type protocol/client
 59:   option transport-type tcp
 60:   option remote-host 192.168.0.195
 61:   option remote-port 6997
 62:   option remote-subvolume brick3
 63: end-volume
 64: 
 65: volume remote3-c
 66:   type protocol/client
 67:   option transport-type tcp
 68:   option remote-host 192.168.0.195
 69:   option remote-port 6998
 70:   option remote-subvolume brick3
 71: end-volume
 72: 
 73: volume r1
 74:   type cluster/replicate
 75:   subvolumes remote1-a remote1-b
 76: end-volume
 77: 
 78: volume r2
 79:   type cluster/replicate
 80:   subvolumes remote2-b remote2-c
 81: end-volume
 82: 
 83: volume r3
 84:   type cluster/replicate
 85:   subvolumes remote3-c remote3-a
 86: end-volume
 87: 
 88: volume d-east
 89:   type cluster/distribute
 90:   subvolumes r1 r2 r3
 91: end-volume
 92: 
 93: volume w-remote1-a
 94:   type protocol/client
 95:   option transport-type tcp
 96:   option remote-host 192.168.0.194
 97:   option remote-port 6996
 98:   option remote-subvolume brick1
 99: end-volume
100: 
101: volume w-remote1-b
102:   type protocol/client
103:   option transport-type tcp
104:   option remote-host 192.168.0.194
105:   option remote-port 6997
106:   option remote-subvolume brick1
107: end-volume
108: 
109: volume w-remote1-c
110:   type protocol/client
111:   option transport-type tcp
112:   option remote-host 192.168.0.194
113:   option remote-port 6998
114:   option remote-subvolume brick1
115: end-volume
116: 
117: volume w-remote2-a
118:   type protocol/client
119:   option transport-type tcp
120:   option remote-host 192.168.0.194
121:   option remote-port 6996
122:   option remote-subvolume brick2
123: end-volume
124: 
125: volume w-remote2-b
126:   type protocol/client
127:   option transport-type tcp
128:   option remote-host 192.168.0.194
129:   option remote-port 6997
130:   option remote-subvolume brick2
131: end-volume
132: 
133: volume w-remote2-c
134:   type protocol/client
135:   option transport-type tcp
136:   option remote-host 192.168.0.194
137:   option remote-port 6998
138:   option remote-subvolume brick2
139: end-volume
140: 
141: volume w-remote3-a
142:   type protocol/client
143:   option transport-type tcp
144:   option remote-host 192.168.0.194
145:   option remote-port 6996
146:   option remote-subvolume brick3
147: end-volume
148: 
149: volume w-remote3-b
150:   type protocol/client
151:   option transport-type tcp
152:   option remote-host 192.168.0.194
153:   option remote-port 6997
154:   option remote-subvolume brick3
155: end-volume
156: 
157: volume w-remote3-c
158:   type protocol/client
159:   option transport-type tcp
160:   option remote-host 192.168.0.194
161:   option remote-port 6998
162:   option remote-subvolume brick3
163: end-volume
164: 
165: volume w-r1
166:   type cluster/replicate
167:   subvolumes w-remote1-a w-remote1-b
168: end-volume
169: 
170: volume w-r2
171:   type cluster/replicate
172:   subvolumes w-remote2-b w-remote2-c
173: end-volume
174: 
175: volume w-r3
176:   type cluster/replicate
177:   subvolumes w-remote3-c w-remote3-a
178: end-volume
179: 
180: volume d-west
181:   type cluster/distribute
182:   subvolumes w-r1 w-r2 w-r3
183: end-volume
184: 
185: volume client1
186:   type cluster/replicate
187:   option read-subvolume d-east
188:   subvolumes d-east d-west
189: end-volume
190:
191: volume trace
192:   type debug/trace
193:   option include open,close,create,readdir,opendir,closedir
194:   subvolumes client1
195: end-volume


Debug information



> +------------------------------------------------------------------------------+
> [2010-03-26 14:20:09] W [xlator.c:656:validate_xlator_volume_options] trace: option 'include' is deprecated, preferred is 'include-ops', continuing with correction
> [2010-03-26 14:20:09] C [dict.c:1142:data_to_str] dict: @data=(nil)
> [2010-03-26 14:20:09] W [client-protocol.c:6586:init] w-remote3-b: Volume is dangling. 
> [2010-03-26 14:20:09] W [client-protocol.c:6586:init] w-remote2-a: Volume is dangling. 
> [2010-03-26 14:20:09] W [client-protocol.c:6586:init] w-remote1-c: Volume is dangling. 
> [2010-03-26 14:20:09] W [client-protocol.c:6586:init] remote3-b: Volume is dangling. 
> [2010-03-26 14:20:09] W [client-protocol.c:6586:init] remote2-a: Volume is dangling. 
> [2010-03-26 14:20:09] W [client-protocol.c:6586:init] remote1-c: Volume is dangling. 
> [2010-03-26 14:20:09] N [glusterfsd.c:1396:main] glusterfs: Successfully started
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote1-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] r1: Subvolume 'remote1-b' came back up; going online.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote1-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] r1: Subvolume 'remote1-b' came back up; going online.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:20:09] N [fuse-bridge.c:2942:fuse_init] glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.13 kernel 7.10
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote2-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] r2: Subvolume 'remote2-b' came back up; going online.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote2-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] r2: Subvolume 'remote2-b' came back up; going online.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote2-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote2-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote3-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] r3: Subvolume 'remote3-c' came back up; going online.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote3-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] r3: Subvolume 'remote3-c' came back up; going online.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] w-r1: Subvolume 'w-remote1-a' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote1-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote1-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote3-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] remote3-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] w-r1: Subvolume 'w-remote1-a' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] w-r3: Subvolume 'w-remote3-a' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] w-r3: Subvolume 'w-remote3-a' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] w-r2: Subvolume 'w-remote2-c' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [afr.c:2627:notify] w-r2: Subvolume 'w-remote2-c' came back up; going online.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick3'.
> [2010-03-26 14:20:09] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick3'.
> [2010-03-26 14:25:09] N [trace.c:1769:trace_opendir] trace: 830:( loc {path=/, ino=1}, fd=0xd1eb80)
> [2010-03-26 14:25:09] N [trace.c:808:trace_opendir_cbk] trace: 830: (op_ret=0, op_errno=117, fd=0xd1eb80)
> [2010-03-26 14:25:11] E [afr-self-heal-algorithm.c:766:sh_diff_checksum_cbk] client1: checksum on /sample-file01 failed on subvolume d-east (File descriptor in bad state)
> [2010-03-26 14:25:11] E [afr-self-heal-algorithm.c:766:sh_diff_checksum_cbk] client1: checksum on /sample-file01 failed on subvolume d-west (File descriptor in bad state)
> pending frames:
> frame : type(1) op(STATFS)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> frame : type(1) op(READDIRP)
> patchset: v3.0.2-41-g029062c
> signal received: 11
> time of crash: 2010-03-26 14:25:11
> configuration details:
> argp 1
> backtrace 1
> dlfcn 1
> fdatasync 1
> libpthread 1
> llistxattr 1
> setfsid 1
> spinlock 1
> epoll.h 1
> xattr.h 1
> st_atim.tv_nsec 1
> package-string: glusterfs 3.0.3
> /lib/libc.so.6[0x7fb9051ee040]
> /lib/glusterfs/3.0.3/xlator/cluster/replicate.so[0x7fb9045860dc]
> /lib/glusterfs/3.0.3/xlator/cluster/replicate.so(afr_sh_algo_diff+0xea)[0x7fb90458675a]
> /lib/glusterfs/3.0.3/xlator/cluster/replicate.so(afr_sh_data_sync_prepare+0xdf)[0x7fb904579b6f]
> /lib/glusterfs/3.0.3/xlator/cluster/replicate.so(afr_sh_data_fix+0x1d4)[0x7fb904579e14]
> /lib/glusterfs/3.0.3/xlator/cluster/replicate.so(afr_sh_data_fstat_cbk+0xa2)[0x7fb90457a082]
> /lib/glusterfs/3.0.3/xlator/cluster/distribute.so(dht_attr_cbk+0xd7)[0x7fb90433fdf7]
> /lib/glusterfs/3.0.3/xlator/cluster/replicate.so(afr_fstat_cbk+0xb4)[0x7fb90456c824]
> /lib/glusterfs/3.0.3/xlator/protocol/client.so(client_fstat_cbk+0x177)[0x7fb9047b1977]
> /lib/glusterfs/3.0.3/xlator/protocol/client.so(protocol_client_pollin+0xca)[0x7fb90479e1da]
> /lib/glusterfs/3.0.3/xlator/protocol/client.so(notify+0xe0)[0x7fb9047ade70]
> /lib/libglusterfs.so.0(xlator_notify+0x43)[0x7fb9059628f3]
> /lib/glusterfs/3.0.3/transport/socket.so(socket_event_handler+0xe0)[0x7fb903505720]
> /lib/libglusterfs.so.0[0x7fb90597d4cf]
> /sbin/glusterfs(main+0x90d)[0x40465d]
> /lib/libc.so.6(__libc_start_main+0xe6)[0x7fb9051d95a6]
> /sbin/glusterfs[0x402ae9]
> ---------
> ================================================================================




> [2010-03-26 14:47:32] W [xlator.c:656:validate_xlator_volume_options] trace: option 'include' is deprecated, preferred is 'include-ops', continuing with correction
> [2010-03-26 14:47:32] C [dict.c:1142:data_to_str] dict: @data=(nil)
> [2010-03-26 14:47:32] W [client-protocol.c:6586:init] w-remote3-b: Volume is dangling. 
> [2010-03-26 14:47:32] W [client-protocol.c:6586:init] w-remote2-a: Volume is dangling. 
> [2010-03-26 14:47:32] W [client-protocol.c:6586:init] w-remote1-c: Volume is dangling. 
> [2010-03-26 14:47:32] W [client-protocol.c:6586:init] remote3-b: Volume is dangling. 
> [2010-03-26 14:47:32] W [client-protocol.c:6586:init] remote2-a: Volume is dangling. 
> [2010-03-26 14:47:32] W [client-protocol.c:6586:init] remote1-c: Volume is dangling. 
> [2010-03-26 14:47:32] N [glusterfsd.c:1396:main] glusterfs: Successfully started
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote1-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] r1: Subvolume 'remote1-a' came back up; going online.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote1-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [fuse-bridge.c:2942:fuse_init] glusterfs-fuse: FUSE inited with protocol versions: glusterfs 7.13 kernel 7.10
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote1-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote1-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote2-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] r2: Subvolume 'remote2-b' came back up; going online.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote2-b: Connected to 192.168.0.195:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] r2: Subvolume 'remote2-b' came back up; going online.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote2-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote2-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote3-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] r3: Subvolume 'remote3-c' came back up; going online.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote3-c: Connected to 192.168.0.195:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] r3: Subvolume 'remote3-c' came back up; going online.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] client1: Subvolume 'd-east' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote3-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] remote3-a: Connected to 192.168.0.195:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] w-r2: Subvolume 'w-remote2-c' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] w-r2: Subvolume 'w-remote2-c' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] w-r3: Subvolume 'w-remote3-c' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] w-r1: Subvolume 'w-remote1-b' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-c: Connected to 192.168.0.194:6998, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] w-r3: Subvolume 'w-remote3-c' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [afr.c:2627:notify] w-r1: Subvolume 'w-remote1-b' came back up; going online.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote2-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick2'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-b: Connected to 192.168.0.194:6997, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote1-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick1'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:47:32] N [client-protocol.c:6246:client_setvolume_cbk] w-remote3-a: Connected to 192.168.0.194:6996, attached to remote volume 'brick3'.
> [2010-03-26 14:47:45] N [trace.c:1769:trace_opendir] trace: 39:( loc {path=/, ino=1}, fd=0x22437c0)
> [2010-03-26 14:47:46] N [trace.c:808:trace_opendir_cbk] trace: 39: (op_ret=0, op_errno=117, fd=0x22437c0)
> [2010-03-26 14:47:46] E [afr-self-heal-algorithm.c:766:sh_diff_checksum_cbk] client1: checksum on /sample-file01 failed on subvolume d-east (File descriptor in bad state)
> [2010-03-26 14:47:46] E [afr-self-heal-algorithm.c:766:sh_diff_checksum_cbk] client1: checksum on /sample-file01 failed on subvolume d-west (File descriptor in bad state)
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] remote2-b: Server 192.168.0.195:6997 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] remote3-c: Server 192.168.0.195:6998 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] remote3-c: Server 192.168.0.195:6998 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] w-remote1-b: Server 192.168.0.194:6997 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] w-remote1-b: Server 192.168.0.194:6997 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] w-remote2-c: Server 192.168.0.194:6998 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] w-remote3-c: Server 192.168.0.194:6998 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] w-remote3-c: Server 192.168.0.194:6998 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] remote1-b: Server 192.168.0.195:6997 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] remote1-a: Server 192.168.0.195:6996 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] remote2-c: Server 192.168.0.195:6998 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:48:57] E [client-protocol.c:415:client_ping_timer_expired] w-remote2-b: Server 192.168.0.194:6997 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:49:10] E [client-protocol.c:415:client_ping_timer_expired] w-remote1-a: Server 192.168.0.194:6996 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:49:10] E [client-protocol.c:415:client_ping_timer_expired] w-remote3-a: Server 192.168.0.194:6996 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:49:10] E [client-protocol.c:415:client_ping_timer_expired] remote3-a: Server 192.168.0.195:6996 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:49:39] E [client-protocol.c:415:client_ping_timer_expired] remote3-a: Server 192.168.0.195:6996 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:49:39] E [client-protocol.c:415:client_ping_timer_expired] w-remote1-a: Server 192.168.0.194:6996 has not responded in the last 42 seconds, disconnecting.
> [2010-03-26 14:49:39] E [client-protocol.c:415:client_ping_timer_expired] w-remote3-a: Server 192.168.0.194:6996 has not responded in the last 42 seconds, disconnecting.







More information about the Gluster-users mailing list