[Gluster-users] Recovery from network failure
Georgecooldude
georgecooldude at gmail.com
Wed Sep 23 20:41:59 UTC 2009
It does seem to detect it in the log.
This is what and did and the attached log file
###
SRV01 - 192.168.1.1
SRV02 - 192.168.1.2
##
------------------------
Step 1: Copy large file to the gluster mount on server02
admin at srv02:/mnt/glusterfs$ ls -lh
total 1.2G
-rw-r--r-- 1 root root 584M 2009-09-23 21:29 test03
------------------------
------------------------
Step 2: Pull the cable from srv02
------------------------
------------------------
Step 3: ls on srv01 - See I have a partial file
admin at srv01:/mnt/glusterfs$ ls -lh
total 775M
-rw-r--r-- 1 root root 191M 2009-09-23 21:28 test03
------------------------
------------------------
Server02 Log file looks like this:
Version : glusterfs 2.0.6 built on Sep 19 2009 18:00:37
TLA Revision : v2.0.6
Starting Time: 2009-09-23 21:26:49
Command line : glusterfsd -f /etc/glusterfs/glusterfs-server.vol -l
/var/log/gluster/gluster-log.txt -L DEBUG --volfile-check
PID : 5085
System name : Linux
Nodename : srv02
Kernel Release : 2.6.24-24-server
Hardware Identifier: x86_64
Given volfile:
+------------------------------------------------------------------------------+
1: # file: /etc/glusterfs/glusterfs-server.vol
2:
3: volume posix
4: type storage/posix
5: option directory /data/export
6: end-volume
7:
8: volume locks
9: type features/locks
10: subvolumes posix
11: end-volume
12:
13: volume brick
14: type performance/io-threads
15: option thread-count 8
16: subvolumes locks
17: end-volume
18:
19: volume posix-ns
20: type storage/posix
21: option directory /data/export-ns
22: end-volume
23:
24: volume locks-ns
25: type features/locks
26: subvolumes posix-ns
27: end-volume
28:
29: volume brick-ns
30: type performance/io-threads
31: option thread-count 8
32: subvolumes locks-ns
33: end-volume
34:
35: volume server
36: type protocol/server
37: option transport-type tcp
38: option auth.addr.brick.allow *
39: option auth.addr.brick-ns.allow *
40: subvolumes brick brick-ns
41: end-volume
+------------------------------------------------------------------------------+
[2009-09-23 21:26:49] D [glusterfsd.c:1205:main] glusterfs: running in pid
5085
[2009-09-23 21:26:49] D [io-threads.c:2280:init] brick: io-threads:
Autoscaling: off, min_threads: 8, max_threads: 8
[2009-09-23 21:26:49] D [io-threads.c:2280:init] brick-ns: io-threads:
Autoscaling: off, min_threads: 8, max_threads: 8
[2009-09-23 21:26:49] D [transport.c:141:transport_load] transport: attempt
to load file /usr/local/lib/glusterfs/2.0.6/transport/socket.so
[2009-09-23 21:26:49] N [glusterfsd.c:1224:main] glusterfs: Successfully
started
[2009-09-23 21:26:56] D [addr.c:174:gf_auth] brick-ns: allowed = "*",
received addr = "192.168.1.2"
[2009-09-23 21:26:56] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.2:1021
[2009-09-23 21:26:56] D [addr.c:174:gf_auth] brick-ns: allowed = "*",
received addr = "192.168.1.2"
[2009-09-23 21:26:56] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.2:1020
[2009-09-23 21:26:56] D [addr.c:174:gf_auth] brick: allowed = "*", received
addr = "192.168.1.2"
[2009-09-23 21:26:56] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.2:1017
[2009-09-23 21:26:56] D [addr.c:174:gf_auth] brick: allowed = "*", received
addr = "192.168.1.2"
[2009-09-23 21:26:56] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.2:1016
[2009-09-23 21:27:16] D [addr.c:174:gf_auth] brick: allowed = "*", received
addr = "192.168.1.1"
[2009-09-23 21:27:16] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1021
[2009-09-23 21:27:17] D [addr.c:174:gf_auth] brick-ns: allowed = "*",
received addr = "192.168.1.1"
[2009-09-23 21:27:17] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1020
[2009-09-23 21:27:17] D [addr.c:174:gf_auth] brick: allowed = "*", received
addr = "192.168.1.1"
[2009-09-23 21:27:17] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1017
[2009-09-23 21:27:17] D [addr.c:174:gf_auth] brick-ns: allowed = "*",
received addr = "192.168.1.1"
[2009-09-23 21:27:17] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1016
[2009-09-23 21:29:21] N [server-protocol.c:7816:notify] server:
192.168.1.1:1021 disconnected
[2009-09-23 21:29:21] N [server-protocol.c:7816:notify] server:
192.168.1.1:1020 disconnected
[2009-09-23 21:29:37] N [server-protocol.c:7816:notify] server:
192.168.1.1:1017 disconnected
[2009-09-23 21:29:37] D [socket.c:1298:socket_submit] server: not connected
(priv->connected = 255)
[2009-09-23 21:29:37] N [server-helpers.c:779:server_connection_destroy]
server: destroyed connection of srv01-5127-2009/09/23-20:52:02:522004-brick2
[2009-09-23 21:29:37] N [server-protocol.c:7816:notify] server:
192.168.1.1:1016 disconnected
[2009-09-23 21:29:37] N [server-helpers.c:779:server_connection_destroy]
server: destroyed connection of
srv01-5127-2009/09/23-20:52:02:522004-brick2-ns
[2009-09-23 21:29:40] D [addr.c:174:gf_auth] brick: allowed = "*", received
addr = "192.168.1.1"
[2009-09-23 21:29:40] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1015
[2009-09-23 21:29:40] D [addr.c:174:gf_auth] brick: allowed = "*", received
addr = "192.168.1.1"
[2009-09-23 21:29:40] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1014
[2009-09-23 21:29:40] D [addr.c:174:gf_auth] brick-ns: allowed = "*",
received addr = "192.168.1.1"
[2009-09-23 21:29:40] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1013
[2009-09-23 21:29:40] D [addr.c:174:gf_auth] brick-ns: allowed = "*",
received addr = "192.168.1.1"
[2009-09-23 21:29:40] N [server-protocol.c:7056:mop_setvolume] server:
accepted client from 192.168.1.1:1012
------------------------
No matter how many times I -ls the directory or file I cannot get it to
sync.
I can rename the files and have the name changes sync. Just not the files
themselves.
admin at srv02:/mnt/glusterfs$ ls -lh
-rw-r--r-- 1 root root 584M 2009-09-23 21:29 test03
admin at srv02:/mnt/glusterfs$ mv test03 test03a
admin at srv01:/mnt/glusterfs$ ls -lh (on server02 now)
-rw-r--r-- 1 root root 191M 2009-09-23 21:28 test03a
Any ideas what I might be doing wrong?
On Wed, Sep 23, 2009 at 5:55 AM, Anand Avati <avati at gluster.com> wrote:
> On 9/23/09, Georgecooldude <georgecooldude at gmail.com> wrote:
> > Anyone have any ideas on the below? Thanks.
> >
>
> Does the logfile of the server whose cable you pulled out, recognize
> the disconnection from the client?
>
> Avati
>
More information about the Gluster-users
mailing list