[Gluster-users] Self-heal working ? Seeing "background meta-data data self-heal failed" in logs
Torbjørn Thorsen
torbjorn at trollweb.no
Wed Apr 23 14:22:24 UTC 2014
Greetings.
We have a distributed and replicated setup where one of the servers,
which means two of the bricks, have been off-line for some time.
Now when I engage the previously-down server, I see some worrying
lines in the client log.
[2014-04-23 13:02:17.384463] E
[afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
0-gluster0-replicate-0: background meta-data data self-heal failed on
/some-path-here/disk0
Not much traffic is moving between the client and the previously-down server,
although I can see from stat-ing and using getfattr on the file
directly on the bricks that writes are happening both places.
I'm not at all sure I'm actually performing the self-heal process, though.
We're on Gluster 3.4, although both servers have been rolling
upgraded, so it seems from the log that we're still in a 3.3 state of
mind, so to speak.
Here's the full log from the client when the server comes back:
[2014-04-23 13:02:03.747626] I
[client-handshake.c:1658:select_server_supported_programs]
0-gluster0-client-2: Using Program GlusterFS 3.3, Num (1298437),
Version (330)
[2014-04-23 13:02:03.763042] I
[client-handshake.c:1658:select_server_supported_programs]
0-gluster0-client-0: Using Program GlusterFS 3.3, Num (1298437),
Version (330)
[2014-04-23 13:02:03.763399] I
[client-handshake.c:1456:client_setvolume_cbk] 0-gluster0-client-2:
Connected to 192.168.51.201:49153, attached to remote volume
'/srv/gluster/brick1'.
[2014-04-23 13:02:03.763418] I
[client-handshake.c:1468:client_setvolume_cbk] 0-gluster0-client-2:
Server and Client lk-version numbers are not same, reopening the fds
[2014-04-23 13:02:03.763468] I
[client-handshake.c:1308:client_post_handshake] 0-gluster0-client-2: 1
fds open - Delaying child_up until they are re-opened
[2014-04-23 13:02:03.764814] I
[client-handshake.c:1456:client_setvolume_cbk] 0-gluster0-client-0:
Connected to 192.168.51.201:49152, attached to remote volume
'/srv/gluster/brick0'.
[2014-04-23 13:02:03.764832] I
[client-handshake.c:1468:client_setvolume_cbk] 0-gluster0-client-0:
Server and Client lk-version numbers are not same, reopening the fds
[2014-04-23 13:02:03.764846] I
[client-handshake.c:1308:client_post_handshake] 0-gluster0-client-0: 2
fds open - Delaying child_up until they are re-opened
[2014-04-23 13:02:03.764952] I
[client-handshake.c:930:client_child_up_reopen_done]
0-gluster0-client-2: last fd open'd/lock-self-heal'd - notifying
CHILD-UP
[2014-04-23 13:02:03.766292] I
[client-handshake.c:930:client_child_up_reopen_done]
0-gluster0-client-0: last fd open'd/lock-self-heal'd - notifying
CHILD-UP
[2014-04-23 13:02:03.766379] I
[client-handshake.c:450:client_set_lk_version_cbk]
0-gluster0-client-2: Server lk version = 1
[2014-04-23 13:02:03.853489] I
[client-handshake.c:450:client_set_lk_version_cbk]
0-gluster0-client-0: Server lk version = 1
[2014-04-23 13:02:17.384463] E
[afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
0-gluster0-replicate-0: background meta-data data self-heal failed on
/some-path-here/disk0
[2014-04-23 13:02:20.253380] E
[afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
0-gluster0-replicate-1: background meta-data data self-heal failed on
/some-other-path/disk0
--
Vennlig hilsen
Torbjørn Thorsen
Utvikler / driftstekniker
Trollweb Solutions AS
- Professional Magento Partner
www.trollweb.no
Telefon dagtid: +47 51215300
Telefon kveld/helg: For kunder med Serviceavtale
Besøksadresse: Luramyrveien 40, 4313 Sandnes
Postadresse: Maurholen 57, 4316 Sandnes
Husk at alle våre standard-vilkår alltid er gjeldende
More information about the Gluster-users
mailing list