[Gluster-users] Self-heal working ? Seeing "background meta-data data self-heal failed" in logs

Torbjørn Thorsen torbjorn at trollweb.no
Wed Apr 23 14:22:24 UTC 2014


Greetings.

We have a distributed and replicated setup where one of the servers,
which means two of the bricks, have been off-line for some time.

Now when I engage the previously-down server, I see some worrying
lines in the client log.

[2014-04-23 13:02:17.384463] E
[afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
0-gluster0-replicate-0: background  meta-data data self-heal failed on
/some-path-here/disk0

Not much traffic is moving between the client and the previously-down server,
although I can see from stat-ing and using getfattr on the file
directly on the bricks that writes are happening both places.
I'm not at all sure I'm actually performing the self-heal process, though.

We're on Gluster 3.4, although both servers have been rolling
upgraded, so it seems from the log that we're still in a 3.3 state of
mind, so to speak.

Here's the full log from the client when the server comes back:
[2014-04-23 13:02:03.747626] I
[client-handshake.c:1658:select_server_supported_programs]
0-gluster0-client-2: Using Program GlusterFS 3.3, Num (1298437),
Version (330)
[2014-04-23 13:02:03.763042] I
[client-handshake.c:1658:select_server_supported_programs]
0-gluster0-client-0: Using Program GlusterFS 3.3, Num (1298437),
Version (330)
[2014-04-23 13:02:03.763399] I
[client-handshake.c:1456:client_setvolume_cbk] 0-gluster0-client-2:
Connected to 192.168.51.201:49153, attached to remote volume
'/srv/gluster/brick1'.
[2014-04-23 13:02:03.763418] I
[client-handshake.c:1468:client_setvolume_cbk] 0-gluster0-client-2:
Server and Client lk-version numbers are not same, reopening the fds
[2014-04-23 13:02:03.763468] I
[client-handshake.c:1308:client_post_handshake] 0-gluster0-client-2: 1
fds open - Delaying child_up until they are re-opened
[2014-04-23 13:02:03.764814] I
[client-handshake.c:1456:client_setvolume_cbk] 0-gluster0-client-0:
Connected to 192.168.51.201:49152, attached to remote volume
'/srv/gluster/brick0'.
[2014-04-23 13:02:03.764832] I
[client-handshake.c:1468:client_setvolume_cbk] 0-gluster0-client-0:
Server and Client lk-version numbers are not same, reopening the fds
[2014-04-23 13:02:03.764846] I
[client-handshake.c:1308:client_post_handshake] 0-gluster0-client-0: 2
fds open - Delaying child_up until they are re-opened
[2014-04-23 13:02:03.764952] I
[client-handshake.c:930:client_child_up_reopen_done]
0-gluster0-client-2: last fd open'd/lock-self-heal'd - notifying
CHILD-UP
[2014-04-23 13:02:03.766292] I
[client-handshake.c:930:client_child_up_reopen_done]
0-gluster0-client-0: last fd open'd/lock-self-heal'd - notifying
CHILD-UP
[2014-04-23 13:02:03.766379] I
[client-handshake.c:450:client_set_lk_version_cbk]
0-gluster0-client-2: Server lk version = 1
[2014-04-23 13:02:03.853489] I
[client-handshake.c:450:client_set_lk_version_cbk]
0-gluster0-client-0: Server lk version = 1
[2014-04-23 13:02:17.384463] E
[afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
0-gluster0-replicate-0: background  meta-data data self-heal failed on
/some-path-here/disk0
[2014-04-23 13:02:20.253380] E
[afr-self-heal-common.c:2212:afr_self_heal_completion_cbk]
0-gluster0-replicate-1: background  meta-data data self-heal failed on
/some-other-path/disk0



-- 
Vennlig hilsen
Torbjørn Thorsen
Utvikler / driftstekniker

Trollweb Solutions AS
- Professional Magento Partner
www.trollweb.no

Telefon dagtid: +47 51215300
Telefon kveld/helg: For kunder med Serviceavtale

Besøksadresse: Luramyrveien 40, 4313 Sandnes
Postadresse: Maurholen 57, 4316 Sandnes

Husk at alle våre standard-vilkår alltid er gjeldende



More information about the Gluster-users mailing list