[Bugs] [Bug 1224116] server crashed during rebalance in dht_selfheal_layout_new_directory

bugzilla at redhat.com bugzilla at redhat.com
Wed Jun 17 10:34:33 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1224116

Susant Kumar Palai <spalai at redhat.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |spalai at redhat.com



--- Comment #5 from Susant Kumar Palai <spalai at redhat.com> ---
Here are the rebalance logs for failure and stop scenarios.

Failed: 

8] I [dht-rebalance.c:2113:gf_defrag_process_dir] 0-testvol-dht: migrate data
called on /.trashcan
[2015-06-16 17:21:53.546314] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:21:53.546445] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:21:53.547448] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:21:53.547530] W [ec-common.c:867:ec_prepare_update_cbk]
0-testvol-disperse-1: Failed to get size and version (error 5: Input/output
error)

[2015-06-16 17:21:53.553661] E [MSGID: 109021]
[dht-rebalance.c:1866:gf_defrag_get_entry] 0-testvol-dht: /.trashcan: Migrate
data failed: Readdir returned Invalid argument. Aborting migrate-data

>>> here readdir got invalid argument. Hence, is the failuer.

[2015-06-16 17:21:53.553706] I [dht-rebalance.c:2283:gf_defrag_process_dir]
0-DHT: Found critical error from gf_defrag_get_entry
[2015-06-16 17:21:53.553736] E [MSGID: 109016]
[dht-rebalance.c:2542:gf_defrag_fix_layout] 0-testvol-dht: Fix layout failed
for /.trashcan
[2015-06-16 17:21:53.553836] I [dht-rebalance.c:1758:gf_defrag_task] 0-DHT:
Thread wokeup. defrag->current_thread_count: 3
[2015-06-16 17:21:53.553988] I [dht-rebalance.c:1758:gf_defrag_task] 0-DHT:
Thread wokeup. defrag->current_thread_count: 4
[2015-06-16 17:21:53.554611] I [MSGID: 109028]
[dht-rebalance.c:3026:gf_defrag_status_get] 0-testvol-dht: Rebalance is failed.
Time taken is 1.00 secs
[2015-06-16 17:21:53.554651] I [MSGID: 109028]
[dht-rebalance.c:3030:gf_defrag_status_get] 0-testvol-dht: Files migrated: 0,
size: 0, lookups: 3, failures: 2, skipped: 0




For stop :
0.509085] I [dht-rebalance.c:2113:gf_defrag_process_dir] 0-testvol-dht: migrate
data called on /.trashcan
[2015-06-16 17:36:50.517475] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:36:50.517994] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:36:50.518169] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:36:50.545902] N [ec-generic.c:1306:ec_combine_xattrop]
0-testvol-disperse-1: Mismatching dictionary in answers of 'GF_FOP_XATTROP'
[2015-06-16 17:36:50.546017] W [ec-common.c:865:ec_prepare_update_cbk]
0-testvol-disperse-1: Failed to get size and version (error 5: Input/output
error)
>>>>  The above log is similar to the failure case. In this case readdir never got response from ec. Henc, it never progressed.

[2015-06-16 17:37:43.136893] I [MSGID: 109028]
[dht-rebalance.c:3023:gf_defrag_status_get] 0-glusterfs: Rebalance is in
progress. Time taken is 55.00 secs
[2015-06-16 17:37:43.136973] I [MSGID: 109028]
[dht-rebalance.c:3027:gf_defrag_status_get] 0-glusterfs: Files migrated: 0,
size: 0, lookups: 3, failures: 0, skipped: 0


>From the above logs, it seems as a ec issue.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=04sOC7MS3J&a=cc_unsubscribe


More information about the Bugs mailing list