[Gluster-users] "No space left on device" during rebalance with failed brick on Gluster 4.1.7

Alan Orth alan.orth at gmail.com
Tue May 7 13:12:06 UTC 2019

Dear list,

We are using a Distributed-Replicate volume with replica 2 on Gluster 4.1.7
on CentOS 7. One of our nodes died recently and we will add new nodes and
bricks to replace it soon. In preparation for the maintenance I wanted to
rebalance the volume to make the disk thrashing less intense when we
add/remove bricks, but after eight hours of scanning I see millions of
"failures" in the rebalance status. The volume rebalance log shows many
errors like:

[2019-05-07 06:06:02.310843] E [MSGID: 109023]
[dht-rebalance.c:2907:gf_defrag_migrate_single_file] 0-data-dht:
migrate-data failed for
[No space left on device]

The bricks on the healthy nodes all have 1.5TB of free space so I'm not
sure what this error means. Could it be because one of the replicas is
unavailable? I saw a similar bug report¹ about that. I've started a simple
fix-layout without data migration and it is working fine.

Thank you,

¹ https://access.redhat.com/solutions/456333
