[Gluster-users] Self Heal Confusion

Brett Holcomb biholcomb at l1049h.com
Sun Dec 23 04:17:32 UTC 2018


Very strange.  I see this in the glusterd.log

[2018-12-22 23:53:47.216743] E [MSGID: 101191] 
[event-epoll.c:671:event_dispatch_epoll_worker] 0-epoll: Failed to 
dispatch handler
(END)

After force starting the volume and doing a

gluster vol  heal projects full

This is in the glustershd log so I assume it started.

[2018-12-22 22:54:22.328897] I [MSGID: 114046] 
[client-handshake.c:1107:client_setvolume_cbk] 0-projects-client-5: 
Connected to projects-client-5, attached to remote volume 
'/srv/gfs01/Projects'.

This shows up in the glfsheal-projects.log file.

[2018-12-22 23:53:41.916773] E [MSGID: 101191] 
[event-epoll.c:671:event_dispatch_epoll_worker] 0-epoll: Failed to 
dispatch handler

I'm not sure what it's trying to tell me when it fails to dispatch a 
handler.

 From what I could find there were issues in the early 5.0 build with 
some of these errors coming up but that a patch was included early on.  
I am on 5.2

I'll keep digging.


On 12/20/18 8:26 PM, John Strunk wrote:
> Assuming your bricks are up... yes, the heal count should be decreasing.
>
> There is/was a bug wherein self-heal would stop healing but would 
> still be running. I don't know whether your version is affected, but 
> the remedy is to just restart the self-heal daemon.
> Force start one of the volumes that has heals pending. The bricks are 
> already running, but it will cause shd to restart and, assuming this 
> is the problem, healing should begin...
>
> $ gluster vol start my-pending-heal-vol force
>
> Others could better comment on the status of the bug.
>
> -John
>
>
> On Thu, Dec 20, 2018 at 5:45 PM Brett Holcomb <biholcomb at l1049h.com 
> <mailto:biholcomb at l1049h.com>> wrote:
>
>     I have one volume that has 85 pending entries in healing and two more
>     volumes with 58,854 entries in healing pending.  These numbers are
>     from
>     the volume heal info summary command.  They have stayed constant
>     for two
>     days now.  I've read the gluster docs and many more.  The Gluster
>     docs
>     just give some commands and non gluster docs basically repeat that.
>     Given that it appears no self-healing is going on for my volume I am
>     confused as to why.
>
>     1.  If a self-heal deamon is listed on a host (all of mine show
>     one with
>     a volume status command) can I assume it's enabled and running?
>
>     2.  I assume the volume that has all the self-heals pending has some
>     serious issues even though I can access the files and directories on
>     it.  If self-heal is running shouldn't the numbers be decreasing?
>
>     It appears to me self-heal is not working properly so how to I get
>     it to
>     start working or should I delete the volume and start over?
>
>     I'm running gluster 5.2 on Centos 7 latest and updated.
>
>     Thank you.
>
>
>     _______________________________________________
>     Gluster-users mailing list
>     Gluster-users at gluster.org <mailto:Gluster-users at gluster.org>
>     https://lists.gluster.org/mailman/listinfo/gluster-users
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.gluster.org/pipermail/gluster-users/attachments/20181222/2e87b820/attachment.html>


More information about the Gluster-users mailing list