[Bugs] [Bug 1623749] Geo-rep: Few workers fails to start with out any failure

bugzilla at redhat.com bugzilla at redhat.com
Thu Oct 11 07:27:18 UTC 2018


https://bugzilla.redhat.com/show_bug.cgi?id=1623749

Chandrakanth Pai <chpai at redhat.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |chpai at redhat.com,
                   |                            |sunkumar at redhat.com
              Flags|                            |needinfo?(sunkumar at redhat.c
                   |                            |om)


--- Doc Text *updated* ---
Previously, workers failed to come up while waiting for the lock. When a monitor starts the workers, they update the status file by using flock to synchronize. When worker one opened the status file to update, worker two could be forked causing the file descriptor to be referenced by worker two. Since it was necessary to close the file descriptor to unlock the lock, worker one failed to unlock as the reference existed in worker two causing a deadlock for worker 2 to come up. With this fix, the flock is unlocked specifically and the status file is updated so that the reference is not leaked to any worker or agent process. As a result of this fix, all workers come up wihout fail.


-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=Z1WpfZ9kOW&a=cc_unsubscribe


More information about the Bugs mailing list