[Bugs] [Bug 1279306] New: Dist-geo-rep : checkpoint doesn't reach even though all the files have been synced through hybrid crawl.

bugzilla at redhat.com bugzilla at redhat.com
Mon Nov 9 05:57:02 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1279306

            Bug ID: 1279306
           Summary: Dist-geo-rep : checkpoint doesn't reach even though
                    all the files have been synced through hybrid crawl.
           Product: GlusterFS
           Version: 3.7.5
         Component: geo-replication
          Severity: high
          Priority: medium
          Assignee: bugs at gluster.org
          Reporter: avishwan at redhat.com
                CC: annair at redhat.com, avishwan at redhat.com,
                    bugs at gluster.org, chrisw at redhat.com, csaba at redhat.com,
                    david.macdonald at redhat.com, gluster-bugs at redhat.com,
                    rhinduja at redhat.com, vkoppad at redhat.com,
                    vshankar at redhat.com
        Depends On: 1044645, 1247536, 1064309
            Blocks: 1202842, 1223636



+++ This bug was initially created as a clone of Bug #1247536 +++

+++ This bug was initially created as a clone of Bug #1044645 +++

Description of problem: geo-rep status checkpoint doesn't reach even though all
the files have been synced through hybrid crawl.


Version-Release number of selected component (if
applicable):glusterfs-3.4.0.51geo-1

How reproducible: didn't try to reproduce, but seems like consistently
reproducible. 


Steps to Reproduce:
1.create and start a geo-rep relationship between master and slave. 
2.stop geo-rep 
3.create some data on master.
4.set the checkpoint.
5.start geo-rep 
6. wait for the geo-rep to sync data. 
7. check geo-rep status whether checkpoint has reached or not. 

Actual results: checkpoint doesn't reach at all. 


Expected results: checkpoint should reach when all the files are synced. 


--- Additional comment from Aravinda VK on 2013-12-20 03:05:42 EST ---

During start of hybrid crawl, crawler stores masters xtime in memory. After
completion of crawl and sync, it will update the same xtime for slave. 

If files created after crawler started, then checkpoint time will be more than
the last saved xtime in memory, so even after completion it shows checkpoint is
not reached.

This is expected behavior, if we update the latest xtime instead of xtime
stored in memory, their are chances of data loss.

--- Additional comment from Venky Shankar on 2013-12-20 03:41:07 EST ---

Vijaykumar,

was I/O done on the mount after checkpoint was set? If yes, then isn't this the
expected behaviour?

--- Additional comment from Rahul Hinduja on 2015-07-07 06:58:30 EDT ---

Verified with build: glusterfs-3.7.1-7.el6rhs.x86_64

Tried both the below scenarios:

a. Have the files before creation of geo-rep session so as to use HYBRID CRAWL
b. Change the change_detector to xsync to use HYBRID CRAWL

In both the above cases, the last sync is not update. In the first case, Last
sync is N/A and in the second case, last sync shows when the last changelog was
synced.

Eventually in Hybrid Crawl, the checkpoint completed Remains always as NO even
when the files are synced to slave. Moving this bug to Assigned state.

--- Additional comment from Rahul Hinduja on 2015-07-07 07:00:25 EDT ---

For Scenario A:
===============

[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave status detail

MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE             
    SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED    ENTRY    DATA   
META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT
COMPLETION TIME   
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A          
 0        3567    7797    0           2015-07-07 15:39:13    No                
     N/A                          
georep1        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A          
 0        3611    7845    0           2015-07-07 15:39:13    No                
     N/A                          
georep3        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A          
 0        3441    7611    0           2015-07-07 15:39:13    No                
     N/A                          
georep3        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A          
 0        3550    7726    0           2015-07-07 15:39:13    No                
     N/A                          
georep2        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
 N/A      N/A     N/A     N/A         N/A                    N/A               
     N/A                          
georep2        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
 N/A      N/A     N/A     N/A         N/A                    N/A               
     N/A                          
georep4        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
 N/A      N/A     N/A     N/A         N/A                    N/A               
     N/A                          
georep4        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
 N/A      N/A     N/A     N/A         N/A                    N/A               
     N/A                          
[root at georep1 scripts]# 



[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave status detail

MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE             
    SLAVE NODE      STATUS     CRAWL STATUS       LAST_SYNCED    ENTRY    DATA 
  META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT
COMPLETION TIME   
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    N/A       
    0        7798    0       0           2015-07-07 15:39:13    No             
        N/A                          
georep1        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    N/A       
    0        7847    0       0           2015-07-07 15:39:13    No             
        N/A                          
georep3        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl       N/A       
    0        3441    0       0           2015-07-07 15:39:13    No             
        N/A                          
georep3        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl       N/A       
    0        3550    0       0           2015-07-07 15:39:13    No             
        N/A                          
georep2        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
    N/A      N/A     N/A     N/A         N/A                    N/A            
        N/A                          
georep2        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
    N/A      N/A     N/A     N/A         N/A                    N/A            
        N/A                          
georep4        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
    N/A      N/A     N/A     N/A         N/A                    N/A            
        N/A                          
georep4        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
    N/A      N/A     N/A     N/A         N/A                    N/A            
        N/A                          
[root at georep1 scripts]# 



[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave status detail

MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE             
    SLAVE NODE      STATUS     CRAWL STATUS       LAST_SYNCED            ENTRY 
  DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED   
CHECKPOINT COMPLETION TIME   
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07
15:39:27    0        0       0       0           2015-07-07 15:39:13    Yes    
                2015-07-07 15:53:41          
georep1        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07
15:39:27    0        0       0       0           2015-07-07 15:39:13    Yes    
                2015-07-07 15:52:54          
georep2        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
            N/A      N/A     N/A     N/A         N/A                    N/A    
                N/A                          
georep2        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
            N/A      N/A     N/A     N/A         N/A                    N/A    
                N/A                          
georep3        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07
15:39:33    0        0       0       0           2015-07-07 15:39:13    Yes    
                2015-07-07 15:53:12          
georep3        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07
15:39:33    0        0       0       0           2015-07-07 15:39:13    Yes    
                2015-07-07 15:53:14          
georep4        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
            N/A      N/A     N/A     N/A         N/A                    N/A    
                N/A                          
georep4        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A       
            N/A      N/A     N/A     N/A         N/A                    N/A    
                N/A                          
[root at georep1 scripts]# 

[rahinuj at rahul Desktop]$

--- Additional comment from Rahul Hinduja on 2015-07-07 07:02:30 EDT ---

For Scenario B:
===============

[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave config change_detector
changelog
[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave config change_detector xsync
geo-replication config updated successfully
[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave config change_detector
xsync
[root at georep1 scripts]#


[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave status detail

MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE             
    SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED            ENTRY   
DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED   
CHECKPOINT COMPLETION TIME   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:27    0        154     0       0           2015-07-07 16:16:07    No     
                N/A                          
georep1        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:27    0        160     0       0           2015-07-07 16:16:07    No     
                N/A                          
georep3        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:33    0        156     0       0           2015-07-07 16:16:07    No     
                N/A                          
georep3        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:33    0        179     0       0           2015-07-07 16:16:07    No     
                N/A                          
georep4        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
georep4        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
georep2        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
georep2        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
[root at georep1 scripts]#

[root at georep1 scripts]# gluster volume geo-replication master
10.70.46.101::slave status detail

MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE             
    SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED            ENTRY   
DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED   
CHECKPOINT COMPLETION TIME   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:27    0        0       0       0           2015-07-07 16:16:07    No     
                N/A                          
georep1        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:27    0        0       0       0           2015-07-07 16:16:07    No     
                N/A                          
georep2        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
georep2        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
georep3        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:33    0        0       0       0           2015-07-07 16:16:07    No     
                N/A                          
georep3        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07
15:39:33    0        0       0       0           2015-07-07 16:16:07    No     
                N/A                          
georep4        master        /rhs/brick1/b1    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
georep4        master        /rhs/brick2/b2    root         
10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A          
         N/A      N/A     N/A     N/A         N/A                    N/A       
             N/A                          
[root at georep1 scripts]#

--- Additional comment from Anand Avati on 2015-07-28 05:26:01 EDT ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#1) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Anand Avati on 2015-08-05 00:52:30 EDT ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#2) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Anand Avati on 2015-08-12 05:57:41 EDT ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#3) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Anand Avati on 2015-08-19 01:59:19 EDT ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#4) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Anand Avati on 2015-08-26 01:56:59 EDT ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#5) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Vijay Bellur on 2015-09-08 13:17:45 EDT ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#6) for review on master by Aravinda VK (avishwan at redhat.com)

--- Additional comment from Vijay Bellur on 2015-11-03 15:23:31 EST ---

REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in
XSync) posted (#7) for review on master by Jeff Darcy (jdarcy at redhat.com)


Referenced Bugs:

https://bugzilla.redhat.com/show_bug.cgi?id=1044645
[Bug 1044645] Dist-geo-rep : checkpoint doesn't reach even though all the
files have been synced through hybrid crawl.
https://bugzilla.redhat.com/show_bug.cgi?id=1202842
[Bug 1202842] [TRACKER] RHGS 3.1 Tracker BZ
https://bugzilla.redhat.com/show_bug.cgi?id=1223636
[Bug 1223636] 3.1 QE Tracker
https://bugzilla.redhat.com/show_bug.cgi?id=1247536
[Bug 1247536] Dist-geo-rep : checkpoint doesn't reach even though all the
files have been synced through hybrid crawl.
-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list