[Gluster-users] Rsync

Hiren Joshi josh at moonfruit.com
Thu Sep 24 11:13:14 UTC 2009


It's been running for over 24 hours now.
Network traffic is nominal, top shows about 200-400% cpu (7 cores so
it's not too bad).
About 14G of memory used (the rest is being used as disk cache).

Thoughts?



<snip>
> > > > 
> > > > An update, after running the rsync for a day, I killed it 
> > > and remounted
> > > > all the disks (the underlying filesystem, not the gluster) 
> > > with noatime,
> > > > the rsync completed in about 600 minutes. I'm now going to 
> > > try one level
> > > > up (about 1,000,000,000 dirs).
> > > > 
> > > > > -----Original Message-----
> > > > > From: Pavan Vilas Sondur [mailto:pavan at gluster.com] 
> > > > > Sent: 23 September 2009 07:55
> > > > > To: Hiren Joshi
> > > > > Cc: gluster-users at gluster.org
> > > > > Subject: Re: Rsync
> > > > > 
> > > > > Hi Hiren,
> > > > > What glusterfs version are you using? Can you send us the 
> > > > > volfiles and the log files.
> > > > > 
> > > > > Pavan
> > > > > 
> > > > > On 22/09/09 16:01 +0100, Hiren Joshi wrote:
> > > > > > I forgot to mention, the mount is mounted with 
> > > direct-io, would this
> > > > > > make a difference? 
> > > > > > 
> > > > > > > -----Original Message-----
> > > > > > > From: gluster-users-bounces at gluster.org 
> > > > > > > [mailto:gluster-users-bounces at gluster.org] On Behalf Of 
> > > > > Hiren Joshi
> > > > > > > Sent: 22 September 2009 11:40
> > > > > > > To: gluster-users at gluster.org
> > > > > > > Subject: [Gluster-users] Rsync
> > > > > > > 
> > > > > > > Hello all,
> > > > > > >  
> > > > > > > I'm getting what I think is bizarre behaviour.... I have 
> > > > > about 400G to
> > > > > > > rsync (rsync -av) onto a gluster share, the data is 
> > > in a directory
> > > > > > > structure which has about 1000 directories per parent and 
> > > > > about 1000
> > > > > > > directories in each of them.
> > > > > > >  
> > > > > > > When I try to rsync an end leaf directory (this 
> has about 4 
> > > > > > > dirs and 100
> > > > > > > files in each) the operation takes about 10 
> seconds. When I 
> > > > > > > go one level
> > > > > > > above (1000 dirs with about 4 dirs in each with about 100 
> > > > > > > files in each)
> > > > > > > the operation takes about 10 minutes.
> > > > > > >  
> > > > > > > Now, if I then go one level above that (that's 1000 
> > dirs with 
> > > > > > > 1000 dirs
> > > > > > > in each with about 4 dirs in each with about 100 files in 
> > > > > each) the
> > > > > > > operation takes days! Top shows glusterfsd takes 300-600% 
> > > > > cpu usage
> > > > > > > (2X4core), I have about 48G of memory (usage is 0% as 
> > > expected).
> > > > > > >  
> > > > > > > Has anyone seen anything like this? How can I speed it up?
> > > > > > >  
> > > > > > > Thanks,
> > > > > > >  
> > > > > > > Josh.
> > > > > > > 
> > > > > > _______________________________________________
> > > > > > Gluster-users mailing list
> > > > > > Gluster-users at gluster.org
> > > > > > http://gluster.org/cgi-bin/mailman/listinfo/gluster-users
> > > > > 
> > > 
> > _______________________________________________
> > Gluster-users mailing list
> > Gluster-users at gluster.org
> > http://gluster.org/cgi-bin/mailman/listinfo/gluster-users
> > 



More information about the Gluster-users mailing list