[Gluster-devel] Doing LS with a lot of directory, files
Krishna Srinivas
krishna at zresearch.com
Thu Apr 24 11:43:55 UTC 2008
Tom,
Are you using latest glusterfs code? There was a change recently
which makes readdir() faster in AFR.
Also note that in your case unify has one subvol, so unify is not
serving any purpose here.
Krishna
On Thu, Apr 24, 2008 at 4:17 PM, Tom Myny <tom.myny at tigron.be> wrote:
> Hello,
>
> I'm running afr on two storage servers, with three clients.
> For the moment, we have copied over 500 million small files on it, splitting
> into each directory which contains 1000 files.
>
> When doing ls in directory containing 1000 directory's we have the following
> issue:
>
>
> - Ls is taking more then 15 minutes to complete in a directory with 1000
> folders. (this will be split also to 100 folders later, but it's now a big
> problem)
> -> Yes, for now its ls --color=auto by default on debian :D
> - When doing copies from other clients, those copies halt until that ls is
> complete.
>
>
> Is there a way to
>
> 1) Do a ls faster (ok, I know it can be that fast like on the filesystem
> itself, but on the filesystem (or an nfs system) it takes max 15 seconds)
> 2) When someone is doing an ls, the other processes are not freesing.
> (checking on the storage servers, we have a load of 0.00)
>
> The filesystems we use are based on xfs.
> An example of a server config:
>
> volume sas-ds
> type storage/posix
> option directory /sas/data
> end-volume
>
> volume sas-ns
> type storage/posix
> option directory /sas/ns
> end-volume
>
> volume sata-ds
> type storage/posix
> option directory /sata/data
> end-volume
>
> volume sata-ns
> type storage/posix
> option directory /sata/ns
> end-volume
>
> volume sas-backup-ds
> type protocol/client
> option transport-type tcp/client
> option remote-host x.x.x.x
> option remote-subvolume sas-ds
> end-volume
>
> volume sas-backup-ns
> type protocol/client
> option transport-type tcp/client
> option remote-host x.x.x.x
> option remote-subvolume sas-ns
> end-volume
>
> ...
>
> volume sas-unify
> type cluster/unify
> subvolumes sas-ds-afr
> option namespace sas-ns-afr
> option scheduler rr
> end-volume
>
> volume sata-unify
> type cluster/unify
> subvolumes sata-ds-afr
> option namespace sata-ns-afr
> option scheduler rr
> end-volume
>
> volume sas
> type performance/io-threads
> option thread-count 16
> option cache-size 256MB
> subvolumes sas-unify
> end-volume
>
> volume sata
> type performance/io-threads
> option thread-count 16
> option cache-size 256MB
> subvolumes sata-unify
> end-volume
>
> ..
>
> I hope to fix this, because we want to double this next year :)
>
>
> Regards,
> Tom
>
>
>
> _______________________________________________
> Gluster-devel mailing list
> Gluster-devel at nongnu.org
> http://lists.nongnu.org/mailman/listinfo/gluster-devel
>
More information about the Gluster-devel
mailing list