[Gluster-users] MacOSX Finder performance woes

Bryan Whitehead driver at megahappy.net
Thu Dec 20 06:50:58 UTC 2012


This sounds like a perfect use case for
https://github.com/jdarcy/negative-lookup

Jeff Darcy made this as an example of building your own translators. But it
basically caches nagative-lookups so repeated asking of the same questions
go away.

Hell, might be beneficial to make a translator that
just immediately replies "does not exist" for all filenames starting ._*


On Wed, Dec 19, 2012 at 8:00 PM, Adam Tygart <mozes at k-state.edu> wrote:

> Dan,
>
> Does something like this help for SAMBA?
> http://www.e-rave.nl/disable-creation-of-ds_store-files-on-samba-shares
>
> Negative lookups are a particularly expensive operation on distributed
> filesystems, GlusterFS in particular. It has to check through each
> brick to see that the file *really* doesn't exist, increasing the
> lookup overhead immensely. If samba doesn't pass these queries to
> GlusterFS, it might help a lot.
>
> --
> Adam Tygart
> Beocat Sysadmin
>
> On Wed, Dec 19, 2012 at 8:29 PM, Daniel Mons <daemons at kanuka.com.au>
> wrote:
> > I'm rolling out 4 lots of 6-node GlusterFS setups for my employer.  Each
> > node is ~33TB of RAID6 backed storage (16x 3TB SATA disks in RAID6 with a
> > hot spare hanging off an LSI controller, with 2x SSDs configured for
> > caching), and Gluster is configured in distribute-replicate.  Each
> cluster
> > is 200TB of raw space, 100TB usable after replication.  When complete,
> there
> > will be 4 of these clusters.
> >
> > Nodes are configured as XFS with 512byte inodes, running a fully patched
> > CentOS6 and Gluster 3.3.1.  Each node has a 6 core Xeon processor (with
> HT
> > for 12 threads) with 32GB of RAM.  Each node runs 2x 10Gbps Ethernet over
> > fiber in a bonded configuration (single IP address per node) for a full
> > 20Gbits per node.
> >
> > GlusterFS FUSE performance under Linux is great (clients run a mix of
> Ubuntu
> > 12.04 LTS for workstations and CentOS6 for servers).  Samba performance
> back
> > to Windows 7 clients is great.  NFS performance via both Gluster's
> userspace
> > setup as well as CentOS6's native NFS4 kernel server are great to most
> other
> > systems where we can't get the Gluster FUSE client loaded (large
> > industry-specific Linux boxes that are provided by vendors as a "black
> box"
> > solution, and only allow limited access via NFS or SMB/CIFS).  All
> testing
> > so far under those conditions proves orders of magnitude faster
> throughput
> > than our existing single NAS solutions.
> >
> > MacOSX Finder performance is a problem, however.  There's a huge bug in
> > MacOSX itself that prevents using NFS at all (discussions on other
> mailing
> > lists suggest it occurred somewhere around 10.6, and continues through
> into
> > 10.7 and 10.8).
> >
> > Mounting via SMB under OSX is more stable than NFS, however in folders
> with
> > a large amount of files, Finder goes looking for a corresponding Apple
> > Resource Fork file (for every "filename.ext", it looks for a
> > "._filename.ext").  Running tcpdump and wireshark on the Gluster nodes
> shows
> > that the resulting "FILE_NOT_FOUND" error back to the client takes a very
> > long time.  Configuring a single node as a pure NAS with the same
> software
> > (but no Gluster implementation) is lightening fast.   As soon as
> GlusterFS
> > comes in to play, reporting of each "FILE_NOT_FOUND" slows down the
> process
> > dramatically, causing a directory with ~1000 images in it to take well
> over
> > 5 minutes to display the contents in MacOSX finder.
> >
> > This problem is resolved somewhat by switching to AFP (via Netatalk
> loaded
> > on the GlusterFS nodes), but it has it's own problems unique to that
> > protocol, and I'd rather stick to GlusterFS-FUSE, NFS or SMB in that
> order
> > of preference.
> >
> > It's worth noting that through the terminal, these problems don't exist.
> > Mounting via SMB, browsing to the volume in terminal and running "ls" or
> > "find" style commands retrieve file listings at a similar speed to Linux
> and
> > Windows.  The problem is limited to clients using Finder to browse
> > directories, and again particularly ones with a large number of files
> that
> > don't have matching Apple Resource Fork files.  (Of note, creating empty
> > files of the matching "._filename.ext" format solves the performance
> > problem, but litters our filestores with millions of empty files, which
> we
> > don't want).
> >
> > I understand the problem is not strictly Gluster's issue.  Finder is
> looking
> > for a heck of a lot of files that don't exist (which is a pretty silly
> > design), and it tends to occur only with Samba re-exporting GlusterFS
> > volumes that we can see.  And likewise Apple's NFS bug that has now been
> in
> > existence across three releases of their OS is pretty horrible.   But
> > hopefully I can at least describe the problem and prompt some testing by
> > others.
> >
> > I haven't had a chance to test a MacOSX FUSE client due to time
> constraints,
> > but that would at least answer the question if the problem is Gluster's
> lag
> > in reporting of files not found, or Samba's.
> >
> > -Dan
> >
> >
> > _______________________________________________
> > Gluster-users mailing list
> > Gluster-users at gluster.org
> > http://supercolony.gluster.org/mailman/listinfo/gluster-users
> _______________________________________________
> Gluster-users mailing list
> Gluster-users at gluster.org
> http://supercolony.gluster.org/mailman/listinfo/gluster-users
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20121219/6565cac0/attachment.html>


More information about the Gluster-users mailing list