[Gluster-users] Performance translators - a overview.

Mon Jun 4 05:43:53 UTC 2012

Hi,

The purpose of performance translators is to decrease system call latency
of applications and increase responsiveness of glusterfs.

The standard approach used within glusterfs to decrease system call latency
is making sure we avoid network roundtrip time as part of the fop
processing. And based on what fop we are dealing with, we have different
translators like read-ahead, io-cache, write-behind, quick-read, md-cache.

   - Though read-ahead and io-cache both serve read calls, the difference
   lies in that read-ahead can even serve first read on an offset (since it
   would have read-ahead on a read with lesser offset) and io-cache can serve
   only requests after first read on an offset from its cache. read-ahead can
   have negative performance impact in the form of cache maintanence on random
   reads.In case of read-ahead, cache is maintained per-fd basis and io-cache
   maintains per-inode cache. Ceiling for cache limits can be configured.
   - write-behind takes the responsibility of storing writes in its cache
   and syncing it with disk in background. Because of this fact, we may not
   able to find out the fate of a write from an application in return value of
   that write. However write-behind communicates errors to application either
   in return value of current or future writes or close call. Paranoid
   applications which need to know errors during any writes previously done,
   should do an fsync. There is another option flush-behind which when turned
   on, makes flush calls sent as part of close, background. The consequence of
   doing flush in background is that posix locks on that fd might not be
   cleared as soon as close returns.
   - quick-read optimizes reads by storing small files in its cache. It
   gets the contents of entire file as part of lookup call done during path to
   inode conversion. It assumes that all opens are done with an intention of
   doing reads and hence doesn't really send open to translators below if the
   file is cached. However, it maintains the abstraction by doing open as part
   of other fd based fops (like fstat, fsync etc). Because of this,
   read-intensive applications like a web-server serving lots of small files,
   can save network round trip for two fops - open and read (It used to save
   close roundtrip call too, but with close implemented as a callback of
   fd-destroy, network roundtrip time is eliminated altogether).
   - md-cache is a translator that caches metadata like stats, certain
   extended attributes of files.

One of the strategies to increase responsiveness is to introduce
asynchronous nature - one doesn't block on a single operation to complete
before taking another - during fop processing. Again asynchronous nature
can be achieved using single or multiple threads. The first approach is
effective only when there are blocking components in the system, like I/O
with network or disk. Performance translators does not do anything helpful
in this aspect (STACK_WIND and STACK_UNWIND macros, non-blocking sockets
etc help here). It is in introducing parallel processing as call proceeds
through gluster translator graph where io-threads (a performance
translator) comes into picture. Apart from introducing parallelism,
io-threads implements priority based processing of fops, which helps to
increase responsiveness. There are other threads within a glusterfs process
which are not maintained by io-threads like fuse-reader, posix janitor, a
thread which polls on network sockets, threads processing send/receive
completion queues in infiniband, threads introduced by syncops, thread
processing timer events etc.

regards,
-- 
Raghavendra G
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://supercolony.gluster.org/pipermail/gluster-users/attachments/20120604/16182ef8/attachment.html>