[Bugs] [Bug 1535852] New: glusterfind is extremely slow if there are lots of changes

Thu Jan 18 07:41:19 UTC 2018

https://bugzilla.redhat.com/show_bug.cgi?id=1535852

            Bug ID: 1535852
           Summary: glusterfind is extremely slow if there are lots of
                    changes
           Product: Red Hat Gluster Storage
           Version: 3.3
         Component: glusterfind
          Assignee: mchangir at redhat.com
          Reporter: mchangir at redhat.com
        QA Contact: rhinduja at redhat.com
                CC: avishwan at redhat.com, bugs at gluster.org,
                    khiremat at redhat.com, nh2-redhatbugzilla at deditus.de,
                    rhs-bugs at redhat.com, storage-qa-internal at redhat.com
        Depends On: 1529883

+++ This bug was initially created as a clone of Bug #1529883 +++

Description of problem:

I noticed that my glusterfind on 3.12.3 ran for 100s of hours straight without
terminating.

A quick strace showed that there were tons of pread64() syscalls in between
each open() of a CHANGELOG.* file.

Looking in /proc/$(pidof glusterfind)/fd, I found that the file it's
pread64()ing from is the `tmp_output_1` sqlite file. It was clearly reading the
entire database in via those syscalls for each *line* of each CHANGELOG.* file.

To make it very clear, it was doing:

for each CHANGELOG file:
  for each line in that file:
     read in the entire SQL database contents (9 MB in my case)

Looking into the code, it beacame clear that there's a simple check implemented
in glusterfind whether some line of a CHANGELOG.* file is already in the DB.
That is done by checking whether some `gfid` is already in the `gfid` column.

Unfortunately that column didn't have an SQL index defined, thus resulting in a
full scan over the database for each check if the line already exists.

If you use sqlite you must really make sure to use indexes, because otherwise
any O(1) or O(log n) operation turns into a O(n) operation, thus giving
glusterfind O(n²) complexity.

I will submit a patch.

It makes glusterfind 150x faster for me.

--- Additional comment from Worker Ant on 2017-12-31 02:08:54 IST ---

REVIEW: https://review.gluster.org/19114 (glusterfind: Speed up gfid lookup
100x by using an SQL index) posted (#1) for review on master by Niklas
Hambüchen

--- Additional comment from Worker Ant on 2017-12-31 15:03:38 IST ---

COMMIT: https://review.gluster.org/19114 committed in master by  with a commit
message- glusterfind: Speed up gfid lookup 100x by using an SQL index

Fixes #1529883.

This fixes some bits of `glusterfind`'s horrible performance,
making it 100x faster.

Until now, glusterfind was, for each line in each CHANGELOG.* file,
linearly reading the entire contents of the sqlite database in
4096-bytes-sized pread64() syscalls when executing the

  SELECT COUNT(1) FROM %s WHERE 1=1 AND gfid = ?

query through the code path:

  get_changes()
    parse_changelog_to_db()
      when_data_meta()
        gfidpath_exists()
          _exists()

In a quick benchmark on my laptop, doing one such `SELECT` query
took ~75ms on a 10MB-sized sqlite DB, while doing the same query
with an index took < 1ms.

Change-Id: I8e7fe60f1f45a06c102f56b54d2ead9e0377794e
BUG: 1529883
Signed-off-by: Niklas Hambüchen <mail at nh2.me>

Referenced Bugs:

https://bugzilla.redhat.com/show_bug.cgi?id=1529883
[Bug 1529883] glusterfind is extremely slow if there are lots of changes
-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=AUbC4fkAgq&a=cc_unsubscribe