<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Jul 23, 2018 at 4:16 PM, Hu Bert <span dir="ltr">&lt;<a href="mailto:revirii@googlemail.com" target="_blank">revirii@googlemail.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Well, over the weekend about 200GB were copied, so now there are<br>

~400GB copied to the brick. That&#39;s far beyond a speed of 10GB per<br>

hour. If I copied the 1.6 TB directly, that would be done within max 2<br>

days. But with the self heal this will take at least 20 days minimum.<br>

<br>

Why is the performance that bad? No chance of speeding this up?<br></blockquote><div><br></div><div>What kind of data do you have?</div><div>How many directories in the filesystem?</div><div>On average how many files per directory?</div><div>What is the depth of your directory hierarchy on average?</div><div>What is average filesize?</div><div><br></div><div>Based on this data we can see if anything can be improved. Or if there are some</div><div>enhancements that need to be implemented in gluster to address this kind of</div><div>data layout<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div class="HOEnZb"><div class="h5"><br>

2018-07-20 9:41 GMT+02:00 Hu Bert &lt;<a href="mailto:revirii@googlemail.com">revirii@googlemail.com</a>&gt;:<br>

&gt; hmm... no one any idea?<br>

&gt;<br>

&gt; Additional question: the hdd on server gluster12 was changed, so far<br>

&gt; ~220 GB were copied. On the other 2 servers i see a lot of entries in<br>

&gt; glustershd.log, about 312.000 respectively 336.000 entries there<br>

&gt; yesterday, most of them (current log output) looking like this:<br>

&gt;<br>

&gt; [2018-07-20 07:30:49.757595] I [MSGID: 108026]<br>

&gt; [afr-self-heal-common.c:1724:<wbr>afr_log_selfheal] 0-shared-replicate-3:<br>

&gt; Completed data selfheal on 0d863a62-0dd8-401c-b699-<wbr>2b642d9fd2b6.<br>

&gt; sources=0 [2]  sinks=1<br>

&gt; [2018-07-20 07:30:49.992398] I [MSGID: 108026]<br>

&gt; [afr-self-heal-metadata.c:52:_<wbr>_afr_selfheal_metadata_do]<br>

&gt; 0-shared-replicate-3: performing metadata selfheal on<br>

&gt; 0d863a62-0dd8-401c-b699-<wbr>2b642d9fd2b6<br>

&gt; [2018-07-20 07:30:50.243551] I [MSGID: 108026]<br>

&gt; [afr-self-heal-common.c:1724:<wbr>afr_log_selfheal] 0-shared-replicate-3:<br>

&gt; Completed metadata selfheal on 0d863a62-0dd8-401c-b699-<wbr>2b642d9fd2b6.<br>

&gt; sources=0 [2]  sinks=1<br>

&gt;<br>

&gt; or like this:<br>

&gt;<br>

&gt; [2018-07-20 07:38:41.726943] I [MSGID: 108026]<br>

&gt; [afr-self-heal-metadata.c:52:_<wbr>_afr_selfheal_metadata_do]<br>

&gt; 0-shared-replicate-3: performing metadata selfheal on<br>

&gt; 9276097a-cdac-4d12-9dc6-<wbr>04b1ea4458ba<br>

&gt; [2018-07-20 07:38:41.855737] I [MSGID: 108026]<br>

&gt; [afr-self-heal-common.c:1724:<wbr>afr_log_selfheal] 0-shared-replicate-3:<br>

&gt; Completed metadata selfheal on 9276097a-cdac-4d12-9dc6-<wbr>04b1ea4458ba.<br>

&gt; sources=[0] 2  sinks=1<br>

&gt; [2018-07-20 07:38:44.755800] I [MSGID: 108026]<br>

&gt; [afr-self-heal-entry.c:887:<wbr>afr_selfheal_entry_do]<br>

&gt; 0-shared-replicate-3: performing entry selfheal on<br>

&gt; 9276097a-cdac-4d12-9dc6-<wbr>04b1ea4458ba<br>

&gt;<br>

&gt; is this behaviour normal? I&#39;d expect these messages on the server with<br>

&gt; the failed brick, not on the other ones.<br>

&gt;<br>

&gt; 2018-07-19 8:31 GMT+02:00 Hu Bert &lt;<a href="mailto:revirii@googlemail.com">revirii@googlemail.com</a>&gt;:<br>

&gt;&gt; Hi there,<br>

&gt;&gt;<br>

&gt;&gt; sent this mail yesterday, but somehow it didn&#39;t work? Wasn&#39;t archived,<br>

&gt;&gt; so please be indulgent it you receive this mail again :-)<br>

&gt;&gt;<br>

&gt;&gt; We are currently running a replicate setup and are experiencing a<br>

&gt;&gt; quite poor performance. It got even worse when within a couple of<br>

&gt;&gt; weeks 2 bricks (disks) crashed. Maybe some general information of our<br>

&gt;&gt; setup:<br>

&gt;&gt;<br>

&gt;&gt; 3 Dell PowerEdge R530 (Xeon E5-1650 v3 Hexa-Core, 64 GB DDR4, OS on<br>

&gt;&gt; separate disks); each server has 4 10TB disks -&gt; each is a brick;<br>

&gt;&gt; replica 3 setup (see gluster volume status below). Debian stretch,<br>

&gt;&gt; kernel 4.9.0, gluster version 3.12.12. Servers and clients are<br>

&gt;&gt; connected via 10 GBit ethernet.<br>

&gt;&gt;<br>

&gt;&gt; About a month ago and 2 days ago a disk died (on different servers);<br>

&gt;&gt; disk were replaced, were brought back into the volume and full self<br>

&gt;&gt; heal started. But the speed for this is quite... disappointing. Each<br>

&gt;&gt; brick has ~1.6TB of data on it (mostly the infamous small files). The<br>

&gt;&gt; full heal i started yesterday copied only ~50GB within 24 hours (48<br>

&gt;&gt; hours: about 100GB) - with<br>

&gt;&gt; this rate it would take weeks until the self heal finishes.<br>

&gt;&gt;<br>

&gt;&gt; After the first heal (started on gluster13 about a month ago, took<br>

&gt;&gt; about 3 weeks) finished we had a terrible performance; CPU on one or<br>

&gt;&gt; two of the nodes (gluster11, gluster12) was up to 1200%, consumed by<br>

&gt;&gt; the brick process of the former crashed brick (bricksdd1),<br>

&gt;&gt; interestingly not on the server with the failed this, but on the other<br>

&gt;&gt; 2 ones...<br>

&gt;&gt;<br>

&gt;&gt; Well... am i doing something wrong? Some options wrongly configured?<br>

&gt;&gt; Terrible setup? Anyone got an idea? Any additional information needed?<br>

&gt;&gt;<br>

&gt;&gt;<br>

&gt;&gt; Thx in advance :-)<br>

&gt;&gt;<br>

&gt;&gt; gluster volume status<br>

&gt;&gt;<br>

&gt;&gt; Volume Name: shared<br>

&gt;&gt; Type: Distributed-Replicate<br>

&gt;&gt; Volume ID: e879d208-1d8c-4089-85f3-<wbr>ef1b3aa45d36<br>

&gt;&gt; Status: Started<br>

&gt;&gt; Snapshot Count: 0<br>

&gt;&gt; Number of Bricks: 4 x 3 = 12<br>

&gt;&gt; Transport-type: tcp<br>

&gt;&gt; Bricks:<br>

&gt;&gt; Brick1: gluster11:/gluster/bricksda1/<wbr>shared<br>

&gt;&gt; Brick2: gluster12:/gluster/bricksda1/<wbr>shared<br>

&gt;&gt; Brick3: gluster13:/gluster/bricksda1/<wbr>shared<br>

&gt;&gt; Brick4: gluster11:/gluster/bricksdb1/<wbr>shared<br>

&gt;&gt; Brick5: gluster12:/gluster/bricksdb1/<wbr>shared<br>

&gt;&gt; Brick6: gluster13:/gluster/bricksdb1/<wbr>shared<br>

&gt;&gt; Brick7: gluster11:/gluster/bricksdc1/<wbr>shared<br>

&gt;&gt; Brick8: gluster12:/gluster/bricksdc1/<wbr>shared<br>

&gt;&gt; Brick9: gluster13:/gluster/bricksdc1/<wbr>shared<br>

&gt;&gt; Brick10: gluster11:/gluster/bricksdd1/<wbr>shared<br>

&gt;&gt; Brick11: gluster12:/gluster/bricksdd1_<wbr>new/shared<br>

&gt;&gt; Brick12: gluster13:/gluster/bricksdd1_<wbr>new/shared<br>

&gt;&gt; Options Reconfigured:<br>

&gt;&gt; cluster.shd-max-threads: 4<br>

&gt;&gt; performance.md-cache-timeout: 60<br>

&gt;&gt; cluster.lookup-optimize: on<br>

&gt;&gt; cluster.readdir-optimize: on<br>

&gt;&gt; performance.cache-refresh-<wbr>timeout: 4<br>

&gt;&gt; performance.parallel-readdir: on<br>

&gt;&gt; server.event-threads: 8<br>

&gt;&gt; client.event-threads: 8<br>

&gt;&gt; performance.cache-max-file-<wbr>size: 128MB<br>

&gt;&gt; performance.write-behind-<wbr>window-size: 16MB<br>

&gt;&gt; performance.io-thread-count: 64<br>

&gt;&gt; cluster.min-free-disk: 1%<br>

&gt;&gt; performance.cache-size: 24GB<br>

&gt;&gt; nfs.disable: on<br>

&gt;&gt; transport.address-family: inet<br>

&gt;&gt; performance.high-prio-threads: 32<br>

&gt;&gt; performance.normal-prio-<wbr>threads: 32<br>

&gt;&gt; performance.low-prio-threads: 32<br>

&gt;&gt; performance.least-prio-<wbr>threads: 8<br>

&gt;&gt; performance.io-cache: on<br>

&gt;&gt; server.allow-insecure: on<br>

&gt;&gt; performance.strict-o-direct: off<br>

&gt;&gt; transport.listen-backlog: 100<br>

&gt;&gt; server.outstanding-rpc-limit: 128<br>

______________________________<wbr>_________________<br>

Gluster-users mailing list<br>

<a href="mailto:Gluster-users@gluster.org">Gluster-users@gluster.org</a><br>

<a href="https://lists.gluster.org/mailman/listinfo/gluster-users" rel="noreferrer" target="_blank">https://lists.gluster.org/<wbr>mailman/listinfo/gluster-users</a><br>

</div></div></blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr">Pranith<br></div></div>

</div></div>