<div dir="ltr"><div dir="ltr">Hi,<div><br></div><div>I&#39;ve been doing some tests with the global thread pool [1], and I&#39;ve observed one important thing:</div><div><br></div><div>Since this new thread pool has very low contention (apparently), it exposes other problems when the number of threads grows. What I&#39;ve seen is that some workloads use all available threads on bricks to do I/O, causing avgload to grow rapidly and saturating the machine (or it seems so), which really makes everything slower. Reducing the maximum number of threads improves performance actually. Other workloads, though, do little I/O (probably most is locking or smallfile operations). In this case limiting the number of threads to a small value causes a performance reduction. To increase performance we need more threads.</div><div><br></div><div>So this is making me thing that maybe we should implement some sort of I/O queue with a maximum I/O depth for each brick (or disk if bricks share same disk). This way we can limit the amount of requests physically accessing the underlying FS concurrently, without actually limiting the number of threads that can be doing other things on each brick. I think this could improve performance.</div><div><br></div><div>Maybe this approach could also be useful in client side, but I think it&#39;s not so critical there.</div><div><br></div><div>What do you think ?</div><div><br></div><div>Xavi</div><div><br></div><div>[1] <a href="https://review.gluster.org/c/glusterfs/+/20636">https://review.gluster.org/c/glusterfs/+/20636</a></div></div></div>