[Bugs] [Bug 1185950] adding replication to a distributed volume makes the volume unavailable

bugzilla at redhat.com bugzilla at redhat.com
Thu Jan 29 15:10:22 UTC 2015


https://bugzilla.redhat.com/show_bug.cgi?id=1185950



--- Comment #5 from pille <pille+redhat+bugzilla at struction.de> ---
i did some more research on that. there seem to be some broken files in the
gluster-mountpoint. whenever you stat them, the mountpoint disconnects:

# ls -lisa

./8634:
ls: cannot access ./8634/copy: Software caused connection abort
ls: cannot access ./8634/random.part: Transport endpoint is not connected
ls: reading directory ./8634: Transport endpoint is not connected
total 215040
?????????? ? ?    ?            ?            ? copy
-rw-r--r-- 3 root root 104857600 Jan  4  2013 file
-rw-r--r-- 1 root root  10485760 Jan  4  2013 file.1st_copy 
-rw-r--r-- 3 root root 104857600 Jan  4  2013 file.2nd_copy 
?????????? ? ?    ?            ?            ? random.part

on the bricks this directory looks like:
storage01:
    total 419840
    25769803936      0 drwxr-xr-x 2 root root       110 Sep  7 16:46 .
             99      0 drwxr-xr-x 6 root root       165 Aug 29 08:52 ..
    25769804012 102400 -rw-r--r-- 4 root root 104857600 Jan  4  2013 copy
    25769804012 102400 -rw-r--r-- 4 root root 104857600 Jan  4  2013 file
    25769804011  10240 -rw-r--r-- 2 root root  10485760 Jan  4  2013
file.1st_copy
    25769804012 102400 -rw-r--r-- 4 root root 104857600 Jan  4  2013
file.2nd_copy
    25769804013 102400 -rw-r--r-- 2 root root 104857600 Jan  4  2013
random.part
storage02:
    total 215040
    21474836640      0 drwxr-xr-x 2 root root        87 Sep  7 16:46 .
    85899346016      0 drwxr-xr-x 6 root root        73 Aug 29 08:52 ..
    21474836716      0 ---------T 2 root root         0 Jan 19 21:25 copy
    21474836715 102400 -rw-r--r-- 2 root root 104857600 Jan  4  2013 file.big
    21474836718  10240 -rw-r--r-- 2 root root  10485760 Jan  4  2013 file.small
    21474836719 102400 -rw-r--r-- 2 root root 104857600 Jan  4  2013
random.full
storage05:
    total 0
    77322450916 0 drwxr-xr-x 2 root root 110 Jan 28 07:45 .
    62290717518 0 drwxr-xr-x 6 root root 165 Jan 27 15:29 ..
    77322572228 0 -rw-r--r-- 4 root root   0 Jan 27 15:30 copy
    77322572228 0 -rw-r--r-- 4 root root   0 Jan 27 15:30 file
    77322572227 0 -rw-r--r-- 2 root root   0 Jan 27 15:30 file.1st_copy
    77322572228 0 -rw-r--r-- 4 root root   0 Jan 27 15:30 file.2nd_copy
    77322572237 0 -rw-r--r-- 2 root root   0 Jan 28 07:45 random.part
storage06:
    total 0
    75162023687 0 drwxr-xr-x 2 root root 87 Jan 28 07:45 .
    53687280280 0 drwxr-xr-x 6 root root 73 Jan 27 10:04 ..
    75163018369 0 ---------T 2 root root  0 Jan 27 15:30 copy
    75163018372 0 -rw-r--r-- 2 root root  0 Jan 28 07:45 file.big
    75163018373 0 -rw-r--r-- 2 root root  0 Jan 28 07:45 file.small
    75163018374 0 -rw-r--r-- 2 root root  0 Jan 28 07:45 random.full

for comparison, this is the directory on the source i rsynced this from:
total 634888
 4294967424      4 drwxr-xr-x 2 root root      4096 Sep  7 16:46 .
60467001539      4 drwxr-xr-x 6 root root      4096 Aug 29 08:52 ..
 4294968218 102400 -rw-r--r-- 3 root root 104857600 Jan  4  2013 copy
 4294968218 102400 -rw-r--r-- 3 root root 104857600 Jan  4  2013 file
 4294968221  10240 -rw-r--r-- 1 root root  10485760 Jan  4  2013 file.1st_copy
 4294968218 102400 -rw-r--r-- 3 root root 104857600 Jan  4  2013 file.2nd_copy
 4294968223 102400 -rw-r--r-- 1 root root 104857600 Jan  4  2013 file.big
 4294968225  10240 -rw-r--r-- 1 root root  10485760 Jan  4  2013 file.small
 4294968227 102400 -rw-r--r-- 1 root root 104857600 Jan  4  2013 random.full
 4294968235 102400 -rw-r--r-- 1 root root 104857600 Jan  4  2013 random.part


there a lots of locations, where those unreadable stat-crashing files are.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.


More information about the Bugs mailing list