From amukherj at redhat.com Sat Jun 1 11:25:12 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Sat, 1 Jun 2019 16:55:12 +0530 Subject: [Gluster-devel] Fwd: [Gluster-Maintainers] Build failed in Jenkins: regression-test-with-multiplex #1359 In-Reply-To: <24208463.92.1559325814227.JavaMail.jenkins@jenkins-el7.rht.gluster.org> References: <24208463.92.1559325814227.JavaMail.jenkins@jenkins-el7.rht.gluster.org> Message-ID: subdir-mount.t has started failing in brick mux regression nightly. This needs to be fixed. Raghavendra - did we manage to get any further clue on uss.t failure? ---------- Forwarded message --------- From: Date: Fri, 31 May 2019 at 23:34 Subject: [Gluster-Maintainers] Build failed in Jenkins: regression-test-with-multiplex #1359 To: , , , < amukherj at redhat.com>, See < https://build.gluster.org/job/regression-test-with-multiplex/1359/display/redirect?page=changes > Changes: [atin] glusterd: add an op-version check [atin] glusterd/svc: glusterd_svcs_stop should call individual wrapper function [atin] glusterd/svc: Stop stale process using the glusterd_proc_stop [Amar Tumballi] lcov: more coverage to shard, old-protocol, sdfs [Kotresh H R] tests/geo-rep: Add EC volume test case [Amar Tumballi] glusterfsd/cleanup: Protect graph object under a lock [Mohammed Rafi KC] glusterd/shd: Optimize the glustershd manager to send reconfigure [Kotresh H R] tests/geo-rep: Add tests to cover glusterd geo-rep [atin] glusterd: Optimize code to copy dictionary in handshake code path ------------------------------------------ [...truncated 3.18 MB...] ./tests/basic/afr/stale-file-lookup.t - 9 second ./tests/basic/afr/granular-esh/replace-brick.t - 9 second ./tests/basic/afr/granular-esh/add-brick.t - 9 second ./tests/basic/afr/gfid-mismatch.t - 9 second ./tests/performance/open-behind.t - 8 second ./tests/features/ssl-authz.t - 8 second ./tests/features/readdir-ahead.t - 8 second ./tests/bugs/upcall/bug-1458127.t - 8 second ./tests/bugs/transport/bug-873367.t - 8 second ./tests/bugs/replicate/bug-1498570-client-iot-graph-check.t - 8 second ./tests/bugs/replicate/bug-1132102.t - 8 second ./tests/bugs/quota/bug-1250582-volume-reset-should-not-remove-quota-quota-deem-statfs.t - 8 second ./tests/bugs/quota/bug-1104692.t - 8 second ./tests/bugs/posix/bug-1360679.t - 8 second ./tests/bugs/posix/bug-1122028.t - 8 second ./tests/bugs/nfs/bug-1157223-symlink-mounting.t - 8 second ./tests/bugs/glusterfs/bug-861015-log.t - 8 second ./tests/bugs/glusterd/sync-post-glusterd-restart.t - 8 second ./tests/bugs/glusterd/bug-1696046.t - 8 second ./tests/bugs/fuse/bug-983477.t - 8 second ./tests/bugs/ec/bug-1227869.t - 8 second ./tests/bugs/distribute/bug-1088231.t - 8 second ./tests/bugs/distribute/bug-1086228.t - 8 second ./tests/bugs/cli/bug-1087487.t - 8 second ./tests/bugs/cli/bug-1022905.t - 8 second ./tests/bugs/bug-1258069.t - 8 second ./tests/bugs/bitrot/1209752-volume-status-should-show-bitrot-scrub-info.t - 8 second ./tests/basic/xlator-pass-through-sanity.t - 8 second ./tests/basic/quota-nfs.t - 8 second ./tests/basic/glusterd/arbiter-volume.t - 8 second ./tests/basic/ctime/ctime-noatime.t - 8 second ./tests/line-coverage/cli-peer-and-volume-operations.t - 7 second ./tests/gfid2path/get-gfid-to-path.t - 7 second ./tests/bugs/upcall/bug-1369430.t - 7 second ./tests/bugs/snapshot/bug-1260848.t - 7 second ./tests/bugs/shard/shard-inode-refcount-test.t - 7 second ./tests/bugs/shard/bug-1258334.t - 7 second ./tests/bugs/replicate/bug-767585-gfid.t - 7 second ./tests/bugs/replicate/bug-1448804-check-quorum-type-values.t - 7 second ./tests/bugs/replicate/bug-1250170-fsync.t - 7 second ./tests/bugs/posix/bug-1175711.t - 7 second ./tests/bugs/nfs/bug-915280.t - 7 second ./tests/bugs/md-cache/setxattr-prepoststat.t - 7 second ./tests/bugs/md-cache/bug-1211863_unlink.t - 7 second ./tests/bugs/glusterfs/bug-848251.t - 7 second ./tests/bugs/distribute/bug-1122443.t - 7 second ./tests/bugs/changelog/bug-1208470.t - 7 second ./tests/bugs/bug-1702299.t - 7 second ./tests/bugs/bug-1371806_2.t - 7 second ./tests/bugs/bitrot/1209818-vol-info-show-scrub-process-properly.t - 7 second ./tests/bugs/bitrot/1209751-bitrot-scrub-tunable-reset.t - 7 second ./tests/bugs/bitrot/1207029-bitrot-daemon-should-start-on-valid-node.t - 7 second ./tests/bitrot/br-stub.t - 7 second ./tests/basic/glusterd/arbiter-volume-probe.t - 7 second ./tests/basic/gfapi/libgfapi-fini-hang.t - 7 second ./tests/basic/fencing/fencing-crash-conistency.t - 7 second ./tests/basic/distribute/file-create.t - 7 second ./tests/basic/afr/tarissue.t - 7 second ./tests/basic/afr/gfid-heal.t - 7 second ./tests/bugs/snapshot/bug-1178079.t - 6 second ./tests/bugs/snapshot/bug-1064768.t - 6 second ./tests/bugs/shard/bug-1342298.t - 6 second ./tests/bugs/shard/bug-1259651.t - 6 second ./tests/bugs/replicate/bug-1686568-send-truncate-on-arbiter-from-shd.t - 6 second ./tests/bugs/replicate/bug-1626994-info-split-brain.t - 6 second ./tests/bugs/replicate/bug-1325792.t - 6 second ./tests/bugs/replicate/bug-1101647.t - 6 second ./tests/bugs/quota/bug-1243798.t - 6 second ./tests/bugs/protocol/bug-1321578.t - 6 second ./tests/bugs/nfs/bug-877885.t - 6 second ./tests/bugs/nfs/bug-1143880-fix-gNFSd-auth-crash.t - 6 second ./tests/bugs/md-cache/bug-1476324.t - 6 second ./tests/bugs/md-cache/afr-stale-read.t - 6 second ./tests/bugs/io-cache/bug-858242.t - 6 second ./tests/bugs/glusterfs/bug-893378.t - 6 second ./tests/bugs/glusterfs/bug-856455.t - 6 second ./tests/bugs/glusterd/quorum-value-check.t - 6 second ./tests/bugs/ec/bug-1179050.t - 6 second ./tests/bugs/distribute/bug-912564.t - 6 second ./tests/bugs/distribute/bug-884597.t - 6 second ./tests/bugs/distribute/bug-1368012.t - 6 second ./tests/bugs/core/bug-986429.t - 6 second ./tests/bugs/core/bug-1699025-brick-mux-detach-brick-fd-issue.t - 6 second ./tests/bugs/core/bug-1168803-snapd-option-validation-fix.t - 6 second ./tests/bugs/bug-1371806_1.t - 6 second ./tests/bugs/bitrot/bug-1229134-bitd-not-support-vol-set.t - 6 second ./tests/bugs/bitrot/bug-1210684-scrub-pause-resume-error-handling.t - 6 second ./tests/bitrot/bug-1221914.t - 6 second ./tests/basic/trace.t - 6 second ./tests/basic/playground/template-xlator-sanity.t - 6 second ./tests/basic/ec/nfs.t - 6 second ./tests/basic/ec/ec-read-policy.t - 6 second ./tests/basic/ec/ec-anonymous-fd.t - 6 second ./tests/basic/distribute/non-root-unlink-stale-linkto.t - 6 second ./tests/basic/changelog/changelog-rename.t - 6 second ./tests/basic/afr/heal-info.t - 6 second ./tests/basic/afr/afr-read-hash-mode.t - 6 second ./tests/gfid2path/gfid2path_nfs.t - 5 second ./tests/bugs/upcall/bug-1422776.t - 5 second ./tests/bugs/replicate/bug-886998.t - 5 second ./tests/bugs/replicate/bug-1365455.t - 5 second ./tests/bugs/readdir-ahead/bug-1670253-consistent-metadata.t - 5 second ./tests/bugs/posix/bug-gfid-path.t - 5 second ./tests/bugs/posix/bug-765380.t - 5 second ./tests/bugs/nfs/bug-847622.t - 5 second ./tests/bugs/nfs/bug-1116503.t - 5 second ./tests/bugs/io-stats/bug-1598548.t - 5 second ./tests/bugs/glusterfs-server/bug-877992.t - 5 second ./tests/bugs/glusterfs-server/bug-873549.t - 5 second ./tests/bugs/glusterfs/bug-895235.t - 5 second ./tests/bugs/fuse/bug-1126048.t - 5 second ./tests/bugs/distribute/bug-907072.t - 5 second ./tests/bugs/core/bug-913544.t - 5 second ./tests/bugs/core/bug-908146.t - 5 second ./tests/bugs/access-control/bug-1051896.t - 5 second ./tests/basic/ec/ec-internal-xattrs.t - 5 second ./tests/basic/ec/ec-fallocate.t - 5 second ./tests/basic/distribute/bug-1265677-use-readdirp.t - 5 second ./tests/basic/afr/arbiter-remove-brick.t - 5 second ./tests/performance/quick-read.t - 4 second ./tests/gfid2path/block-mount-access.t - 4 second ./tests/features/delay-gen.t - 4 second ./tests/bugs/upcall/bug-upcall-stat.t - 4 second ./tests/bugs/upcall/bug-1394131.t - 4 second ./tests/bugs/unclassified/bug-1034085.t - 4 second ./tests/bugs/snapshot/bug-1111041.t - 4 second ./tests/bugs/shard/bug-1272986.t - 4 second ./tests/bugs/shard/bug-1256580.t - 4 second ./tests/bugs/shard/bug-1250855.t - 4 second ./tests/bugs/shard/bug-1245547.t - 4 second ./tests/bugs/rpc/bug-954057.t - 4 second ./tests/bugs/replicate/bug-976800.t - 4 second ./tests/bugs/replicate/bug-880898.t - 4 second ./tests/bugs/replicate/bug-1480525.t - 4 second ./tests/bugs/read-only/bug-1134822-read-only-default-in-graph.t - 4 second ./tests/bugs/readdir-ahead/bug-1446516.t - 4 second ./tests/bugs/readdir-ahead/bug-1439640.t - 4 second ./tests/bugs/readdir-ahead/bug-1390050.t - 4 second ./tests/bugs/quota/bug-1287996.t - 4 second ./tests/bugs/quick-read/bug-846240.t - 4 second ./tests/bugs/posix/disallow-gfid-volumeid-removexattr.t - 4 second ./tests/bugs/posix/bug-1619720.t - 4 second ./tests/bugs/nl-cache/bug-1451588.t - 4 second ./tests/bugs/nfs/zero-atime.t - 4 second ./tests/bugs/nfs/subdir-trailing-slash.t - 4 second ./tests/bugs/nfs/socket-as-fifo.t - 4 second ./tests/bugs/nfs/showmount-many-clients.t - 4 second ./tests/bugs/nfs/bug-1210338.t - 4 second ./tests/bugs/nfs/bug-1166862.t - 4 second ./tests/bugs/nfs/bug-1161092-nfs-acls.t - 4 second ./tests/bugs/md-cache/bug-1632503.t - 4 second ./tests/bugs/glusterfs-server/bug-864222.t - 4 second ./tests/bugs/glusterfs/bug-1482528.t - 4 second ./tests/bugs/glusterd/bug-948729/bug-948729-mode-script.t - 4 second ./tests/bugs/glusterd/bug-948729/bug-948729-force.t - 4 second ./tests/bugs/glusterd/bug-1482906-peer-file-blank-line.t - 4 second ./tests/bugs/glusterd/bug-1091935-brick-order-check-from-cli-to-glusterd.t - 4 second ./tests/bugs/geo-replication/bug-1296496.t - 4 second ./tests/bugs/fuse/bug-1336818.t - 4 second ./tests/bugs/fuse/bug-1283103.t - 4 second ./tests/bugs/core/io-stats-1322825.t - 4 second ./tests/bugs/core/bug-834465.t - 4 second ./tests/bugs/core/bug-1135514-allow-setxattr-with-null-value.t - 4 second ./tests/bugs/core/949327.t - 4 second ./tests/bugs/cli/bug-977246.t - 4 second ./tests/bugs/cli/bug-961307.t - 4 second ./tests/bugs/cli/bug-1004218.t - 4 second ./tests/bugs/bug-1138841.t - 4 second ./tests/bugs/access-control/bug-1387241.t - 4 second ./tests/bitrot/bug-internal-xattrs-check-1243391.t - 4 second ./tests/basic/quota-rename.t - 4 second ./tests/basic/hardlink-limit.t - 4 second ./tests/basic/ec/dht-rename.t - 4 second ./tests/basic/distribute/lookup.t - 4 second ./tests/line-coverage/meta-max-coverage.t - 3 second ./tests/gfid2path/gfid2path_fuse.t - 3 second ./tests/bugs/unclassified/bug-991622.t - 3 second ./tests/bugs/trace/bug-797171.t - 3 second ./tests/bugs/glusterfs-server/bug-861542.t - 3 second ./tests/bugs/glusterfs/bug-869724.t - 3 second ./tests/bugs/glusterfs/bug-860297.t - 3 second ./tests/bugs/glusterfs/bug-844688.t - 3 second ./tests/bugs/glusterd/bug-948729/bug-948729.t - 3 second ./tests/bugs/distribute/bug-1204140.t - 3 second ./tests/bugs/core/bug-924075.t - 3 second ./tests/bugs/core/bug-845213.t - 3 second ./tests/bugs/core/bug-1421721-mpx-toggle.t - 3 second ./tests/bugs/core/bug-1119582.t - 3 second ./tests/bugs/core/bug-1117951.t - 3 second ./tests/bugs/cli/bug-983317-volume-get.t - 3 second ./tests/bugs/cli/bug-867252.t - 3 second ./tests/basic/glusterd/check-cloudsync-ancestry.t - 3 second ./tests/basic/fops-sanity.t - 3 second ./tests/basic/fencing/test-fence-option.t - 3 second ./tests/basic/distribute/debug-xattrs.t - 3 second ./tests/basic/afr/ta-check-locks.t - 3 second ./tests/line-coverage/volfile-with-all-graph-syntax.t - 2 second ./tests/line-coverage/some-features-in-libglusterfs.t - 2 second ./tests/bugs/shard/bug-1261773.t - 2 second ./tests/bugs/replicate/bug-884328.t - 2 second ./tests/bugs/readdir-ahead/bug-1512437.t - 2 second ./tests/bugs/nfs/bug-970070.t - 2 second ./tests/bugs/nfs/bug-1302948.t - 2 second ./tests/bugs/logging/bug-823081.t - 2 second ./tests/bugs/glusterfs-server/bug-889996.t - 2 second ./tests/bugs/glusterfs/bug-892730.t - 2 second ./tests/bugs/glusterfs/bug-811493.t - 2 second ./tests/bugs/glusterd/bug-1085330-and-bug-916549.t - 2 second ./tests/bugs/distribute/bug-924265.t - 2 second ./tests/bugs/core/log-bug-1362520.t - 2 second ./tests/bugs/core/bug-903336.t - 2 second ./tests/bugs/core/bug-1111557.t - 2 second ./tests/bugs/cli/bug-969193.t - 2 second ./tests/bugs/cli/bug-949298.t - 2 second ./tests/bugs/cli/bug-921215.t - 2 second ./tests/bugs/cli/bug-1378842-volume-get-all.t - 2 second ./tests/basic/peer-parsing.t - 2 second ./tests/basic/md-cache/bug-1418249.t - 2 second ./tests/basic/afr/arbiter-cli.t - 2 second ./tests/bugs/replicate/ta-inode-refresh-read.t - 1 second ./tests/bugs/glusterfs/bug-853690.t - 1 second ./tests/bugs/cli/bug-764638.t - 1 second ./tests/bugs/cli/bug-1047378.t - 1 second ./tests/basic/netgroup_parsing.t - 1 second ./tests/basic/gfapi/sink.t - 1 second ./tests/basic/exports_parsing.t - 1 second ./tests/basic/posixonly.t - 0 second ./tests/basic/glusterfsd-args.t - 0 second 2 test(s) failed ./tests/basic/uss.t ./tests/features/subdir-mount.t 0 test(s) generated core 5 test(s) needed retry ./tests/basic/afr/split-brain-favorite-child-policy.t ./tests/basic/ec/self-heal.t ./tests/basic/uss.t ./tests/basic/volfile-sanity.t ./tests/features/subdir-mount.t Result is 1 tar: Removing leading `/' from member names kernel.core_pattern = /%e-%p.core Build step 'Execute shell' marked build as failure _______________________________________________ maintainers mailing list maintainers at gluster.org https://lists.gluster.org/mailman/listinfo/maintainers -- - Atin (atinm) -------------- next part -------------- An HTML attachment was scrubbed... URL: From amukherj at redhat.com Sat Jun 1 11:27:20 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Sat, 1 Jun 2019 16:57:20 +0530 Subject: [Gluster-devel] Fwd: [Gluster-Maintainers] Build failed in Jenkins: regression-test-with-multiplex #1357 In-Reply-To: <727602310.89.1559238721974.JavaMail.jenkins@jenkins-el7.rht.gluster.org> References: <1056764480.87.1559168297540.JavaMail.jenkins@jenkins-el7.rht.gluster.org> <727602310.89.1559238721974.JavaMail.jenkins@jenkins-el7.rht.gluster.org> Message-ID: Rafi - tests/bugs/glusterd/serializ e-shd-manager-glusterd- restart.t seems to be failing often. Can you please investigate the reason of this spurious failure? ---------- Forwarded message --------- From: Date: Thu, 30 May 2019 at 23:22 Subject: [Gluster-Maintainers] Build failed in Jenkins: regression-test-with-multiplex #1357 To: , See < https://build.gluster.org/job/regression-test-with-multiplex/1357/display/redirect?page=changes > Changes: [Xavi Hernandez] tests: add tests for different signal handling [Xavi Hernandez] marker: remove some unused functions [Xavi Hernandez] glusterd: coverity fix ------------------------------------------ [...truncated 2.92 MB...] ./tests/basic/ec/ec-root-heal.t - 9 second ./tests/basic/afr/ta-write-on-bad-brick.t - 9 second ./tests/basic/afr/ta.t - 9 second ./tests/basic/afr/gfid-mismatch.t - 9 second ./tests/performance/open-behind.t - 8 second ./tests/features/ssl-authz.t - 8 second ./tests/features/readdir-ahead.t - 8 second ./tests/features/lock-migration/lkmigration-set-option.t - 8 second ./tests/bugs/replicate/bug-921231.t - 8 second ./tests/bugs/replicate/bug-1686568-send-truncate-on-arbiter-from-shd.t - 8 second ./tests/bugs/replicate/bug-1132102.t - 8 second ./tests/bugs/posix/bug-990028.t - 8 second ./tests/bugs/posix/bug-1360679.t - 8 second ./tests/bugs/nfs/bug-915280.t - 8 second ./tests/bugs/nfs/bug-1157223-symlink-mounting.t - 8 second ./tests/bugs/glusterfs/bug-872923.t - 8 second ./tests/bugs/glusterfs/bug-861015-log.t - 8 second ./tests/bugs/glusterd/sync-post-glusterd-restart.t - 8 second ./tests/bugs/glusterd/bug-1696046.t - 8 second ./tests/bugs/distribute/bug-1088231.t - 8 second ./tests/bugs/distribute/bug-1086228.t - 8 second ./tests/bugs/cli/bug-1087487.t - 8 second ./tests/bugs/bug-1258069.t - 8 second ./tests/bugs/bitrot/1209818-vol-info-show-scrub-process-properly.t - 8 second ./tests/bugs/bitrot/1209752-volume-status-should-show-bitrot-scrub-info.t - 8 second ./tests/basic/quota-nfs.t - 8 second ./tests/basic/ec/statedump.t - 8 second ./tests/basic/ctime/ctime-noatime.t - 8 second ./tests/basic/afr/ta-shd.t - 8 second ./tests/basic/afr/arbiter-remove-brick.t - 8 second ./tests/line-coverage/cli-peer-and-volume-operations.t - 7 second ./tests/gfid2path/get-gfid-to-path.t - 7 second ./tests/gfid2path/block-mount-access.t - 7 second ./tests/bugs/upcall/bug-1369430.t - 7 second ./tests/bugs/transport/bug-873367.t - 7 second ./tests/bugs/snapshot/bug-1260848.t - 7 second ./tests/bugs/snapshot/bug-1064768.t - 7 second ./tests/bugs/shard/shard-inode-refcount-test.t - 7 second ./tests/bugs/shard/bug-1258334.t - 7 second ./tests/bugs/replicate/bug-1626994-info-split-brain.t - 7 second ./tests/bugs/replicate/bug-1498570-client-iot-graph-check.t - 7 second ./tests/bugs/replicate/bug-1448804-check-quorum-type-values.t - 7 second ./tests/bugs/replicate/bug-1250170-fsync.t - 7 second ./tests/bugs/replicate/bug-1101647.t - 7 second ./tests/bugs/quota/bug-1250582-volume-reset-should-not-remove-quota-quota-deem-statfs.t - 7 second ./tests/bugs/quota/bug-1104692.t - 7 second ./tests/bugs/posix/bug-1175711.t - 7 second ./tests/bugs/posix/bug-1122028.t - 7 second ./tests/bugs/md-cache/setxattr-prepoststat.t - 7 second ./tests/bugs/glusterfs/bug-848251.t - 7 second ./tests/bugs/ec/bug-1227869.t - 7 second ./tests/bugs/distribute/bug-884597.t - 7 second ./tests/bugs/distribute/bug-1122443.t - 7 second ./tests/bugs/changelog/bug-1208470.t - 7 second ./tests/bugs/bug-1371806_2.t - 7 second ./tests/bugs/bitrot/1209751-bitrot-scrub-tunable-reset.t - 7 second ./tests/bitrot/bug-1221914.t - 7 second ./tests/bitrot/br-stub.t - 7 second ./tests/basic/xlator-pass-through-sanity.t - 7 second ./tests/basic/trace.t - 7 second ./tests/basic/glusterd/arbiter-volume-probe.t - 7 second ./tests/basic/gfapi/libgfapi-fini-hang.t - 7 second ./tests/basic/distribute/file-create.t - 7 second ./tests/basic/afr/tarissue.t - 7 second ./tests/basic/afr/gfid-heal.t - 7 second ./tests/bugs/shard/bug-1342298.t - 6 second ./tests/bugs/shard/bug-1272986.t - 6 second ./tests/bugs/shard/bug-1259651.t - 6 second ./tests/bugs/replicate/bug-767585-gfid.t - 6 second ./tests/bugs/replicate/bug-1325792.t - 6 second ./tests/bugs/readdir-ahead/bug-1670253-consistent-metadata.t - 6 second ./tests/bugs/quota/bug-1243798.t - 6 second ./tests/bugs/protocol/bug-1321578.t - 6 second ./tests/bugs/posix/bug-765380.t - 6 second ./tests/bugs/nfs/bug-877885.t - 6 second ./tests/bugs/nfs/bug-847622.t - 6 second ./tests/bugs/nfs/bug-1143880-fix-gNFSd-auth-crash.t - 6 second ./tests/bugs/md-cache/bug-1211863_unlink.t - 6 second ./tests/bugs/io-stats/bug-1598548.t - 6 second ./tests/bugs/io-cache/bug-858242.t - 6 second ./tests/bugs/glusterfs/bug-893378.t - 6 second ./tests/bugs/glusterfs/bug-856455.t - 6 second ./tests/bugs/glusterd/quorum-value-check.t - 6 second ./tests/bugs/fuse/bug-1126048.t - 6 second ./tests/bugs/ec/bug-1179050.t - 6 second ./tests/bugs/distribute/bug-912564.t - 6 second ./tests/bugs/distribute/bug-1368012.t - 6 second ./tests/bugs/core/bug-986429.t - 6 second ./tests/bugs/core/bug-1699025-brick-mux-detach-brick-fd-issue.t - 6 second ./tests/bugs/core/bug-1168803-snapd-option-validation-fix.t - 6 second ./tests/bugs/bug-1702299.t - 6 second ./tests/bugs/bug-1371806_1.t - 6 second ./tests/bugs/bitrot/1207029-bitrot-daemon-should-start-on-valid-node.t - 6 second ./tests/basic/playground/template-xlator-sanity.t - 6 second ./tests/basic/fencing/fencing-crash-conistency.t - 6 second ./tests/basic/ec/nfs.t - 6 second ./tests/basic/ec/ec-read-policy.t - 6 second ./tests/basic/ec/ec-anonymous-fd.t - 6 second ./tests/basic/afr/afr-read-hash-mode.t - 6 second ./tests/gfid2path/gfid2path_nfs.t - 5 second ./tests/features/delay-gen.t - 5 second ./tests/bugs/unclassified/bug-1034085.t - 5 second ./tests/bugs/snapshot/bug-1178079.t - 5 second ./tests/bugs/snapshot/bug-1111041.t - 5 second ./tests/bugs/shard/bug-1256580.t - 5 second ./tests/bugs/replicate/bug-1365455.t - 5 second ./tests/bugs/posix/bug-gfid-path.t - 5 second ./tests/bugs/nfs/bug-1166862.t - 5 second ./tests/bugs/md-cache/bug-1632503.t - 5 second ./tests/bugs/md-cache/afr-stale-read.t - 5 second ./tests/bugs/glusterfs-server/bug-877992.t - 5 second ./tests/bugs/glusterfs-server/bug-873549.t - 5 second ./tests/bugs/glusterfs-server/bug-864222.t - 5 second ./tests/bugs/glusterfs/bug-895235.t - 5 second ./tests/bugs/glusterfs/bug-1482528.t - 5 second ./tests/bugs/glusterd/bug-948729/bug-948729-force.t - 5 second ./tests/bugs/glusterd/bug-1091935-brick-order-check-from-cli-to-glusterd.t - 5 second ./tests/bugs/geo-replication/bug-1296496.t - 5 second ./tests/bugs/distribute/bug-907072.t - 5 second ./tests/bugs/core/bug-913544.t - 5 second ./tests/bugs/core/bug-834465.t - 5 second ./tests/bugs/bitrot/bug-1229134-bitd-not-support-vol-set.t - 5 second ./tests/bugs/bitrot/bug-1210684-scrub-pause-resume-error-handling.t - 5 second ./tests/bugs/access-control/bug-1051896.t - 5 second ./tests/basic/hardlink-limit.t - 5 second ./tests/basic/ec/ec-fallocate.t - 5 second ./tests/basic/ec/dht-rename.t - 5 second ./tests/basic/distribute/non-root-unlink-stale-linkto.t - 5 second ./tests/basic/changelog/changelog-rename.t - 5 second ./tests/basic/afr/heal-info.t - 5 second ./tests/performance/quick-read.t - 4 second ./tests/gfid2path/gfid2path_fuse.t - 4 second ./tests/bugs/upcall/bug-upcall-stat.t - 4 second ./tests/bugs/upcall/bug-1422776.t - 4 second ./tests/bugs/upcall/bug-1394131.t - 4 second ./tests/bugs/trace/bug-797171.t - 4 second ./tests/bugs/shard/bug-1250855.t - 4 second ./tests/bugs/rpc/bug-954057.t - 4 second ./tests/bugs/replicate/bug-976800.t - 4 second ./tests/bugs/replicate/bug-886998.t - 4 second ./tests/bugs/replicate/bug-880898.t - 4 second ./tests/bugs/replicate/bug-1480525.t - 4 second ./tests/bugs/read-only/bug-1134822-read-only-default-in-graph.t - 4 second ./tests/bugs/readdir-ahead/bug-1446516.t - 4 second ./tests/bugs/readdir-ahead/bug-1439640.t - 4 second ./tests/bugs/readdir-ahead/bug-1390050.t - 4 second ./tests/bugs/quota/bug-1287996.t - 4 second ./tests/bugs/quick-read/bug-846240.t - 4 second ./tests/bugs/nl-cache/bug-1451588.t - 4 second ./tests/bugs/nfs/subdir-trailing-slash.t - 4 second ./tests/bugs/nfs/socket-as-fifo.t - 4 second ./tests/bugs/nfs/showmount-many-clients.t - 4 second ./tests/bugs/nfs/bug-1210338.t - 4 second ./tests/bugs/nfs/bug-1161092-nfs-acls.t - 4 second ./tests/bugs/nfs/bug-1116503.t - 4 second ./tests/bugs/md-cache/bug-1476324.t - 4 second ./tests/bugs/glusterfs/bug-869724.t - 4 second ./tests/bugs/glusterd/bug-948729/bug-948729.t - 4 second ./tests/bugs/fuse/bug-1283103.t - 4 second ./tests/bugs/core/io-stats-1322825.t - 4 second ./tests/bugs/core/bug-924075.t - 4 second ./tests/bugs/core/bug-908146.t - 4 second ./tests/bugs/core/949327.t - 4 second ./tests/bugs/cli/bug-983317-volume-get.t - 4 second ./tests/bugs/cli/bug-977246.t - 4 second ./tests/bugs/cli/bug-961307.t - 4 second ./tests/bugs/cli/bug-1004218.t - 4 second ./tests/bugs/bug-1138841.t - 4 second ./tests/bugs/access-control/bug-1387241.t - 4 second ./tests/basic/quota-rename.t - 4 second ./tests/basic/fencing/test-fence-option.t - 4 second ./tests/basic/ec/ec-internal-xattrs.t - 4 second ./tests/basic/distribute/lookup.t - 4 second ./tests/basic/distribute/bug-1265677-use-readdirp.t - 4 second ./tests/line-coverage/volfile-with-all-graph-syntax.t - 3 second ./tests/line-coverage/some-features-in-libglusterfs.t - 3 second ./tests/bugs/unclassified/bug-991622.t - 3 second ./tests/bugs/readdir-ahead/bug-1512437.t - 3 second ./tests/bugs/posix/disallow-gfid-volumeid-removexattr.t - 3 second ./tests/bugs/posix/bug-1619720.t - 3 second ./tests/bugs/nfs/zero-atime.t - 3 second ./tests/bugs/glusterfs/bug-844688.t - 3 second ./tests/bugs/glusterd/bug-948729/bug-948729-mode-script.t - 3 second ./tests/bugs/glusterd/bug-1482906-peer-file-blank-line.t - 3 second ./tests/bugs/fuse/bug-1336818.t - 3 second ./tests/bugs/core/log-bug-1362520.t - 3 second ./tests/bugs/core/bug-1421721-mpx-toggle.t - 3 second ./tests/bugs/core/bug-1135514-allow-setxattr-with-null-value.t - 3 second ./tests/bugs/core/bug-1119582.t - 3 second ./tests/bugs/core/bug-1117951.t - 3 second ./tests/bugs/cli/bug-867252.t - 3 second ./tests/bitrot/bug-internal-xattrs-check-1243391.t - 3 second ./tests/basic/md-cache/bug-1418249.t - 3 second ./tests/basic/glusterd/check-cloudsync-ancestry.t - 3 second ./tests/basic/fops-sanity.t - 3 second ./tests/basic/distribute/debug-xattrs.t - 3 second ./tests/line-coverage/meta-max-coverage.t - 2 second ./tests/bugs/shard/bug-1261773.t - 2 second ./tests/bugs/shard/bug-1245547.t - 2 second ./tests/bugs/replicate/bug-884328.t - 2 second ./tests/bugs/nfs/bug-970070.t - 2 second ./tests/bugs/nfs/bug-1302948.t - 2 second ./tests/bugs/logging/bug-823081.t - 2 second ./tests/bugs/glusterfs-server/bug-889996.t - 2 second ./tests/bugs/glusterfs-server/bug-861542.t - 2 second ./tests/bugs/glusterfs/bug-892730.t - 2 second ./tests/bugs/glusterfs/bug-860297.t - 2 second ./tests/bugs/glusterfs/bug-853690.t - 2 second ./tests/bugs/glusterfs/bug-811493.t - 2 second ./tests/bugs/glusterd/bug-1085330-and-bug-916549.t - 2 second ./tests/bugs/distribute/bug-924265.t - 2 second ./tests/bugs/distribute/bug-1204140.t - 2 second ./tests/bugs/core/bug-903336.t - 2 second ./tests/bugs/core/bug-845213.t - 2 second ./tests/bugs/core/bug-1111557.t - 2 second ./tests/bugs/cli/bug-969193.t - 2 second ./tests/bugs/cli/bug-764638.t - 2 second ./tests/bugs/cli/bug-1378842-volume-get-all.t - 2 second ./tests/basic/peer-parsing.t - 2 second ./tests/basic/gfapi/sink.t - 2 second ./tests/basic/afr/ta-check-locks.t - 2 second ./tests/basic/afr/arbiter-cli.t - 2 second ./tests/bugs/replicate/ta-inode-refresh-read.t - 1 second ./tests/bugs/cli/bug-949298.t - 1 second ./tests/bugs/cli/bug-921215.t - 1 second ./tests/bugs/cli/bug-1047378.t - 1 second ./tests/basic/posixonly.t - 1 second ./tests/basic/netgroup_parsing.t - 1 second ./tests/basic/exports_parsing.t - 1 second ./tests/basic/glusterfsd-args.t - 0 second 2 test(s) failed ./tests/basic/uss.t ./tests/bugs/glusterd/serialize-shd-manager-glusterd-restart.t 0 test(s) generated core 3 test(s) needed retry ./tests/basic/uss.t ./tests/basic/volfile-sanity.t ./tests/bugs/glusterd/serialize-shd-manager-glusterd-restart.t Result is 124 tar: Removing leading `/' from member names kernel.core_pattern = /%e-%p.core Build step 'Execute shell' marked build as failure _______________________________________________ maintainers mailing list maintainers at gluster.org https://lists.gluster.org/mailman/listinfo/maintainers -- - Atin (atinm) -------------- next part -------------- An HTML attachment was scrubbed... URL: From jenkins at build.gluster.org Mon Jun 3 01:45:02 2019 From: jenkins at build.gluster.org (jenkins at build.gluster.org) Date: Mon, 3 Jun 2019 01:45:02 +0000 (UTC) Subject: [Gluster-devel] Weekly Untriaged Bugs Message-ID: <506588597.104.1559526303096.JavaMail.jenkins@jenkins-el7.rht.gluster.org> [...truncated 6 lines...] https://bugzilla.redhat.com/1714851 / core: issues with 'list.h' elements in clang-scan https://bugzilla.redhat.com/1714895 / libglusterfsclient: Glusterfs(fuse) client crash https://bugzilla.redhat.com/1716097 / project-infrastructure: infra: create suse-packing at lists.nfs-ganesha.org alias [...truncated 2 lines...] -------------- next part -------------- A non-text attachment was scrubbed... Name: build.log Type: application/octet-stream Size: 670 bytes Desc: not available URL: From hgowtham at redhat.com Mon Jun 3 08:18:46 2019 From: hgowtham at redhat.com (Hari Gowtham) Date: Mon, 3 Jun 2019 13:48:46 +0530 Subject: [Gluster-devel] Release 6.3: Expected tagging on June 7th Message-ID: Hi, Expected tagging date for release-6.3 is on June, 7th, 2019. Please ensure required patches are back-ported and also are passing regressions and are appropriately reviewed for easy merging and tagging on the date. -- Regards, Hari Gowtham. From rabhat at redhat.com Mon Jun 3 14:20:30 2019 From: rabhat at redhat.com (FNU Raghavendra Manjunath) Date: Mon, 3 Jun 2019 10:20:30 -0400 Subject: [Gluster-devel] [Gluster-Maintainers] Build failed in Jenkins: regression-test-with-multiplex #1359 In-Reply-To: References: <24208463.92.1559325814227.JavaMail.jenkins@jenkins-el7.rht.gluster.org> Message-ID: Yes. I have sent this patch [1] for review. It is now not failing in regression tests. (i.e. uss.t is not failing) [1] https://review.gluster.org/#/c/glusterfs/+/22728/ Regards, Raghavendra On Sat, Jun 1, 2019 at 7:25 AM Atin Mukherjee wrote: > subdir-mount.t has started failing in brick mux regression nightly. This > needs to be fixed. > > Raghavendra - did we manage to get any further clue on uss.t failure? > > ---------- Forwarded message --------- > From: > Date: Fri, 31 May 2019 at 23:34 > Subject: [Gluster-Maintainers] Build failed in Jenkins: > regression-test-with-multiplex #1359 > To: , , , > , > > > See < > https://build.gluster.org/job/regression-test-with-multiplex/1359/display/redirect?page=changes > > > > Changes: > > [atin] glusterd: add an op-version check > > [atin] glusterd/svc: glusterd_svcs_stop should call individual wrapper > function > > [atin] glusterd/svc: Stop stale process using the glusterd_proc_stop > > [Amar Tumballi] lcov: more coverage to shard, old-protocol, sdfs > > [Kotresh H R] tests/geo-rep: Add EC volume test case > > [Amar Tumballi] glusterfsd/cleanup: Protect graph object under a lock > > [Mohammed Rafi KC] glusterd/shd: Optimize the glustershd manager to send > reconfigure > > [Kotresh H R] tests/geo-rep: Add tests to cover glusterd geo-rep > > [atin] glusterd: Optimize code to copy dictionary in handshake code path > > ------------------------------------------ > [...truncated 3.18 MB...] > ./tests/basic/afr/stale-file-lookup.t - 9 second > ./tests/basic/afr/granular-esh/replace-brick.t - 9 second > ./tests/basic/afr/granular-esh/add-brick.t - 9 second > ./tests/basic/afr/gfid-mismatch.t - 9 second > ./tests/performance/open-behind.t - 8 second > ./tests/features/ssl-authz.t - 8 second > ./tests/features/readdir-ahead.t - 8 second > ./tests/bugs/upcall/bug-1458127.t - 8 second > ./tests/bugs/transport/bug-873367.t - 8 second > ./tests/bugs/replicate/bug-1498570-client-iot-graph-check.t - 8 second > ./tests/bugs/replicate/bug-1132102.t - 8 second > ./tests/bugs/quota/bug-1250582-volume-reset-should-not-remove-quota-quota-deem-statfs.t > - 8 second > ./tests/bugs/quota/bug-1104692.t - 8 second > ./tests/bugs/posix/bug-1360679.t - 8 second > ./tests/bugs/posix/bug-1122028.t - 8 second > ./tests/bugs/nfs/bug-1157223-symlink-mounting.t - 8 second > ./tests/bugs/glusterfs/bug-861015-log.t - 8 second > ./tests/bugs/glusterd/sync-post-glusterd-restart.t - 8 second > ./tests/bugs/glusterd/bug-1696046.t - 8 second > ./tests/bugs/fuse/bug-983477.t - 8 second > ./tests/bugs/ec/bug-1227869.t - 8 second > ./tests/bugs/distribute/bug-1088231.t - 8 second > ./tests/bugs/distribute/bug-1086228.t - 8 second > ./tests/bugs/cli/bug-1087487.t - 8 second > ./tests/bugs/cli/bug-1022905.t - 8 second > ./tests/bugs/bug-1258069.t - 8 second > ./tests/bugs/bitrot/1209752-volume-status-should-show-bitrot-scrub-info.t > - 8 second > ./tests/basic/xlator-pass-through-sanity.t - 8 second > ./tests/basic/quota-nfs.t - 8 second > ./tests/basic/glusterd/arbiter-volume.t - 8 second > ./tests/basic/ctime/ctime-noatime.t - 8 second > ./tests/line-coverage/cli-peer-and-volume-operations.t - 7 second > ./tests/gfid2path/get-gfid-to-path.t - 7 second > ./tests/bugs/upcall/bug-1369430.t - 7 second > ./tests/bugs/snapshot/bug-1260848.t - 7 second > ./tests/bugs/shard/shard-inode-refcount-test.t - 7 second > ./tests/bugs/shard/bug-1258334.t - 7 second > ./tests/bugs/replicate/bug-767585-gfid.t - 7 second > ./tests/bugs/replicate/bug-1448804-check-quorum-type-values.t - 7 second > ./tests/bugs/replicate/bug-1250170-fsync.t - 7 second > ./tests/bugs/posix/bug-1175711.t - 7 second > ./tests/bugs/nfs/bug-915280.t - 7 second > ./tests/bugs/md-cache/setxattr-prepoststat.t - 7 second > ./tests/bugs/md-cache/bug-1211863_unlink.t - 7 second > ./tests/bugs/glusterfs/bug-848251.t - 7 second > ./tests/bugs/distribute/bug-1122443.t - 7 second > ./tests/bugs/changelog/bug-1208470.t - 7 second > ./tests/bugs/bug-1702299.t - 7 second > ./tests/bugs/bug-1371806_2.t - 7 second > ./tests/bugs/bitrot/1209818-vol-info-show-scrub-process-properly.t - 7 > second > ./tests/bugs/bitrot/1209751-bitrot-scrub-tunable-reset.t - 7 second > ./tests/bugs/bitrot/1207029-bitrot-daemon-should-start-on-valid-node.t - > 7 second > ./tests/bitrot/br-stub.t - 7 second > ./tests/basic/glusterd/arbiter-volume-probe.t - 7 second > ./tests/basic/gfapi/libgfapi-fini-hang.t - 7 second > ./tests/basic/fencing/fencing-crash-conistency.t - 7 second > ./tests/basic/distribute/file-create.t - 7 second > ./tests/basic/afr/tarissue.t - 7 second > ./tests/basic/afr/gfid-heal.t - 7 second > ./tests/bugs/snapshot/bug-1178079.t - 6 second > ./tests/bugs/snapshot/bug-1064768.t - 6 second > ./tests/bugs/shard/bug-1342298.t - 6 second > ./tests/bugs/shard/bug-1259651.t - 6 second > ./tests/bugs/replicate/bug-1686568-send-truncate-on-arbiter-from-shd.t - > 6 second > ./tests/bugs/replicate/bug-1626994-info-split-brain.t - 6 second > ./tests/bugs/replicate/bug-1325792.t - 6 second > ./tests/bugs/replicate/bug-1101647.t - 6 second > ./tests/bugs/quota/bug-1243798.t - 6 second > ./tests/bugs/protocol/bug-1321578.t - 6 second > ./tests/bugs/nfs/bug-877885.t - 6 second > ./tests/bugs/nfs/bug-1143880-fix-gNFSd-auth-crash.t - 6 second > ./tests/bugs/md-cache/bug-1476324.t - 6 second > ./tests/bugs/md-cache/afr-stale-read.t - 6 second > ./tests/bugs/io-cache/bug-858242.t - 6 second > ./tests/bugs/glusterfs/bug-893378.t - 6 second > ./tests/bugs/glusterfs/bug-856455.t - 6 second > ./tests/bugs/glusterd/quorum-value-check.t - 6 second > ./tests/bugs/ec/bug-1179050.t - 6 second > ./tests/bugs/distribute/bug-912564.t - 6 second > ./tests/bugs/distribute/bug-884597.t - 6 second > ./tests/bugs/distribute/bug-1368012.t - 6 second > ./tests/bugs/core/bug-986429.t - 6 second > ./tests/bugs/core/bug-1699025-brick-mux-detach-brick-fd-issue.t - 6 > second > ./tests/bugs/core/bug-1168803-snapd-option-validation-fix.t - 6 second > ./tests/bugs/bug-1371806_1.t - 6 second > ./tests/bugs/bitrot/bug-1229134-bitd-not-support-vol-set.t - 6 second > ./tests/bugs/bitrot/bug-1210684-scrub-pause-resume-error-handling.t - 6 > second > ./tests/bitrot/bug-1221914.t - 6 second > ./tests/basic/trace.t - 6 second > ./tests/basic/playground/template-xlator-sanity.t - 6 second > ./tests/basic/ec/nfs.t - 6 second > ./tests/basic/ec/ec-read-policy.t - 6 second > ./tests/basic/ec/ec-anonymous-fd.t - 6 second > ./tests/basic/distribute/non-root-unlink-stale-linkto.t - 6 second > ./tests/basic/changelog/changelog-rename.t - 6 second > ./tests/basic/afr/heal-info.t - 6 second > ./tests/basic/afr/afr-read-hash-mode.t - 6 second > ./tests/gfid2path/gfid2path_nfs.t - 5 second > ./tests/bugs/upcall/bug-1422776.t - 5 second > ./tests/bugs/replicate/bug-886998.t - 5 second > ./tests/bugs/replicate/bug-1365455.t - 5 second > ./tests/bugs/readdir-ahead/bug-1670253-consistent-metadata.t - 5 second > ./tests/bugs/posix/bug-gfid-path.t - 5 second > ./tests/bugs/posix/bug-765380.t - 5 second > ./tests/bugs/nfs/bug-847622.t - 5 second > ./tests/bugs/nfs/bug-1116503.t - 5 second > ./tests/bugs/io-stats/bug-1598548.t - 5 second > ./tests/bugs/glusterfs-server/bug-877992.t - 5 second > ./tests/bugs/glusterfs-server/bug-873549.t - 5 second > ./tests/bugs/glusterfs/bug-895235.t - 5 second > ./tests/bugs/fuse/bug-1126048.t - 5 second > ./tests/bugs/distribute/bug-907072.t - 5 second > ./tests/bugs/core/bug-913544.t - 5 second > ./tests/bugs/core/bug-908146.t - 5 second > ./tests/bugs/access-control/bug-1051896.t - 5 second > ./tests/basic/ec/ec-internal-xattrs.t - 5 second > ./tests/basic/ec/ec-fallocate.t - 5 second > ./tests/basic/distribute/bug-1265677-use-readdirp.t - 5 second > ./tests/basic/afr/arbiter-remove-brick.t - 5 second > ./tests/performance/quick-read.t - 4 second > ./tests/gfid2path/block-mount-access.t - 4 second > ./tests/features/delay-gen.t - 4 second > ./tests/bugs/upcall/bug-upcall-stat.t - 4 second > ./tests/bugs/upcall/bug-1394131.t - 4 second > ./tests/bugs/unclassified/bug-1034085.t - 4 second > ./tests/bugs/snapshot/bug-1111041.t - 4 second > ./tests/bugs/shard/bug-1272986.t - 4 second > ./tests/bugs/shard/bug-1256580.t - 4 second > ./tests/bugs/shard/bug-1250855.t - 4 second > ./tests/bugs/shard/bug-1245547.t - 4 second > ./tests/bugs/rpc/bug-954057.t - 4 second > ./tests/bugs/replicate/bug-976800.t - 4 second > ./tests/bugs/replicate/bug-880898.t - 4 second > ./tests/bugs/replicate/bug-1480525.t - 4 second > ./tests/bugs/read-only/bug-1134822-read-only-default-in-graph.t - 4 > second > ./tests/bugs/readdir-ahead/bug-1446516.t - 4 second > ./tests/bugs/readdir-ahead/bug-1439640.t - 4 second > ./tests/bugs/readdir-ahead/bug-1390050.t - 4 second > ./tests/bugs/quota/bug-1287996.t - 4 second > ./tests/bugs/quick-read/bug-846240.t - 4 second > ./tests/bugs/posix/disallow-gfid-volumeid-removexattr.t - 4 second > ./tests/bugs/posix/bug-1619720.t - 4 second > ./tests/bugs/nl-cache/bug-1451588.t - 4 second > ./tests/bugs/nfs/zero-atime.t - 4 second > ./tests/bugs/nfs/subdir-trailing-slash.t - 4 second > ./tests/bugs/nfs/socket-as-fifo.t - 4 second > ./tests/bugs/nfs/showmount-many-clients.t - 4 second > ./tests/bugs/nfs/bug-1210338.t - 4 second > ./tests/bugs/nfs/bug-1166862.t - 4 second > ./tests/bugs/nfs/bug-1161092-nfs-acls.t - 4 second > ./tests/bugs/md-cache/bug-1632503.t - 4 second > ./tests/bugs/glusterfs-server/bug-864222.t - 4 second > ./tests/bugs/glusterfs/bug-1482528.t - 4 second > ./tests/bugs/glusterd/bug-948729/bug-948729-mode-script.t - 4 second > ./tests/bugs/glusterd/bug-948729/bug-948729-force.t - 4 second > ./tests/bugs/glusterd/bug-1482906-peer-file-blank-line.t - 4 second > ./tests/bugs/glusterd/bug-1091935-brick-order-check-from-cli-to-glusterd.t > - 4 second > ./tests/bugs/geo-replication/bug-1296496.t - 4 second > ./tests/bugs/fuse/bug-1336818.t - 4 second > ./tests/bugs/fuse/bug-1283103.t - 4 second > ./tests/bugs/core/io-stats-1322825.t - 4 second > ./tests/bugs/core/bug-834465.t - 4 second > ./tests/bugs/core/bug-1135514-allow-setxattr-with-null-value.t - 4 second > ./tests/bugs/core/949327.t - 4 second > ./tests/bugs/cli/bug-977246.t - 4 second > ./tests/bugs/cli/bug-961307.t - 4 second > ./tests/bugs/cli/bug-1004218.t - 4 second > ./tests/bugs/bug-1138841.t - 4 second > ./tests/bugs/access-control/bug-1387241.t - 4 second > ./tests/bitrot/bug-internal-xattrs-check-1243391.t - 4 second > ./tests/basic/quota-rename.t - 4 second > ./tests/basic/hardlink-limit.t - 4 second > ./tests/basic/ec/dht-rename.t - 4 second > ./tests/basic/distribute/lookup.t - 4 second > ./tests/line-coverage/meta-max-coverage.t - 3 second > ./tests/gfid2path/gfid2path_fuse.t - 3 second > ./tests/bugs/unclassified/bug-991622.t - 3 second > ./tests/bugs/trace/bug-797171.t - 3 second > ./tests/bugs/glusterfs-server/bug-861542.t - 3 second > ./tests/bugs/glusterfs/bug-869724.t - 3 second > ./tests/bugs/glusterfs/bug-860297.t - 3 second > ./tests/bugs/glusterfs/bug-844688.t - 3 second > ./tests/bugs/glusterd/bug-948729/bug-948729.t - 3 second > ./tests/bugs/distribute/bug-1204140.t - 3 second > ./tests/bugs/core/bug-924075.t - 3 second > ./tests/bugs/core/bug-845213.t - 3 second > ./tests/bugs/core/bug-1421721-mpx-toggle.t - 3 second > ./tests/bugs/core/bug-1119582.t - 3 second > ./tests/bugs/core/bug-1117951.t - 3 second > ./tests/bugs/cli/bug-983317-volume-get.t - 3 second > ./tests/bugs/cli/bug-867252.t - 3 second > ./tests/basic/glusterd/check-cloudsync-ancestry.t - 3 second > ./tests/basic/fops-sanity.t - 3 second > ./tests/basic/fencing/test-fence-option.t - 3 second > ./tests/basic/distribute/debug-xattrs.t - 3 second > ./tests/basic/afr/ta-check-locks.t - 3 second > ./tests/line-coverage/volfile-with-all-graph-syntax.t - 2 second > ./tests/line-coverage/some-features-in-libglusterfs.t - 2 second > ./tests/bugs/shard/bug-1261773.t - 2 second > ./tests/bugs/replicate/bug-884328.t - 2 second > ./tests/bugs/readdir-ahead/bug-1512437.t - 2 second > ./tests/bugs/nfs/bug-970070.t - 2 second > ./tests/bugs/nfs/bug-1302948.t - 2 second > ./tests/bugs/logging/bug-823081.t - 2 second > ./tests/bugs/glusterfs-server/bug-889996.t - 2 second > ./tests/bugs/glusterfs/bug-892730.t - 2 second > ./tests/bugs/glusterfs/bug-811493.t - 2 second > ./tests/bugs/glusterd/bug-1085330-and-bug-916549.t - 2 second > ./tests/bugs/distribute/bug-924265.t - 2 second > ./tests/bugs/core/log-bug-1362520.t - 2 second > ./tests/bugs/core/bug-903336.t - 2 second > ./tests/bugs/core/bug-1111557.t - 2 second > ./tests/bugs/cli/bug-969193.t - 2 second > ./tests/bugs/cli/bug-949298.t - 2 second > ./tests/bugs/cli/bug-921215.t - 2 second > ./tests/bugs/cli/bug-1378842-volume-get-all.t - 2 second > ./tests/basic/peer-parsing.t - 2 second > ./tests/basic/md-cache/bug-1418249.t - 2 second > ./tests/basic/afr/arbiter-cli.t - 2 second > ./tests/bugs/replicate/ta-inode-refresh-read.t - 1 second > ./tests/bugs/glusterfs/bug-853690.t - 1 second > ./tests/bugs/cli/bug-764638.t - 1 second > ./tests/bugs/cli/bug-1047378.t - 1 second > ./tests/basic/netgroup_parsing.t - 1 second > ./tests/basic/gfapi/sink.t - 1 second > ./tests/basic/exports_parsing.t - 1 second > ./tests/basic/posixonly.t - 0 second > ./tests/basic/glusterfsd-args.t - 0 second > > > 2 test(s) failed > ./tests/basic/uss.t > ./tests/features/subdir-mount.t > > 0 test(s) generated core > > > 5 test(s) needed retry > ./tests/basic/afr/split-brain-favorite-child-policy.t > ./tests/basic/ec/self-heal.t > ./tests/basic/uss.t > ./tests/basic/volfile-sanity.t > ./tests/features/subdir-mount.t > > Result is 1 > > tar: Removing leading `/' from member names > kernel.core_pattern = /%e-%p.core > Build step 'Execute shell' marked build as failure > _______________________________________________ > maintainers mailing list > maintainers at gluster.org > https://lists.gluster.org/mailman/listinfo/maintainers > > > -- > - Atin (atinm) > -------------- next part -------------- An HTML attachment was scrubbed... URL: From abhishpaliwal at gmail.com Tue Jun 4 10:09:59 2019 From: abhishpaliwal at gmail.com (ABHISHEK PALIWAL) Date: Tue, 4 Jun 2019 15:39:59 +0530 Subject: [Gluster-devel] Memory leak in glusterfs In-Reply-To: References: Message-ID: Hi Team, Please respond on the issue which I raised. Regards, Abhishek On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL wrote: > Anyone please reply.... > > On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL > wrote: > >> Hi Team, >> >> I upload some valgrind logs from my gluster 5.4 setup. This is writing to >> the volume every 15 minutes. I stopped glusterd and then copy away the >> logs. The test was running for some simulated days. They are zipped in >> valgrind-54.zip. >> >> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >> glusterfs and even some definitely lost bytes. >> >> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record 391 >> of 391 >> ==2737== at 0x4C29C25: calloc (in >> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >> ==2737== by 0xA22485E: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA217C94: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA21D9F8: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA21DED9: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA21E685: ??? (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0xA1B9D8C: init (in >> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >> ==2737== by 0x4E511CE: xlator_init (in /usr/lib64/libglusterfs.so.0.0.1) >> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >> /usr/lib64/libglusterfs.so.0.0.1) >> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >> ==2737== >> ==2737== LEAK SUMMARY: >> ==2737== definitely lost: 1,053 bytes in 10 blocks >> ==2737== indirectly lost: 317 bytes in 3 blocks >> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >> ==2737== still reachable: 53,277 bytes in 201 blocks >> ==2737== suppressed: 0 bytes in 0 blocks >> >> -- >> >> >> >> >> Regards >> Abhishek Paliwal >> > -- Regards Abhishek Paliwal -------------- next part -------------- An HTML attachment was scrubbed... URL: From zgrep at 139.com Tue Jun 4 11:33:54 2019 From: zgrep at 139.com (=?utf-8?B?WGllIENoYW5nbG9uZw==?=) Date: 04 Jun 2019 19:33:54 +0800 Subject: [Gluster-devel] GETXATTR op pending on index xlator for more than 10 hours Message-ID: 2019060419335438074695@139.com> Hi all, Today, i found gnfs GETXATTR bailing out on gluster release 3.12.0. I have a simple 4*2 Distributed-Rep volume. [2019-06-03 19:58:33.085880] E [rpc-clnt.c:185:Call_bail] 0-cl25vol01-client-4: bailing out frame type(GlusterFS 3.3) op(GETXATTR(18)) xid=0x21de4275 sent = 2019-06-03 19:28:30.552356. timeout = 1800 for 10.3.133.57:49153 xid= 0x21de4275 = 568214133 Then i try to dump brick 10.3.133.57:49153, and find the GETXATTR op pending on index xlator for more than 10 hours! 1111MicrosoftInternetExplorer402DocumentNotSpecified7.8 ?Normal0 [root at node0001 gluster]# grep -rn 568214133 gluster-brick-1-cl25vol01.6078.dump.15596* gluster-brick-1-cl25vol01.6078.dump.1559617125:5093:unique=568214133 gluster-brick-1-cl25vol01.6078.dump.1559618121:5230:unique=568214133 gluster-brick-1-cl25vol01.6078.dump.1559618912:5434:unique=568214133 gluster-brick-1-cl25vol01.6078.dump.1559628467:6921:unique=568214133 [root at node0001 gluster]# date -d @1559617125 Tue Jun 4 10:58:45 CST 2019 [root at node0001 gluster]# date -d @1559628467 Tue Jun 4 14:07:47 CST 2019 1111MicrosoftInternetExplorer402DocumentNotSpecified7.8 ?Normal0 [root at node0001 gluster]# [global.callpool.stack.115] stack=0x7f8b342623c0 uid=500 gid=500 pid=-6 unique=568214133 lk-owner=faffffff op=stack type=0 cnt=4 [global.callpool.stack.115.frame.1] frame=0x7f8b1d6fb540 ref_count=0 translator=cl25vol01-index complete=0 parent=cl25vol01-quota wind_from=quota_getxattr wind_to=(this->children->xlator)->fops->getxattr unwind_to=default_getxattr_cbk [global.callpool.stack.115.frame.2] frame=0x7f8b30a14da0 ref_count=1 translator=cl25vol01-quota complete=0 parent=cl25vol01-io-stats wind_from=io_stats_getxattr wind_to=(this->children->xlator)->fops->getxattr unwind_to=io_stats_getxattr_cbk [global.callpool.stack.115.frame.3] frame=0x7f8b6debada0 ref_count=1 translator=cl25vol01-io-stats complete=0 parent=cl25vol01-server wind_from=server_getxattr_resume wind_to=FIRST_CHILD(this)->fops->getxattr unwind_to=server_getxattr_cbk [global.callpool.stack.115.frame.4] frame=0x7f8b21962a60 ref_count=1 translator=cl25vol01-server complete=0 I've checked the code logic and got nothing, any advice? I still have the scene on my side, so we can dig more. Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: From ykaul at redhat.com Tue Jun 4 13:27:04 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Tue, 4 Jun 2019 16:27:04 +0300 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: <090785225412c2b5b269454f8812d0a165aea62d.camel@redhat.com> References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

<0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> <090785225412c2b5b269454f8812d0a165aea62d.camel@redhat.com> Message-ID: What was the result of this investigation? I suspect seeing the same issue on builder209[1]. Y. [1] https://build.gluster.org/job/centos7-regression/6302/consoleFull On Fri, Apr 5, 2019 at 5:40 PM Michael Scherer wrote: > Le vendredi 05 avril 2019 ? 16:55 +0530, Nithya Balachandran a ?crit : > > On Fri, 5 Apr 2019 at 12:16, Michael Scherer > > wrote: > > > > > Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : > > > > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : > > > > > I'm not convinced this is solved. Just had what I believe is a > > > > > similar > > > > > failure: > > > > > > > > > > *00:12:02.532* A dependency job for rpc-statd.service failed. > > > > > See > > > > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: > > > > > rpc.statd is > > > > > not running but is required for remote locking.*00:12:02.532* > > > > > mount.nfs: Either use '-o nolock' to keep locks local, or start > > > > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was > > > > > specified > > > > > > > > > > (of course, it can always be my patch!) > > > > > > > > > > https://build.gluster.org/job/centos7-regression/5384/console > > > > > > > > same issue, different builder (206). I will check them all, as > > > > the > > > > issue is more widespread than I expected (or it did popup since > > > > last > > > > time I checked). > > > > > > Deepshika did notice that the issue came back on one server > > > (builder202) after a reboot, so the rpcbind issue is not related to > > > the > > > network initscript one, so the RCA continue. > > > > > > We are looking for another workaround involving fiddling with the > > > socket (until we find why it do use ipv6 at boot, but not after, > > > when > > > ipv6 is disabled). > > > > > > > Could this be relevant? > > https://access.redhat.com/solutions/2798411 > > Good catch. > > So, we already do that, Nigel took care of that (after 2 days of > research). But I didn't knew the exact symptoms, and decided to double > check just in case. > > And... there is no sysctl.conf in the initrd. Running dracut -v -f do > not change anything. > > Running "dracut -v -f -H" take care of that (and this fix the problem), > but: > - our ansible script already run that > - -H is hostonly, which is already the default on EL7 according to the > doc. > > However, if dracut-config-generic is installed, it doesn't build a > hostonly initrd, and so do not include the sysctl.conf file (who break > rpcbnd, who break the test suite). > > And for some reason, it is installed the image in ec2 (likely default), > but not by default on the builders. > > So what happen is that after a kernel upgrade, dracut rebuild a generic > initrd instead of a hostonly one, who break things. And kernel was > likely upgraded recently (and upgrade happen nightly (for some value of > "night"), so we didn't see that earlier, nor with a fresh system. > > > So now, we have several solution: > - be explicit on using hostonly in dracut, so this doesn't happen again > (or not for this reason) > > - disable ipv6 in rpcbind in a cleaner way (to be tested) > > - get the test suite work with ip v6 > > In the long term, I also want to monitor the processes, but for that, I > need a VPN between the nagios server and ec2, and that project got > blocked by several issues (like EC2 not support ecdsa keys, and we use > that for ansible, so we have to come back to RSA for full automated > deployment, and openvon requires to use certificates, so I need a newer > python openssl for doing what I want, and RHEL 7 is too old, etc, etc). > > As the weekend approach for me, I just rebuilt the initrd for the time > being. I guess forcing hostonly is the safest fix for now, but this > will be for monday. > -- > Michael Scherer > Sysadmin, Community Infrastructure and Platform, OSAS > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dkhandel at redhat.com Wed Jun 5 06:57:21 2019 From: dkhandel at redhat.com (Deepshikha Khandelwal) Date: Wed, 5 Jun 2019 12:27:21 +0530 Subject: [Gluster-devel] [Gluster-infra] rebal-all-nodes-migrate.t always fails now In-Reply-To: References: <94bd8147c5035da76c3ac3ae90a8a02ed000106a.camel@redhat.com>

<0ca34e42063ad77f323155c85a7bb3ba7a79931b.camel@redhat.com> <090785225412c2b5b269454f8812d0a165aea62d.camel@redhat.com> Message-ID: I recently added 3 builders builder208, builder209, builder210 to the regression pool. Network to these new builders did not come up because it was looking for non-existing ethernet card eth0 on reboot and hence failing. I'll reconnect them back and update here once I fix the issue today. Sorry for the inconvenience. On Tue, Jun 4, 2019 at 7:07 PM Yaniv Kaul wrote: > What was the result of this investigation? I suspect seeing the same issue > on builder209[1]. > Y. > > [1] https://build.gluster.org/job/centos7-regression/6302/consoleFull > > On Fri, Apr 5, 2019 at 5:40 PM Michael Scherer > wrote: > >> Le vendredi 05 avril 2019 ? 16:55 +0530, Nithya Balachandran a ?crit : >> > On Fri, 5 Apr 2019 at 12:16, Michael Scherer >> > wrote: >> > >> > > Le jeudi 04 avril 2019 ? 18:24 +0200, Michael Scherer a ?crit : >> > > > Le jeudi 04 avril 2019 ? 19:10 +0300, Yaniv Kaul a ?crit : >> > > > > I'm not convinced this is solved. Just had what I believe is a >> > > > > similar >> > > > > failure: >> > > > > >> > > > > *00:12:02.532* A dependency job for rpc-statd.service failed. >> > > > > See >> > > > > 'journalctl -xe' for details.*00:12:02.532* mount.nfs: >> > > > > rpc.statd is >> > > > > not running but is required for remote locking.*00:12:02.532* >> > > > > mount.nfs: Either use '-o nolock' to keep locks local, or start >> > > > > statd.*00:12:02.532* mount.nfs: an incorrect mount option was >> > > > > specified >> > > > > >> > > > > (of course, it can always be my patch!) >> > > > > >> > > > > https://build.gluster.org/job/centos7-regression/5384/console >> > > > >> > > > same issue, different builder (206). I will check them all, as >> > > > the >> > > > issue is more widespread than I expected (or it did popup since >> > > > last >> > > > time I checked). >> > > >> > > Deepshika did notice that the issue came back on one server >> > > (builder202) after a reboot, so the rpcbind issue is not related to >> > > the >> > > network initscript one, so the RCA continue. >> > > >> > > We are looking for another workaround involving fiddling with the >> > > socket (until we find why it do use ipv6 at boot, but not after, >> > > when >> > > ipv6 is disabled). >> > > >> > >> > Could this be relevant? >> > https://access.redhat.com/solutions/2798411 >> >> Good catch. >> >> So, we already do that, Nigel took care of that (after 2 days of >> research). But I didn't knew the exact symptoms, and decided to double >> check just in case. >> >> And... there is no sysctl.conf in the initrd. Running dracut -v -f do >> not change anything. >> >> Running "dracut -v -f -H" take care of that (and this fix the problem), >> but: >> - our ansible script already run that >> - -H is hostonly, which is already the default on EL7 according to the >> doc. >> >> However, if dracut-config-generic is installed, it doesn't build a >> hostonly initrd, and so do not include the sysctl.conf file (who break >> rpcbnd, who break the test suite). >> >> And for some reason, it is installed the image in ec2 (likely default), >> but not by default on the builders. >> >> So what happen is that after a kernel upgrade, dracut rebuild a generic >> initrd instead of a hostonly one, who break things. And kernel was >> likely upgraded recently (and upgrade happen nightly (for some value of >> "night"), so we didn't see that earlier, nor with a fresh system. >> >> >> So now, we have several solution: >> - be explicit on using hostonly in dracut, so this doesn't happen again >> (or not for this reason) >> >> - disable ipv6 in rpcbind in a cleaner way (to be tested) >> >> - get the test suite work with ip v6 >> >> In the long term, I also want to monitor the processes, but for that, I >> need a VPN between the nagios server and ec2, and that project got >> blocked by several issues (like EC2 not support ecdsa keys, and we use >> that for ansible, so we have to come back to RSA for full automated >> deployment, and openvon requires to use certificates, so I need a newer >> python openssl for doing what I want, and RHEL 7 is too old, etc, etc). >> >> As the weekend approach for me, I just rebuilt the initrd for the time >> being. I guess forcing hostonly is the safest fix for now, but this >> will be for monday. >> -- >> Michael Scherer >> Sysadmin, Community Infrastructure and Platform, OSAS >> >> >> _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From atumball at redhat.com Wed Jun 5 07:00:16 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Wed, 5 Jun 2019 12:30:16 +0530 Subject: [Gluster-devel] Update: GlusterFS code coverage Message-ID: All, I just wanted to update everyone about one of the initiatives we have undertaken, ie, increasing the overall code coverage of GlusterFS above 70%. You can have a look at current code coverage here: https://build.gluster.org/job/line-coverage/lastCompletedBuild/Line_20Coverage_20Report/ (This shows the latest all the time) The daily job, and its details are captured @ https://build.gluster.org/job/line-coverage/ When we started focus on code coverage 3 months back, our code coverage was around 60% overall. We kept the ambitious goal of increasing the code coverage by 10% before glusterfs-7.0 release, and I am happy to announce that we met this goal, before the branching. Before talking about next goals, I want to thank and call out few developers who made this happen. * Xavier Hernandez - Made EC cross 90% from < 70%. * Glusterd Team (Sanju, Rishub, Mohit, Atin) - Increased CLI/glusterd coverage * Geo-Rep Team (Kotresh, Sunny, Shwetha, Aravinda). * Sheetal (help to increase glfs-api test cases, which indirectly helped cover more code across). Also note that, Some components like AFR/replicate was already at 80%+ before we started the efforts. Now, our next goal is to make sure we have above 80% functions coverage in all of the top level components shown. Once that is done, we will focus on 75% code coverage across all components. (ie, no 'Red' in top level page). While it was possible to meet our goal of increasing the overall code coverage from 60% - 70%, increasing it above 70% is not going to be easy, mainly because it involves adding more tests for negative test cases, and adding tests with different options (currently >300 of them across). We also need to look at details from code coverage tests, and reverse engineer to see how to write a test to hit the particular line in the code. I personally invite everyone who is interested to contribute to gluster project to get involved in this effort. Help us write test cases, suggest how to improve it. Help by assigning interns write them for us (if your team has some of them). This is a good way to understand glusterfs code too. We are happy to organize sessions on how to walk through the code etc if required. Happy to hear feedback and see more contribution in this area. Regards, Amar -------------- next part -------------- An HTML attachment was scrubbed... URL: From nbalacha at redhat.com Wed Jun 5 13:52:56 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Wed, 5 Jun 2019 19:22:56 +0530 Subject: [Gluster-devel] [Gluster-users] Memory leak in glusterfs In-Reply-To: References: Message-ID: Hi, Writing to a volume should not affect glusterd. The stack you have shown in the valgrind looks like the memory used to initialise the structures glusterd uses and will free only when it is stopped. Can you provide more details to what it is you are trying to test? Regards, Nithya On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL wrote: > Hi Team, > > Please respond on the issue which I raised. > > Regards, > Abhishek > > On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL > wrote: > >> Anyone please reply.... >> >> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >> wrote: >> >>> Hi Team, >>> >>> I upload some valgrind logs from my gluster 5.4 setup. This is writing >>> to the volume every 15 minutes. I stopped glusterd and then copy away the >>> logs. The test was running for some simulated days. They are zipped in >>> valgrind-54.zip. >>> >>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>> glusterfs and even some definitely lost bytes. >>> >>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>> 391 of 391 >>> ==2737== at 0x4C29C25: calloc (in >>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>> ==2737== by 0xA22485E: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA217C94: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA21D9F8: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA21DED9: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA21E685: ??? (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0xA1B9D8C: init (in >>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>> ==2737== by 0x4E511CE: xlator_init (in /usr/lib64/libglusterfs.so.0.0.1) >>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>> /usr/lib64/libglusterfs.so.0.0.1) >>> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>> ==2737== >>> ==2737== LEAK SUMMARY: >>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>> ==2737== indirectly lost: 317 bytes in 3 blocks >>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>> ==2737== still reachable: 53,277 bytes in 201 blocks >>> ==2737== suppressed: 0 bytes in 0 blocks >>> >>> -- >>> >>> >>> >>> >>> Regards >>> Abhishek Paliwal >>> >> > > -- > > > > > Regards > Abhishek Paliwal > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From sankarshan.mukhopadhyay at gmail.com Thu Jun 6 03:42:30 2019 From: sankarshan.mukhopadhyay at gmail.com (Sankarshan Mukhopadhyay) Date: Thu, 6 Jun 2019 09:12:30 +0530 Subject: [Gluster-devel] More intelligent file distribution across subvols of DHT when file size is known In-Reply-To: References: Message-ID: On Wed, May 22, 2019 at 6:53 PM Krutika Dhananjay wrote: > > Hi, > > I've proposed a solution to the problem of space running out in some children of DHT even when its other children have free space available, here - https://github.com/gluster/glusterfs/issues/675. > > The proposal aims to solve a very specific instance of this generic class of problems where fortunately the size of the file that is getting created is known beforehand. > > Requesting feedback on the proposal or even alternate solutions, if you have any. There has not been much commentary on this issue in the last 10 odd days. What is the next step? From ykaul at redhat.com Thu Jun 6 06:17:25 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Thu, 6 Jun 2019 09:17:25 +0300 Subject: [Gluster-devel] CI failure - NameError: name 'unicode' is not defined (related to changelogparser.py) Message-ID: >From [1]. I think it's a Python2/3 thing, so perhaps a CI issue additionally (though if our code is not Python 3 ready, let's ensure we use Python 2 explicitly until we fix this). *00:47:05.207* ok 14 [ 13/ 386] < 34> 'gluster --mode=script --wignore volume start patchy'*00:47:05.207* ok 15 [ 13/ 70] < 36> '_GFS --attribute-timeout=0 --entry-timeout=0 --volfile-id=patchy --volfile-server=builder208.int.aws.gluster.org /mnt/glusterfs/0'*00:47:05.207* Traceback (most recent call last):*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 233, in *00:47:05.207* parse(sys.argv[1])*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 221, in parse*00:47:05.207* process_record(data, tokens, changelog_ts, callback)*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 178, in process_record*00:47:05.207* callback(record)*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 182, in default_callback*00:47:05.207* sys.stdout.write(u"{0}\n".format(record))*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 128, in __str__*00:47:05.207* return unicode(self).encode('utf-8')*00:47:05.207* NameError: name 'unicode' is not defined*00:47:05.207* not ok 16 [ 53/ 39] < 42> '2 check_changelog_op /d/backends/patchy0/.glusterfs/changelogs RENAME' -> 'Got "0" instead of "2"' Y. [1] https://build.gluster.org/job/centos7-regression/6318/console -------------- next part -------------- An HTML attachment was scrubbed... URL: From xhernandez at redhat.com Thu Jun 6 06:32:59 2019 From: xhernandez at redhat.com (Xavi Hernandez) Date: Thu, 6 Jun 2019 08:32:59 +0200 Subject: [Gluster-devel] Should we enable contention notification by default ? In-Reply-To: References: <2044282595.16006319.1556799471980.JavaMail.zimbra@redhat.com>

Message-ID: On Thu, May 2, 2019 at 5:45 PM Atin Mukherjee wrote: > > > On Thu, 2 May 2019 at 20:38, Xavi Hernandez wrote: > >> On Thu, May 2, 2019 at 4:06 PM Atin Mukherjee >> wrote: >> >>> >>> >>> On Thu, 2 May 2019 at 19:14, Xavi Hernandez >>> wrote: >>> >>>> On Thu, 2 May 2019, 15:37 Milind Changire, wrote: >>>> >>>>> On Thu, May 2, 2019 at 6:44 PM Xavi Hernandez >>>>> wrote: >>>>> >>>>>> Hi Ashish, >>>>>> >>>>>> On Thu, May 2, 2019 at 2:17 PM Ashish Pandey >>>>>> wrote: >>>>>> >>>>>>> Xavi, >>>>>>> >>>>>>> I would like to keep this option (features.lock-notify-contention) >>>>>>> enabled by default. >>>>>>> However, I can see that there is one more option which will impact >>>>>>> the working of this option which is "notify-contention-delay" >>>>>>> >>>>>> >>>>> Just a nit. I wish the option was called "notify-contention-interval" >>>>> The "delay" part doesn't really emphasize where the delay would be put >>>>> in. >>>>> >>>> >>>> It makes sense. Maybe we can also rename it or add a second name >>>> (alias). If there are no objections, I will send a patch with the change. >>>> >>>> Xavi >>>> >>>> >>>>> >>>>>> .description = "This value determines the minimum amount of time >>>>>>> " >>>>>>> "(in seconds) between upcall contention >>>>>>> notifications " >>>>>>> "on the same inode. If multiple lock requests >>>>>>> are " >>>>>>> "received during this period, only one upcall >>>>>>> will " >>>>>>> "be sent."}, >>>>>>> >>>>>>> I am not sure what should be the best value for this option if we >>>>>>> want to keep features.lock-notify-contention ON by default? >>>>>>> It looks like if we keep the value of notify-contention-delay more, >>>>>>> say 5 sec, it will wait for this much time to send up call >>>>>>> notification which does not look good. >>>>>>> >>>>>> >>>>>> No, the first notification is sent immediately. What this option does >>>>>> is to define the minimum interval between notifications. This interval is >>>>>> per lock. This is done to avoid storms of notifications if many requests >>>>>> come referencing the same lock. >>>>>> >>>>>> Is my understanding correct? >>>>>>> What will be impact of this value and what should be the default >>>>>>> value of this option? >>>>>>> >>>>>> >>>>>> I think the current default value of 5 seconds seems good enough. If >>>>>> there are many bricks, each brick could send a notification per lock. 1000 >>>>>> bricks would mean a client would receive 1000 notifications every 5 >>>>>> seconds. It doesn't seem too much, but in those cases 10, and considering >>>>>> we could have other locks, maybe a higher value could be better. >>>>>> >>>>>> Xavi >>>>>> >>>>>> >>>>>>> >>>>>>> --- >>>>>>> Ashish >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> ------------------------------ >>>>>>> *From: *"Xavi Hernandez" >>>>>>> *To: *"gluster-devel" >>>>>>> *Cc: *"Pranith Kumar Karampuri" , "Ashish >>>>>>> Pandey" , "Amar Tumballi" >>>>>>> *Sent: *Thursday, May 2, 2019 4:15:38 PM >>>>>>> *Subject: *Should we enable contention notification by default ? >>>>>>> >>>>>>> Hi all, >>>>>>> >>>>>>> there's a feature in the locks xlator that sends a notification to >>>>>>> current owner of a lock when another client tries to acquire the same lock. >>>>>>> This way the current owner is made aware of the contention and can release >>>>>>> the lock as soon as possible to allow the other client to proceed. >>>>>>> >>>>>>> This is specially useful when eager-locking is used and multiple >>>>>>> clients access the same files and directories. Currently both replicated >>>>>>> and dispersed volumes use eager-locking and can use contention notification >>>>>>> to force an early release of the lock. >>>>>>> >>>>>>> Eager-locking reduces the number of network requests required for >>>>>>> each operation, improving performance, but could add delays to other >>>>>>> clients while it keeps the inode or entry locked. With the contention >>>>>>> notification feature we avoid this delay, so we get the best performance >>>>>>> with minimal issues in multiclient environments. >>>>>>> >>>>>>> Currently the contention notification feature is controlled by the >>>>>>> 'features.lock-notify-contention' option and it's disabled by default. >>>>>>> Should we enable it by default ? >>>>>>> >>>>>>> I don't see any reason to keep it disabled by default. Does anyone >>>>>>> foresee any problem ? >>>>>>> >>>>>> >>> Is it a server only option? Otherwise it will break backward >>> compatibility if we rename the key, If alias can get this fixed, that?s a >>> better choice but I?m not sure if it solves all the problems. >>> >> >> It's a server side option. I though that an alias didn't have any other >> implication than accept two names for the same option. Is there anything >> else I need to consider ? >> > > If it?s a server side option then there?s no challenge in alias. If you do > rename then in heterogeneous server versions volume set wouldn?t work > though. > I created a patch to change this and set notify-contention to 'yes' by default. I'll test upgrade paths to make sure that nothing breaks. Xavi > >> >>> >>>>>>> Regards, >>>>>>> >>>>>>> Xavi >>>>>>> >>>>>>> _______________________________________________ >>>>>> Gluster-devel mailing list >>>>>> Gluster-devel at gluster.org >>>>>> https://lists.gluster.org/mailman/listinfo/gluster-devel >>>>> >>>>> >>>>> >>>>> -- >>>>> Milind >>>>> >>>>> _______________________________________________ >>>> Gluster-devel mailing list >>>> Gluster-devel at gluster.org >>>> https://lists.gluster.org/mailman/listinfo/gluster-devel >>> >>> -- >>> --Atin >>> >> -- > --Atin > -------------- next part -------------- An HTML attachment was scrubbed... URL: From xhernandez at redhat.com Thu Jun 6 06:38:45 2019 From: xhernandez at redhat.com (Xavi Hernandez) Date: Thu, 6 Jun 2019 08:38:45 +0200 Subject: [Gluster-devel] Should we enable contention notification by default ? In-Reply-To: References: <2044282595.16006319.1556799471980.JavaMail.zimbra@redhat.com>

Message-ID: Missed the patch link: https://review.gluster.org/c/glusterfs/+/22828 On Thu, Jun 6, 2019 at 8:32 AM Xavi Hernandez wrote: > On Thu, May 2, 2019 at 5:45 PM Atin Mukherjee > wrote: > >> >> >> On Thu, 2 May 2019 at 20:38, Xavi Hernandez >> wrote: >> >>> On Thu, May 2, 2019 at 4:06 PM Atin Mukherjee < >>> atin.mukherjee83 at gmail.com> wrote: >>> >>>> >>>> >>>> On Thu, 2 May 2019 at 19:14, Xavi Hernandez >>>> wrote: >>>> >>>>> On Thu, 2 May 2019, 15:37 Milind Changire, >>>>> wrote: >>>>> >>>>>> On Thu, May 2, 2019 at 6:44 PM Xavi Hernandez >>>>>> wrote: >>>>>> >>>>>>> Hi Ashish, >>>>>>> >>>>>>> On Thu, May 2, 2019 at 2:17 PM Ashish Pandey >>>>>>> wrote: >>>>>>> >>>>>>>> Xavi, >>>>>>>> >>>>>>>> I would like to keep this option (features.lock-notify-contention) >>>>>>>> enabled by default. >>>>>>>> However, I can see that there is one more option which will impact >>>>>>>> the working of this option which is "notify-contention-delay" >>>>>>>> >>>>>>> >>>>>> Just a nit. I wish the option was called "notify-contention-interval" >>>>>> The "delay" part doesn't really emphasize where the delay would be >>>>>> put in. >>>>>> >>>>> >>>>> It makes sense. Maybe we can also rename it or add a second name >>>>> (alias). If there are no objections, I will send a patch with the change. >>>>> >>>>> Xavi >>>>> >>>>> >>>>>> >>>>>>> .description = "This value determines the minimum amount of >>>>>>>> time " >>>>>>>> "(in seconds) between upcall contention >>>>>>>> notifications " >>>>>>>> "on the same inode. If multiple lock requests >>>>>>>> are " >>>>>>>> "received during this period, only one upcall >>>>>>>> will " >>>>>>>> "be sent."}, >>>>>>>> >>>>>>>> I am not sure what should be the best value for this option if we >>>>>>>> want to keep features.lock-notify-contention ON by default? >>>>>>>> It looks like if we keep the value of notify-contention-delay more, >>>>>>>> say 5 sec, it will wait for this much time to send up call >>>>>>>> notification which does not look good. >>>>>>>> >>>>>>> >>>>>>> No, the first notification is sent immediately. What this option >>>>>>> does is to define the minimum interval between notifications. This interval >>>>>>> is per lock. This is done to avoid storms of notifications if many requests >>>>>>> come referencing the same lock. >>>>>>> >>>>>>> Is my understanding correct? >>>>>>>> What will be impact of this value and what should be the default >>>>>>>> value of this option? >>>>>>>> >>>>>>> >>>>>>> I think the current default value of 5 seconds seems good enough. If >>>>>>> there are many bricks, each brick could send a notification per lock. 1000 >>>>>>> bricks would mean a client would receive 1000 notifications every 5 >>>>>>> seconds. It doesn't seem too much, but in those cases 10, and considering >>>>>>> we could have other locks, maybe a higher value could be better. >>>>>>> >>>>>>> Xavi >>>>>>> >>>>>>> >>>>>>>> >>>>>>>> --- >>>>>>>> Ashish >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> ------------------------------ >>>>>>>> *From: *"Xavi Hernandez" >>>>>>>> *To: *"gluster-devel" >>>>>>>> *Cc: *"Pranith Kumar Karampuri" , "Ashish >>>>>>>> Pandey" , "Amar Tumballi" >>>>>>> > >>>>>>>> *Sent: *Thursday, May 2, 2019 4:15:38 PM >>>>>>>> *Subject: *Should we enable contention notification by default ? >>>>>>>> >>>>>>>> Hi all, >>>>>>>> >>>>>>>> there's a feature in the locks xlator that sends a notification to >>>>>>>> current owner of a lock when another client tries to acquire the same lock. >>>>>>>> This way the current owner is made aware of the contention and can release >>>>>>>> the lock as soon as possible to allow the other client to proceed. >>>>>>>> >>>>>>>> This is specially useful when eager-locking is used and multiple >>>>>>>> clients access the same files and directories. Currently both replicated >>>>>>>> and dispersed volumes use eager-locking and can use contention notification >>>>>>>> to force an early release of the lock. >>>>>>>> >>>>>>>> Eager-locking reduces the number of network requests required for >>>>>>>> each operation, improving performance, but could add delays to other >>>>>>>> clients while it keeps the inode or entry locked. With the contention >>>>>>>> notification feature we avoid this delay, so we get the best performance >>>>>>>> with minimal issues in multiclient environments. >>>>>>>> >>>>>>>> Currently the contention notification feature is controlled by the >>>>>>>> 'features.lock-notify-contention' option and it's disabled by default. >>>>>>>> Should we enable it by default ? >>>>>>>> >>>>>>>> I don't see any reason to keep it disabled by default. Does anyone >>>>>>>> foresee any problem ? >>>>>>>> >>>>>>> >>>> Is it a server only option? Otherwise it will break backward >>>> compatibility if we rename the key, If alias can get this fixed, that?s a >>>> better choice but I?m not sure if it solves all the problems. >>>> >>> >>> It's a server side option. I though that an alias didn't have any other >>> implication than accept two names for the same option. Is there anything >>> else I need to consider ? >>> >> >> If it?s a server side option then there?s no challenge in alias. If you >> do rename then in heterogeneous server versions volume set wouldn?t work >> though. >> > > I created a patch to change this and set notify-contention to 'yes' by > default. I'll test upgrade paths to make sure that nothing breaks. > > Xavi > > >> >>> >>>> >>>>>>>> Regards, >>>>>>>> >>>>>>>> Xavi >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>> Gluster-devel mailing list >>>>>>> Gluster-devel at gluster.org >>>>>>> https://lists.gluster.org/mailman/listinfo/gluster-devel >>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Milind >>>>>> >>>>>> _______________________________________________ >>>>> Gluster-devel mailing list >>>>> Gluster-devel at gluster.org >>>>> https://lists.gluster.org/mailman/listinfo/gluster-devel >>>> >>>> -- >>>> --Atin >>>> >>> -- >> --Atin >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From abhishpaliwal at gmail.com Thu Jun 6 06:38:20 2019 From: abhishpaliwal at gmail.com (ABHISHEK PALIWAL) Date: Thu, 6 Jun 2019 12:08:20 +0530 Subject: [Gluster-devel] [Gluster-users] Memory leak in glusterfs In-Reply-To: References: Message-ID: Hi Nithya, Here is the Setup details and test which we are doing as below: One client, two gluster Server. The client is writing and deleting one file each 15 minutes by script test_v4.15.sh. IP Server side: 128.224.98.157 /gluster/gv0/ 128.224.98.159 /gluster/gv0/ Client side: 128.224.98.160 /gluster_mount/ Server side: gluster volume create gv0 replica 2 128.224.98.157:/gluster/gv0/ 128.224.98.159:/gluster/gv0/ force gluster volume start gv0 root at 128:/tmp/brick/gv0# gluster volume info Volume Name: gv0 Type: Replicate Volume ID: 7105a475-5929-4d60-ba23-be57445d97b5 Status: Started Snapshot Count: 0 Number of Bricks: 1 x 2 = 2 Transport-type: tcp Bricks: Brick1: 128.224.98.157:/gluster/gv0 Brick2: 128.224.98.159:/gluster/gv0 Options Reconfigured: transport.address-family: inet nfs.disable: on performance.client-io-threads: off exec script: ./ps_mem.py -p 605 -w 61 > log root at 128:/# ./ps_mem.py -p 605 Private + Shared = RAM used Program 23668.0 KiB + 1188.0 KiB = 24856.0 KiB glusterfsd --------------------------------- 24856.0 KiB ================================= Client side: mount -t glusterfs -o acl -o resolve-gids 128.224.98.157:gv0 /gluster_mount We are using the below script write and delete the file. *test_v4.15.sh * Also the below script to see the memory increase whihle the script is above script is running in background. *ps_mem.py* I am attaching the script files as well as the result got after testing the scenario. On Wed, Jun 5, 2019 at 7:23 PM Nithya Balachandran wrote: > Hi, > > Writing to a volume should not affect glusterd. The stack you have shown > in the valgrind looks like the memory used to initialise the structures > glusterd uses and will free only when it is stopped. > > Can you provide more details to what it is you are trying to test? > > Regards, > Nithya > > > On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL > wrote: > >> Hi Team, >> >> Please respond on the issue which I raised. >> >> Regards, >> Abhishek >> >> On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL >> wrote: >> >>> Anyone please reply.... >>> >>> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >>> wrote: >>> >>>> Hi Team, >>>> >>>> I upload some valgrind logs from my gluster 5.4 setup. This is writing >>>> to the volume every 15 minutes. I stopped glusterd and then copy away the >>>> logs. The test was running for some simulated days. They are zipped in >>>> valgrind-54.zip. >>>> >>>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>>> glusterfs and even some definitely lost bytes. >>>> >>>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>>> 391 of 391 >>>> ==2737== at 0x4C29C25: calloc (in >>>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>>> ==2737== by 0xA22485E: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA217C94: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA21D9F8: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA21DED9: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA21E685: ??? (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0xA1B9D8C: init (in >>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>> ==2737== by 0x4E511CE: xlator_init (in /usr/lib64/libglusterfs.so.0.0.1) >>>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>>> /usr/lib64/libglusterfs.so.0.0.1) >>>> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >>>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>>> ==2737== >>>> ==2737== LEAK SUMMARY: >>>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>>> ==2737== indirectly lost: 317 bytes in 3 blocks >>>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>>> ==2737== still reachable: 53,277 bytes in 201 blocks >>>> ==2737== suppressed: 0 bytes in 0 blocks >>>> >>>> -- >>>> >>>> >>>> >>>> >>>> Regards >>>> Abhishek Paliwal >>>> >>> >> >> -- >> >> >> >> >> Regards >> Abhishek Paliwal >> _______________________________________________ >> Gluster-users mailing list >> Gluster-users at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-users > > -- Regards Abhishek Paliwal -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ps_mem.py Type: text/x-python Size: 18465 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: test_v4.15.sh Type: application/x-shellscript Size: 660 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ps_mem_server1.log Type: text/x-log Size: 135168 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ps_mem_server2.log Type: text/x-log Size: 135168 bytes Desc: not available URL: From spisla80 at gmail.com Thu Jun 6 06:46:52 2019 From: spisla80 at gmail.com (David Spisla) Date: Thu, 6 Jun 2019 08:46:52 +0200 Subject: [Gluster-devel] Bitrot: Segmentation fault found in bitrot stub Message-ID: Dear Gluster Devel, all informations are here: https://bugzilla.redhat.com/show_bug.cgi?id=1717757 Also a full backtrace is provided. The place of of the seg fault is located Regards David Spisla -------------- next part -------------- An HTML attachment was scrubbed... URL: From nbalacha at redhat.com Thu Jun 6 10:38:17 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Thu, 6 Jun 2019 16:08:17 +0530 Subject: [Gluster-devel] [Gluster-users] Memory leak in glusterfs In-Reply-To: References:

Message-ID: Hi Abhishek, I am still not clear as to the purpose of the tests. Can you clarify why you are using valgrind and why you think there is a memory leak? Regards, Nithya On Thu, 6 Jun 2019 at 12:09, ABHISHEK PALIWAL wrote: > Hi Nithya, > > Here is the Setup details and test which we are doing as below: > > > One client, two gluster Server. > The client is writing and deleting one file each 15 minutes by script > test_v4.15.sh. > > IP > Server side: > 128.224.98.157 /gluster/gv0/ > 128.224.98.159 /gluster/gv0/ > > Client side: > 128.224.98.160 /gluster_mount/ > > Server side: > gluster volume create gv0 replica 2 128.224.98.157:/gluster/gv0/ > 128.224.98.159:/gluster/gv0/ force > gluster volume start gv0 > > root at 128:/tmp/brick/gv0# gluster volume info > > Volume Name: gv0 > Type: Replicate > Volume ID: 7105a475-5929-4d60-ba23-be57445d97b5 > Status: Started > Snapshot Count: 0 > Number of Bricks: 1 x 2 = 2 > Transport-type: tcp > Bricks: > Brick1: 128.224.98.157:/gluster/gv0 > Brick2: 128.224.98.159:/gluster/gv0 > Options Reconfigured: > transport.address-family: inet > nfs.disable: on > performance.client-io-threads: off > > exec script: ./ps_mem.py -p 605 -w 61 > log > root at 128:/# ./ps_mem.py -p 605 > Private + Shared = RAM used Program > 23668.0 KiB + 1188.0 KiB = 24856.0 KiB glusterfsd > --------------------------------- > 24856.0 KiB > ================================= > > > Client side: > mount -t glusterfs -o acl -o resolve-gids 128.224.98.157:gv0 > /gluster_mount > > > We are using the below script write and delete the file. > > *test_v4.15.sh * > > Also the below script to see the memory increase whihle the script is > above script is running in background. > > *ps_mem.py* > > I am attaching the script files as well as the result got after testing > the scenario. > > On Wed, Jun 5, 2019 at 7:23 PM Nithya Balachandran > wrote: > >> Hi, >> >> Writing to a volume should not affect glusterd. The stack you have shown >> in the valgrind looks like the memory used to initialise the structures >> glusterd uses and will free only when it is stopped. >> >> Can you provide more details to what it is you are trying to test? >> >> Regards, >> Nithya >> >> >> On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL >> wrote: >> >>> Hi Team, >>> >>> Please respond on the issue which I raised. >>> >>> Regards, >>> Abhishek >>> >>> On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL < >>> abhishpaliwal at gmail.com> wrote: >>> >>>> Anyone please reply.... >>>> >>>> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >>>> wrote: >>>> >>>>> Hi Team, >>>>> >>>>> I upload some valgrind logs from my gluster 5.4 setup. This is writing >>>>> to the volume every 15 minutes. I stopped glusterd and then copy away the >>>>> logs. The test was running for some simulated days. They are zipped in >>>>> valgrind-54.zip. >>>>> >>>>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>>>> glusterfs and even some definitely lost bytes. >>>>> >>>>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>>>> 391 of 391 >>>>> ==2737== at 0x4C29C25: calloc (in >>>>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>>>> ==2737== by 0xA22485E: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA217C94: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA21D9F8: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA21DED9: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA21E685: ??? (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0xA1B9D8C: init (in >>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>> ==2737== by 0x4E511CE: xlator_init (in >>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>>>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>> ==2737== by 0x409C35: glusterfs_process_volfp (in /usr/sbin/glusterfsd) >>>>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>>>> ==2737== >>>>> ==2737== LEAK SUMMARY: >>>>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>>>> ==2737== indirectly lost: 317 bytes in 3 blocks >>>>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>>>> ==2737== still reachable: 53,277 bytes in 201 blocks >>>>> ==2737== suppressed: 0 bytes in 0 blocks >>>>> >>>>> -- >>>>> >>>>> >>>>> >>>>> >>>>> Regards >>>>> Abhishek Paliwal >>>>> >>>> >>> >>> -- >>> >>> >>> >>> >>> Regards >>> Abhishek Paliwal >>> _______________________________________________ >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> > > -- > > > > > Regards > Abhishek Paliwal > -------------- next part -------------- An HTML attachment was scrubbed... URL: From abhishpaliwal at gmail.com Fri Jun 7 02:43:03 2019 From: abhishpaliwal at gmail.com (ABHISHEK PALIWAL) Date: Fri, 7 Jun 2019 08:13:03 +0530 Subject: [Gluster-devel] [Gluster-users] Memory leak in glusterfs In-Reply-To: References:

Message-ID: Hi Nithya, We are having the setup where copying the file to and deleting it from gluster mount point to update the latest file. We noticed due to this having some memory increase in glusterfsd process. To find the memory leak we are using valgrind but didn't get any help. That's why contacted to glusterfs community. Regards, Abhishek On Thu, Jun 6, 2019, 16:08 Nithya Balachandran wrote: > Hi Abhishek, > > I am still not clear as to the purpose of the tests. Can you clarify why > you are using valgrind and why you think there is a memory leak? > > Regards, > Nithya > > On Thu, 6 Jun 2019 at 12:09, ABHISHEK PALIWAL > wrote: > >> Hi Nithya, >> >> Here is the Setup details and test which we are doing as below: >> >> >> One client, two gluster Server. >> The client is writing and deleting one file each 15 minutes by script >> test_v4.15.sh. >> >> IP >> Server side: >> 128.224.98.157 /gluster/gv0/ >> 128.224.98.159 /gluster/gv0/ >> >> Client side: >> 128.224.98.160 /gluster_mount/ >> >> Server side: >> gluster volume create gv0 replica 2 128.224.98.157:/gluster/gv0/ >> 128.224.98.159:/gluster/gv0/ force >> gluster volume start gv0 >> >> root at 128:/tmp/brick/gv0# gluster volume info >> >> Volume Name: gv0 >> Type: Replicate >> Volume ID: 7105a475-5929-4d60-ba23-be57445d97b5 >> Status: Started >> Snapshot Count: 0 >> Number of Bricks: 1 x 2 = 2 >> Transport-type: tcp >> Bricks: >> Brick1: 128.224.98.157:/gluster/gv0 >> Brick2: 128.224.98.159:/gluster/gv0 >> Options Reconfigured: >> transport.address-family: inet >> nfs.disable: on >> performance.client-io-threads: off >> >> exec script: ./ps_mem.py -p 605 -w 61 > log >> root at 128:/# ./ps_mem.py -p 605 >> Private + Shared = RAM used Program >> 23668.0 KiB + 1188.0 KiB = 24856.0 KiB glusterfsd >> --------------------------------- >> 24856.0 KiB >> ================================= >> >> >> Client side: >> mount -t glusterfs -o acl -o resolve-gids 128.224.98.157:gv0 >> /gluster_mount >> >> >> We are using the below script write and delete the file. >> >> *test_v4.15.sh * >> >> Also the below script to see the memory increase whihle the script is >> above script is running in background. >> >> *ps_mem.py* >> >> I am attaching the script files as well as the result got after testing >> the scenario. >> >> On Wed, Jun 5, 2019 at 7:23 PM Nithya Balachandran >> wrote: >> >>> Hi, >>> >>> Writing to a volume should not affect glusterd. The stack you have shown >>> in the valgrind looks like the memory used to initialise the structures >>> glusterd uses and will free only when it is stopped. >>> >>> Can you provide more details to what it is you are trying to test? >>> >>> Regards, >>> Nithya >>> >>> >>> On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL >>> wrote: >>> >>>> Hi Team, >>>> >>>> Please respond on the issue which I raised. >>>> >>>> Regards, >>>> Abhishek >>>> >>>> On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL < >>>> abhishpaliwal at gmail.com> wrote: >>>> >>>>> Anyone please reply.... >>>>> >>>>> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >>>>> wrote: >>>>> >>>>>> Hi Team, >>>>>> >>>>>> I upload some valgrind logs from my gluster 5.4 setup. This is >>>>>> writing to the volume every 15 minutes. I stopped glusterd and then copy >>>>>> away the logs. The test was running for some simulated days. They are >>>>>> zipped in valgrind-54.zip. >>>>>> >>>>>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>>>>> glusterfs and even some definitely lost bytes. >>>>>> >>>>>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss record >>>>>> 391 of 391 >>>>>> ==2737== at 0x4C29C25: calloc (in >>>>>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>>>>> ==2737== by 0xA22485E: ??? (in >>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>> ==2737== by 0xA217C94: ??? (in >>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>> ==2737== by 0xA21D9F8: ??? (in >>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>> ==2737== by 0xA21DED9: ??? (in >>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>> ==2737== by 0xA21E685: ??? (in >>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>> ==2737== by 0xA1B9D8C: init (in >>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>> ==2737== by 0x4E511CE: xlator_init (in >>>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>>>>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>>> ==2737== by 0x409C35: glusterfs_process_volfp (in >>>>>> /usr/sbin/glusterfsd) >>>>>> ==2737== by 0x409D99: glusterfs_volumes_init (in /usr/sbin/glusterfsd) >>>>>> ==2737== >>>>>> ==2737== LEAK SUMMARY: >>>>>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>>>>> ==2737== indirectly lost: 317 bytes in 3 blocks >>>>>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>>>>> ==2737== still reachable: 53,277 bytes in 201 blocks >>>>>> ==2737== suppressed: 0 bytes in 0 blocks >>>>>> >>>>>> -- >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> Regards >>>>>> Abhishek Paliwal >>>>>> >>>>> >>>> >>>> -- >>>> >>>> >>>> >>>> >>>> Regards >>>> Abhishek Paliwal >>>> _______________________________________________ >>>> Gluster-users mailing list >>>> Gluster-users at gluster.org >>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>> >>> >> >> -- >> >> >> >> >> Regards >> Abhishek Paliwal >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From nbalacha at redhat.com Fri Jun 7 03:09:03 2019 From: nbalacha at redhat.com (Nithya Balachandran) Date: Fri, 7 Jun 2019 08:39:03 +0530 Subject: [Gluster-devel] [Gluster-users] Memory leak in glusterfs In-Reply-To: References:

Message-ID: Hi Abhishek, Please use statedumps taken at intervals to determine where the memory is increasing. See [1] for details. Regards, Nithya [1] https://docs.gluster.org/en/latest/Troubleshooting/statedump/ On Fri, 7 Jun 2019 at 08:13, ABHISHEK PALIWAL wrote: > Hi Nithya, > > We are having the setup where copying the file to and deleting it from > gluster mount point to update the latest file. We noticed due to this > having some memory increase in glusterfsd process. > > To find the memory leak we are using valgrind but didn't get any help. > > That's why contacted to glusterfs community. > > Regards, > Abhishek > > On Thu, Jun 6, 2019, 16:08 Nithya Balachandran > wrote: > >> Hi Abhishek, >> >> I am still not clear as to the purpose of the tests. Can you clarify why >> you are using valgrind and why you think there is a memory leak? >> >> Regards, >> Nithya >> >> On Thu, 6 Jun 2019 at 12:09, ABHISHEK PALIWAL >> wrote: >> >>> Hi Nithya, >>> >>> Here is the Setup details and test which we are doing as below: >>> >>> >>> One client, two gluster Server. >>> The client is writing and deleting one file each 15 minutes by script >>> test_v4.15.sh. >>> >>> IP >>> Server side: >>> 128.224.98.157 /gluster/gv0/ >>> 128.224.98.159 /gluster/gv0/ >>> >>> Client side: >>> 128.224.98.160 /gluster_mount/ >>> >>> Server side: >>> gluster volume create gv0 replica 2 128.224.98.157:/gluster/gv0/ >>> 128.224.98.159:/gluster/gv0/ force >>> gluster volume start gv0 >>> >>> root at 128:/tmp/brick/gv0# gluster volume info >>> >>> Volume Name: gv0 >>> Type: Replicate >>> Volume ID: 7105a475-5929-4d60-ba23-be57445d97b5 >>> Status: Started >>> Snapshot Count: 0 >>> Number of Bricks: 1 x 2 = 2 >>> Transport-type: tcp >>> Bricks: >>> Brick1: 128.224.98.157:/gluster/gv0 >>> Brick2: 128.224.98.159:/gluster/gv0 >>> Options Reconfigured: >>> transport.address-family: inet >>> nfs.disable: on >>> performance.client-io-threads: off >>> >>> exec script: ./ps_mem.py -p 605 -w 61 > log >>> root at 128:/# ./ps_mem.py -p 605 >>> Private + Shared = RAM used Program >>> 23668.0 KiB + 1188.0 KiB = 24856.0 KiB glusterfsd >>> --------------------------------- >>> 24856.0 KiB >>> ================================= >>> >>> >>> Client side: >>> mount -t glusterfs -o acl -o resolve-gids 128.224.98.157:gv0 >>> /gluster_mount >>> >>> >>> We are using the below script write and delete the file. >>> >>> *test_v4.15.sh * >>> >>> Also the below script to see the memory increase whihle the script is >>> above script is running in background. >>> >>> *ps_mem.py* >>> >>> I am attaching the script files as well as the result got after testing >>> the scenario. >>> >>> On Wed, Jun 5, 2019 at 7:23 PM Nithya Balachandran >>> wrote: >>> >>>> Hi, >>>> >>>> Writing to a volume should not affect glusterd. The stack you have >>>> shown in the valgrind looks like the memory used to initialise the >>>> structures glusterd uses and will free only when it is stopped. >>>> >>>> Can you provide more details to what it is you are trying to test? >>>> >>>> Regards, >>>> Nithya >>>> >>>> >>>> On Tue, 4 Jun 2019 at 15:41, ABHISHEK PALIWAL >>>> wrote: >>>> >>>>> Hi Team, >>>>> >>>>> Please respond on the issue which I raised. >>>>> >>>>> Regards, >>>>> Abhishek >>>>> >>>>> On Fri, May 17, 2019 at 2:46 PM ABHISHEK PALIWAL < >>>>> abhishpaliwal at gmail.com> wrote: >>>>> >>>>>> Anyone please reply.... >>>>>> >>>>>> On Thu, May 16, 2019, 10:49 ABHISHEK PALIWAL >>>>>> wrote: >>>>>> >>>>>>> Hi Team, >>>>>>> >>>>>>> I upload some valgrind logs from my gluster 5.4 setup. This is >>>>>>> writing to the volume every 15 minutes. I stopped glusterd and then copy >>>>>>> away the logs. The test was running for some simulated days. They are >>>>>>> zipped in valgrind-54.zip. >>>>>>> >>>>>>> Lots of info in valgrind-2730.log. Lots of possibly lost bytes in >>>>>>> glusterfs and even some definitely lost bytes. >>>>>>> >>>>>>> ==2737== 1,572,880 bytes in 1 blocks are possibly lost in loss >>>>>>> record 391 of 391 >>>>>>> ==2737== at 0x4C29C25: calloc (in >>>>>>> /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) >>>>>>> ==2737== by 0xA22485E: ??? (in >>>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>>> ==2737== by 0xA217C94: ??? (in >>>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>>> ==2737== by 0xA21D9F8: ??? (in >>>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>>> ==2737== by 0xA21DED9: ??? (in >>>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>>> ==2737== by 0xA21E685: ??? (in >>>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>>> ==2737== by 0xA1B9D8C: init (in >>>>>>> /usr/lib64/glusterfs/5.4/xlator/mgmt/glusterd.so) >>>>>>> ==2737== by 0x4E511CE: xlator_init (in >>>>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>>>> ==2737== by 0x4E8A2B8: ??? (in /usr/lib64/libglusterfs.so.0.0.1) >>>>>>> ==2737== by 0x4E8AAB3: glusterfs_graph_activate (in >>>>>>> /usr/lib64/libglusterfs.so.0.0.1) >>>>>>> ==2737== by 0x409C35: glusterfs_process_volfp (in >>>>>>> /usr/sbin/glusterfsd) >>>>>>> ==2737== by 0x409D99: glusterfs_volumes_init (in >>>>>>> /usr/sbin/glusterfsd) >>>>>>> ==2737== >>>>>>> ==2737== LEAK SUMMARY: >>>>>>> ==2737== definitely lost: 1,053 bytes in 10 blocks >>>>>>> ==2737== indirectly lost: 317 bytes in 3 blocks >>>>>>> ==2737== possibly lost: 2,374,971 bytes in 524 blocks >>>>>>> ==2737== still reachable: 53,277 bytes in 201 blocks >>>>>>> ==2737== suppressed: 0 bytes in 0 blocks >>>>>>> >>>>>>> -- >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> Regards >>>>>>> Abhishek Paliwal >>>>>>> >>>>>> >>>>> >>>>> -- >>>>> >>>>> >>>>> >>>>> >>>>> Regards >>>>> Abhishek Paliwal >>>>> _______________________________________________ >>>>> Gluster-users mailing list >>>>> Gluster-users at gluster.org >>>>> https://lists.gluster.org/mailman/listinfo/gluster-users >>>> >>>> >>> >>> -- >>> >>> >>> >>> >>> Regards >>> Abhishek Paliwal >>> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: From atumball at redhat.com Fri Jun 7 04:36:25 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Fri, 7 Jun 2019 10:06:25 +0530 Subject: [Gluster-devel] [Gluster-Maintainers] Fwd: Build failed in Jenkins: regression-test-with-multiplex #1359 In-Reply-To: References: <24208463.92.1559325814227.JavaMail.jenkins@jenkins-el7.rht.gluster.org> Message-ID: Got time to test subdir-mount.t failing in brick-mux scenario. I noticed some issues, where I need further help from glusterd team. subdir-mount.t expects 'hook' script to run after add-brick to make sure the required subdirectories are healed and are present in new bricks. This is important as subdir mount expects the subdirs to exist for successful mount. But in case of brick-mux setup, I see that in some cases (6/10), hook script (add-brick/post-hook/S13-create-subdir-mount.sh) started getting executed after 20second of finishing the add-brick command. Due to this, the mount which we execute after add-brick failed. My question is, what is making post hook script to run so late ?? I can recreate the issues locally on my laptop too. On Sat, Jun 1, 2019 at 4:55 PM Atin Mukherjee wrote: > subdir-mount.t has started failing in brick mux regression nightly. This > needs to be fixed. > > Raghavendra - did we manage to get any further clue on uss.t failure? > > ---------- Forwarded message --------- > From: > Date: Fri, 31 May 2019 at 23:34 > Subject: [Gluster-Maintainers] Build failed in Jenkins: > regression-test-with-multiplex #1359 > To: , , , > , > > > See < > https://build.gluster.org/job/regression-test-with-multiplex/1359/display/redirect?page=changes > > > > Changes: > > [atin] glusterd: add an op-version check > > [atin] glusterd/svc: glusterd_svcs_stop should call individual wrapper > function > > [atin] glusterd/svc: Stop stale process using the glusterd_proc_stop > > [Amar Tumballi] lcov: more coverage to shard, old-protocol, sdfs > > [Kotresh H R] tests/geo-rep: Add EC volume test case > > [Amar Tumballi] glusterfsd/cleanup: Protect graph object under a lock > > [Mohammed Rafi KC] glusterd/shd: Optimize the glustershd manager to send > reconfigure > > [Kotresh H R] tests/geo-rep: Add tests to cover glusterd geo-rep > > [atin] glusterd: Optimize code to copy dictionary in handshake code path > > ------------------------------------------ > [...truncated 3.18 MB...] > ./tests/basic/afr/stale-file-lookup.t - 9 second > ./tests/basic/afr/granular-esh/replace-brick.t - 9 second > ./tests/basic/afr/granular-esh/add-brick.t - 9 second > ./tests/basic/afr/gfid-mismatch.t - 9 second > ./tests/performance/open-behind.t - 8 second > ./tests/features/ssl-authz.t - 8 second > ./tests/features/readdir-ahead.t - 8 second > ./tests/bugs/upcall/bug-1458127.t - 8 second > ./tests/bugs/transport/bug-873367.t - 8 second > ./tests/bugs/replicate/bug-1498570-client-iot-graph-check.t - 8 second > ./tests/bugs/replicate/bug-1132102.t - 8 second > ./tests/bugs/quota/bug-1250582-volume-reset-should-not-remove-quota-quota-deem-statfs.t > - 8 second > ./tests/bugs/quota/bug-1104692.t - 8 second > ./tests/bugs/posix/bug-1360679.t - 8 second > ./tests/bugs/posix/bug-1122028.t - 8 second > ./tests/bugs/nfs/bug-1157223-symlink-mounting.t - 8 second > ./tests/bugs/glusterfs/bug-861015-log.t - 8 second > ./tests/bugs/glusterd/sync-post-glusterd-restart.t - 8 second > ./tests/bugs/glusterd/bug-1696046.t - 8 second > ./tests/bugs/fuse/bug-983477.t - 8 second > ./tests/bugs/ec/bug-1227869.t - 8 second > ./tests/bugs/distribute/bug-1088231.t - 8 second > ./tests/bugs/distribute/bug-1086228.t - 8 second > ./tests/bugs/cli/bug-1087487.t - 8 second > ./tests/bugs/cli/bug-1022905.t - 8 second > ./tests/bugs/bug-1258069.t - 8 second > ./tests/bugs/bitrot/1209752-volume-status-should-show-bitrot-scrub-info.t > - 8 second > ./tests/basic/xlator-pass-through-sanity.t - 8 second > ./tests/basic/quota-nfs.t - 8 second > ./tests/basic/glusterd/arbiter-volume.t - 8 second > ./tests/basic/ctime/ctime-noatime.t - 8 second > ./tests/line-coverage/cli-peer-and-volume-operations.t - 7 second > ./tests/gfid2path/get-gfid-to-path.t - 7 second > ./tests/bugs/upcall/bug-1369430.t - 7 second > ./tests/bugs/snapshot/bug-1260848.t - 7 second > ./tests/bugs/shard/shard-inode-refcount-test.t - 7 second > ./tests/bugs/shard/bug-1258334.t - 7 second > ./tests/bugs/replicate/bug-767585-gfid.t - 7 second > ./tests/bugs/replicate/bug-1448804-check-quorum-type-values.t - 7 second > ./tests/bugs/replicate/bug-1250170-fsync.t - 7 second > ./tests/bugs/posix/bug-1175711.t - 7 second > ./tests/bugs/nfs/bug-915280.t - 7 second > ./tests/bugs/md-cache/setxattr-prepoststat.t - 7 second > ./tests/bugs/md-cache/bug-1211863_unlink.t - 7 second > ./tests/bugs/glusterfs/bug-848251.t - 7 second > ./tests/bugs/distribute/bug-1122443.t - 7 second > ./tests/bugs/changelog/bug-1208470.t - 7 second > ./tests/bugs/bug-1702299.t - 7 second > ./tests/bugs/bug-1371806_2.t - 7 second > ./tests/bugs/bitrot/1209818-vol-info-show-scrub-process-properly.t - 7 > second > ./tests/bugs/bitrot/1209751-bitrot-scrub-tunable-reset.t - 7 second > ./tests/bugs/bitrot/1207029-bitrot-daemon-should-start-on-valid-node.t - > 7 second > ./tests/bitrot/br-stub.t - 7 second > ./tests/basic/glusterd/arbiter-volume-probe.t - 7 second > ./tests/basic/gfapi/libgfapi-fini-hang.t - 7 second > ./tests/basic/fencing/fencing-crash-conistency.t - 7 second > ./tests/basic/distribute/file-create.t - 7 second > ./tests/basic/afr/tarissue.t - 7 second > ./tests/basic/afr/gfid-heal.t - 7 second > ./tests/bugs/snapshot/bug-1178079.t - 6 second > ./tests/bugs/snapshot/bug-1064768.t - 6 second > ./tests/bugs/shard/bug-1342298.t - 6 second > ./tests/bugs/shard/bug-1259651.t - 6 second > ./tests/bugs/replicate/bug-1686568-send-truncate-on-arbiter-from-shd.t - > 6 second > ./tests/bugs/replicate/bug-1626994-info-split-brain.t - 6 second > ./tests/bugs/replicate/bug-1325792.t - 6 second > ./tests/bugs/replicate/bug-1101647.t - 6 second > ./tests/bugs/quota/bug-1243798.t - 6 second > ./tests/bugs/protocol/bug-1321578.t - 6 second > ./tests/bugs/nfs/bug-877885.t - 6 second > ./tests/bugs/nfs/bug-1143880-fix-gNFSd-auth-crash.t - 6 second > ./tests/bugs/md-cache/bug-1476324.t - 6 second > ./tests/bugs/md-cache/afr-stale-read.t - 6 second > ./tests/bugs/io-cache/bug-858242.t - 6 second > ./tests/bugs/glusterfs/bug-893378.t - 6 second > ./tests/bugs/glusterfs/bug-856455.t - 6 second > ./tests/bugs/glusterd/quorum-value-check.t - 6 second > ./tests/bugs/ec/bug-1179050.t - 6 second > ./tests/bugs/distribute/bug-912564.t - 6 second > ./tests/bugs/distribute/bug-884597.t - 6 second > ./tests/bugs/distribute/bug-1368012.t - 6 second > ./tests/bugs/core/bug-986429.t - 6 second > ./tests/bugs/core/bug-1699025-brick-mux-detach-brick-fd-issue.t - 6 > second > ./tests/bugs/core/bug-1168803-snapd-option-validation-fix.t - 6 second > ./tests/bugs/bug-1371806_1.t - 6 second > ./tests/bugs/bitrot/bug-1229134-bitd-not-support-vol-set.t - 6 second > ./tests/bugs/bitrot/bug-1210684-scrub-pause-resume-error-handling.t - 6 > second > ./tests/bitrot/bug-1221914.t - 6 second > ./tests/basic/trace.t - 6 second > ./tests/basic/playground/template-xlator-sanity.t - 6 second > ./tests/basic/ec/nfs.t - 6 second > ./tests/basic/ec/ec-read-policy.t - 6 second > ./tests/basic/ec/ec-anonymous-fd.t - 6 second > ./tests/basic/distribute/non-root-unlink-stale-linkto.t - 6 second > ./tests/basic/changelog/changelog-rename.t - 6 second > ./tests/basic/afr/heal-info.t - 6 second > ./tests/basic/afr/afr-read-hash-mode.t - 6 second > ./tests/gfid2path/gfid2path_nfs.t - 5 second > ./tests/bugs/upcall/bug-1422776.t - 5 second > ./tests/bugs/replicate/bug-886998.t - 5 second > ./tests/bugs/replicate/bug-1365455.t - 5 second > ./tests/bugs/readdir-ahead/bug-1670253-consistent-metadata.t - 5 second > ./tests/bugs/posix/bug-gfid-path.t - 5 second > ./tests/bugs/posix/bug-765380.t - 5 second > ./tests/bugs/nfs/bug-847622.t - 5 second > ./tests/bugs/nfs/bug-1116503.t - 5 second > ./tests/bugs/io-stats/bug-1598548.t - 5 second > ./tests/bugs/glusterfs-server/bug-877992.t - 5 second > ./tests/bugs/glusterfs-server/bug-873549.t - 5 second > ./tests/bugs/glusterfs/bug-895235.t - 5 second > ./tests/bugs/fuse/bug-1126048.t - 5 second > ./tests/bugs/distribute/bug-907072.t - 5 second > ./tests/bugs/core/bug-913544.t - 5 second > ./tests/bugs/core/bug-908146.t - 5 second > ./tests/bugs/access-control/bug-1051896.t - 5 second > ./tests/basic/ec/ec-internal-xattrs.t - 5 second > ./tests/basic/ec/ec-fallocate.t - 5 second > ./tests/basic/distribute/bug-1265677-use-readdirp.t - 5 second > ./tests/basic/afr/arbiter-remove-brick.t - 5 second > ./tests/performance/quick-read.t - 4 second > ./tests/gfid2path/block-mount-access.t - 4 second > ./tests/features/delay-gen.t - 4 second > ./tests/bugs/upcall/bug-upcall-stat.t - 4 second > ./tests/bugs/upcall/bug-1394131.t - 4 second > ./tests/bugs/unclassified/bug-1034085.t - 4 second > ./tests/bugs/snapshot/bug-1111041.t - 4 second > ./tests/bugs/shard/bug-1272986.t - 4 second > ./tests/bugs/shard/bug-1256580.t - 4 second > ./tests/bugs/shard/bug-1250855.t - 4 second > ./tests/bugs/shard/bug-1245547.t - 4 second > ./tests/bugs/rpc/bug-954057.t - 4 second > ./tests/bugs/replicate/bug-976800.t - 4 second > ./tests/bugs/replicate/bug-880898.t - 4 second > ./tests/bugs/replicate/bug-1480525.t - 4 second > ./tests/bugs/read-only/bug-1134822-read-only-default-in-graph.t - 4 > second > ./tests/bugs/readdir-ahead/bug-1446516.t - 4 second > ./tests/bugs/readdir-ahead/bug-1439640.t - 4 second > ./tests/bugs/readdir-ahead/bug-1390050.t - 4 second > ./tests/bugs/quota/bug-1287996.t - 4 second > ./tests/bugs/quick-read/bug-846240.t - 4 second > ./tests/bugs/posix/disallow-gfid-volumeid-removexattr.t - 4 second > ./tests/bugs/posix/bug-1619720.t - 4 second > ./tests/bugs/nl-cache/bug-1451588.t - 4 second > ./tests/bugs/nfs/zero-atime.t - 4 second > ./tests/bugs/nfs/subdir-trailing-slash.t - 4 second > ./tests/bugs/nfs/socket-as-fifo.t - 4 second > ./tests/bugs/nfs/showmount-many-clients.t - 4 second > ./tests/bugs/nfs/bug-1210338.t - 4 second > ./tests/bugs/nfs/bug-1166862.t - 4 second > ./tests/bugs/nfs/bug-1161092-nfs-acls.t - 4 second > ./tests/bugs/md-cache/bug-1632503.t - 4 second > ./tests/bugs/glusterfs-server/bug-864222.t - 4 second > ./tests/bugs/glusterfs/bug-1482528.t - 4 second > ./tests/bugs/glusterd/bug-948729/bug-948729-mode-script.t - 4 second > ./tests/bugs/glusterd/bug-948729/bug-948729-force.t - 4 second > ./tests/bugs/glusterd/bug-1482906-peer-file-blank-line.t - 4 second > ./tests/bugs/glusterd/bug-1091935-brick-order-check-from-cli-to-glusterd.t > - 4 second > ./tests/bugs/geo-replication/bug-1296496.t - 4 second > ./tests/bugs/fuse/bug-1336818.t - 4 second > ./tests/bugs/fuse/bug-1283103.t - 4 second > ./tests/bugs/core/io-stats-1322825.t - 4 second > ./tests/bugs/core/bug-834465.t - 4 second > ./tests/bugs/core/bug-1135514-allow-setxattr-with-null-value.t - 4 second > ./tests/bugs/core/949327.t - 4 second > ./tests/bugs/cli/bug-977246.t - 4 second > ./tests/bugs/cli/bug-961307.t - 4 second > ./tests/bugs/cli/bug-1004218.t - 4 second > ./tests/bugs/bug-1138841.t - 4 second > ./tests/bugs/access-control/bug-1387241.t - 4 second > ./tests/bitrot/bug-internal-xattrs-check-1243391.t - 4 second > ./tests/basic/quota-rename.t - 4 second > ./tests/basic/hardlink-limit.t - 4 second > ./tests/basic/ec/dht-rename.t - 4 second > ./tests/basic/distribute/lookup.t - 4 second > ./tests/line-coverage/meta-max-coverage.t - 3 second > ./tests/gfid2path/gfid2path_fuse.t - 3 second > ./tests/bugs/unclassified/bug-991622.t - 3 second > ./tests/bugs/trace/bug-797171.t - 3 second > ./tests/bugs/glusterfs-server/bug-861542.t - 3 second > ./tests/bugs/glusterfs/bug-869724.t - 3 second > ./tests/bugs/glusterfs/bug-860297.t - 3 second > ./tests/bugs/glusterfs/bug-844688.t - 3 second > ./tests/bugs/glusterd/bug-948729/bug-948729.t - 3 second > ./tests/bugs/distribute/bug-1204140.t - 3 second > ./tests/bugs/core/bug-924075.t - 3 second > ./tests/bugs/core/bug-845213.t - 3 second > ./tests/bugs/core/bug-1421721-mpx-toggle.t - 3 second > ./tests/bugs/core/bug-1119582.t - 3 second > ./tests/bugs/core/bug-1117951.t - 3 second > ./tests/bugs/cli/bug-983317-volume-get.t - 3 second > ./tests/bugs/cli/bug-867252.t - 3 second > ./tests/basic/glusterd/check-cloudsync-ancestry.t - 3 second > ./tests/basic/fops-sanity.t - 3 second > ./tests/basic/fencing/test-fence-option.t - 3 second > ./tests/basic/distribute/debug-xattrs.t - 3 second > ./tests/basic/afr/ta-check-locks.t - 3 second > ./tests/line-coverage/volfile-with-all-graph-syntax.t - 2 second > ./tests/line-coverage/some-features-in-libglusterfs.t - 2 second > ./tests/bugs/shard/bug-1261773.t - 2 second > ./tests/bugs/replicate/bug-884328.t - 2 second > ./tests/bugs/readdir-ahead/bug-1512437.t - 2 second > ./tests/bugs/nfs/bug-970070.t - 2 second > ./tests/bugs/nfs/bug-1302948.t - 2 second > ./tests/bugs/logging/bug-823081.t - 2 second > ./tests/bugs/glusterfs-server/bug-889996.t - 2 second > ./tests/bugs/glusterfs/bug-892730.t - 2 second > ./tests/bugs/glusterfs/bug-811493.t - 2 second > ./tests/bugs/glusterd/bug-1085330-and-bug-916549.t - 2 second > ./tests/bugs/distribute/bug-924265.t - 2 second > ./tests/bugs/core/log-bug-1362520.t - 2 second > ./tests/bugs/core/bug-903336.t - 2 second > ./tests/bugs/core/bug-1111557.t - 2 second > ./tests/bugs/cli/bug-969193.t - 2 second > ./tests/bugs/cli/bug-949298.t - 2 second > ./tests/bugs/cli/bug-921215.t - 2 second > ./tests/bugs/cli/bug-1378842-volume-get-all.t - 2 second > ./tests/basic/peer-parsing.t - 2 second > ./tests/basic/md-cache/bug-1418249.t - 2 second > ./tests/basic/afr/arbiter-cli.t - 2 second > ./tests/bugs/replicate/ta-inode-refresh-read.t - 1 second > ./tests/bugs/glusterfs/bug-853690.t - 1 second > ./tests/bugs/cli/bug-764638.t - 1 second > ./tests/bugs/cli/bug-1047378.t - 1 second > ./tests/basic/netgroup_parsing.t - 1 second > ./tests/basic/gfapi/sink.t - 1 second > ./tests/basic/exports_parsing.t - 1 second > ./tests/basic/posixonly.t - 0 second > ./tests/basic/glusterfsd-args.t - 0 second > > > 2 test(s) failed > ./tests/basic/uss.t > ./tests/features/subdir-mount.t > > 0 test(s) generated core > > > 5 test(s) needed retry > ./tests/basic/afr/split-brain-favorite-child-policy.t > ./tests/basic/ec/self-heal.t > ./tests/basic/uss.t > ./tests/basic/volfile-sanity.t > ./tests/features/subdir-mount.t > > Result is 1 > > tar: Removing leading `/' from member names > kernel.core_pattern = /%e-%p.core > Build step 'Execute shell' marked build as failure > _______________________________________________ > maintainers mailing list > maintainers at gluster.org > https://lists.gluster.org/mailman/listinfo/maintainers > > > -- > - Atin (atinm) > _______________________________________________ > maintainers mailing list > maintainers at gluster.org > https://lists.gluster.org/mailman/listinfo/maintainers > -- Amar Tumballi (amarts) -------------- next part -------------- An HTML attachment was scrubbed... URL: From dkhandel at redhat.com Fri Jun 7 04:54:53 2019 From: dkhandel at redhat.com (Deepshikha Khandelwal) Date: Fri, 7 Jun 2019 10:24:53 +0530 Subject: [Gluster-devel] CI failure - NameError: name 'unicode' is not defined (related to changelogparser.py) In-Reply-To: References: Message-ID: Hi Yaniv, We are working on this. The builders are picking up python3.6 which is leading to modules missing and such undefined errors. Kotresh has sent a patch https://review.gluster.org/#/c/glusterfs/+/22829/ to fix the issue. On Thu, Jun 6, 2019 at 11:49 AM Yaniv Kaul wrote: > From [1]. > > I think it's a Python2/3 thing, so perhaps a CI issue additionally (though > if our code is not Python 3 ready, let's ensure we use Python 2 explicitly > until we fix this). > > *00:47:05.207* ok 14 [ 13/ 386] < 34> 'gluster --mode=script --wignore volume start patchy'*00:47:05.207* ok 15 [ 13/ 70] < 36> '_GFS --attribute-timeout=0 --entry-timeout=0 --volfile-id=patchy --volfile-server=builder208.int.aws.gluster.org /mnt/glusterfs/0'*00:47:05.207* Traceback (most recent call last):*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 233, in *00:47:05.207* parse(sys.argv[1])*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 221, in parse*00:47:05.207* process_record(data, tokens, changelog_ts, callback)*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 178, in process_record*00:47:05.207* callback(record)*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 182, in default_callback*00:47:05.207* sys.stdout.write(u"{0}\n".format(record))*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 128, in __str__*00:47:05.207* return unicode(self).encode('utf-8')*00:47:05.207* NameError: name 'unicode' is not defined*00:47:05.207* not ok 16 [ 53/ 39] < 42> '2 check_changelog_op /d/backends/patchy0/.glusterfs/changelogs RENAME' -> 'Got "0" instead of "2"' > > > Y. > > [1] https://build.gluster.org/job/centos7-regression/6318/console > > _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From atumball at redhat.com Sun Jun 9 04:48:48 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Sun, 9 Jun 2019 10:18:48 +0530 Subject: [Gluster-devel] CI failure - NameError: name 'unicode' is not defined (related to changelogparser.py) In-Reply-To: References:

Message-ID: Update: The issue happened because python3 got installed on centos7.x series of builders due to other package dependencies. And considering GlusterFS picks python3 as priority even if python2 is default, the tests started to fail. We had completed the work of migrating the code to work smoothly with python3 by glusterfs-6.0 release, but had not noticed issues with regression framework as it was running only on centos7 (python2) earlier. With this event, our regression tests are also now compatible with python3 (Thanks the the below mentioned patch of Kotresh). We were able to mark few spurious failures as BAD_TEST, and fix all the python3 related issues in regression by EOD Friday, and after watching regression tests for 1 more day, can say that the issues are now resolved. Please resubmit (or rebase in the gerrit web) before triggering the 'recheck centos' in the submitted patch(es). Thanks everyone who responded quickly once the issue was noticed, and we are back to GREEN again. Regards, Amar On Fri, Jun 7, 2019 at 10:26 AM Deepshikha Khandelwal wrote: > Hi Yaniv, > > We are working on this. The builders are picking up python3.6 which is > leading to modules missing and such undefined errors. > > Kotresh has sent a patch https://review.gluster.org/#/c/glusterfs/+/22829/ > to fix the issue. > > > > On Thu, Jun 6, 2019 at 11:49 AM Yaniv Kaul wrote: > >> From [1]. >> >> I think it's a Python2/3 thing, so perhaps a CI issue additionally >> (though if our code is not Python 3 ready, let's ensure we use Python 2 >> explicitly until we fix this). >> >> *00:47:05.207* ok 14 [ 13/ 386] < 34> 'gluster --mode=script --wignore volume start patchy'*00:47:05.207* ok 15 [ 13/ 70] < 36> '_GFS --attribute-timeout=0 --entry-timeout=0 --volfile-id=patchy --volfile-server=builder208.int.aws.gluster.org /mnt/glusterfs/0'*00:47:05.207* Traceback (most recent call last):*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 233, in *00:47:05.207* parse(sys.argv[1])*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 221, in parse*00:47:05.207* process_record(data, tokens, changelog_ts, callback)*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 178, in process_record*00:47:05.207* callback(record)*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 182, in default_callback*00:47:05.207* sys.stdout.write(u"{0}\n".format(record))*00:47:05.207* File "./tests/basic/changelog/../../utils/changelogparser.py", line 128, in __str__*00:47:05.207* return unicode(self).encode('utf-8')*00:47:05.207* NameError: name 'unicode' is not defined*00:47:05.207* not ok 16 [ 53/ 39] < 42> '2 check_changelog_op /d/backends/patchy0/.glusterfs/changelogs RENAME' -> 'Got "0" instead of "2"' >> >> >> Y. >> >> [1] https://build.gluster.org/job/centos7-regression/6318/console >> >> _______________________________________________ >> >> Community Meeting Calendar: >> >> APAC Schedule - >> Every 2nd and 4th Tuesday at 11:30 AM IST >> Bridge: https://bluejeans.com/836554017 >> >> NA/EMEA Schedule - >> Every 1st and 3rd Tuesday at 01:00 PM EDT >> Bridge: https://bluejeans.com/486278655 >> >> Gluster-devel mailing list >> Gluster-devel at gluster.org >> https://lists.gluster.org/mailman/listinfo/gluster-devel >> >> _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > -- Amar Tumballi (amarts) -------------- next part -------------- An HTML attachment was scrubbed... URL: From jenkins at build.gluster.org Mon Jun 10 01:45:02 2019 From: jenkins at build.gluster.org (jenkins at build.gluster.org) Date: Mon, 10 Jun 2019 01:45:02 +0000 (UTC) Subject: [Gluster-devel] Weekly Untriaged Bugs Message-ID: <1648858451.122.1560131103113.JavaMail.jenkins@jenkins-el7.rht.gluster.org> [...truncated 6 lines...] https://bugzilla.redhat.com/1714851 / core: issues with 'list.h' elements in clang-scan https://bugzilla.redhat.com/1716790 / geo-replication: geo-rep: Rename with same destination name test case occasionally fails on EC Volume https://bugzilla.redhat.com/1716812 / glusterd: Failed to create volume which transport_type is "tcp,rdma" https://bugzilla.redhat.com/1716875 / gluster-smb: Inode Unref Assertion failed: inode->ref https://bugzilla.redhat.com/1716455 / gluster-smb: OS X error -50 when creating sub-folder on Samba share when using Gluster VFS https://bugzilla.redhat.com/1716440 / gluster-smb: SMBD thread panics when connected to from OS X machine https://bugzilla.redhat.com/1714895 / libglusterfsclient: Glusterfs(fuse) client crash https://bugzilla.redhat.com/1717824 / locks: Fencing: Added the tcmu-runner ALUA feature support but after one of node is rebooted the glfs_file_lock() get stucked https://bugzilla.redhat.com/1718562 / locks: flock failure (regression) https://bugzilla.redhat.com/1718227 / scripts: SELinux context labels are missing for newly added bricks using add-brick command [...truncated 2 lines...] -------------- next part -------------- A non-text attachment was scrubbed... Name: build.log Type: application/octet-stream Size: 1492 bytes Desc: not available URL: From ykaul at redhat.com Mon Jun 10 06:40:27 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Mon, 10 Jun 2019 09:40:27 +0300 Subject: [Gluster-devel] Test failed - due to out of memory on builder201? Message-ID: >From [1], we can see that non-root-unlink-stale-linkto.t failed on: useradd: /etc/passwd.30380: Cannot allocate memory useradd: cannot lock /etc/passwd; try again later. My patch[2] only removed include statements that were not needed. I'm not sure how it can cause a memory issue. So it's either we have some regression, or the slaves do not have enough memory. It was running on builder201.aws.gluster.org . Checking it, I see other jobs[3] failing on the same issue. Perhaps it is the slave? Any ideas? TIA, Y. [1] https://build.gluster.org/job/centos7-regression/6382/consoleFull [2] https://review.gluster.org/#/c/glusterfs/+/22844/ [3] https://build.gluster.org/job/centos7-regression/6385/console BTW, we REALLY need to fix this message - it's clueless: E [MSGID: 106176] [glusterd-handshake.c:1038:__server_getspec] 0-management: Failed to mount the volume -------------- next part -------------- An HTML attachment was scrubbed... URL: From dkhandel at redhat.com Mon Jun 10 07:05:45 2019 From: dkhandel at redhat.com (Deepshikha Khandelwal) Date: Mon, 10 Jun 2019 12:35:45 +0530 Subject: [Gluster-devel] Test failed - due to out of memory on builder201? In-Reply-To: References: Message-ID: Hi Yaniv, I'm working on it. Looking at the logs and further health checkups, I did not find any memory issue tracebacks on builder201. On Mon, Jun 10, 2019 at 12:12 PM Yaniv Kaul wrote: > From [1], we can see that non-root-unlink-stale-linkto.t failed on: > useradd: /etc/passwd.30380: Cannot allocate memory > useradd: cannot lock /etc/passwd; try again later. > > My patch[2] only removed include statements that were not needed. > I'm not sure how it can cause a memory issue. > So it's either we have some regression, or the slaves do not have enough > memory. > It was running on builder201.aws.gluster.org . > > Checking it, I see other jobs[3] failing on the same issue. Perhaps it is > the slave? > > Any ideas? > TIA, > Y. > > [1] https://build.gluster.org/job/centos7-regression/6382/consoleFull > [2] https://review.gluster.org/#/c/glusterfs/+/22844/ > [3] https://build.gluster.org/job/centos7-regression/6385/console > > BTW, we REALLY need to fix this message - it's clueless: > E [MSGID: 106176] [glusterd-handshake.c:1038:__server_getspec] > 0-management: Failed to mount the volume > > _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cynthia.zhou at nokia-sbell.com Mon Jun 10 07:42:35 2019 From: cynthia.zhou at nokia-sbell.com (Zhou, Cynthia (NSB - CN/Hangzhou)) Date: Mon, 10 Jun 2019 07:42:35 +0000 Subject: [Gluster-devel] glusterfsd memory leak issue found after enable ssl In-Reply-To: References: <07cb1c3aa08b414dbe37442955ddad36@nokia-sbell.com>

<6ce04fb69243465295a71b6953eafa19@nokia-sbell.com> <3cd91d1ce39541e7ad30c60ef15000aa@nokia-sbell.com>

<5d0c2ed30e884b86ba29bff5a47c960e@nokia-sbell.com>

<6d3f68f73e6d440dab19028526745171@nokia-sbell.com> <0d7934cac01f4a43b4581a2f74928dbc@nokia-sbell.com> <9ea2678487544232bfe66e0e7c6d3091@nokia-sbell.com> Message-ID: <217c6a2dbe704777bd8c3662683e75ad@nokia-sbell.com> Hi, How about this patch? I see there is a failed test, is that related to my change? cynthia From: Raghavendra Gowdappa Sent: Thursday, May 09, 2019 12:13 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) Cc: Amar Tumballi Suryanarayan ; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterfsd memory leak issue found after enable ssl Thanks!! On Thu, May 9, 2019 at 8:34 AM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Hi, Ok, It is posted to https://review.gluster.org/#/c/glusterfs/+/22687/ From: Raghavendra Gowdappa > Sent: Wednesday, May 08, 2019 7:35 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) > Cc: Amar Tumballi Suryanarayan >; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterfsd memory leak issue found after enable ssl On Wed, May 8, 2019 at 1:29 PM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Hi 'Milind Changire' , The leak is getting more and more clear to me now. the unsolved memory leak is because of in gluterfs version 3.12.15 (in my env)the ssl context is a shared one, while we do ssl_acept, ssl will allocate some read/write buffer to ssl object, however, ssl_free in socket_reset or fini function of socket.c, the buffer is returened back to ssl context free list instead of completely freed. Thanks Cynthia for your efforts in identifying and fixing the leak. If you post a patch to gerrit, I'll be happy to merge it and get the fix into the codebase. So following patch is able to fix the memory leak issue completely.(created for gluster master branch) --- a/rpc/rpc-transport/socket/src/socket.c +++ b/rpc/rpc-transport/socket/src/socket.c @@ -446,6 +446,7 @@ ssl_setup_connection_postfix(rpc_transport_t *this) gf_log(this->name, GF_LOG_DEBUG, "SSL verification succeeded (client: %s) (server: %s)", this->peerinfo.identifier, this->myinfo.identifier); + X509_free(peer); return gf_strdup(peer_CN); /* Error paths. */ @@ -1157,7 +1158,21 @@ __socket_reset(rpc_transport_t *this) memset(&priv->incoming, 0, sizeof(priv->incoming)); event_unregister_close(this->ctx->event_pool, priv->sock, priv->idx); - + if(priv->use_ssl&& priv->ssl_ssl) + { + gf_log(this->name, GF_LOG_TRACE, + "clear and reset for socket(%d), free ssl ", + priv->sock); + if(priv->ssl_ctx) + { + SSL_CTX_free(priv->ssl_ctx); + priv->ssl_ctx = NULL; + } + SSL_shutdown(priv->ssl_ssl); + SSL_clear(priv->ssl_ssl); + SSL_free(priv->ssl_ssl); + priv->ssl_ssl = NULL; + } priv->sock = -1; priv->idx = -1; priv->connected = -1; @@ -4675,6 +4690,21 @@ fini(rpc_transport_t *this) pthread_mutex_destroy(&priv->out_lock); pthread_mutex_destroy(&priv->cond_lock); pthread_cond_destroy(&priv->cond); + if(priv->use_ssl&& priv->ssl_ssl) + { + gf_log(this->name, GF_LOG_TRACE, + "clear and reset for socket(%d), free ssl ", + priv->sock); + if(priv->ssl_ctx) + { + SSL_CTX_free(priv->ssl_ctx); + priv->ssl_ctx = NULL; + } + SSL_shutdown(priv->ssl_ssl); + SSL_clear(priv->ssl_ssl); + SSL_free(priv->ssl_ssl); From: Zhou, Cynthia (NSB - CN/Hangzhou) Sent: Monday, May 06, 2019 2:12 PM To: 'Amar Tumballi Suryanarayan' > Cc: 'Milind Changire' >; 'gluster-devel at gluster.org' > Subject: RE: [Gluster-devel] glusterfsd memory leak issue found after enable ssl Hi, From our test valgrind and libleak all blame ssl3_accept ///////////////////////////from valgrind attached to glusterfds/////////////////////////////////////////// ==16673== 198,720 bytes in 12 blocks are definitely lost in loss record 1,114 of 1,123 ==16673== at 0x4C2EB7B: malloc (vg_replace_malloc.c:299) ==16673== by 0x63E1977: CRYPTO_malloc (in /usr/lib64/libcrypto.so.1.0.2p) ==16673== by 0xA855E0C: ssl3_setup_write_buffer (in /usr/lib64/libssl.so.1.0.2p) ==16673== by 0xA855E77: ssl3_setup_buffers (in /usr/lib64/libssl.so.1.0.2p) ==16673== by 0xA8485D9: ssl3_accept (in /usr/lib64/libssl.so.1.0.2p) ==16673== by 0xA610DDF: ssl_complete_connection (socket.c:400) ==16673== by 0xA617F38: ssl_handle_server_connection_attempt (socket.c:2409) ==16673== by 0xA618420: socket_complete_connection (socket.c:2554) ==16673== by 0xA618788: socket_event_handler (socket.c:2613) ==16673== by 0x4ED6983: event_dispatch_epoll_handler (event-epoll.c:587) ==16673== by 0x4ED6C5A: event_dispatch_epoll_worker (event-epoll.c:663) ==16673== by 0x615C5D9: start_thread (in /usr/lib64/libpthread-2.27.so) ==16673== ==16673== 200,544 bytes in 12 blocks are definitely lost in loss record 1,115 of 1,123 ==16673== at 0x4C2EB7B: malloc (vg_replace_malloc.c:299) ==16673== by 0x63E1977: CRYPTO_malloc (in /usr/lib64/libcrypto.so.1.0.2p) ==16673== by 0xA855D12: ssl3_setup_read_buffer (in /usr/lib64/libssl.so.1.0.2p) ==16673== by 0xA855E68: ssl3_setup_buffers (in /usr/lib64/libssl.so.1.0.2p) ==16673== by 0xA8485D9: ssl3_accept (in /usr/lib64/libssl.so.1.0.2p) ==16673== by 0xA610DDF: ssl_complete_connection (socket.c:400) ==16673== by 0xA617F38: ssl_handle_server_connection_attempt (socket.c:2409) ==16673== by 0xA618420: socket_complete_connection (socket.c:2554) ==16673== by 0xA618788: socket_event_handler (socket.c:2613) ==16673== by 0x4ED6983: event_dispatch_epoll_handler (event-epoll.c:587) ==16673== by 0x4ED6C5A: event_dispatch_epoll_worker (event-epoll.c:663) ==16673== by 0x615C5D9: start_thread (in /usr/lib64/libpthread-2.27.so) ==16673== valgrind --leak-check=f ////////////////////////////////////with libleak attached to glusterfsd///////////////////////////////////////// callstack[2419] expires. count=1 size=224/224 alloc=362 free=350 /home/robot/libleak/libleak.so(malloc+0x25) [0x7f1460604065] /lib64/libcrypto.so.10(CRYPTO_malloc+0x58) [0x7f145ecd9978] /lib64/libcrypto.so.10(EVP_DigestInit_ex+0x2a9) [0x7f145ed95749] /lib64/libssl.so.10(ssl3_digest_cached_records+0x11d) [0x7f145abb6ced] /lib64/libssl.so.10(ssl3_accept+0xc8f) [0x7f145abadc4f] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(ssl_complete_connection+0x5e) [0x7f145ae00f3a] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(+0xc16d) [0x7f145ae0816d] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(+0xc68a) [0x7f145ae0868a] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(+0xc9f2) [0x7f145ae089f2] /lib64/libglusterfs.so.0(+0x9b96f) [0x7f146038596f] /lib64/libglusterfs.so.0(+0x9bc46) [0x7f1460385c46] /lib64/libpthread.so.0(+0x75da) [0x7f145f0d15da] /lib64/libc.so.6(clone+0x3f) [0x7f145e9a7eaf] callstack[2432] expires. count=1 size=104/104 alloc=362 free=0 /home/robot/libleak/libleak.so(malloc+0x25) [0x7f1460604065] /lib64/libcrypto.so.10(CRYPTO_malloc+0x58) [0x7f145ecd9978] /lib64/libcrypto.so.10(BN_MONT_CTX_new+0x17) [0x7f145ed48627] /lib64/libcrypto.so.10(BN_MONT_CTX_set_locked+0x6d) [0x7f145ed489fd] /lib64/libcrypto.so.10(+0xff4d9) [0x7f145ed6a4d9] /lib64/libcrypto.so.10(int_rsa_verify+0x1cd) [0x7f145ed6d41d] /lib64/libcrypto.so.10(RSA_verify+0x32) [0x7f145ed6d972] /lib64/libcrypto.so.10(+0x107ff5) [0x7f145ed72ff5] /lib64/libcrypto.so.10(EVP_VerifyFinal+0x211) [0x7f145ed9dd51] /lib64/libssl.so.10(ssl3_get_cert_verify+0x5bb) [0x7f145abac06b] /lib64/libssl.so.10(ssl3_accept+0x988) [0x7f145abad948] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(ssl_complete_connection+0x5e) [0x7f145ae00f3a] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(+0xc16d) [0x7f145ae0816d] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(+0xc68a) [0x7f145ae0868a] /usr/lib64/glusterfs/3.12.15/rpc-transport/socket.so(+0xc9f2) [0x7f145ae089f2] /lib64/libglusterfs.so.0(+0x9b96f) [0x7f146038596f] /lib64/libglusterfs.so.0(+0x9bc46) [0x7f1460385c46] /lib64/libpthread.so.0(+0x75da) [0x7f145f0d15da] /lib64/libc.so.6(clone+0x3f) [0x7f145e9a7eaf] one interesting thing is that the memory goes up to about 300m then it stopped increasing !!! I am wondering if this is caused by open-ssl library? But when I search from openssl community, there is no such issue reported before. Is glusterfs using ssl_accept correctly? cynthia From: Zhou, Cynthia (NSB - CN/Hangzhou) Sent: Monday, May 06, 2019 10:34 AM To: 'Amar Tumballi Suryanarayan' > Cc: Milind Changire >; gluster-devel at gluster.org Subject: RE: [Gluster-devel] glusterfsd memory leak issue found after enable ssl Hi, Sorry, I am so busy with other issues these days, could you help me to submit my patch for review? It is based on glusterfs3.12.15 code. But even with this patch , memory leak still exists, from memory leak tool it should be related with ssl_accept, not sure if it is because of openssl library or because improper use of ssl interfaces. --- a/rpc/rpc-transport/socket/src/socket.c +++ b/rpc/rpc-transport/socket/src/socket.c @@ -1019,7 +1019,16 @@ static void __socket_reset(rpc_transport_t *this) { memset(&priv->incoming, 0, sizeof(priv->incoming)); event_unregister_close(this->ctx->event_pool, priv->sock, priv->idx); - + if(priv->use_ssl&& priv->ssl_ssl) + { + gf_log(this->name, GF_LOG_INFO, + "clear and reset for socket(%d), free ssl ", + priv->sock); + SSL_shutdown(priv->ssl_ssl); + SSL_clear(priv->ssl_ssl); + SSL_free(priv->ssl_ssl); + priv->ssl_ssl = NULL; + } priv->sock = -1; priv->idx = -1; priv->connected = -1; @@ -4238,6 +4250,16 @@ void fini(rpc_transport_t *this) { pthread_mutex_destroy(&priv->out_lock); pthread_mutex_destroy(&priv->cond_lock); pthread_cond_destroy(&priv->cond); + if(priv->use_ssl&& priv->ssl_ssl) + { + gf_log(this->name, GF_LOG_INFO, + "clear and reset for socket(%d), free ssl ", + priv->sock); + SSL_shutdown(priv->ssl_ssl); + SSL_clear(priv->ssl_ssl); + SSL_free(priv->ssl_ssl); + priv->ssl_ssl = NULL; + } if (priv->ssl_private_key) { GF_FREE(priv->ssl_private_key); } From: Amar Tumballi Suryanarayan > Sent: Wednesday, May 01, 2019 8:43 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) > Cc: Milind Changire >; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterfsd memory leak issue found after enable ssl Hi Cynthia Zhou, Can you post the patch which fixes the issue of missing free? We will continue to investigate the leak further, but would really appreciate getting the patch which is already worked on land into upstream master. -Amar On Mon, Apr 22, 2019 at 1:38 PM Zhou, Cynthia (NSB - CN/Hangzhou) > wrote: Ok, I am clear now. I?ve added ssl_free in socket reset and socket finish function, though glusterfsd memory leak is not that much, still it is leaking, from source code I can not find anything else, Could you help to check if this issue exists in your env? If not I may have a try to merge your patch . Step 1> while true;do gluster v heal info, 2> check the vol-name glusterfsd memory usage, it is obviously increasing. cynthia From: Milind Changire > Sent: Monday, April 22, 2019 2:36 PM To: Zhou, Cynthia (NSB - CN/Hangzhou) > Cc: Atin Mukherjee >; gluster-devel at gluster.org Subject: Re: [Gluster-devel] glusterfsd memory leak issue found after enable ssl According to BIO_new_socket() man page ... If the close flag is set then the socket is shut down and closed when the BIO is freed. For Gluster to have more control over the socket shutdown, the BIO_NOCLOSE flag is set. Otherwise, SSL takes control of socket shutdown whenever BIO is freed. _______________________________________________ Gluster-devel mailing list Gluster-devel at gluster.org https://lists.gluster.org/mailman/listinfo/gluster-devel -- Amar Tumballi (amarts) _______________________________________________ Community Meeting Calendar: APAC Schedule - Every 2nd and 4th Tuesday at 11:30 AM IST Bridge: https://bluejeans.com/836554017 NA/EMEA Schedule - Every 1st and 3rd Tuesday at 01:00 PM EDT Bridge: https://bluejeans.com/486278655 Gluster-devel mailing list Gluster-devel at gluster.org https://lists.gluster.org/mailman/listinfo/gluster-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: From ykaul at redhat.com Mon Jun 10 08:31:18 2019 From: ykaul at redhat.com (Yaniv Kaul) Date: Mon, 10 Jun 2019 11:31:18 +0300 Subject: [Gluster-devel] glusterfsd memory leak issue found after enable ssl In-Reply-To: <217c6a2dbe704777bd8c3662683e75ad@nokia-sbell.com> References: <07cb1c3aa08b414dbe37442955ddad36@nokia-sbell.com>

<6ce04fb69243465295a71b6953eafa19@nokia-sbell.com> <3cd91d1ce39541e7ad30c60ef15000aa@nokia-sbell.com>

<5d0c2ed30e884b86ba29bff5a47c960e@nokia-sbell.com>

<6d3f68f73e6d440dab19028526745171@nokia-sbell.com> <0d7934cac01f4a43b4581a2f74928dbc@nokia-sbell.com> <9ea2678487544232bfe66e0e7c6d3091@nokia-sbell.com> <217c6a2dbe704777bd8c3662683e75ad@nokia-sbell.com> Message-ID: On Mon, Jun 10, 2019 at 10:43 AM Zhou, Cynthia (NSB - CN/Hangzhou) < cynthia.zhou at nokia-sbell.com> wrote: > Hi, > > How about this patch? I see there is a failed test, is that related to my > change? > Quite likely. Have you looked at the failure? It produces a stack which looks close to where your patch is: 01:02:58.118 Thread 1 (Thread 0x7efe40930700 (LWP 17150)): 01:02:58.118 #0 0x00007efe4dfd359c in free () from /lib64/libc.so.6 01:02:58.118 No symbol table info available. 01:02:58.118 #1 0x00007efe4e38970d in CRYPTO_free () from /lib64/libcrypto.so.10 01:02:58.118 No symbol table info available. 01:02:58.118 #2 0x00007efe4e4400e7 in sk_free () from /lib64/libcrypto.so.10 01:02:58.118 No symbol table info available. 01:02:58.118 #3 0x00007efe4e4863de in x509_verify_param_zero () from /lib64/libcrypto.so.10 01:02:58.118 No symbol table info available. 01:02:58.118 #4 0x00007efe4e48644e in X509_VERIFY_PARAM_free () from /lib64/libcrypto.so.10 01:02:58.118 No symbol table info available. 01:02:58.118 #5 0x00007efe42a107d9 in SSL_CTX_free () from /lib64/libssl.so.10 01:02:58.120 No symbol table info available. 01:02:58.120 #6 0x00007efe42a12cc0 in SSL_free () from /lib64/libssl.so.10 01:02:58.122 No symbol table info available. 01:02:58.122 #7 0x00007efe42c463eb in __socket_reset (this=0x7efe34001240) at /home/jenkins/root/workspace/centos7-regression/rpc/rpc-transport/socket/src/socket.c:1170 01:02:58.123 priv = 0x7efe340017a0 01:02:58.123 __FUNCTION__ = "__socket_reset" 01:02:58.123 #8 0x00007efe42c46e43 in socket_event_poll_err (this=0x7efe34001240, gen=4, idx=2) at /home/jenkins/root/workspace/centos7-regression/rpc/rpc-transport/socket/src/socket.c:1383 01:02:58.123 priv = 0x7efe340017a0 01:02:58.123 socket_closed = false 01:02:58.123 __FUNCTION__ = "socket_event_poll_err" 01:02:58.123 #9 0x00007efe42c4d056 in socket_event_handler (fd=6, idx=2, gen=4, data=0x7efe34001240, poll_in=1, poll_out=0, poll_err=16, event_thread_died=0 '\000') at /home/jenkins/root/workspace/centos7-regression/rpc/rpc-transport/socket/src/socket.c:3037 > > cynthia > > > > *From:* Raghavendra Gowdappa > *Sent:* Thursday, May 09, 2019 12:13 PM > *To:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Cc:* Amar Tumballi Suryanarayan ; > gluster-devel at gluster.org > *Subject:* Re: [Gluster-devel] glusterfsd memory leak issue found after > enable ssl > > > > Thanks!! > > > > On Thu, May 9, 2019 at 8:34 AM Zhou, Cynthia (NSB - CN/Hangzhou) < > cynthia.zhou at nokia-sbell.com> wrote: > > Hi, > > Ok, It is posted to https://review.gluster.org/#/c/glusterfs/+/22687/ > > > > > > > > *From:* Raghavendra Gowdappa > *Sent:* Wednesday, May 08, 2019 7:35 PM > *To:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Cc:* Amar Tumballi Suryanarayan ; > gluster-devel at gluster.org > *Subject:* Re: [Gluster-devel] glusterfsd memory leak issue found after > enable ssl > > > > > > > > On Wed, May 8, 2019 at 1:29 PM Zhou, Cynthia (NSB - CN/Hangzhou) < > cynthia.zhou at nokia-sbell.com> wrote: > > Hi 'Milind Changire' , > > The leak is getting more and more clear to me now. the unsolved memory > leak is because of in gluterfs version 3.12.15 (in my env)the ssl context > is a shared one, while we do ssl_acept, ssl will allocate some read/write > buffer to ssl object, however, ssl_free in socket_reset or fini function of > socket.c, the buffer is returened back to ssl context free list instead of > completely freed. > > > > Thanks Cynthia for your efforts in identifying and fixing the leak. If you > post a patch to gerrit, I'll be happy to merge it and get the fix into the > codebase. > > > > > > So following patch is able to fix the memory leak issue > completely.(created for gluster master branch) > > > > --- a/rpc/rpc-transport/socket/src/socket.c > +++ b/rpc/rpc-transport/socket/src/socket.c > @@ -446,6 +446,7 @@ ssl_setup_connection_postfix(rpc_transport_t *this) > gf_log(this->name, GF_LOG_DEBUG, > "SSL verification succeeded (client: %s) (server: %s)", > this->peerinfo.identifier, this->myinfo.identifier); > + X509_free(peer); > return gf_strdup(peer_CN); > > /* Error paths. */ > @@ -1157,7 +1158,21 @@ __socket_reset(rpc_transport_t *this) > memset(&priv->incoming, 0, sizeof(priv->incoming)); > > event_unregister_close(this->ctx->event_pool, priv->sock, priv->idx); > - > + if(priv->use_ssl&& priv->ssl_ssl) > + { > + gf_log(this->name, GF_LOG_TRACE, > + "clear and reset for socket(%d), free ssl ", > + priv->sock); > + if(priv->ssl_ctx) > + { > + SSL_CTX_free(priv->ssl_ctx); > + priv->ssl_ctx = NULL; > + } > + SSL_shutdown(priv->ssl_ssl); > + SSL_clear(priv->ssl_ssl); > + SSL_free(priv->ssl_ssl); > + priv->ssl_ssl = NULL; > + } > priv->sock = -1; > priv->idx = -1; > priv->connected = -1; > @@ -4675,6 +4690,21 @@ fini(rpc_transport_t *this) > pthread_mutex_destroy(&priv->out_lock); > pthread_mutex_destroy(&priv->cond_lock); > pthread_cond_destroy(&priv->cond); > + if(priv->use_ssl&& priv->ssl_ssl) > + { > + gf_log(this->name, GF_LOG_TRACE, > + "clear and reset for socket(%d), free ssl > ", > + priv->sock); > + if(priv->ssl_ctx) > + { > + SSL_CTX_free(priv->ssl_ctx); > + priv->ssl_ctx = NULL; > + } > + SSL_shutdown(priv->ssl_ssl); > + SSL_clear(priv->ssl_ssl); > + SSL_free(priv->ssl_ssl); > > *From:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Sent:* Monday, May 06, 2019 2:12 PM > *To:* 'Amar Tumballi Suryanarayan' > *Cc:* 'Milind Changire' ; 'gluster-devel at gluster.org' > > *Subject:* RE: [Gluster-devel] glusterfsd memory leak issue found after > enable ssl > > > > Hi, > > From our test valgrind and libleak all blame ssl3_accept > > ///////////////////////////from valgrind attached to > glusterfds/////////////////////////////////////////// > > ==16673== 198,720 bytes in 12 blocks are definitely lost in loss record > 1,114 of 1,123 > ==16673== at 0x4C2EB7B: malloc (vg_replace_malloc.c:299) > ==16673== by 0x63E1977: CRYPTO_malloc (in /usr/lib64/ > *libcrypto.so.1.0.2p*) > ==16673== by 0xA855E0C: ssl3_setup_write_buffer (in /usr/lib64/ > *libssl.so.1.0.2p*) > ==16673== by 0xA855E77: ssl3_setup_buffers (in /usr/lib64/ > *libssl.so.1.0.2p*) > ==16673== by 0xA8485D9: ssl3_accept (in /usr/lib64/*libssl.so.1.0.2p*) > ==16673== by 0xA610DDF: ssl_complete_connection (socket.c:400) > ==16673== by 0xA617F38: ssl_handle_server_connection_attempt > (socket.c:2409) > ==16673== by 0xA618420: socket_complete_connection (socket.c:2554) > ==16673== by 0xA618788: socket_event_handler (socket.c:2613) > ==16673== by 0x4ED6983: event_dispatch_epoll_handler (event-epoll.c:587) > ==16673== by 0x4ED6C5A: event_dispatch_epoll_worker (event-epoll.c:663) > ==16673== by 0x615C5D9: start_thread (in /usr/lib64/*libpthread-2.27.so > *) > ==16673== > ==16673== 200,544 bytes in 12 blocks are definitely lost in loss record > 1,115 of 1,123 > ==16673== at 0x4C2EB7B: malloc (vg_replace_malloc.c:299) > ==16673== by 0x63E1977: CRYPTO_malloc (in /usr/lib64/ > *libcrypto.so.1.0.2p*) > ==16673== by 0xA855D12: ssl3_setup_read_buffer (in /usr/lib64/ > *libssl.so.1.0.2p*) > ==16673== by 0xA855E68: ssl3_setup_buffers (in /usr/lib64/ > *libssl.so.1.0.2p*) > ==16673== by 0xA8485D9: ssl3_accept (in /usr/lib64/*libssl.so.1.0.2p*) > ==16673== by 0xA610DDF: ssl_complete_connection (socket.c:400) > ==16673== by 0xA617F38: ssl_handle_server_connection_attempt > (socket.c:2409) > ==16673== by 0xA618420: socket_complete_connection (socket.c:2554) > ==16673== by 0xA618788: socket_event_handler (socket.c:2613) > ==16673== by 0x4ED6983: event_dispatch_epoll_handler (event-epoll.c:587) > ==16673== by 0x4ED6C5A: event_dispatch_epoll_worker (event-epoll.c:663) > ==16673== by 0x615C5D9: start_thread (in /usr/lib64/*libpthread-2.27.so > *) > ==16673== > valgrind --leak-check=f > > > > > > ////////////////////////////////////with libleak attached to > glusterfsd///////////////////////////////////////// > > callstack[2419] expires. count=1 size=224/224 alloc=362 free=350 > /home/robot/libleak/*libleak.so(malloc+0x25*) [0x7f1460604065] > /lib64/*libcrypto.so.10(CRYPTO_malloc+0x58*) [0x7f145ecd9978] > /lib64/*libcrypto.so.10(EVP_DigestInit_ex+0x2a9*) [0x7f145ed95749] > /lib64/*libssl.so.10(ssl3_digest_cached_records+0x11d*) > [0x7f145abb6ced] > /lib64/*libssl.so.10(**ssl3_accept**+0xc8f*) [0x7f145abadc4f] > /usr/lib64/glusterfs/3.12.15/rpc-transport/ > *socket.so(ssl_complete_connection+0x5e*) [0x7f145ae00f3a] > /usr/lib64/glusterfs/3.12.15/rpc-transport/*socket.so(+0xc16d*) > [0x7f145ae0816d] > /usr/lib64/glusterfs/3.12.15/rpc-transport/*socket.so(+0xc68a*) > [0x7f145ae0868a] > /usr/lib64/glusterfs/3.12.15/rpc-transport/*socket.so(+0xc9f2*) > [0x7f145ae089f2] > /lib64/*libglusterfs.so.0(+0x9b96f*) [0x7f146038596f] > /lib64/*libglusterfs.so.0(+0x9bc46*) [0x7f1460385c46] > /lib64/*libpthread.so.0(+0x75da*) [0x7f145f0d15da] > /lib64/*libc.so.6(clone+0x3f*) [0x7f145e9a7eaf] > > callstack[2432] expires. count=1 size=104/104 alloc=362 free=0 > /home/robot/libleak/*libleak.so(malloc+0x25*) [0x7f1460604065] > /lib64/*libcrypto.so.10(CRYPTO_malloc+0x58*) [0x7f145ecd9978] > /lib64/*libcrypto.so.10(BN_MONT_CTX_new+0x17*) [0x7f145ed48627] > /lib64/*libcrypto.so.10(BN_MONT_CTX_set_locked+0x6d*) [0x7f145ed489fd] > /lib64/*libcrypto.so.10(+0xff4d9*) [0x7f145ed6a4d9] > /lib64/*libcrypto.so.10(int_rsa_verify+0x1cd*) [0x7f145ed6d41d] > /lib64/*libcrypto.so.10(RSA_verify+0x32*) [0x7f145ed6d972] > /lib64/*libcrypto.so.10(+0x107ff5*) [0x7f145ed72ff5] > /lib64/*libcrypto.so.10(EVP_VerifyFinal+0x211*) [0x7f145ed9dd51] > /lib64/*libssl.so.10(ssl3_get_cert_verify+0x5bb*) [0x7f145abac06b] > /lib64/*libssl.so.10(**ssl3_accept**+0x988*) [0x7f145abad948] > /usr/lib64/glusterfs/3.12.15/rpc-transport/ > *socket.so(ssl_complete_connection+0x5e*) [0x7f145ae00f3a] > /usr/lib64/glusterfs/3.12.15/rpc-transport/*socket.so(+0xc16d*) > [0x7f145ae0816d] > /usr/lib64/glusterfs/3.12.15/rpc-transport/*socket.so(+0xc68a*) > [0x7f145ae0868a] > /usr/lib64/glusterfs/3.12.15/rpc-transport/*socket.so(+0xc9f2*) > [0x7f145ae089f2] > /lib64/*libglusterfs.so.0(+0x9b96f*) [0x7f146038596f] > /lib64/*libglusterfs.so.0(+0x9bc46*) [0x7f1460385c46] > /lib64/*libpthread.so.0(+0x75da*) [0x7f145f0d15da] > /lib64/*libc.so.6(clone+0x3f*) [0x7f145e9a7eaf] > > > > one interesting thing is that the memory goes up to about 300m then it > stopped increasing !!! > > I am wondering if this is caused by open-ssl library? But when I search > from openssl community, there is no such issue reported before. > > Is glusterfs using ssl_accept correctly? > > > > cynthia > > *From:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Sent:* Monday, May 06, 2019 10:34 AM > *To:* 'Amar Tumballi Suryanarayan' > *Cc:* Milind Changire ; gluster-devel at gluster.org > *Subject:* RE: [Gluster-devel] glusterfsd memory leak issue found after > enable ssl > > > > Hi, > > Sorry, I am so busy with other issues these days, could you help me to > submit my patch for review? It is based on glusterfs3.12.15 code. But even > with this patch , memory leak still exists, from memory leak tool it should > be related with ssl_accept, not sure if it is because of openssl library or > because improper use of ssl interfaces. > > --- a/rpc/rpc-transport/socket/src/socket.c > > +++ b/rpc/rpc-transport/socket/src/socket.c > > @@ -1019,7 +1019,16 @@ static void __socket_reset(rpc_transport_t *this) { > > memset(&priv->incoming, 0, sizeof(priv->incoming)); > > > > event_unregister_close(this->ctx->event_pool, priv->sock, priv->idx); > > - > > + if(priv->use_ssl&& priv->ssl_ssl) > > + { > > + gf_log(this->name, GF_LOG_INFO, > > + "clear and reset for socket(%d), free ssl ", > > + priv->sock); > > + SSL_shutdown(priv->ssl_ssl); > > + SSL_clear(priv->ssl_ssl); > > + SSL_free(priv->ssl_ssl); > > + priv->ssl_ssl = NULL; > > + } > > priv->sock = -1; > > priv->idx = -1; > > priv->connected = -1; > > @@ -4238,6 +4250,16 @@ void fini(rpc_transport_t *this) { > > pthread_mutex_destroy(&priv->out_lock); > > pthread_mutex_destroy(&priv->cond_lock); > > pthread_cond_destroy(&priv->cond); > > + if(priv->use_ssl&& priv->ssl_ssl) > > + { > > + gf_log(this->name, GF_LOG_INFO, > > + "clear and reset for socket(%d), free ssl ", > > + priv->sock); > > + SSL_shutdown(priv->ssl_ssl); > > + SSL_clear(priv->ssl_ssl); > > + SSL_free(priv->ssl_ssl); > > + priv->ssl_ssl = NULL; > > + } > > if (priv->ssl_private_key) { > > GF_FREE(priv->ssl_private_key); > > } > > > > > > *From:* Amar Tumballi Suryanarayan > *Sent:* Wednesday, May 01, 2019 8:43 PM > *To:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Cc:* Milind Changire ; gluster-devel at gluster.org > *Subject:* Re: [Gluster-devel] glusterfsd memory leak issue found after > enable ssl > > > > Hi Cynthia Zhou, > > > > Can you post the patch which fixes the issue of missing free? We will > continue to investigate the leak further, but would really appreciate > getting the patch which is already worked on land into upstream master. > > > > -Amar > > > > On Mon, Apr 22, 2019 at 1:38 PM Zhou, Cynthia (NSB - CN/Hangzhou) < > cynthia.zhou at nokia-sbell.com> wrote: > > Ok, I am clear now. > > I?ve added ssl_free in socket reset and socket finish function, though > glusterfsd memory leak is not that much, still it is leaking, from source > code I can not find anything else, > > Could you help to check if this issue exists in your env? If not I may > have a try to merge your patch . > > Step > > 1> while true;do gluster v heal info, > > 2> check the vol-name glusterfsd memory usage, it is obviously > increasing. > > cynthia > > > > *From:* Milind Changire > *Sent:* Monday, April 22, 2019 2:36 PM > *To:* Zhou, Cynthia (NSB - CN/Hangzhou) > *Cc:* Atin Mukherjee ; gluster-devel at gluster.org > *Subject:* Re: [Gluster-devel] glusterfsd memory leak issue found after > enable ssl > > > > According to BIO_new_socket() man page ... > > > > *If the close flag is set then the socket is shut down and closed when the > BIO is freed.* > > > > For Gluster to have more control over the socket shutdown, the BIO_NOCLOSE > flag is set. Otherwise, SSL takes control of socket shutdown whenever BIO > is freed. > > > > _______________________________________________ > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > > > -- > > Amar Tumballi (amarts) > > _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From amukherj at redhat.com Mon Jun 10 13:12:21 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Mon, 10 Jun 2019 18:42:21 +0530 Subject: [Gluster-devel] [Gluster-Maintainers] Fwd: Build failed in Jenkins: regression-test-with-multiplex #1359 In-Reply-To: References: <24208463.92.1559325814227.JavaMail.jenkins@jenkins-el7.rht.gluster.org> Message-ID: On Fri, Jun 7, 2019 at 10:07 AM Amar Tumballi Suryanarayan < atumball at redhat.com> wrote: > Got time to test subdir-mount.t failing in brick-mux scenario. > > I noticed some issues, where I need further help from glusterd team. > > subdir-mount.t expects 'hook' script to run after add-brick to make sure > the required subdirectories are healed and are present in new bricks. This > is important as subdir mount expects the subdirs to exist for successful > mount. > > But in case of brick-mux setup, I see that in some cases (6/10), hook > script (add-brick/post-hook/S13-create-subdir-mount.sh) started getting > executed after 20second of finishing the add-brick command. Due to this, > the mount which we execute after add-brick failed. > > My question is, what is making post hook script to run so late ?? > It's not only the add-brick in the post hook. Given post hook scripts are async in nature, I see the respective hook scripts of create/start/set volume operation have executed quite a late which is very surprising until and unless some thread has been stuck for quite a while. Unfortunately for both Mohit and I, the issue isn't reproducible locally. Mohit would give it a try in softserve infra but at this point of time, there's no conclusive evidence, the analysis continues. Amar - would it be possible for you to do a git blame given you can reproduce this? May 31 nightly ( https://build.gluster.org/job/regression-test-with-multiplex/1359/) is when this test started failing. > I can recreate the issues locally on my laptop too. > > > On Sat, Jun 1, 2019 at 4:55 PM Atin Mukherjee wrote: > >> subdir-mount.t has started failing in brick mux regression nightly. This >> needs to be fixed. >> >> Raghavendra - did we manage to get any further clue on uss.t failure? >> >> ---------- Forwarded message --------- >> From: >> Date: Fri, 31 May 2019 at 23:34 >> Subject: [Gluster-Maintainers] Build failed in Jenkins: >> regression-test-with-multiplex #1359 >> To: , , , >> , >> >> >> See < >> https://build.gluster.org/job/regression-test-with-multiplex/1359/display/redirect?page=changes >> > >> >> Changes: >> >> [atin] glusterd: add an op-version check >> >> [atin] glusterd/svc: glusterd_svcs_stop should call individual wrapper >> function >> >> [atin] glusterd/svc: Stop stale process using the glusterd_proc_stop >> >> [Amar Tumballi] lcov: more coverage to shard, old-protocol, sdfs >> >> [Kotresh H R] tests/geo-rep: Add EC volume test case >> >> [Amar Tumballi] glusterfsd/cleanup: Protect graph object under a lock >> >> [Mohammed Rafi KC] glusterd/shd: Optimize the glustershd manager to send >> reconfigure >> >> [Kotresh H R] tests/geo-rep: Add tests to cover glusterd geo-rep >> >> [atin] glusterd: Optimize code to copy dictionary in handshake code path >> >> ------------------------------------------ >> [...truncated 3.18 MB...] >> ./tests/basic/afr/stale-file-lookup.t - 9 second >> ./tests/basic/afr/granular-esh/replace-brick.t - 9 second >> ./tests/basic/afr/granular-esh/add-brick.t - 9 second >> ./tests/basic/afr/gfid-mismatch.t - 9 second >> ./tests/performance/open-behind.t - 8 second >> ./tests/features/ssl-authz.t - 8 second >> ./tests/features/readdir-ahead.t - 8 second >> ./tests/bugs/upcall/bug-1458127.t - 8 second >> ./tests/bugs/transport/bug-873367.t - 8 second >> ./tests/bugs/replicate/bug-1498570-client-iot-graph-check.t - 8 second >> ./tests/bugs/replicate/bug-1132102.t - 8 second >> ./tests/bugs/quota/bug-1250582-volume-reset-should-not-remove-quota-quota-deem-statfs.t >> - 8 second >> ./tests/bugs/quota/bug-1104692.t - 8 second >> ./tests/bugs/posix/bug-1360679.t - 8 second >> ./tests/bugs/posix/bug-1122028.t - 8 second >> ./tests/bugs/nfs/bug-1157223-symlink-mounting.t - 8 second >> ./tests/bugs/glusterfs/bug-861015-log.t - 8 second >> ./tests/bugs/glusterd/sync-post-glusterd-restart.t - 8 second >> ./tests/bugs/glusterd/bug-1696046.t - 8 second >> ./tests/bugs/fuse/bug-983477.t - 8 second >> ./tests/bugs/ec/bug-1227869.t - 8 second >> ./tests/bugs/distribute/bug-1088231.t - 8 second >> ./tests/bugs/distribute/bug-1086228.t - 8 second >> ./tests/bugs/cli/bug-1087487.t - 8 second >> ./tests/bugs/cli/bug-1022905.t - 8 second >> ./tests/bugs/bug-1258069.t - 8 second >> ./tests/bugs/bitrot/1209752-volume-status-should-show-bitrot-scrub-info.t >> - 8 second >> ./tests/basic/xlator-pass-through-sanity.t - 8 second >> ./tests/basic/quota-nfs.t - 8 second >> ./tests/basic/glusterd/arbiter-volume.t - 8 second >> ./tests/basic/ctime/ctime-noatime.t - 8 second >> ./tests/line-coverage/cli-peer-and-volume-operations.t - 7 second >> ./tests/gfid2path/get-gfid-to-path.t - 7 second >> ./tests/bugs/upcall/bug-1369430.t - 7 second >> ./tests/bugs/snapshot/bug-1260848.t - 7 second >> ./tests/bugs/shard/shard-inode-refcount-test.t - 7 second >> ./tests/bugs/shard/bug-1258334.t - 7 second >> ./tests/bugs/replicate/bug-767585-gfid.t - 7 second >> ./tests/bugs/replicate/bug-1448804-check-quorum-type-values.t - 7 second >> ./tests/bugs/replicate/bug-1250170-fsync.t - 7 second >> ./tests/bugs/posix/bug-1175711.t - 7 second >> ./tests/bugs/nfs/bug-915280.t - 7 second >> ./tests/bugs/md-cache/setxattr-prepoststat.t - 7 second >> ./tests/bugs/md-cache/bug-1211863_unlink.t - 7 second >> ./tests/bugs/glusterfs/bug-848251.t - 7 second >> ./tests/bugs/distribute/bug-1122443.t - 7 second >> ./tests/bugs/changelog/bug-1208470.t - 7 second >> ./tests/bugs/bug-1702299.t - 7 second >> ./tests/bugs/bug-1371806_2.t - 7 second >> ./tests/bugs/bitrot/1209818-vol-info-show-scrub-process-properly.t - 7 >> second >> ./tests/bugs/bitrot/1209751-bitrot-scrub-tunable-reset.t - 7 second >> ./tests/bugs/bitrot/1207029-bitrot-daemon-should-start-on-valid-node.t >> - 7 second >> ./tests/bitrot/br-stub.t - 7 second >> ./tests/basic/glusterd/arbiter-volume-probe.t - 7 second >> ./tests/basic/gfapi/libgfapi-fini-hang.t - 7 second >> ./tests/basic/fencing/fencing-crash-conistency.t - 7 second >> ./tests/basic/distribute/file-create.t - 7 second >> ./tests/basic/afr/tarissue.t - 7 second >> ./tests/basic/afr/gfid-heal.t - 7 second >> ./tests/bugs/snapshot/bug-1178079.t - 6 second >> ./tests/bugs/snapshot/bug-1064768.t - 6 second >> ./tests/bugs/shard/bug-1342298.t - 6 second >> ./tests/bugs/shard/bug-1259651.t - 6 second >> ./tests/bugs/replicate/bug-1686568-send-truncate-on-arbiter-from-shd.t >> - 6 second >> ./tests/bugs/replicate/bug-1626994-info-split-brain.t - 6 second >> ./tests/bugs/replicate/bug-1325792.t - 6 second >> ./tests/bugs/replicate/bug-1101647.t - 6 second >> ./tests/bugs/quota/bug-1243798.t - 6 second >> ./tests/bugs/protocol/bug-1321578.t - 6 second >> ./tests/bugs/nfs/bug-877885.t - 6 second >> ./tests/bugs/nfs/bug-1143880-fix-gNFSd-auth-crash.t - 6 second >> ./tests/bugs/md-cache/bug-1476324.t - 6 second >> ./tests/bugs/md-cache/afr-stale-read.t - 6 second >> ./tests/bugs/io-cache/bug-858242.t - 6 second >> ./tests/bugs/glusterfs/bug-893378.t - 6 second >> ./tests/bugs/glusterfs/bug-856455.t - 6 second >> ./tests/bugs/glusterd/quorum-value-check.t - 6 second >> ./tests/bugs/ec/bug-1179050.t - 6 second >> ./tests/bugs/distribute/bug-912564.t - 6 second >> ./tests/bugs/distribute/bug-884597.t - 6 second >> ./tests/bugs/distribute/bug-1368012.t - 6 second >> ./tests/bugs/core/bug-986429.t - 6 second >> ./tests/bugs/core/bug-1699025-brick-mux-detach-brick-fd-issue.t - 6 >> second >> ./tests/bugs/core/bug-1168803-snapd-option-validation-fix.t - 6 second >> ./tests/bugs/bug-1371806_1.t - 6 second >> ./tests/bugs/bitrot/bug-1229134-bitd-not-support-vol-set.t - 6 second >> ./tests/bugs/bitrot/bug-1210684-scrub-pause-resume-error-handling.t - 6 >> second >> ./tests/bitrot/bug-1221914.t - 6 second >> ./tests/basic/trace.t - 6 second >> ./tests/basic/playground/template-xlator-sanity.t - 6 second >> ./tests/basic/ec/nfs.t - 6 second >> ./tests/basic/ec/ec-read-policy.t - 6 second >> ./tests/basic/ec/ec-anonymous-fd.t - 6 second >> ./tests/basic/distribute/non-root-unlink-stale-linkto.t - 6 second >> ./tests/basic/changelog/changelog-rename.t - 6 second >> ./tests/basic/afr/heal-info.t - 6 second >> ./tests/basic/afr/afr-read-hash-mode.t - 6 second >> ./tests/gfid2path/gfid2path_nfs.t - 5 second >> ./tests/bugs/upcall/bug-1422776.t - 5 second >> ./tests/bugs/replicate/bug-886998.t - 5 second >> ./tests/bugs/replicate/bug-1365455.t - 5 second >> ./tests/bugs/readdir-ahead/bug-1670253-consistent-metadata.t - 5 second >> ./tests/bugs/posix/bug-gfid-path.t - 5 second >> ./tests/bugs/posix/bug-765380.t - 5 second >> ./tests/bugs/nfs/bug-847622.t - 5 second >> ./tests/bugs/nfs/bug-1116503.t - 5 second >> ./tests/bugs/io-stats/bug-1598548.t - 5 second >> ./tests/bugs/glusterfs-server/bug-877992.t - 5 second >> ./tests/bugs/glusterfs-server/bug-873549.t - 5 second >> ./tests/bugs/glusterfs/bug-895235.t - 5 second >> ./tests/bugs/fuse/bug-1126048.t - 5 second >> ./tests/bugs/distribute/bug-907072.t - 5 second >> ./tests/bugs/core/bug-913544.t - 5 second >> ./tests/bugs/core/bug-908146.t - 5 second >> ./tests/bugs/access-control/bug-1051896.t - 5 second >> ./tests/basic/ec/ec-internal-xattrs.t - 5 second >> ./tests/basic/ec/ec-fallocate.t - 5 second >> ./tests/basic/distribute/bug-1265677-use-readdirp.t - 5 second >> ./tests/basic/afr/arbiter-remove-brick.t - 5 second >> ./tests/performance/quick-read.t - 4 second >> ./tests/gfid2path/block-mount-access.t - 4 second >> ./tests/features/delay-gen.t - 4 second >> ./tests/bugs/upcall/bug-upcall-stat.t - 4 second >> ./tests/bugs/upcall/bug-1394131.t - 4 second >> ./tests/bugs/unclassified/bug-1034085.t - 4 second >> ./tests/bugs/snapshot/bug-1111041.t - 4 second >> ./tests/bugs/shard/bug-1272986.t - 4 second >> ./tests/bugs/shard/bug-1256580.t - 4 second >> ./tests/bugs/shard/bug-1250855.t - 4 second >> ./tests/bugs/shard/bug-1245547.t - 4 second >> ./tests/bugs/rpc/bug-954057.t - 4 second >> ./tests/bugs/replicate/bug-976800.t - 4 second >> ./tests/bugs/replicate/bug-880898.t - 4 second >> ./tests/bugs/replicate/bug-1480525.t - 4 second >> ./tests/bugs/read-only/bug-1134822-read-only-default-in-graph.t - 4 >> second >> ./tests/bugs/readdir-ahead/bug-1446516.t - 4 second >> ./tests/bugs/readdir-ahead/bug-1439640.t - 4 second >> ./tests/bugs/readdir-ahead/bug-1390050.t - 4 second >> ./tests/bugs/quota/bug-1287996.t - 4 second >> ./tests/bugs/quick-read/bug-846240.t - 4 second >> ./tests/bugs/posix/disallow-gfid-volumeid-removexattr.t - 4 second >> ./tests/bugs/posix/bug-1619720.t - 4 second >> ./tests/bugs/nl-cache/bug-1451588.t - 4 second >> ./tests/bugs/nfs/zero-atime.t - 4 second >> ./tests/bugs/nfs/subdir-trailing-slash.t - 4 second >> ./tests/bugs/nfs/socket-as-fifo.t - 4 second >> ./tests/bugs/nfs/showmount-many-clients.t - 4 second >> ./tests/bugs/nfs/bug-1210338.t - 4 second >> ./tests/bugs/nfs/bug-1166862.t - 4 second >> ./tests/bugs/nfs/bug-1161092-nfs-acls.t - 4 second >> ./tests/bugs/md-cache/bug-1632503.t - 4 second >> ./tests/bugs/glusterfs-server/bug-864222.t - 4 second >> ./tests/bugs/glusterfs/bug-1482528.t - 4 second >> ./tests/bugs/glusterd/bug-948729/bug-948729-mode-script.t - 4 second >> ./tests/bugs/glusterd/bug-948729/bug-948729-force.t - 4 second >> ./tests/bugs/glusterd/bug-1482906-peer-file-blank-line.t - 4 second >> ./tests/bugs/glusterd/bug-1091935-brick-order-check-from-cli-to-glusterd.t >> - 4 second >> ./tests/bugs/geo-replication/bug-1296496.t - 4 second >> ./tests/bugs/fuse/bug-1336818.t - 4 second >> ./tests/bugs/fuse/bug-1283103.t - 4 second >> ./tests/bugs/core/io-stats-1322825.t - 4 second >> ./tests/bugs/core/bug-834465.t - 4 second >> ./tests/bugs/core/bug-1135514-allow-setxattr-with-null-value.t - 4 >> second >> ./tests/bugs/core/949327.t - 4 second >> ./tests/bugs/cli/bug-977246.t - 4 second >> ./tests/bugs/cli/bug-961307.t - 4 second >> ./tests/bugs/cli/bug-1004218.t - 4 second >> ./tests/bugs/bug-1138841.t - 4 second >> ./tests/bugs/access-control/bug-1387241.t - 4 second >> ./tests/bitrot/bug-internal-xattrs-check-1243391.t - 4 second >> ./tests/basic/quota-rename.t - 4 second >> ./tests/basic/hardlink-limit.t - 4 second >> ./tests/basic/ec/dht-rename.t - 4 second >> ./tests/basic/distribute/lookup.t - 4 second >> ./tests/line-coverage/meta-max-coverage.t - 3 second >> ./tests/gfid2path/gfid2path_fuse.t - 3 second >> ./tests/bugs/unclassified/bug-991622.t - 3 second >> ./tests/bugs/trace/bug-797171.t - 3 second >> ./tests/bugs/glusterfs-server/bug-861542.t - 3 second >> ./tests/bugs/glusterfs/bug-869724.t - 3 second >> ./tests/bugs/glusterfs/bug-860297.t - 3 second >> ./tests/bugs/glusterfs/bug-844688.t - 3 second >> ./tests/bugs/glusterd/bug-948729/bug-948729.t - 3 second >> ./tests/bugs/distribute/bug-1204140.t - 3 second >> ./tests/bugs/core/bug-924075.t - 3 second >> ./tests/bugs/core/bug-845213.t - 3 second >> ./tests/bugs/core/bug-1421721-mpx-toggle.t - 3 second >> ./tests/bugs/core/bug-1119582.t - 3 second >> ./tests/bugs/core/bug-1117951.t - 3 second >> ./tests/bugs/cli/bug-983317-volume-get.t - 3 second >> ./tests/bugs/cli/bug-867252.t - 3 second >> ./tests/basic/glusterd/check-cloudsync-ancestry.t - 3 second >> ./tests/basic/fops-sanity.t - 3 second >> ./tests/basic/fencing/test-fence-option.t - 3 second >> ./tests/basic/distribute/debug-xattrs.t - 3 second >> ./tests/basic/afr/ta-check-locks.t - 3 second >> ./tests/line-coverage/volfile-with-all-graph-syntax.t - 2 second >> ./tests/line-coverage/some-features-in-libglusterfs.t - 2 second >> ./tests/bugs/shard/bug-1261773.t - 2 second >> ./tests/bugs/replicate/bug-884328.t - 2 second >> ./tests/bugs/readdir-ahead/bug-1512437.t - 2 second >> ./tests/bugs/nfs/bug-970070.t - 2 second >> ./tests/bugs/nfs/bug-1302948.t - 2 second >> ./tests/bugs/logging/bug-823081.t - 2 second >> ./tests/bugs/glusterfs-server/bug-889996.t - 2 second >> ./tests/bugs/glusterfs/bug-892730.t - 2 second >> ./tests/bugs/glusterfs/bug-811493.t - 2 second >> ./tests/bugs/glusterd/bug-1085330-and-bug-916549.t - 2 second >> ./tests/bugs/distribute/bug-924265.t - 2 second >> ./tests/bugs/core/log-bug-1362520.t - 2 second >> ./tests/bugs/core/bug-903336.t - 2 second >> ./tests/bugs/core/bug-1111557.t - 2 second >> ./tests/bugs/cli/bug-969193.t - 2 second >> ./tests/bugs/cli/bug-949298.t - 2 second >> ./tests/bugs/cli/bug-921215.t - 2 second >> ./tests/bugs/cli/bug-1378842-volume-get-all.t - 2 second >> ./tests/basic/peer-parsing.t - 2 second >> ./tests/basic/md-cache/bug-1418249.t - 2 second >> ./tests/basic/afr/arbiter-cli.t - 2 second >> ./tests/bugs/replicate/ta-inode-refresh-read.t - 1 second >> ./tests/bugs/glusterfs/bug-853690.t - 1 second >> ./tests/bugs/cli/bug-764638.t - 1 second >> ./tests/bugs/cli/bug-1047378.t - 1 second >> ./tests/basic/netgroup_parsing.t - 1 second >> ./tests/basic/gfapi/sink.t - 1 second >> ./tests/basic/exports_parsing.t - 1 second >> ./tests/basic/posixonly.t - 0 second >> ./tests/basic/glusterfsd-args.t - 0 second >> >> >> 2 test(s) failed >> ./tests/basic/uss.t >> ./tests/features/subdir-mount.t >> >> 0 test(s) generated core >> >> >> 5 test(s) needed retry >> ./tests/basic/afr/split-brain-favorite-child-policy.t >> ./tests/basic/ec/self-heal.t >> ./tests/basic/uss.t >> ./tests/basic/volfile-sanity.t >> ./tests/features/subdir-mount.t >> >> Result is 1 >> >> tar: Removing leading `/' from member names >> kernel.core_pattern = /%e-%p.core >> Build step 'Execute shell' marked build as failure >> _______________________________________________ >> maintainers mailing list >> maintainers at gluster.org >> https://lists.gluster.org/mailman/listinfo/maintainers >> >> >> -- >> - Atin (atinm) >> _______________________________________________ >> maintainers mailing list >> maintainers at gluster.org >> https://lists.gluster.org/mailman/listinfo/maintainers >> > > > -- > Amar Tumballi (amarts) > -------------- next part -------------- An HTML attachment was scrubbed... URL: From hgowtham at redhat.com Mon Jun 10 13:28:10 2019 From: hgowtham at redhat.com (hgowtham at redhat.com) Date: Mon, 10 Jun 2019 13:28:10 +0000 Subject: [Gluster-devel] Invitation: Gluster Community Meeting (APAC friendly hours) @ Tue Jun 11, 2019 11:30am - 12:30pm (IST) (gluster-devel@gluster.org) Message-ID: <000000000000c3110d058af82636@google.com> You have been invited to the following event. Title: Gluster Community Meeting (APAC friendly hours) Bridge: https://bluejeans.com/836554017 Meeting minutes: https://hackmd.io/A07qMrezSOyeUUGxPhBHqQ?both Previous Meeting notes: http://github.com/gluster/community When: Tue Jun 11, 2019 11:30am ? 12:30pm India Standard Time - Kolkata Where: https://bluejeans.com/836554017 Calendar: gluster-devel at gluster.org Who: * hgowtham at redhat.com - organizer * gluster-users at gluster.org * gluster-devel at gluster.org Event details: https://www.google.com/calendar/event?action=VIEW&eid=MmFrY3BnZ3I4MG5kdmcxbmJnYzlkcDBycmwgZ2x1c3Rlci1kZXZlbEBnbHVzdGVyLm9yZw&tok=MTkjaGdvd3RoYW1AcmVkaGF0LmNvbWI3NDk5MGVkZDkyOWZjNzhjNTVmNmU0YWY2NWQzNjk4NzYzODdiM2Q&ctz=Asia%2FKolkata&hl=en&es=0 Invitation from Google Calendar: https://www.google.com/calendar/ You are receiving this courtesy email at the account gluster-devel at gluster.org because you are an attendee of this event. To stop receiving future updates for this event, decline this event. Alternatively you can sign up for a Google account at https://www.google.com/calendar/ and control your notification settings for your entire calendar. Forwarding this invitation could allow any recipient to send a response to the organizer and be added to the guest list, or invite others regardless of their own invitation status, or to modify your RSVP. Learn more at https://support.google.com/calendar/answer/37135#forwarding -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/calendar Size: 1721 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: invite.ics Type: application/ics Size: 1759 bytes Desc: not available URL: From hgowtham at redhat.com Mon Jun 10 14:07:24 2019 From: hgowtham at redhat.com (Hari Gowtham) Date: Mon, 10 Jun 2019 19:37:24 +0530 Subject: [Gluster-devel] [Gluster-users] No healing on peer disconnect - is it correct? In-Reply-To: <3B1EE351-5F82-4D05-947A-4960BBAC885A@gmail.com> References: <10D708D0-E523-46A0-91BF-FFC41886E316@gmail.com> <3B1EE351-5F82-4D05-947A-4960BBAC885A@gmail.com> Message-ID: On Mon, Jun 10, 2019 at 7:21 PM snowmailer wrote: > > Can someone advice on this, please? > > BR! > > D?a 3. 6. 2019 o 18:58 u??vate? Martin nap?sal: > > > Hi all, > > > > I need someone to explain if my gluster behaviour is correct. I am not sure if my gluster works as it should. I have simple Replica 3 - Number of Bricks: 1 x 3 = 3. > > > > When one of my hypervisor is disconnected as peer, i.e. gluster process is down but bricks running, other two healthy nodes start signalling that they lost one peer. This is correct. > > Next, I restart gluster process on node where gluster process failed and I thought It should trigger healing of files on failed node but nothing is happening. > > > > I run VMs disks on this gluster volume. No healing is triggered after gluster restart, remaining two nodes get peer back after restart of gluster and everything is running without down time. > > Even VMs that are running on ?failed? node where gluster process was down (bricks were up) are running without down time. I assume your VMs use gluster as the storage. In that case, the gluster volume might be mounted on all the hypervisors. The mount/ client is smart enough to give the correct data from the other two machines which were always up. This is the reason things are working fine. Gluster should heal the brick. Adding people how can help you better with the heal part. @Karthik Subrahmanya @Ravishankar N do take a look and answer this part. > > > > Is this behaviour correct? I mean No healing is triggered after peer is reconnected back and VMs. > > > > Thanks for explanation. > > > > BR! > > Martin > > > > > _______________________________________________ > Gluster-users mailing list > Gluster-users at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-users -- Regards, Hari Gowtham. From atumball at redhat.com Mon Jun 10 17:15:27 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Mon, 10 Jun 2019 22:45:27 +0530 Subject: [Gluster-devel] Regression failure continues: 'tests/basic/afr/split-brain-favorite-child-policy.t` Message-ID: Fails with: *20:56:58* ok 132 [ 8/ 82] < 194> 'gluster --mode=script --wignore volume heal patchy'*20:56:58* not ok 133 [ 8/ 80260] < 195> '^0$ get_pending_heal_count patchy' -> 'Got "2" instead of "^0$"'*20:56:58* ok 134 [ 18/ 2] < 197> '0 echo 0' Looks like when the error occurred, it took 80seconds. I see 2 different patches fail on this, would be good to analyze it further. Regards, Amar -- Amar Tumballi (amarts) -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Mon Jun 10 17:17:57 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Mon, 10 Jun 2019 22:47:57 +0530 Subject: [Gluster-devel] Regression failure continues: 'tests/basic/afr/split-brain-favorite-child-policy.t` In-Reply-To: References: Message-ID: Hi Amar, I found the issue, will be sending a patch in a while. Regards, Karthik On Mon, Jun 10, 2019 at 10:46 PM Amar Tumballi Suryanarayan < atumball at redhat.com> wrote: > Fails with: > > *20:56:58* ok 132 [ 8/ 82] < 194> 'gluster --mode=script --wignore volume heal patchy'*20:56:58* not ok 133 [ 8/ 80260] < 195> '^0$ get_pending_heal_count patchy' -> 'Got "2" instead of "^0$"'*20:56:58* ok 134 [ 18/ 2] < 197> '0 echo 0' > > > Looks like when the error occurred, it took 80seconds. > > > I see 2 different patches fail on this, would be good to analyze it further. > > > Regards, > > Amar > > > -- > Amar Tumballi (amarts) > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ksubrahm at redhat.com Mon Jun 10 18:16:11 2019 From: ksubrahm at redhat.com (Karthik Subrahmanya) Date: Mon, 10 Jun 2019 23:46:11 +0530 Subject: [Gluster-devel] Regression failure continues: 'tests/basic/afr/split-brain-favorite-child-policy.t` In-Reply-To: References:

Message-ID: Patch posted: https://review.gluster.org/#/c/glusterfs/+/22850/ -Karthik On Mon, Jun 10, 2019 at 10:47 PM Karthik Subrahmanya wrote: > Hi Amar, > > I found the issue, will be sending a patch in a while. > > Regards, > Karthik > > On Mon, Jun 10, 2019 at 10:46 PM Amar Tumballi Suryanarayan < > atumball at redhat.com> wrote: > >> Fails with: >> >> *20:56:58* ok 132 [ 8/ 82] < 194> 'gluster --mode=script --wignore volume heal patchy'*20:56:58* not ok 133 [ 8/ 80260] < 195> '^0$ get_pending_heal_count patchy' -> 'Got "2" instead of "^0$"'*20:56:58* ok 134 [ 18/ 2] < 197> '0 echo 0' >> >> >> Looks like when the error occurred, it took 80seconds. >> >> >> I see 2 different patches fail on this, would be good to analyze it further. >> >> >> Regards, >> >> Amar >> >> >> -- >> Amar Tumballi (amarts) >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ravishankar at redhat.com Tue Jun 11 04:50:10 2019 From: ravishankar at redhat.com (Ravishankar N) Date: Tue, 11 Jun 2019 10:20:10 +0530 Subject: [Gluster-devel] [Gluster-users] No healing on peer disconnect - is it correct? In-Reply-To: References: <10D708D0-E523-46A0-91BF-FFC41886E316@gmail.com> <3B1EE351-5F82-4D05-947A-4960BBAC885A@gmail.com>

Message-ID: <28417fb7-5081-cc8e-7ffc-625f9905f9c2@redhat.com> There will be pending heals only when the brick process goes down or there is a disconnect between the client and that brick. When you say " gluster process is down but bricks running", I'm guessing you killed only glusterd and not the glusterfsd brick process. That won't cause any pending heals. If there is something to be healed, `gluster volume heal $volname info` will display the list of files. Hope that helps, Ravi On 10/06/19 7:53 PM, Martin wrote: > My VMs using Gluster as storage through libgfapi support in Qemu. But > I dont see any healing of reconnected brick. > > Thanks Karthik /?Ravishankar in advance! > >> On 10 Jun 2019, at 16:07, Hari Gowtham > > wrote: >> >> On Mon, Jun 10, 2019 at 7:21 PM snowmailer > > wrote: >>> >>> Can someone advice on this, please? >>> >>> BR! >>> >>> D?a 3. 6. 2019 o 18:58 u??vate? Martin >> > nap?sal: >>> >>>> Hi all, >>>> >>>> I need someone to explain if my gluster behaviour is correct. I am >>>> not sure if my gluster works as it should. I have simple Replica 3 >>>> - Number of Bricks: 1 x 3 = 3. >>>> >>>> When one of my hypervisor is disconnected as peer, i.e. gluster >>>> process is down but bricks running, other two healthy nodes start >>>> signalling that they lost one peer. This is correct. >>>> Next, I restart gluster process on node where gluster process >>>> failed and I thought It should trigger healing of files on failed >>>> node but nothing is happening. >>>> >>>> I run VMs disks on this gluster volume. No healing is triggered >>>> after gluster restart, remaining two nodes get peer back after >>>> restart of gluster and everything is running without down time. >>>> Even VMs that are running on ?failed? node where gluster process >>>> was down (bricks were up) are running without down time. >> >> I assume your VMs use gluster as the storage. In that case, the >> gluster volume might be mounted on all the hypervisors. >> The mount/ client is smart enough to give the correct data from the >> other two machines which were always up. >> This is the reason things are working fine. >> >> Gluster should heal the brick. >> Adding people how can help you better with the heal part. >> @Karthik Subrahmanya ?@Ravishankar N do take a look and answer this part. >> >>>> >>>> Is this behaviour correct? I mean No healing is triggered after >>>> peer is reconnected back and VMs. >>>> >>>> Thanks for explanation. >>>> >>>> BR! >>>> Martin >>>> >>>> >>> _______________________________________________ >>> Gluster-users mailing list >>> Gluster-users at gluster.org >>> https://lists.gluster.org/mailman/listinfo/gluster-users >> >> >> >> -- >> Regards, >> Hari Gowtham. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From anoopcs at cryptolab.net Tue Jun 11 05:03:43 2019 From: anoopcs at cryptolab.net (Anoop C S) Date: Tue, 11 Jun 2019 10:33:43 +0530 Subject: [Gluster-devel] [Gluster-Maintainers] Fwd: Build failed in Jenkins: regression-test-with-multiplex #1359 In-Reply-To: References: <24208463.92.1559325814227.JavaMail.jenkins@jenkins-el7.rht.gluster.org> Message-ID: <5f05e6372e856402bb344107920cc1693a47be13.camel@cryptolab.net> On Fri, 2019-06-07 at 10:06 +0530, Amar Tumballi Suryanarayan wrote: > Got time to test subdir-mount.t failing in brick-mux scenario. > > I noticed some issues, where I need further help from glusterd team. > > subdir-mount.t expects 'hook' script to run after add-brick to make > sure the required subdirectories are healed and are present in new > bricks. This is important as subdir mount expects the subdirs to > exist for successful mount. > > But in case of brick-mux setup, I see that in some cases (6/10), hook > script (add-brick/post-hook/S13-create-subdir-mount.sh) started > getting executed after 20second of finishing the add-brick command. > Due to this, the mount which we execute after add-brick failed. > > My question is, what is making post hook script to run so late ?? Note:- I would like to add that I have a patch[1] under review adding another post add-brick hook script which might further result in delayed execution of S13create-subdir-mounts.sh. Because S10selinux-label- brick.sh from my change comes before the existing hook script. [1] https://review.gluster.org/c/glusterfs/+/22834 > I can recreate the issues locally on my laptop too. > > > On Sat, Jun 1, 2019 at 4:55 PM Atin Mukherjee > wrote: > > subdir-mount.t has started failing in brick mux regression nightly. > > This needs to be fixed. > > > > Raghavendra - did we manage to get any further clue on uss.t > > failure? > > > > ---------- Forwarded message --------- > > From: > > Date: Fri, 31 May 2019 at 23:34 > > Subject: [Gluster-Maintainers] Build failed in Jenkins: regression- > > test-with-multiplex #1359 > > To: , , < > > amarts at redhat.com>, , > > > > > > See < > > https://build.gluster.org/job/regression-test-with-multiplex/1359/display/redirect?page=changes > > > > > > > Changes: > > > > [atin] glusterd: add an op-version check > > > > [atin] glusterd/svc: glusterd_svcs_stop should call individual > > wrapper function > > > > [atin] glusterd/svc: Stop stale process using the > > glusterd_proc_stop > > > > [Amar Tumballi] lcov: more coverage to shard, old-protocol, sdfs > > > > [Kotresh H R] tests/geo-rep: Add EC volume test case > > > > [Amar Tumballi] glusterfsd/cleanup: Protect graph object under a > > lock > > > > [Mohammed Rafi KC] glusterd/shd: Optimize the glustershd manager to > > send reconfigure > > > > [Kotresh H R] tests/geo-rep: Add tests to cover glusterd geo-rep > > > > [atin] glusterd: Optimize code to copy dictionary in handshake code > > path > > > > ------------------------------------------ > > [...truncated 3.18 MB...] > > ./tests/basic/afr/stale-file-lookup.t - 9 second > > ./tests/basic/afr/granular-esh/replace-brick.t - 9 second > > ./tests/basic/afr/granular-esh/add-brick.t - 9 second > > ./tests/basic/afr/gfid-mismatch.t - 9 second > > ./tests/performance/open-behind.t - 8 second > > ./tests/features/ssl-authz.t - 8 second > > ./tests/features/readdir-ahead.t - 8 second > > ./tests/bugs/upcall/bug-1458127.t - 8 second > > ./tests/bugs/transport/bug-873367.t - 8 second > > ./tests/bugs/replicate/bug-1498570-client-iot-graph-check.t - 8 > > second > > ./tests/bugs/replicate/bug-1132102.t - 8 second > > ./tests/bugs/quota/bug-1250582-volume-reset-should-not-remove- > > quota-quota-deem-statfs.t - 8 second > > ./tests/bugs/quota/bug-1104692.t - 8 second > > ./tests/bugs/posix/bug-1360679.t - 8 second > > ./tests/bugs/posix/bug-1122028.t - 8 second > > ./tests/bugs/nfs/bug-1157223-symlink-mounting.t - 8 second > > ./tests/bugs/glusterfs/bug-861015-log.t - 8 second > > ./tests/bugs/glusterd/sync-post-glusterd-restart.t - 8 second > > ./tests/bugs/glusterd/bug-1696046.t - 8 second > > ./tests/bugs/fuse/bug-983477.t - 8 second > > ./tests/bugs/ec/bug-1227869.t - 8 second > > ./tests/bugs/distribute/bug-1088231.t - 8 second > > ./tests/bugs/distribute/bug-1086228.t - 8 second > > ./tests/bugs/cli/bug-1087487.t - 8 second > > ./tests/bugs/cli/bug-1022905.t - 8 second > > ./tests/bugs/bug-1258069.t - 8 second > > ./tests/bugs/bitrot/1209752-volume-status-should-show-bitrot-scrub- > > info.t - 8 second > > ./tests/basic/xlator-pass-through-sanity.t - 8 second > > ./tests/basic/quota-nfs.t - 8 second > > ./tests/basic/glusterd/arbiter-volume.t - 8 second > > ./tests/basic/ctime/ctime-noatime.t - 8 second > > ./tests/line-coverage/cli-peer-and-volume-operations.t - 7 second > > ./tests/gfid2path/get-gfid-to-path.t - 7 second > > ./tests/bugs/upcall/bug-1369430.t - 7 second > > ./tests/bugs/snapshot/bug-1260848.t - 7 second > > ./tests/bugs/shard/shard-inode-refcount-test.t - 7 second > > ./tests/bugs/shard/bug-1258334.t - 7 second > > ./tests/bugs/replicate/bug-767585-gfid.t - 7 second > > ./tests/bugs/replicate/bug-1448804-check-quorum-type-values.t - 7 > > second > > ./tests/bugs/replicate/bug-1250170-fsync.t - 7 second > > ./tests/bugs/posix/bug-1175711.t - 7 second > > ./tests/bugs/nfs/bug-915280.t - 7 second > > ./tests/bugs/md-cache/setxattr-prepoststat.t - 7 second > > ./tests/bugs/md-cache/bug-1211863_unlink.t - 7 second > > ./tests/bugs/glusterfs/bug-848251.t - 7 second > > ./tests/bugs/distribute/bug-1122443.t - 7 second > > ./tests/bugs/changelog/bug-1208470.t - 7 second > > ./tests/bugs/bug-1702299.t - 7 second > > ./tests/bugs/bug-1371806_2.t - 7 second > > ./tests/bugs/bitrot/1209818-vol-info-show-scrub-process-properly.t > > - 7 second > > ./tests/bugs/bitrot/1209751-bitrot-scrub-tunable-reset.t - 7 > > second > > ./tests/bugs/bitrot/1207029-bitrot-daemon-should-start-on-valid- > > node.t - 7 second > > ./tests/bitrot/br-stub.t - 7 second > > ./tests/basic/glusterd/arbiter-volume-probe.t - 7 second > > ./tests/basic/gfapi/libgfapi-fini-hang.t - 7 second > > ./tests/basic/fencing/fencing-crash-conistency.t - 7 second > > ./tests/basic/distribute/file-create.t - 7 second > > ./tests/basic/afr/tarissue.t - 7 second > > ./tests/basic/afr/gfid-heal.t - 7 second > > ./tests/bugs/snapshot/bug-1178079.t - 6 second > > ./tests/bugs/snapshot/bug-1064768.t - 6 second > > ./tests/bugs/shard/bug-1342298.t - 6 second > > ./tests/bugs/shard/bug-1259651.t - 6 second > > ./tests/bugs/replicate/bug-1686568-send-truncate-on-arbiter-from- > > shd.t - 6 second > > ./tests/bugs/replicate/bug-1626994-info-split-brain.t - 6 second > > ./tests/bugs/replicate/bug-1325792.t - 6 second > > ./tests/bugs/replicate/bug-1101647.t - 6 second > > ./tests/bugs/quota/bug-1243798.t - 6 second > > ./tests/bugs/protocol/bug-1321578.t - 6 second > > ./tests/bugs/nfs/bug-877885.t - 6 second > > ./tests/bugs/nfs/bug-1143880-fix-gNFSd-auth-crash.t - 6 second > > ./tests/bugs/md-cache/bug-1476324.t - 6 second > > ./tests/bugs/md-cache/afr-stale-read.t - 6 second > > ./tests/bugs/io-cache/bug-858242.t - 6 second > > ./tests/bugs/glusterfs/bug-893378.t - 6 second > > ./tests/bugs/glusterfs/bug-856455.t - 6 second > > ./tests/bugs/glusterd/quorum-value-check.t - 6 second > > ./tests/bugs/ec/bug-1179050.t - 6 second > > ./tests/bugs/distribute/bug-912564.t - 6 second > > ./tests/bugs/distribute/bug-884597.t - 6 second > > ./tests/bugs/distribute/bug-1368012.t - 6 second > > ./tests/bugs/core/bug-986429.t - 6 second > > ./tests/bugs/core/bug-1699025-brick-mux-detach-brick-fd-issue.t - > > 6 second > > ./tests/bugs/core/bug-1168803-snapd-option-validation-fix.t - 6 > > second > > ./tests/bugs/bug-1371806_1.t - 6 second > > ./tests/bugs/bitrot/bug-1229134-bitd-not-support-vol-set.t - 6 > > second > > ./tests/bugs/bitrot/bug-1210684-scrub-pause-resume-error- > > handling.t - 6 second > > ./tests/bitrot/bug-1221914.t - 6 second > > ./tests/basic/trace.t - 6 second > > ./tests/basic/playground/template-xlator-sanity.t - 6 second > > ./tests/basic/ec/nfs.t - 6 second > > ./tests/basic/ec/ec-read-policy.t - 6 second > > ./tests/basic/ec/ec-anonymous-fd.t - 6 second > > ./tests/basic/distribute/non-root-unlink-stale-linkto.t - 6 > > second > > ./tests/basic/changelog/changelog-rename.t - 6 second > > ./tests/basic/afr/heal-info.t - 6 second > > ./tests/basic/afr/afr-read-hash-mode.t - 6 second > > ./tests/gfid2path/gfid2path_nfs.t - 5 second > > ./tests/bugs/upcall/bug-1422776.t - 5 second > > ./tests/bugs/replicate/bug-886998.t - 5 second > > ./tests/bugs/replicate/bug-1365455.t - 5 second > > ./tests/bugs/readdir-ahead/bug-1670253-consistent-metadata.t - 5 > > second > > ./tests/bugs/posix/bug-gfid-path.t - 5 second > > ./tests/bugs/posix/bug-765380.t - 5 second > > ./tests/bugs/nfs/bug-847622.t - 5 second > > ./tests/bugs/nfs/bug-1116503.t - 5 second > > ./tests/bugs/io-stats/bug-1598548.t - 5 second > > ./tests/bugs/glusterfs-server/bug-877992.t - 5 second > > ./tests/bugs/glusterfs-server/bug-873549.t - 5 second > > ./tests/bugs/glusterfs/bug-895235.t - 5 second > > ./tests/bugs/fuse/bug-1126048.t - 5 second > > ./tests/bugs/distribute/bug-907072.t - 5 second > > ./tests/bugs/core/bug-913544.t - 5 second > > ./tests/bugs/core/bug-908146.t - 5 second > > ./tests/bugs/access-control/bug-1051896.t - 5 second > > ./tests/basic/ec/ec-internal-xattrs.t - 5 second > > ./tests/basic/ec/ec-fallocate.t - 5 second > > ./tests/basic/distribute/bug-1265677-use-readdirp.t - 5 second > > ./tests/basic/afr/arbiter-remove-brick.t - 5 second > > ./tests/performance/quick-read.t - 4 second > > ./tests/gfid2path/block-mount-access.t - 4 second > > ./tests/features/delay-gen.t - 4 second > > ./tests/bugs/upcall/bug-upcall-stat.t - 4 second > > ./tests/bugs/upcall/bug-1394131.t - 4 second > > ./tests/bugs/unclassified/bug-1034085.t - 4 second > > ./tests/bugs/snapshot/bug-1111041.t - 4 second > > ./tests/bugs/shard/bug-1272986.t - 4 second > > ./tests/bugs/shard/bug-1256580.t - 4 second > > ./tests/bugs/shard/bug-1250855.t - 4 second > > ./tests/bugs/shard/bug-1245547.t - 4 second > > ./tests/bugs/rpc/bug-954057.t - 4 second > > ./tests/bugs/replicate/bug-976800.t - 4 second > > ./tests/bugs/replicate/bug-880898.t - 4 second > > ./tests/bugs/replicate/bug-1480525.t - 4 second > > ./tests/bugs/read-only/bug-1134822-read-only-default-in-graph.t - > > 4 second > > ./tests/bugs/readdir-ahead/bug-1446516.t - 4 second > > ./tests/bugs/readdir-ahead/bug-1439640.t - 4 second > > ./tests/bugs/readdir-ahead/bug-1390050.t - 4 second > > ./tests/bugs/quota/bug-1287996.t - 4 second > > ./tests/bugs/quick-read/bug-846240.t - 4 second > > ./tests/bugs/posix/disallow-gfid-volumeid-removexattr.t - 4 > > second > > ./tests/bugs/posix/bug-1619720.t - 4 second > > ./tests/bugs/nl-cache/bug-1451588.t - 4 second > > ./tests/bugs/nfs/zero-atime.t - 4 second > > ./tests/bugs/nfs/subdir-trailing-slash.t - 4 second > > ./tests/bugs/nfs/socket-as-fifo.t - 4 second > > ./tests/bugs/nfs/showmount-many-clients.t - 4 second > > ./tests/bugs/nfs/bug-1210338.t - 4 second > > ./tests/bugs/nfs/bug-1166862.t - 4 second > > ./tests/bugs/nfs/bug-1161092-nfs-acls.t - 4 second > > ./tests/bugs/md-cache/bug-1632503.t - 4 second > > ./tests/bugs/glusterfs-server/bug-864222.t - 4 second > > ./tests/bugs/glusterfs/bug-1482528.t - 4 second > > ./tests/bugs/glusterd/bug-948729/bug-948729-mode-script.t - 4 > > second > > ./tests/bugs/glusterd/bug-948729/bug-948729-force.t - 4 second > > ./tests/bugs/glusterd/bug-1482906-peer-file-blank-line.t - 4 > > second > > ./tests/bugs/glusterd/bug-1091935-brick-order-check-from-cli-to- > > glusterd.t - 4 second > > ./tests/bugs/geo-replication/bug-1296496.t - 4 second > > ./tests/bugs/fuse/bug-1336818.t - 4 second > > ./tests/bugs/fuse/bug-1283103.t - 4 second > > ./tests/bugs/core/io-stats-1322825.t - 4 second > > ./tests/bugs/core/bug-834465.t - 4 second > > ./tests/bugs/core/bug-1135514-allow-setxattr-with-null-value.t - > > 4 second > > ./tests/bugs/core/949327.t - 4 second > > ./tests/bugs/cli/bug-977246.t - 4 second > > ./tests/bugs/cli/bug-961307.t - 4 second > > ./tests/bugs/cli/bug-1004218.t - 4 second > > ./tests/bugs/bug-1138841.t - 4 second > > ./tests/bugs/access-control/bug-1387241.t - 4 second > > ./tests/bitrot/bug-internal-xattrs-check-1243391.t - 4 second > > ./tests/basic/quota-rename.t - 4 second > > ./tests/basic/hardlink-limit.t - 4 second > > ./tests/basic/ec/dht-rename.t - 4 second > > ./tests/basic/distribute/lookup.t - 4 second > > ./tests/line-coverage/meta-max-coverage.t - 3 second > > ./tests/gfid2path/gfid2path_fuse.t - 3 second > > ./tests/bugs/unclassified/bug-991622.t - 3 second > > ./tests/bugs/trace/bug-797171.t - 3 second > > ./tests/bugs/glusterfs-server/bug-861542.t - 3 second > > ./tests/bugs/glusterfs/bug-869724.t - 3 second > > ./tests/bugs/glusterfs/bug-860297.t - 3 second > > ./tests/bugs/glusterfs/bug-844688.t - 3 second > > ./tests/bugs/glusterd/bug-948729/bug-948729.t - 3 second > > ./tests/bugs/distribute/bug-1204140.t - 3 second > > ./tests/bugs/core/bug-924075.t - 3 second > > ./tests/bugs/core/bug-845213.t - 3 second > > ./tests/bugs/core/bug-1421721-mpx-toggle.t - 3 second > > ./tests/bugs/core/bug-1119582.t - 3 second > > ./tests/bugs/core/bug-1117951.t - 3 second > > ./tests/bugs/cli/bug-983317-volume-get.t - 3 second > > ./tests/bugs/cli/bug-867252.t - 3 second > > ./tests/basic/glusterd/check-cloudsync-ancestry.t - 3 second > > ./tests/basic/fops-sanity.t - 3 second > > ./tests/basic/fencing/test-fence-option.t - 3 second > > ./tests/basic/distribute/debug-xattrs.t - 3 second > > ./tests/basic/afr/ta-check-locks.t - 3 second > > ./tests/line-coverage/volfile-with-all-graph-syntax.t - 2 second > > ./tests/line-coverage/some-features-in-libglusterfs.t - 2 second > > ./tests/bugs/shard/bug-1261773.t - 2 second > > ./tests/bugs/replicate/bug-884328.t - 2 second > > ./tests/bugs/readdir-ahead/bug-1512437.t - 2 second > > ./tests/bugs/nfs/bug-970070.t - 2 second > > ./tests/bugs/nfs/bug-1302948.t - 2 second > > ./tests/bugs/logging/bug-823081.t - 2 second > > ./tests/bugs/glusterfs-server/bug-889996.t - 2 second > > ./tests/bugs/glusterfs/bug-892730.t - 2 second > > ./tests/bugs/glusterfs/bug-811493.t - 2 second > > ./tests/bugs/glusterd/bug-1085330-and-bug-916549.t - 2 second > > ./tests/bugs/distribute/bug-924265.t - 2 second > > ./tests/bugs/core/log-bug-1362520.t - 2 second > > ./tests/bugs/core/bug-903336.t - 2 second > > ./tests/bugs/core/bug-1111557.t - 2 second > > ./tests/bugs/cli/bug-969193.t - 2 second > > ./tests/bugs/cli/bug-949298.t - 2 second > > ./tests/bugs/cli/bug-921215.t - 2 second > > ./tests/bugs/cli/bug-1378842-volume-get-all.t - 2 second > > ./tests/basic/peer-parsing.t - 2 second > > ./tests/basic/md-cache/bug-1418249.t - 2 second > > ./tests/basic/afr/arbiter-cli.t - 2 second > > ./tests/bugs/replicate/ta-inode-refresh-read.t - 1 second > > ./tests/bugs/glusterfs/bug-853690.t - 1 second > > ./tests/bugs/cli/bug-764638.t - 1 second > > ./tests/bugs/cli/bug-1047378.t - 1 second > > ./tests/basic/netgroup_parsing.t - 1 second > > ./tests/basic/gfapi/sink.t - 1 second > > ./tests/basic/exports_parsing.t - 1 second > > ./tests/basic/posixonly.t - 0 second > > ./tests/basic/glusterfsd-args.t - 0 second > > > > > > 2 test(s) failed > > ./tests/basic/uss.t > > ./tests/features/subdir-mount.t > > > > 0 test(s) generated core > > > > > > 5 test(s) needed retry > > ./tests/basic/afr/split-brain-favorite-child-policy.t > > ./tests/basic/ec/self-heal.t > > ./tests/basic/uss.t > > ./tests/basic/volfile-sanity.t > > ./tests/features/subdir-mount.t > > > > Result is 1 > > > > tar: Removing leading `/' from member names > > kernel.core_pattern = /%e-%p.core > > Build step 'Execute shell' marked build as failure > > _______________________________________________ > > maintainers mailing list > > maintainers at gluster.org > > https://lists.gluster.org/mailman/listinfo/maintainers > > > > > > -- > > - Atin (atinm) > > _______________________________________________ > > maintainers mailing list > > maintainers at gluster.org > > https://lists.gluster.org/mailman/listinfo/maintainers > > > _______________________________________________ > maintainers mailing list > maintainers at gluster.org > https://lists.gluster.org/mailman/listinfo/maintainers From amukherj at redhat.com Tue Jun 11 08:51:35 2019 From: amukherj at redhat.com (Atin Mukherjee) Date: Tue, 11 Jun 2019 14:21:35 +0530 Subject: [Gluster-devel] https://build.gluster.org/job/centos7-regression/6404/consoleFull - Problem accessing //job/centos7-regression/6404/consoleFull. Reason: Not found Message-ID: https://bugzilla.redhat.com/show_bug.cgi?id=1719174 The patch which failed the regression is https://review.gluster.org/22851 . -------------- next part -------------- An HTML attachment was scrubbed... URL: From linux at eikelenboom.it Tue Jun 11 10:46:38 2019 From: linux at eikelenboom.it (Sander Eikelenboom) Date: Tue, 11 Jun 2019 12:46:38 +0200 Subject: [Gluster-devel] Linux 5.2-RC regression bisected, mounting glusterfs volumes fails after commit: fuse: require /dev/fuse reads to have enough buffer capacity Message-ID: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> L.S., While testing a linux 5.2 kernel I noticed it fails to mount my glusterfs volumes. It repeatedly fails with: [2019-06-11 09:15:27.106946] W [fuse-bridge.c:4993:fuse_thread_proc] 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) [2019-06-11 09:15:27.106955] W [fuse-bridge.c:4993:fuse_thread_proc] 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) [2019-06-11 09:15:27.106963] W [fuse-bridge.c:4993:fuse_thread_proc] 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) [2019-06-11 09:15:27.106971] W [fuse-bridge.c:4993:fuse_thread_proc] 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) etc. etc. Bisecting turned up as culprit: commit d4b13963f217dd947da5c0cabd1569e914d21699: fuse: require /dev/fuse reads to have enough buffer capacity The glusterfs version i'm using is from Debian stable: ii glusterfs-client 3.8.8-1 amd64 clustered file-system (client package) ii glusterfs-common 3.8.8-1 amd64 GlusterFS common libraries and translator modules A 5.1.* kernel works fine, as does a 5.2-rc4 kernel with said commit reverted. -- Sander From atumball at redhat.com Tue Jun 11 11:40:40 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Tue, 11 Jun 2019 17:10:40 +0530 Subject: [Gluster-devel] Linux 5.2-RC regression bisected, mounting glusterfs volumes fails after commit: fuse: require /dev/fuse reads to have enough buffer capacity In-Reply-To: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> Message-ID: Thanks for the heads up! We will see how to revert / fix the issue properly for 5.2 kernel. -Amar On Tue, Jun 11, 2019 at 4:34 PM Sander Eikelenboom wrote: > L.S., > > While testing a linux 5.2 kernel I noticed it fails to mount my glusterfs > volumes. > > It repeatedly fails with: > [2019-06-11 09:15:27.106946] W [fuse-bridge.c:4993:fuse_thread_proc] > 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) > [2019-06-11 09:15:27.106955] W [fuse-bridge.c:4993:fuse_thread_proc] > 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) > [2019-06-11 09:15:27.106963] W [fuse-bridge.c:4993:fuse_thread_proc] > 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) > [2019-06-11 09:15:27.106971] W [fuse-bridge.c:4993:fuse_thread_proc] > 0-glusterfs-fuse: read from /dev/fuse returned -1 (Invalid argument) > etc. > etc. > > Bisecting turned up as culprit: > commit d4b13963f217dd947da5c0cabd1569e914d21699: fuse: require > /dev/fuse reads to have enough buffer capacity > > The glusterfs version i'm using is from Debian stable: > ii glusterfs-client 3.8.8-1 > amd64 clustered file-system (client package) > ii glusterfs-common 3.8.8-1 > amd64 GlusterFS common libraries and translator modules > > > A 5.1.* kernel works fine, as does a 5.2-rc4 kernel with said commit > reverted. > > -- > Sander > _______________________________________________ > > Community Meeting Calendar: > > APAC Schedule - > Every 2nd and 4th Tuesday at 11:30 AM IST > Bridge: https://bluejeans.com/836554017 > > NA/EMEA Schedule - > Every 1st and 3rd Tuesday at 01:00 PM EDT > Bridge: https://bluejeans.com/486278655 > > Gluster-devel mailing list > Gluster-devel at gluster.org > https://lists.gluster.org/mailman/listinfo/gluster-devel > > > -- Amar Tumballi (amarts) -------------- next part -------------- An HTML attachment was scrubbed... URL: From hgowtham at redhat.com Wed Jun 12 09:14:04 2019 From: hgowtham at redhat.com (Hari Gowtham) Date: Wed, 12 Jun 2019 14:44:04 +0530 Subject: [Gluster-devel] Removing glupy from release 5.7 Message-ID: Hi, Due to the recent changes we made. we have a build issue because of glupy. As glupy is already removed from master, we are thinking of removing it in 5.7 as well rather than fixing the issue. The release of 5.7 will be delayed as we have send a patch to fix this issue. And if anyone has any concerns, do let us know. -- Regards, Hari Gowtham. From kirr at nexedi.com Wed Jun 12 11:25:51 2019 From: kirr at nexedi.com (Kirill Smelkov) Date: Wed, 12 Jun 2019 11:25:51 +0000 Subject: [Gluster-devel] [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) In-Reply-To: References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> <20190611202738.GA22556@deco.navytux.spb.ru> Message-ID: <20190612112544.GA21465@deco.navytux.spb.ru> On Wed, Jun 12, 2019 at 09:44:49AM +0200, Miklos Szeredi wrote: > On Tue, Jun 11, 2019 at 10:28 PM Kirill Smelkov wrote: > > > Miklos, would 4K -> `sizeof(fuse_in_header) + sizeof(fuse_write_in)` for > > header room change be accepted? > > Yes, next cycle. For 4.2 I'll just push the revert. Thanks Miklos. Please consider queuing the following patch for 5.3. Sander, could you please confirm that glusterfs is not broken with this version of the check? Thanks beforehand, Kirill ---- 8< ---- >From 24a04e8be9bbf6e67de9e1908dcbe95d426d2521 Mon Sep 17 00:00:00 2001 From: Kirill Smelkov Date: Wed, 27 Mar 2019 10:15:15 +0000 Subject: [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) [ This retries commit d4b13963f217 which was reverted in 766741fcaa1f. In this version we require only `sizeof(fuse_in_header) + sizeof(fuse_write_in)` instead of 4K for FUSE request header room, because, contrary to libfuse and kernel client behaviour, GlusterFS actually provides only so much room for request header. ] A FUSE filesystem server queues /dev/fuse sys_read calls to get filesystem requests to handle. It does not know in advance what would be that request as it can be anything that client issues - LOOKUP, READ, WRITE, ... Many requests are short and retrieve data from the filesystem. However WRITE and NOTIFY_REPLY write data into filesystem. Before getting into operation phase, FUSE filesystem server and kernel client negotiate what should be the maximum write size the client will ever issue. After negotiation the contract in between server/client is that the filesystem server then should queue /dev/fuse sys_read calls with enough buffer capacity to receive any client request - WRITE in particular, while FUSE client should not, in particular, send WRITE requests with > negotiated max_write payload. FUSE client in kernel and libfuse historically reserve 4K for request header. However an existing filesystem server - GlusterFS - was found which reserves only 80 bytes for header room (= `sizeof(fuse_in_header) + sizeof(fuse_write_in)`). https://lore.kernel.org/linux-fsdevel/20190611202738.GA22556 at deco.navytux.spb.ru/ https://github.com/gluster/glusterfs/blob/v3.8.15-0-gd174f021a/xlators/mount/fuse/src/fuse-bridge.c#L4894 Since `sizeof(fuse_in_header) + sizeof(fuse_write_in)` == `sizeof(fuse_in_header) + sizeof(fuse_read_in)` == `sizeof(fuse_in_header) + sizeof(fuse_notify_retrieve_in)` is the absolute minimum any sane filesystem should be using for header room, the contract is that filesystem server should queue sys_reads with `sizeof(fuse_in_header) + sizeof(fuse_write_in)` + max_write buffer. If the filesystem server does not follow this contract, what can happen is that fuse_dev_do_read will see that request size is > buffer size, and then it will return EIO to client who issued the request but won't indicate in any way that there is a problem to filesystem server. This can be hard to diagnose because for some requests, e.g. for NOTIFY_REPLY which mimics WRITE, there is no client thread that is waiting for request completion and that EIO goes nowhere, while on filesystem server side things look like the kernel is not replying back after successful NOTIFY_RETRIEVE request made by the server. We can make the problem easy to diagnose if we indicate via error return to filesystem server when it is violating the contract. This should not practically cause problems because if a filesystem server is using shorter buffer, writes to it were already very likely to cause EIO, and if the filesystem is read-only it should be too following FUSE_MIN_READ_BUFFER minimum buffer size. Please see [1] for context where the problem of stuck filesystem was hit for real (because kernel client was incorrectly sending more than max_write data with NOTIFY_REPLY; see also previous patch), how the situation was traced and for more involving patch that did not make it into the tree. [1] https://marc.info/?l=linux-fsdevel&m=155057023600853&w=2 Signed-off-by: Kirill Smelkov Cc: Han-Wen Nienhuys Cc: Jakob Unterwurzacher Signed-off-by: Miklos Szeredi --- fs/fuse/dev.c | 19 +++++++++++++++++++ 1 file changed, 19 insertions(+) diff --git a/fs/fuse/dev.c b/fs/fuse/dev.c index ea8237513dfa..15531ba560b5 100644 --- a/fs/fuse/dev.c +++ b/fs/fuse/dev.c @@ -1317,6 +1317,25 @@ static ssize_t fuse_dev_do_read(struct fuse_dev *fud, struct file *file, unsigned reqsize; unsigned int hash; + /* + * Require sane minimum read buffer - that has capacity for fixed part + * of any request header + negotiated max_write room for data. If the + * requirement is not satisfied return EINVAL to the filesystem server + * to indicate that it is not following FUSE server/client contract. + * Don't dequeue / abort any request. + * + * Historically libfuse reserves 4K for fixed header room, but e.g. + * GlusterFS reserves only 80 bytes + * + * = `sizeof(fuse_in_header) + sizeof(fuse_write_in)` + * + * which is the absolute minimum any sane filesystem should be using + * for header room. + */ + if (nbytes < max_t(size_t, FUSE_MIN_READ_BUFFER, + sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) + return -EINVAL; + restart: spin_lock(&fiq->waitq.lock); err = -EAGAIN; -- 2.20.1 From linux at eikelenboom.it Wed Jun 12 12:11:37 2019 From: linux at eikelenboom.it (Sander Eikelenboom) Date: Wed, 12 Jun 2019 14:11:37 +0200 Subject: [Gluster-devel] [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) In-Reply-To: <20190612112544.GA21465@deco.navytux.spb.ru> References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> <20190611202738.GA22556@deco.navytux.spb.ru> <20190612112544.GA21465@deco.navytux.spb.ru> Message-ID: <97c87eb3-5b95-c848-8c50-ed7b535220b0@eikelenboom.it> On 12/06/2019 13:25, Kirill Smelkov wrote: > On Wed, Jun 12, 2019 at 09:44:49AM +0200, Miklos Szeredi wrote: >> On Tue, Jun 11, 2019 at 10:28 PM Kirill Smelkov wrote: >> >>> Miklos, would 4K -> `sizeof(fuse_in_header) + sizeof(fuse_write_in)` for >>> header room change be accepted? >> >> Yes, next cycle. For 4.2 I'll just push the revert. > > Thanks Miklos. Please consider queuing the following patch for 5.3. > Sander, could you please confirm that glusterfs is not broken with this > version of the check? > > Thanks beforehand, > Kirill Sure will give it a spin this evening and report back. -- Sander From linux at eikelenboom.it Wed Jun 12 13:03:49 2019 From: linux at eikelenboom.it (Sander Eikelenboom) Date: Wed, 12 Jun 2019 15:03:49 +0200 Subject: [Gluster-devel] [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) In-Reply-To: <20190612112544.GA21465@deco.navytux.spb.ru> References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> <20190611202738.GA22556@deco.navytux.spb.ru> <20190612112544.GA21465@deco.navytux.spb.ru> Message-ID: On 12/06/2019 13:25, Kirill Smelkov wrote: > On Wed, Jun 12, 2019 at 09:44:49AM +0200, Miklos Szeredi wrote: >> On Tue, Jun 11, 2019 at 10:28 PM Kirill Smelkov wrote: >> >>> Miklos, would 4K -> `sizeof(fuse_in_header) + sizeof(fuse_write_in)` for >>> header room change be accepted? >> >> Yes, next cycle. For 4.2 I'll just push the revert. > > Thanks Miklos. Please consider queuing the following patch for 5.3. > Sander, could you please confirm that glusterfs is not broken with this > version of the check? > > Thanks beforehand, > Kirill Hmm unfortunately it doesn't build, see below. -- Sander In file included from ./include/linux/list.h:9:0, from ./include/linux/wait.h:7, from ./include/linux/wait_bit.h:8, from ./include/linux/fs.h:6, from fs/fuse/fuse_i.h:17, from fs/fuse/dev.c:9: fs/fuse/dev.c: In function ?fuse_dev_do_read?: fs/fuse/dev.c:1336:14: error: ?fuse_in_header? undeclared (first use in this function) sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) ^ ./include/linux/kernel.h:818:40: note: in definition of macro ?__typecheck? (!!(sizeof((typeof(x) *)1 == (typeof(y) *)1))) ^ ./include/linux/kernel.h:842:24: note: in expansion of macro ?__safe_cmp? __builtin_choose_expr(__safe_cmp(x, y), \ ^~~~~~~~~~ ./include/linux/kernel.h:918:27: note: in expansion of macro ?__careful_cmp? #define max_t(type, x, y) __careful_cmp((type)(x), (type)(y), >) ^~~~~~~~~~~~~ fs/fuse/dev.c:1335:15: note: in expansion of macro ?max_t? if (nbytes < max_t(size_t, FUSE_MIN_READ_BUFFER, ^~~~~ fs/fuse/dev.c:1336:14: note: each undeclared identifier is reported only once for each function it appears in sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) ^ ./include/linux/kernel.h:818:40: note: in definition of macro ?__typecheck? (!!(sizeof((typeof(x) *)1 == (typeof(y) *)1))) ^ ./include/linux/kernel.h:842:24: note: in expansion of macro ?__safe_cmp? __builtin_choose_expr(__safe_cmp(x, y), \ ^~~~~~~~~~ ./include/linux/kernel.h:918:27: note: in expansion of macro ?__careful_cmp? #define max_t(type, x, y) __careful_cmp((type)(x), (type)(y), >) ^~~~~~~~~~~~~ fs/fuse/dev.c:1335:15: note: in expansion of macro ?max_t? if (nbytes < max_t(size_t, FUSE_MIN_READ_BUFFER, ^~~~~ fs/fuse/dev.c:1336:39: error: ?fuse_write_in? undeclared (first use in this function) sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) ^ ./include/linux/kernel.h:818:40: note: in definition of macro ?__typecheck? (!!(sizeof((typeof(x) *)1 == (typeof(y) *)1))) ^ ./include/linux/kernel.h:842:24: note: in expansion of macro ?__safe_cmp? __builtin_choose_expr(__safe_cmp(x, y), \ ^~~~~~~~~~ ./include/linux/kernel.h:918:27: note: in expansion of macro ?__careful_cmp? #define max_t(type, x, y) __careful_cmp((type)(x), (type)(y), >) ^~~~~~~~~~~~~ fs/fuse/dev.c:1335:15: note: in expansion of macro ?max_t? if (nbytes < max_t(size_t, FUSE_MIN_READ_BUFFER, ^~~~~ ./include/linux/kernel.h:842:2: error: first argument to ?__builtin_choose_expr? not a constant __builtin_choose_expr(__safe_cmp(x, y), \ ^ ./include/linux/kernel.h:918:27: note: in expansion of macro ?__careful_cmp? #define max_t(type, x, y) __careful_cmp((type)(x), (type)(y), >) ^~~~~~~~~~~~~ fs/fuse/dev.c:1335:15: note: in expansion of macro ?max_t? if (nbytes < max_t(size_t, FUSE_MIN_READ_BUFFER, ^~~~~ scripts/Makefile.build:278: recipe for target 'fs/fuse/dev.o' failed make[3]: *** [fs/fuse/dev.o] Error 1 scripts/Makefile.build:489: recipe for target 'fs/fuse' failed make[2]: *** [fs/fuse] Error 2 From ndevos at redhat.com Wed Jun 12 13:34:37 2019 From: ndevos at redhat.com (Niels de Vos) Date: Wed, 12 Jun 2019 15:34:37 +0200 Subject: [Gluster-devel] Removing glupy from release 5.7 In-Reply-To: References: Message-ID: <20190612133437.GK8725@ndevos-x270> On Wed, Jun 12, 2019 at 02:44:04PM +0530, Hari Gowtham wrote: > Hi, > > Due to the recent changes we made. we have a build issue because of glupy. > As glupy is already removed from master, we are thinking of removing > it in 5.7 as well rather than fixing the issue. > > The release of 5.7 will be delayed as we have send a patch to fix this issue. > And if anyone has any concerns, do let us know. Could you link to the BZ with the build error and patches that attempt fixing it? We normally do not remove features with minor updates. Fixing the build error would be the preferred approach. Thanks, Niels From hgowtham at redhat.com Wed Jun 12 14:24:17 2019 From: hgowtham at redhat.com (Hari Gowtham) Date: Wed, 12 Jun 2019 19:54:17 +0530 Subject: [Gluster-devel] Removing glupy from release 5.7 In-Reply-To: <20190612133437.GK8725@ndevos-x270> References: <20190612133437.GK8725@ndevos-x270> Message-ID: We haven't sent any patch to fix it. Waiting for the decision to be made. The bz: https://bugzilla.redhat.com/show_bug.cgi?id=1719778 The link to the build log: https://build.gluster.org/job/strfmt_errors/18888/artifact/RPMS/el6/i686/build.log The last few messages in the log: config.status: creating xlators/features/changelog/lib/src/Makefile config.status: creating xlators/features/changetimerecorder/Makefile config.status: creating xlators/features/changetimerecorder/src/Makefile BUILDSTDERR: config.status: error: cannot find input file: xlators/features/glupy/Makefile.in RPM build errors: BUILDSTDERR: error: Bad exit status from /var/tmp/rpm-tmp.kGZI5V (%build) BUILDSTDERR: Bad exit status from /var/tmp/rpm-tmp.kGZI5V (%build) Child return code was: 1 EXCEPTION: [Error()] Traceback (most recent call last): File "/usr/lib/python3.6/site-packages/mockbuild/trace_decorator.py", line 96, in trace result = func(*args, **kw) File "/usr/lib/python3.6/site-packages/mockbuild/util.py", line 736, in do_with_status raise exception.Error("Command failed: \n # %s\n%s" % (command, output), child.returncode) mockbuild.exception.Error: Command failed: # bash --login -c /usr/bin/rpmbuild -bb --target i686 --nodeps /builddir/build/SPECS/glusterfs.spec On Wed, Jun 12, 2019 at 7:04 PM Niels de Vos wrote: > > On Wed, Jun 12, 2019 at 02:44:04PM +0530, Hari Gowtham wrote: > > Hi, > > > > Due to the recent changes we made. we have a build issue because of glupy. > > As glupy is already removed from master, we are thinking of removing > > it in 5.7 as well rather than fixing the issue. > > > > The release of 5.7 will be delayed as we have send a patch to fix this issue. > > And if anyone has any concerns, do let us know. > > Could you link to the BZ with the build error and patches that attempt > fixing it? > > We normally do not remove features with minor updates. Fixing the build > error would be the preferred approach. > > Thanks, > Niels -- Regards, Hari Gowtham. From kirr at nexedi.com Wed Jun 12 14:12:26 2019 From: kirr at nexedi.com (Kirill Smelkov) Date: Wed, 12 Jun 2019 14:12:26 +0000 Subject: [Gluster-devel] [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) In-Reply-To: References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> <20190611202738.GA22556@deco.navytux.spb.ru> <20190612112544.GA21465@deco.navytux.spb.ru> Message-ID: <20190612141220.GA25389@deco.navytux.spb.ru> On Wed, Jun 12, 2019 at 03:03:49PM +0200, Sander Eikelenboom wrote: > On 12/06/2019 13:25, Kirill Smelkov wrote: > > On Wed, Jun 12, 2019 at 09:44:49AM +0200, Miklos Szeredi wrote: > >> On Tue, Jun 11, 2019 at 10:28 PM Kirill Smelkov wrote: > >> > >>> Miklos, would 4K -> `sizeof(fuse_in_header) + sizeof(fuse_write_in)` for > >>> header room change be accepted? > >> > >> Yes, next cycle. For 4.2 I'll just push the revert. > > > > Thanks Miklos. Please consider queuing the following patch for 5.3. > > Sander, could you please confirm that glusterfs is not broken with this > > version of the check? > > > > Thanks beforehand, > > Kirill > > > Hmm unfortunately it doesn't build, see below. > [...] > fs/fuse/dev.c:1336:14: error: ?fuse_in_header? undeclared (first use in this function) > sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) Sorry, my bad, it was missing "struct" before fuse_in_header. I originally compile-tested the patch with `make -j4`, was distracted onto other topic and did not see the error after returning due to long tail of successful CC lines. Apologize for the inconvenience. Below is a fixed patch that was both compile-tested and runtime-tested with my FUSE workloads (non-glusterfs). Kirill ---- 8< ---- >From 98fd29bb6789d5f6c346274b99d47008ad856607 Mon Sep 17 00:00:00 2001 From: Kirill Smelkov Date: Wed, 12 Jun 2019 17:06:18 +0300 Subject: [PATCH v2] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) [ This retries commit d4b13963f217 which was reverted in 766741fcaa1f. In this version we require only `sizeof(fuse_in_header) + sizeof(fuse_write_in)` instead of 4K for FUSE request header room, because, contrary to libfuse and kernel client behaviour, GlusterFS actually provides only so much room for request header. ] A FUSE filesystem server queues /dev/fuse sys_read calls to get filesystem requests to handle. It does not know in advance what would be that request as it can be anything that client issues - LOOKUP, READ, WRITE, ... Many requests are short and retrieve data from the filesystem. However WRITE and NOTIFY_REPLY write data into filesystem. Before getting into operation phase, FUSE filesystem server and kernel client negotiate what should be the maximum write size the client will ever issue. After negotiation the contract in between server/client is that the filesystem server then should queue /dev/fuse sys_read calls with enough buffer capacity to receive any client request - WRITE in particular, while FUSE client should not, in particular, send WRITE requests with > negotiated max_write payload. FUSE client in kernel and libfuse historically reserve 4K for request header. However an existing filesystem server - GlusterFS - was found which reserves only 80 bytes for header room (= `sizeof(fuse_in_header) + sizeof(fuse_write_in)`). https://lore.kernel.org/linux-fsdevel/20190611202738.GA22556 at deco.navytux.spb.ru/ https://github.com/gluster/glusterfs/blob/v3.8.15-0-gd174f021a/xlators/mount/fuse/src/fuse-bridge.c#L4894 Since `sizeof(fuse_in_header) + sizeof(fuse_write_in)` == `sizeof(fuse_in_header) + sizeof(fuse_read_in)` == `sizeof(fuse_in_header) + sizeof(fuse_notify_retrieve_in)` is the absolute minimum any sane filesystem should be using for header room, the contract is that filesystem server should queue sys_reads with `sizeof(fuse_in_header) + sizeof(fuse_write_in)` + max_write buffer. If the filesystem server does not follow this contract, what can happen is that fuse_dev_do_read will see that request size is > buffer size, and then it will return EIO to client who issued the request but won't indicate in any way that there is a problem to filesystem server. This can be hard to diagnose because for some requests, e.g. for NOTIFY_REPLY which mimics WRITE, there is no client thread that is waiting for request completion and that EIO goes nowhere, while on filesystem server side things look like the kernel is not replying back after successful NOTIFY_RETRIEVE request made by the server. We can make the problem easy to diagnose if we indicate via error return to filesystem server when it is violating the contract. This should not practically cause problems because if a filesystem server is using shorter buffer, writes to it were already very likely to cause EIO, and if the filesystem is read-only it should be too following FUSE_MIN_READ_BUFFER minimum buffer size. Please see [1] for context where the problem of stuck filesystem was hit for real (because kernel client was incorrectly sending more than max_write data with NOTIFY_REPLY; see also previous patch), how the situation was traced and for more involving patch that did not make it into the tree. [1] https://marc.info/?l=linux-fsdevel&m=155057023600853&w=2 Signed-off-by: Kirill Smelkov Cc: Han-Wen Nienhuys Cc: Jakob Unterwurzacher --- fs/fuse/dev.c | 20 ++++++++++++++++++++ 1 file changed, 20 insertions(+) diff --git a/fs/fuse/dev.c b/fs/fuse/dev.c index ea8237513dfa..b2b2344eadcf 100644 --- a/fs/fuse/dev.c +++ b/fs/fuse/dev.c @@ -1317,6 +1317,26 @@ static ssize_t fuse_dev_do_read(struct fuse_dev *fud, struct file *file, unsigned reqsize; unsigned int hash; + /* + * Require sane minimum read buffer - that has capacity for fixed part + * of any request header + negotiated max_write room for data. If the + * requirement is not satisfied return EINVAL to the filesystem server + * to indicate that it is not following FUSE server/client contract. + * Don't dequeue / abort any request. + * + * Historically libfuse reserves 4K for fixed header room, but e.g. + * GlusterFS reserves only 80 bytes + * + * = `sizeof(fuse_in_header) + sizeof(fuse_write_in)` + * + * which is the absolute minimum any sane filesystem should be using + * for header room. + */ + if (nbytes < max_t(size_t, FUSE_MIN_READ_BUFFER, + sizeof(struct fuse_in_header) + sizeof(struct fuse_write_in) + + fc->max_write)) + return -EINVAL; + restart: spin_lock(&fiq->waitq.lock); err = -EAGAIN; -- 2.20.1 From ndevos at redhat.com Wed Jun 12 15:11:42 2019 From: ndevos at redhat.com (Niels de Vos) Date: Wed, 12 Jun 2019 17:11:42 +0200 Subject: [Gluster-devel] Removing glupy from release 5.7 In-Reply-To: References: <20190612133437.GK8725@ndevos-x270> Message-ID: <20190612151142.GL8725@ndevos-x270> On Wed, Jun 12, 2019 at 07:54:17PM +0530, Hari Gowtham wrote: > We haven't sent any patch to fix it. > Waiting for the decision to be made. > The bz: https://bugzilla.redhat.com/show_bug.cgi?id=1719778 > The link to the build log: > https://build.gluster.org/job/strfmt_errors/18888/artifact/RPMS/el6/i686/build.log > > The last few messages in the log: > > config.status: creating xlators/features/changelog/lib/src/Makefile > config.status: creating xlators/features/changetimerecorder/Makefile > config.status: creating xlators/features/changetimerecorder/src/Makefile > BUILDSTDERR: config.status: error: cannot find input file: > xlators/features/glupy/Makefile.in > RPM build errors: > BUILDSTDERR: error: Bad exit status from /var/tmp/rpm-tmp.kGZI5V (%build) > BUILDSTDERR: Bad exit status from /var/tmp/rpm-tmp.kGZI5V (%build) > Child return code was: 1 > EXCEPTION: [Error()] > Traceback (most recent call last): > File "/usr/lib/python3.6/site-packages/mockbuild/trace_decorator.py", > line 96, in trace > result = func(*args, **kw) > File "/usr/lib/python3.6/site-packages/mockbuild/util.py", line 736, > in do_with_status > raise exception.Error("Command failed: \n # %s\n%s" % (command, > output), child.returncode) > mockbuild.exception.Error: Command failed: > # bash --login -c /usr/bin/rpmbuild -bb --target i686 --nodeps > /builddir/build/SPECS/glusterfs.spec Those messages are caused by missing files. The 'make dist' that generates the tarball in the previous step did not included the glupy files. https://build.gluster.org/job/strfmt_errors/18888/console contains the following message: configure: WARNING: --------------------------------------------------------------------------------- cannot build glupy. python 3.6 and python-devel/python-dev package are required. --------------------------------------------------------------------------------- I am not sure if there have been any recent backports to release-5 that introduced this behaviour. Maybe it is related to the builder where the tarball is generated. The job seems to detect python-3.6.8, which is not included in CentOS-7 for all I know? Maybe someone else understands how this can happen? HTH, Niels > > On Wed, Jun 12, 2019 at 7:04 PM Niels de Vos wrote: > > > > On Wed, Jun 12, 2019 at 02:44:04PM +0530, Hari Gowtham wrote: > > > Hi, > > > > > > Due to the recent changes we made. we have a build issue because of glupy. > > > As glupy is already removed from master, we are thinking of removing > > > it in 5.7 as well rather than fixing the issue. > > > > > > The release of 5.7 will be delayed as we have send a patch to fix this issue. > > > And if anyone has any concerns, do let us know. > > > > Could you link to the BZ with the build error and patches that attempt > > fixing it? > > > > We normally do not remove features with minor updates. Fixing the build > > error would be the preferred approach. > > > > Thanks, > > Niels > > > > -- > Regards, > Hari Gowtham. From linux at eikelenboom.it Wed Jun 12 16:28:17 2019 From: linux at eikelenboom.it (Sander Eikelenboom) Date: Wed, 12 Jun 2019 18:28:17 +0200 Subject: [Gluster-devel] [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) In-Reply-To: <20190612141220.GA25389@deco.navytux.spb.ru> References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> <20190611202738.GA22556@deco.navytux.spb.ru> <20190612112544.GA21465@deco.navytux.spb.ru> <20190612141220.GA25389@deco.navytux.spb.ru> Message-ID: On 12/06/2019 16:12, Kirill Smelkov wrote: > On Wed, Jun 12, 2019 at 03:03:49PM +0200, Sander Eikelenboom wrote: >> On 12/06/2019 13:25, Kirill Smelkov wrote: >>> On Wed, Jun 12, 2019 at 09:44:49AM +0200, Miklos Szeredi wrote: >>>> On Tue, Jun 11, 2019 at 10:28 PM Kirill Smelkov wrote: >>>> >>>>> Miklos, would 4K -> `sizeof(fuse_in_header) + sizeof(fuse_write_in)` for >>>>> header room change be accepted? >>>> >>>> Yes, next cycle. For 4.2 I'll just push the revert. >>> >>> Thanks Miklos. Please consider queuing the following patch for 5.3. >>> Sander, could you please confirm that glusterfs is not broken with this >>> version of the check? >>> >>> Thanks beforehand, >>> Kirill >> >> >> Hmm unfortunately it doesn't build, see below. >> [...] >> fs/fuse/dev.c:1336:14: error: ?fuse_in_header? undeclared (first use in this function) >> sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) > > Sorry, my bad, it was missing "struct" before fuse_in_header. I > originally compile-tested the patch with `make -j4`, was distracted onto > other topic and did not see the error after returning due to long tail > of successful CC lines. Apologize for the inconvenience. Below is a > fixed patch that was both compile-tested and runtime-tested with my FUSE > workloads (non-glusterfs). > > Kirill > Just tested and it works for me, thanks ! -- Sander From kirr at nexedi.com Wed Jun 12 17:03:04 2019 From: kirr at nexedi.com (Kirill Smelkov) Date: Wed, 12 Jun 2019 17:03:04 +0000 Subject: [Gluster-devel] [PATCH] fuse: require /dev/fuse reads to have enough buffer capacity (take 2) In-Reply-To: References: <876aefd0-808a-bb4b-0897-191f0a8d9e12@eikelenboom.it> <20190611202738.GA22556@deco.navytux.spb.ru> <20190612112544.GA21465@deco.navytux.spb.ru> <20190612141220.GA25389@deco.navytux.spb.ru> Message-ID: <20190612170259.GA27637@deco.navytux.spb.ru> On Wed, Jun 12, 2019 at 06:28:17PM +0200, Sander Eikelenboom wrote: > On 12/06/2019 16:12, Kirill Smelkov wrote: > > On Wed, Jun 12, 2019 at 03:03:49PM +0200, Sander Eikelenboom wrote: > >> On 12/06/2019 13:25, Kirill Smelkov wrote: > >>> On Wed, Jun 12, 2019 at 09:44:49AM +0200, Miklos Szeredi wrote: > >>>> On Tue, Jun 11, 2019 at 10:28 PM Kirill Smelkov wrote: > >>>> > >>>>> Miklos, would 4K -> `sizeof(fuse_in_header) + sizeof(fuse_write_in)` for > >>>>> header room change be accepted? > >>>> > >>>> Yes, next cycle. For 4.2 I'll just push the revert. > >>> > >>> Thanks Miklos. Please consider queuing the following patch for 5.3. > >>> Sander, could you please confirm that glusterfs is not broken with this > >>> version of the check? > >>> > >>> Thanks beforehand, > >>> Kirill > >> > >> > >> Hmm unfortunately it doesn't build, see below. > >> [...] > >> fs/fuse/dev.c:1336:14: error: ?fuse_in_header? undeclared (first use in this function) > >> sizeof(fuse_in_header) + sizeof(fuse_write_in) + fc->max_write)) > > > > Sorry, my bad, it was missing "struct" before fuse_in_header. I > > originally compile-tested the patch with `make -j4`, was distracted onto > > other topic and did not see the error after returning due to long tail > > of successful CC lines. Apologize for the inconvenience. Below is a > > fixed patch that was both compile-tested and runtime-tested with my FUSE > > workloads (non-glusterfs). > > > > Kirill > > > > Just tested and it works for me, thanks ! Thanks for feedback. Kirill From atumball at redhat.com Wed Jun 12 17:41:09 2019 From: atumball at redhat.com (Amar Tumballi Suryanarayan) Date: Wed, 12 Jun 2019 23:11:09 +0530 Subject: [Gluster-devel] Removing glupy from release 5.7 In-Reply-To: <20190612151142.GL8725@ndevos-x270> References: <20190612133437.GK8725@ndevos-x270> <20190612151142.GL8725@ndevos-x270> Message-ID: