[heketi-devel] Unable to use Heketi setup to install Gluster for Kubernetes

Mon Sep 4 13:52:03 UTC 2017

I am just running this bash script which will start the
rancher-glusterfs-client container:

[root at node-a ~]# cat rancher-glusterfs-client.sh
#!/bin/bash
sudo docker run --privileged \
        --name=gluster-client \
        -d \
        -v /sys/fs/cgroup:/sys/fs/cgroup \
        -v /var/log/glusterfs:/var/log/glusterfs \
        --env GLUSTER_PEER=10.128.0.12,10.128.0.15,10.128.0.16 \
        nixel/rancher-glusterfs-client

On Mon, Sep 4, 2017 at 6:26 PM, Jose A. Rivera <jarrpa at redhat.com> wrote:

> What is the exact command you're running?
>
> On Mon, Sep 4, 2017 at 4:26 AM, Gaurav Chhabra <varuag.chhabra at gmail.com>
> wrote:
> > Hi Jose,
> >
> >
> > From this link, it seems mount.glusterfs might actually be present in the
> > container that launched and quickly terminated.
> >
> > https://unix.stackexchange.com/questions/312178/
> glusterfs-replicated-volume-mounting-issue
> >
> > If you check the question that the user posted, the same error (Mount
> > failed) was reported that i sent you in the last email.
> >
> > After seeing the above, i checked /var/log/glusterfs on my host
> (RancherOS)
> > but it was empty. I ran the container again but with explicit volume
> mount
> > as shown below:
> >
> > [root at node-a ~]# cat rancher-glusterfs-client.sh
> > #!/bin/bash
> > sudo docker run --privileged \
> >         --name=gluster-client \
> >         -d \
> >         -v /sys/fs/cgroup:/sys/fs/cgroup \
> >         -v /var/log/glusterfs:/var/log/glusterfs \
> >         --env GLUSTER_PEER=10.128.0.12,10.128.0.15,10.128.0.16 \
> >         nixel/rancher-glusterfs-client
> > This time, i could see a log file (/var/log/glusterfs/mnt-
> ranchervol.log)
> > present. I have attached the content of the same. Also attached are logs
> > from Heketi client/server (both on one node) and Gluster cluster.
> >
> >
> > Regards,
> > Gaurav
> >
> >
> > On Mon, Sep 4, 2017 at 2:29 PM, Gaurav Chhabra <varuag.chhabra at gmail.com
> >
> > wrote:
> >>
> >> Hi Jose,
> >>
> >>
> >> I tried setting up things using the link you provided and i was able to
> >> get all steps working for 3 node Gluster cluster, all running on
> CentOS7,
> >> without any issue. However, as expected, when i tried configuring
> Kubernetes
> >> by installing nixel/rancher-glusterfs-client container, i got error:
> >>
> >> [root at node-a ~]# cat rancher-glusterfs-client.sh
> >> #!/bin/bash
> >> sudo docker run --privileged \
> >>         --name=gluster-client \
> >>         -d \
> >>         -v /sys/fs/cgroup:/sys/fs/cgroup \
> >>         --env GLUSTER_PEER=10.128.0.12,10.128.0.15,10.128.0.16 \
> >>         nixel/rancher-glusterfs-client
> >>
> >> [root at node-a ~]# ./rancher-glusterfs-client.sh
> >> ac069caccdce147d6f423fc566166345191dbc1b11f3416c66207a1fd11fda6b
> >>
> >> [root at node-a ~]# docker logs gluster-client
> >> => Checking if I can reach GlusterFS node 10.128.0.12 ...
> >> => GlusterFS node 10.128.0.12 is alive
> >> => Mounting GlusterFS volume ranchervol from GlusterFS node 10.128.0.12
> >> ...
> >> Mount failed. Please check the log file for more details.
> >>
> >> If i try running the next step as described in your link, i get the
> >> following:
> >>
> >> [root at node-a ~]# modprobe fuse
> >> modprobe: module fuse not found in modules.dep
> >>
> >> Since the container failed to start, i could only check on the host
> >> (RancherOS) and i could only find two mount-related commands: mount &
> >> mountpoint
> >>
> >> Any pointers?
> >>
> >>
> >> Regards,
> >> Gaurav
> >>
> >> On Sun, Sep 3, 2017 at 11:18 PM, Jose A. Rivera <jarrpa at redhat.com>
> wrote:
> >>>
> >>> Installing the glusterfs-client container should be fine. :) The main
> >>> thing that's needed is that all your Kubernetes nodes need to have the
> >>> "mount.glusterfs" command available so Kube can mount the GlusterFS
> >>> volumes and present them to the pods.
> >>>
> >>> On Sun, Sep 3, 2017 at 12:14 PM, Gaurav Chhabra
> >>> <varuag.chhabra at gmail.com> wrote:
> >>> > Thanks Jose. The link you've suggested looks good but again it
> expects
> >>> > me to
> >>> > install gluster-client on Kubernetes node and i fall into the same
> >>> > issue of
> >>> > installing a container for glusterfs. Only difference is that this
> time
> >>> > it's
> >>> > glusterfs-client and not glusterfs-server. :)
> >>> >
> >>> > I will try this out and let you know tomorrow.
> >>> >
> >>> >
> >>> > Regards,
> >>> > Gaurav
> >>> >
> >>> >
> >>> > On Sun, Sep 3, 2017 at 2:11 AM, Jose A. Rivera <jarrpa at redhat.com>
> >>> > wrote:
> >>> >>
> >>> >> Hey, no problem! I'm eager to learn more about different flavors of
> >>> >> Linux, I just apologize for my relative inexperience with them. :)
> >>> >>
> >>> >> To that end, I will also admit I'm not very experienced with direct
> >>> >> Docker myself. I understand the basic workflow and know some of the
> >>> >> run options, but not having deep experience keeps me from having a
> >>> >> better understanding of the patterns and consequences.
> >>> >>
> >>> >> Thus, I'd like to guide you in a direction I'd more apt to help you
> in
> >>> >> right now. I know that you can't have multiple GlusterFS servers
> >>> >> running on the same nodes, and I know that we have been successfully
> >>> >> running several configurations using our gluster/gluster-centos
> image.
> >>> >> If you follow the Kubernetes configuration on gluster-kubernetes,
> the
> >>> >> pod/container is run privileged and with host networking, and we
> >>> >> require that the node has all listed ports open, not just 2222. The
> >>> >> sshd running in the container is listening on 2222, not 22, but it
> is
> >>> >> also not really required if you're not doing geo-replication.
> >>> >>
> >>> >> Alternatively, you can indeed run GlusterFS outside of Kubernetes
> but
> >>> >> still have Kubernetes apps access GlusterFS storage. The nodes can
> be
> >>> >> anything you want, they just need to be running GlusterFS and you
> need
> >>> >> a heketi service managing them. Here is an example of how to set
> this
> >>> >> up using CentOS:
> >>> >>
> >>> >>
> >>> >>
> >>> >> https://github.com/gluster/gluster-kubernetes/tree/
> master/docs/examples/dynamic_provisioning_external_gluster
> >>> >>
> >>> >> Hope this is at least leading you in a useful direction. :)
> >>> >>
> >>> >> --Jose
> >>> >>
> >>> >> On Sat, Sep 2, 2017 at 3:16 PM, Gaurav Chhabra
> >>> >> <varuag.chhabra at gmail.com>
> >>> >> wrote:
> >>> >> > Hi Jose,
> >>> >> >
> >>> >> > webcenter/rancher-glusterfs-server is actually a container
> provided
> >>> >> > by
> >>> >> > Sebastien, its maintainer. It's a Docker container which has
> >>> >> > GlusterFS
> >>> >> > server running within it. On the host i.e., RancherOS, there is no
> >>> >> > separate
> >>> >> > GlusterFS server running because we cannot install anything that
> >>> >> > way.
> >>> >> > Running using container is the only way so i started
> >>> >> > ancher-glusterfs-server
> >>> >> > container with the following parameters:
> >>> >> >
> >>> >> > [root at node-1 rancher]# cat gluster-server.sh
> >>> >> > #!/bin/bash
> >>> >> > sudo docker run --name=gluster-server -d \
> >>> >> >         --env 'SERVICE_NAME=gluster' \
> >>> >> >         --restart always \
> >>> >> >         --publish 2222:22 \
> >>> >> >         webcenter/rancher-glusterfs-server
> >>> >> >
> >>> >> > Here's the link to the dockerfile:
> >>> >> >
> >>> >> >
> >>> >> > https://hub.docker.com/r/webcenter/rancher-glusterfs-
> server/~/dockerfile/
> >>> >> >
> >>> >> > It's similar to other GlusteFS containers provided by other
> >>> >> > maintainers
> >>> >> > for
> >>> >> > different OS. For example, for CentOS, we have
> >>> >> > https://hub.docker.com/r/gluster/gluster-centos/~/dockerfile/
> >>> >> >
> >>> >> > From what i understand, Heketi does support container based
> >>> >> > GlusterFS
> >>> >> > server
> >>> >> > as mentioned in the prerequisite where it says:
> >>> >> >
> >>> >> > "Each node must have the following ports opened for GlusterFS
> >>> >> > communications:
> >>> >> >  2222 - GlusterFS pod's sshd"
> >>> >> >
> >>> >> > That's the reason i've exposed port 2222 for 22 as shown above.
> >>> >> > Please
> >>> >> > correct me if i misunderstood it.
> >>> >> >
> >>> >> > As soon as i run the above script (gluster-server.sh), it
> >>> >> > automatically
> >>> >> > creates the following directories on host. This should have
> ideally
> >>> >> > not
> >>> >> > been
> >>> >> > empty as you mentioned.
> >>> >> >
> >>> >> > /etc/glusterfs    /var/lib/glusterd    /var/log/glusterfs
> >>> >> >
> >>> >> > Just wanted to know in which circumstances do we get this specific
> >>> >> > error
> >>> >> > (Failed to get D-Bus connection: Operation not permitted) related
> to
> >>> >> > Readiness probe failing. Searching online took me to discussions
> >>> >> > around
> >>> >> > running container in privileged mode and some directory to be
> >>> >> > mounted.
> >>> >> > Based
> >>> >> > on that, i also modified my container startup script to the
> >>> >> > following:
> >>> >> >
> >>> >> > #!/bin/bash
> >>> >> > sudo docker run --privileged \
> >>> >> >         --name=gluster-server \
> >>> >> >         -d \
> >>> >> >         -v /sys/fs/cgroup:/sys/fs/cgroup \
> >>> >> >         -v /etc/glusterfs:/etc/glusterfs \
> >>> >> >         -v /var/lib/glusterd:/var/lib/glusterd \
> >>> >> >         -v /var/log/glusterfs:/var/log/glusterfs \
> >>> >> >         --env 'SERVICE_NAME=gluster' \
> >>> >> >         --restart always \
> >>> >> >         --publish 2222:22 \
> >>> >> >         webcenter/rancher-glusterfs-server
> >>> >> > Still, the issue persists.
> >>> >> >
> >>> >> > I also logged into the container and checked whether systemctl
> >>> >> > command
> >>> >> > is
> >>> >> > present. It was there but manualy running the command also doesn't
> >>> >> > work:
> >>> >> >
> >>> >> > [root at node-c ~]# docker exec -it gluster-server /bin/bash
> >>> >> > root at 42150f203f80:/app# systemctl status glusterd.service
> >>> >> > WARNING: terminal is not fully functional
> >>> >> > Failed to connect to bus: No such file or directory
> >>> >> >
> >>> >> > Under section 'ADVANCED OPTIONS - Security/Host' in this link, it
> >>> >> > talks
> >>> >> > about SYS_ADMIN setting. Any idea how i can try this?
> >>> >> >
> >>> >> > Also, there was this mentioned in the Heketi setup page:
> >>> >> > "If you are not able to deploy a hyper-converged GlusterFS
> cluster,
> >>> >> > you
> >>> >> > must
> >>> >> > have one running somewhere that the Kubernetes nodes can access"
> >>> >> >
> >>> >> >>>> Does it mean running the three node Gluster cluster outside
> >>> >> >>>> Kubernetes,
> >>> >> >>>> may be on some VM running on RHEL/CentOS etc? If yes, then how
> >>> >> >>>> will i
> >>> >> >>>> be
> >>> >> >>>> able to tell Gluster which volume from the Kubernetes cluster
> pod
> >>> >> >>>> to
> >>> >> >>>> sync?
> >>> >> >>>> Any references?
> >>> >> >
> >>> >> >
> >>> >> > I really appreciate your responses despite the fact that you've
> not
> >>> >> > used
> >>> >> > RancherOS but still trying to help.
> >>> >> >
> >>> >> >
> >>> >> > Thanks,
> >>> >> > Gaurav
> >>> >> >
> >>> >> >
> >>> >> > On Sat, Sep 2, 2017 at 7:35 PM, Jose A. Rivera <jarrpa at redhat.com
> >
> >>> >> > wrote:
> >>> >> >>
> >>> >> >> I'm afraid I have no experience with RancherOS, so I may be
> missing
> >>> >> >> some things about how it works. My primary experience is with
> >>> >> >> Fedora,
> >>> >> >> CentOS, and Ubuntu.
> >>> >> >>
> >>> >> >> What is webcenter/rancher-glusterfs-server? If it's running
> another
> >>> >> >> glusterd then you probably don't want to be running it and should
> >>> >> >> remove it from your systems.
> >>> >> >>
> >>> >> >> The glusterfs pods mount hostpath volumes from the host they're
> >>> >> >> running on to persist their configuration. Thus anything they
> write
> >>> >> >> to
> >>> >> >> those directories should land on the host. If that's not
> happening
> >>> >> >> then that's an additional problem.
> >>> >> >>
> >>> >> >> --Jose
> >>> >> >>
> >>> >> >> On Fri, Sep 1, 2017 at 11:17 PM, Gaurav Chhabra
> >>> >> >> <varuag.chhabra at gmail.com> wrote:
> >>> >> >> > Hi Jose,
> >>> >> >> >
> >>> >> >> >
> >>> >> >> > I tried your suggestion but there is one confusion regarding
> >>> >> >> > point
> >>> >> >> > #3.
> >>> >> >> > Since
> >>> >> >> > RancherOS has everything running as container, i am running
> >>> >> >> > webcenter/rancher-glusterfs-server container on all three
> nodes.
> >>> >> >> > Now
> >>> >> >> > as
> >>> >> >> > far
> >>> >> >> > as removing the directories are concerned, i hope you meant
> >>> >> >> > removing
> >>> >> >> > them on
> >>> >> >> > the host and _not_ from within the container. After completing
> >>> >> >> > step 1
> >>> >> >> > and 2,
> >>> >> >> > i checked the contents of all the directories that you
> specified
> >>> >> >> > in
> >>> >> >> > point
> >>> >> >> > #3. All were empty as you can see in the attached
> other_logs.txt.
> >>> >> >> > So
> >>> >> >> > i
> >>> >> >> > got
> >>> >> >> > confused. I ran the deploy again but the issue persists. Two
> pods
> >>> >> >> > show
> >>> >> >> > Liveness error and the third one, Readiness error.
> >>> >> >> >
> >>> >> >> > I then tried removing those directories (Step #3) from within
> the
> >>> >> >> > container
> >>> >> >> > but getting following error:
> >>> >> >> >
> >>> >> >> > root at c0f8ab4d92a2:/app# rm -rf /var/lib/heketi /etc/glusterfs
> >>> >> >> > /var/lib/glusterd /var/log/glusterfs
> >>> >> >> > rm: cannot remove '/var/lib/glusterd': Device or resource busy
> >>> >> >> >
> >>> >> >> >
> >>> >> >> >
> >>> >> >> > On Fri, Sep 1, 2017 at 8:21 PM, Jose A. Rivera
> >>> >> >> > <jarrpa at redhat.com>
> >>> >> >> > wrote:
> >>> >> >> >>
> >>> >> >> >> 1. Add a line to the ssh-exec portion of heketi.json of the
> >>> >> >> >> sort:
> >>> >> >> >>
> >>> >> >> >> "sudo": true,
> >>> >> >> >>
> >>> >> >> >> 2. Run
> >>> >> >> >>
> >>> >> >> >> gk-deploy -g --abort
> >>> >> >> >>
> >>> >> >> >> 3. On the nodes that were/will be running GlusterFS pods, run:
> >>> >> >> >>
> >>> >> >> >> rm -rf /var/lib/heketi /etc/glusterfs /var/lib/glusterd
> >>> >> >> >> /var/log/glusterfs
> >>> >> >> >>
> >>> >> >> >> Then try the deploy again.
> >>> >> >> >>
> >>> >> >> >> On Fri, Sep 1, 2017 at 6:05 AM, Gaurav Chhabra
> >>> >> >> >> <varuag.chhabra at gmail.com>
> >>> >> >> >> wrote:
> >>> >> >> >> > Hi Jose,
> >>> >> >> >> >
> >>> >> >> >> >
> >>> >> >> >> > Thanks for the reply. It seems the three gluster pods might
> >>> >> >> >> > have
> >>> >> >> >> > been
> >>> >> >> >> > a
> >>> >> >> >> > copy-paste from another set of cluster where i was trying to
> >>> >> >> >> > setup
> >>> >> >> >> > the
> >>> >> >> >> > same
> >>> >> >> >> > thing using CentOS. Sorry for that. By the way, i did check
> >>> >> >> >> > for
> >>> >> >> >> > the
> >>> >> >> >> > kernel
> >>> >> >> >> > modules and it seems it's already there. Also, i am
> attaching
> >>> >> >> >> > fresh
> >>> >> >> >> > set
> >>> >> >> >> > of
> >>> >> >> >> > files because i created a new cluster and thought of giving
> it
> >>> >> >> >> > a
> >>> >> >> >> > try
> >>> >> >> >> > again.
> >>> >> >> >> > Issue still persists. :(
> >>> >> >> >> >
> >>> >> >> >> > In heketi.json, there is a slight change w.r.t the user
> which
> >>> >> >> >> > connects
> >>> >> >> >> > to
> >>> >> >> >> > glusterfs node using SSH. I am not sure how Heketi was using
> >>> >> >> >> > root
> >>> >> >> >> > user
> >>> >> >> >> > to
> >>> >> >> >> > login because i wasn't able to use root and do manual SSH.
> >>> >> >> >> > With
> >>> >> >> >> > rancher
> >>> >> >> >> > user, i can login successfully so i think this should be
> fine.
> >>> >> >> >> >
> >>> >> >> >> > /etc/heketi/heketi.json:
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >     "executor": "ssh",
> >>> >> >> >> >
> >>> >> >> >> >     "_sshexec_comment": "SSH username and private key file
> >>> >> >> >> > information",
> >>> >> >> >> >     "sshexec": {
> >>> >> >> >> >       "keyfile": "/var/lib/heketi/.ssh/id_rsa",
> >>> >> >> >> >       "user": "rancher",
> >>> >> >> >> >       "port": "22",
> >>> >> >> >> >       "fstab": "/etc/fstab"
> >>> >> >> >> >     },
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >
> >>> >> >> >> > Before running gk-deploy:
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> > [root at workstation deploy]# kubectl get
> >>> >> >> >> > nodes,pods,daemonset,deployments,services
> >>> >> >> >> > NAME                                     STATUS    AGE
> >>> >> >> >> > VERSION
> >>> >> >> >> > no/node-a.c.kubernetes-174104.internal   Ready     3h
> >>> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> > no/node-b.c.kubernetes-174104.internal   Ready     3h
> >>> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> > no/node-c.c.kubernetes-174104.internal   Ready     3h
> >>> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >
> >>> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
> >>> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   3h
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >
> >>> >> >> >> > After running gk-deploy:
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> > [root at workstation messagegc]# kubectl get
> >>> >> >> >> > nodes,pods,daemonset,deployments,services
> >>> >> >> >> > NAME                                     STATUS    AGE
> >>> >> >> >> > VERSION
> >>> >> >> >> > no/node-a.c.kubernetes-174104.internal   Ready     3h
> >>> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> > no/node-b.c.kubernetes-174104.internal   Ready     3h
> >>> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> > no/node-c.c.kubernetes-174104.internal   Ready     3h
> >>> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >
> >>> >> >> >> > NAME                 READY     STATUS    RESTARTS   AGE
> >>> >> >> >> > po/glusterfs-0j9l5   0/1       Running   0          2m
> >>> >> >> >> > po/glusterfs-gqz4c   0/1       Running   0          2m
> >>> >> >> >> > po/glusterfs-gxvcb   0/1       Running   0          2m
> >>> >> >> >> >
> >>> >> >> >> > NAME           DESIRED   CURRENT   READY     UP-TO-DATE
> >>> >> >> >> > AVAILABLE
> >>> >> >> >> > NODE-SELECTOR           AGE
> >>> >> >> >> > ds/glusterfs   3         3         0         3            0
> >>> >> >> >> > storagenode=glusterfs   2m
> >>> >> >> >> >
> >>> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
> >>> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   3h
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >
> >>> >> >> >> > Kernel module check on all three nodes:
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> > [root at node-a ~]# find /lib*/modules/$(uname -r) -name *.ko
> |
> >>> >> >> >> > grep
> >>> >> >> >> > 'thin-pool\|snapshot\|mirror' | xargs ls -ltr
> >>> >> >> >> > -rw-r--r--    1 root     root         92310 Jun 26 04:13
> >>> >> >> >> >
> >>> >> >> >> > /lib64/modules/4.9.34-rancher/kernel/drivers/md/dm-thin-
> pool.ko
> >>> >> >> >> > -rw-r--r--    1 root     root         56982 Jun 26 04:13
> >>> >> >> >> > /lib64/modules/4.9.34-rancher/
> kernel/drivers/md/dm-snapshot.ko
> >>> >> >> >> > -rw-r--r--    1 root     root         27070 Jun 26 04:13
> >>> >> >> >> > /lib64/modules/4.9.34-rancher/
> kernel/drivers/md/dm-mirror.ko
> >>> >> >> >> > -rw-r--r--    1 root     root         92310 Jun 26 04:13
> >>> >> >> >> > /lib/modules/4.9.34-rancher/kernel/drivers/md/dm-thin-
> pool.ko
> >>> >> >> >> > -rw-r--r--    1 root     root         56982 Jun 26 04:13
> >>> >> >> >> > /lib/modules/4.9.34-rancher/kernel/drivers/md/dm-snapshot.
> ko
> >>> >> >> >> > -rw-r--r--    1 root     root         27070 Jun 26 04:13
> >>> >> >> >> > /lib/modules/4.9.34-rancher/kernel/drivers/md/dm-mirror.ko
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >
> >>> >> >> >> > Error snapshot attached.
> >>> >> >> >> >
> >>> >> >> >> > In my first mail, i checked that Readiness Probe failure
> check
> >>> >> >> >> > has
> >>> >> >> >> > this
> >>> >> >> >> > code
> >>> >> >> >> > in kube-templates/glusterfs-daemonset.yaml file:
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >         readinessProbe:
> >>> >> >> >> >           timeoutSeconds: 3
> >>> >> >> >> >           initialDelaySeconds: 40
> >>> >> >> >> >           exec:
> >>> >> >> >> >             command:
> >>> >> >> >> >             - "/bin/bash"
> >>> >> >> >> >             - "-c"
> >>> >> >> >> >             - systemctl status glusterd.service
> >>> >> >> >> >           periodSeconds: 25
> >>> >> >> >> >           successThreshold: 1
> >>> >> >> >> >           failureThreshold: 15
> >>> >> >> >> >
> >>> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >
> >>> >> >> >> > I tried logging into glustefs container on one of the node
> and
> >>> >> >> >> > ran
> >>> >> >> >> > the
> >>> >> >> >> > above
> >>> >> >> >> > command:
> >>> >> >> >> >
> >>> >> >> >> > [root at node-a ~]# docker exec -it c0f8ab4d92a23b6df2
> /bin/bash
> >>> >> >> >> > root at c0f8ab4d92a2:/app# systemctl status glusterd.service
> >>> >> >> >> > WARNING: terminal is not fully functional
> >>> >> >> >> > Failed to connect to bus: No such file or directory
> >>> >> >> >> >
> >>> >> >> >> >
> >>> >> >> >> > Any check that i can do manually on nodes to debug further?
> >>> >> >> >> > Any
> >>> >> >> >> > suggestions?
> >>> >> >> >> >
> >>> >> >> >> >
> >>> >> >> >> > On Thu, Aug 31, 2017 at 6:53 PM, Jose A. Rivera
> >>> >> >> >> > <jarrpa at redhat.com>
> >>> >> >> >> > wrote:
> >>> >> >> >> >>
> >>> >> >> >> >> Hey Gaurav,
> >>> >> >> >> >>
> >>> >> >> >> >> The kernel modules must be loaded on all nodes that will
> run
> >>> >> >> >> >> heketi
> >>> >> >> >> >> pods. Additionally, you must have at least three nodes
> >>> >> >> >> >> specified
> >>> >> >> >> >> in
> >>> >> >> >> >> your topology file. I'm not sure how you're getting three
> >>> >> >> >> >> gluster
> >>> >> >> >> >> pods
> >>> >> >> >> >> when you only have two nodes defined... :)
> >>> >> >> >> >>
> >>> >> >> >> >> --Jose
> >>> >> >> >> >>
> >>> >> >> >> >> On Wed, Aug 30, 2017 at 5:27 AM, Gaurav Chhabra
> >>> >> >> >> >> <varuag.chhabra at gmail.com> wrote:
> >>> >> >> >> >> > Hi,
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > I have the following setup in place:
> >>> >> >> >> >> >
> >>> >> >> >> >> > 1 node    : RancherOS having Rancher application for
> >>> >> >> >> >> > Kubernetes
> >>> >> >> >> >> > setup
> >>> >> >> >> >> > 2 nodes  : RancherOS having Rancher agent
> >>> >> >> >> >> > 1 node   : CentOS 7 workstation having kubectl installed
> >>> >> >> >> >> > and
> >>> >> >> >> >> > folder
> >>> >> >> >> >> > cloned/downloaded from
> >>> >> >> >> >> > https://github.com/gluster/gluster-kubernetes
> >>> >> >> >> >> > using
> >>> >> >> >> >> > which i run Heketi setup (gk-deploy -g)
> >>> >> >> >> >> >
> >>> >> >> >> >> > I also have rancher-glusterfs-server container running
> with
> >>> >> >> >> >> > the
> >>> >> >> >> >> > following
> >>> >> >> >> >> > configuration:
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >> > [root at node-1 rancher]# cat gluster-server.sh
> >>> >> >> >> >> > #!/bin/bash
> >>> >> >> >> >> >
> >>> >> >> >> >> > sudo docker run --name=gluster-server -d \
> >>> >> >> >> >> >         --env 'SERVICE_NAME=gluster' \
> >>> >> >> >> >> >         --restart always \
> >>> >> >> >> >> >         --env 'GLUSTER_DATA=/srv/docker/gitlab' \
> >>> >> >> >> >> >         --publish 2222:22 \
> >>> >> >> >> >> >         webcenter/rancher-glusterfs-server
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >> >
> >>> >> >> >> >> > In /etc/heketi/heketi.json, following is the only
> modified
> >>> >> >> >> >> > portion:
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >> >     "executor": "ssh",
> >>> >> >> >> >> >
> >>> >> >> >> >> >     "_sshexec_comment": "SSH username and private key
> file
> >>> >> >> >> >> > information",
> >>> >> >> >> >> >     "sshexec": {
> >>> >> >> >> >> >       "keyfile": "/var/lib/heketi/.ssh/id_rsa",
> >>> >> >> >> >> >       "user": "root",
> >>> >> >> >> >> >       "port": "22",
> >>> >> >> >> >> >       "fstab": "/etc/fstab"
> >>> >> >> >> >> >     },
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >> >
> >>> >> >> >> >> > Status before running gk-deploy:
> >>> >> >> >> >> >
> >>> >> >> >> >> > [root at workstation deploy]# kubectl get
> >>> >> >> >> >> > nodes,pods,services,deployments
> >>> >> >> >> >> > NAME                                     STATUS    AGE
> >>> >> >> >> >> > VERSION
> >>> >> >> >> >> > no/node-1.c.kubernetes-174104.internal   Ready     2d
> >>> >> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >> > no/node-2.c.kubernetes-174104.internal   Ready     2d
> >>> >> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >> > no/node-3.c.kubernetes-174104.internal   Ready     2d
> >>> >> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >> >
> >>> >> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
> >>> >> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   2d
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > Now when i run 'gk-deploy -g', in the Rancher console, i
> >>> >> >> >> >> > see
> >>> >> >> >> >> > the
> >>> >> >> >> >> > following
> >>> >> >> >> >> > error:
> >>> >> >> >> >> > Readiness probe failed: Failed to get D-Bus connection:
> >>> >> >> >> >> > Operation
> >>> >> >> >> >> > not
> >>> >> >> >> >> > permitted
> >>> >> >> >> >> >
> >>> >> >> >> >> > From the attached gk-deploy_log i see that it failed at:
> >>> >> >> >> >> > Waiting for GlusterFS pods to start ... pods not found.
> >>> >> >> >> >> >
> >>> >> >> >> >> > In the kube-templates/glusterfs-daemonset.yaml file, i
> see
> >>> >> >> >> >> > this
> >>> >> >> >> >> > for
> >>> >> >> >> >> > Readiness probe section:
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >> >         readinessProbe:
> >>> >> >> >> >> >           timeoutSeconds: 3
> >>> >> >> >> >> >           initialDelaySeconds: 40
> >>> >> >> >> >> >           exec:
> >>> >> >> >> >> >             command:
> >>> >> >> >> >> >             - "/bin/bash"
> >>> >> >> >> >> >             - "-c"
> >>> >> >> >> >> >             - systemctl status glusterd.service
> >>> >> >> >> >> >           periodSeconds: 25
> >>> >> >> >> >> >           successThreshold: 1
> >>> >> >> >> >> >           failureThreshold: 15
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > ------------------------------
> ------------------------------------
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > Status after running gk-deploy:
> >>> >> >> >> >> >
> >>> >> >> >> >> > [root at workstation deploy]# kubectl get
> >>> >> >> >> >> > nodes,pods,deployments,services
> >>> >> >> >> >> > NAME                                     STATUS    AGE
> >>> >> >> >> >> > VERSION
> >>> >> >> >> >> > no/node-1.c.kubernetes-174104.internal   Ready     2d
> >>> >> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >> > no/node-2.c.kubernetes-174104.internal   Ready     2d
> >>> >> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >> > no/node-3.c.kubernetes-174104.internal   Ready     2d
> >>> >> >> >> >> > v1.7.2-rancher1
> >>> >> >> >> >> >
> >>> >> >> >> >> > NAME                 READY     STATUS    RESTARTS   AGE
> >>> >> >> >> >> > po/glusterfs-0s440   0/1       Running   0          1m
> >>> >> >> >> >> > po/glusterfs-j7dgr   0/1       Running   0          1m
> >>> >> >> >> >> > po/glusterfs-p6jl3   0/1       Running   0          1m
> >>> >> >> >> >> >
> >>> >> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
> >>> >> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   2d
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > Also, from prerequisite perspective, i was also seeing
> this
> >>> >> >> >> >> > mentioned:
> >>> >> >> >> >> >
> >>> >> >> >> >> > The following kernel modules must be loaded:
> >>> >> >> >> >> >  * dm_snapshot
> >>> >> >> >> >> >  * dm_mirror
> >>> >> >> >> >> >  * dm_thin_pool
> >>> >> >> >> >> >
> >>> >> >> >> >> > Where exactly is this to be checked? On all Gluster
> server
> >>> >> >> >> >> > nodes?
> >>> >> >> >> >> > How
> >>> >> >> >> >> > can i
> >>> >> >> >> >> > check whether it's there?
> >>> >> >> >> >> >
> >>> >> >> >> >> > I have attached topology.json and gk-deploy log for
> >>> >> >> >> >> > reference.
> >>> >> >> >> >> >
> >>> >> >> >> >> > Does this issue has anything to do with the host OS
> >>> >> >> >> >> > (RancherOS)
> >>> >> >> >> >> > that
> >>> >> >> >> >> > i
> >>> >> >> >> >> > am
> >>> >> >> >> >> > using for Gluster nodes? Any idea how i can fix this? Any
> >>> >> >> >> >> > help
> >>> >> >> >> >> > will
> >>> >> >> >> >> > really
> >>> >> >> >> >> > be appreciated.
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > Thanks.
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> >
> >>> >> >> >> >> > _______________________________________________
> >>> >> >> >> >> > heketi-devel mailing list
> >>> >> >> >> >> > heketi-devel at gluster.org
> >>> >> >> >> >> > http://lists.gluster.org/mailman/listinfo/heketi-devel
> >>> >> >> >> >> >
> >>> >> >> >> >
> >>> >> >> >> >
> >>> >> >> >
> >>> >> >> >
> >>> >> >
> >>> >> >
> >>> >
> >>> >
> >>
> >>
> >
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.gluster.org/pipermail/heketi-devel/attachments/20170904/16cb8f10/attachment-0001.html>