[heketi-devel] Unable to use Heketi setup to install Gluster for Kubernetes

Mon Sep 4 12:56:55 UTC 2017

What is the exact command you're running?

On Mon, Sep 4, 2017 at 4:26 AM, Gaurav Chhabra <varuag.chhabra at gmail.com> wrote:
> Hi Jose,
>
>
> From this link, it seems mount.glusterfs might actually be present in the
> container that launched and quickly terminated.
>
> https://unix.stackexchange.com/questions/312178/glusterfs-replicated-volume-mounting-issue
>
> If you check the question that the user posted, the same error (Mount
> failed) was reported that i sent you in the last email.
>
> After seeing the above, i checked /var/log/glusterfs on my host (RancherOS)
> but it was empty. I ran the container again but with explicit volume mount
> as shown below:
>
> [root at node-a ~]# cat rancher-glusterfs-client.sh
> #!/bin/bash
> sudo docker run --privileged \
>         --name=gluster-client \
>         -d \
>         -v /sys/fs/cgroup:/sys/fs/cgroup \
>         -v /var/log/glusterfs:/var/log/glusterfs \
>         --env GLUSTER_PEER=10.128.0.12,10.128.0.15,10.128.0.16 \
>         nixel/rancher-glusterfs-client
> This time, i could see a log file (/var/log/glusterfs/mnt-ranchervol.log)
> present. I have attached the content of the same. Also attached are logs
> from Heketi client/server (both on one node) and Gluster cluster.
>
>
> Regards,
> Gaurav
>
>
> On Mon, Sep 4, 2017 at 2:29 PM, Gaurav Chhabra <varuag.chhabra at gmail.com>
> wrote:
>>
>> Hi Jose,
>>
>>
>> I tried setting up things using the link you provided and i was able to
>> get all steps working for 3 node Gluster cluster, all running on CentOS7,
>> without any issue. However, as expected, when i tried configuring Kubernetes
>> by installing nixel/rancher-glusterfs-client container, i got error:
>>
>> [root at node-a ~]# cat rancher-glusterfs-client.sh
>> #!/bin/bash
>> sudo docker run --privileged \
>>         --name=gluster-client \
>>         -d \
>>         -v /sys/fs/cgroup:/sys/fs/cgroup \
>>         --env GLUSTER_PEER=10.128.0.12,10.128.0.15,10.128.0.16 \
>>         nixel/rancher-glusterfs-client
>>
>> [root at node-a ~]# ./rancher-glusterfs-client.sh
>> ac069caccdce147d6f423fc566166345191dbc1b11f3416c66207a1fd11fda6b
>>
>> [root at node-a ~]# docker logs gluster-client
>> => Checking if I can reach GlusterFS node 10.128.0.12 ...
>> => GlusterFS node 10.128.0.12 is alive
>> => Mounting GlusterFS volume ranchervol from GlusterFS node 10.128.0.12
>> ...
>> Mount failed. Please check the log file for more details.
>>
>> If i try running the next step as described in your link, i get the
>> following:
>>
>> [root at node-a ~]# modprobe fuse
>> modprobe: module fuse not found in modules.dep
>>
>> Since the container failed to start, i could only check on the host
>> (RancherOS) and i could only find two mount-related commands: mount &
>> mountpoint
>>
>> Any pointers?
>>
>>
>> Regards,
>> Gaurav
>>
>> On Sun, Sep 3, 2017 at 11:18 PM, Jose A. Rivera <jarrpa at redhat.com> wrote:
>>>
>>> Installing the glusterfs-client container should be fine. :) The main
>>> thing that's needed is that all your Kubernetes nodes need to have the
>>> "mount.glusterfs" command available so Kube can mount the GlusterFS
>>> volumes and present them to the pods.
>>>
>>> On Sun, Sep 3, 2017 at 12:14 PM, Gaurav Chhabra
>>> <varuag.chhabra at gmail.com> wrote:
>>> > Thanks Jose. The link you've suggested looks good but again it expects
>>> > me to
>>> > install gluster-client on Kubernetes node and i fall into the same
>>> > issue of
>>> > installing a container for glusterfs. Only difference is that this time
>>> > it's
>>> > glusterfs-client and not glusterfs-server. :)
>>> >
>>> > I will try this out and let you know tomorrow.
>>> >
>>> >
>>> > Regards,
>>> > Gaurav
>>> >
>>> >
>>> > On Sun, Sep 3, 2017 at 2:11 AM, Jose A. Rivera <jarrpa at redhat.com>
>>> > wrote:
>>> >>
>>> >> Hey, no problem! I'm eager to learn more about different flavors of
>>> >> Linux, I just apologize for my relative inexperience with them. :)
>>> >>
>>> >> To that end, I will also admit I'm not very experienced with direct
>>> >> Docker myself. I understand the basic workflow and know some of the
>>> >> run options, but not having deep experience keeps me from having a
>>> >> better understanding of the patterns and consequences.
>>> >>
>>> >> Thus, I'd like to guide you in a direction I'd more apt to help you in
>>> >> right now. I know that you can't have multiple GlusterFS servers
>>> >> running on the same nodes, and I know that we have been successfully
>>> >> running several configurations using our gluster/gluster-centos image.
>>> >> If you follow the Kubernetes configuration on gluster-kubernetes, the
>>> >> pod/container is run privileged and with host networking, and we
>>> >> require that the node has all listed ports open, not just 2222. The
>>> >> sshd running in the container is listening on 2222, not 22, but it is
>>> >> also not really required if you're not doing geo-replication.
>>> >>
>>> >> Alternatively, you can indeed run GlusterFS outside of Kubernetes but
>>> >> still have Kubernetes apps access GlusterFS storage. The nodes can be
>>> >> anything you want, they just need to be running GlusterFS and you need
>>> >> a heketi service managing them. Here is an example of how to set this
>>> >> up using CentOS:
>>> >>
>>> >>
>>> >>
>>> >> https://github.com/gluster/gluster-kubernetes/tree/master/docs/examples/dynamic_provisioning_external_gluster
>>> >>
>>> >> Hope this is at least leading you in a useful direction. :)
>>> >>
>>> >> --Jose
>>> >>
>>> >> On Sat, Sep 2, 2017 at 3:16 PM, Gaurav Chhabra
>>> >> <varuag.chhabra at gmail.com>
>>> >> wrote:
>>> >> > Hi Jose,
>>> >> >
>>> >> > webcenter/rancher-glusterfs-server is actually a container provided
>>> >> > by
>>> >> > Sebastien, its maintainer. It's a Docker container which has
>>> >> > GlusterFS
>>> >> > server running within it. On the host i.e., RancherOS, there is no
>>> >> > separate
>>> >> > GlusterFS server running because we cannot install anything that
>>> >> > way.
>>> >> > Running using container is the only way so i started
>>> >> > ancher-glusterfs-server
>>> >> > container with the following parameters:
>>> >> >
>>> >> > [root at node-1 rancher]# cat gluster-server.sh
>>> >> > #!/bin/bash
>>> >> > sudo docker run --name=gluster-server -d \
>>> >> >         --env 'SERVICE_NAME=gluster' \
>>> >> >         --restart always \
>>> >> >         --publish 2222:22 \
>>> >> >         webcenter/rancher-glusterfs-server
>>> >> >
>>> >> > Here's the link to the dockerfile:
>>> >> >
>>> >> >
>>> >> > https://hub.docker.com/r/webcenter/rancher-glusterfs-server/~/dockerfile/
>>> >> >
>>> >> > It's similar to other GlusteFS containers provided by other
>>> >> > maintainers
>>> >> > for
>>> >> > different OS. For example, for CentOS, we have
>>> >> > https://hub.docker.com/r/gluster/gluster-centos/~/dockerfile/
>>> >> >
>>> >> > From what i understand, Heketi does support container based
>>> >> > GlusterFS
>>> >> > server
>>> >> > as mentioned in the prerequisite where it says:
>>> >> >
>>> >> > "Each node must have the following ports opened for GlusterFS
>>> >> > communications:
>>> >> >  2222 - GlusterFS pod's sshd"
>>> >> >
>>> >> > That's the reason i've exposed port 2222 for 22 as shown above.
>>> >> > Please
>>> >> > correct me if i misunderstood it.
>>> >> >
>>> >> > As soon as i run the above script (gluster-server.sh), it
>>> >> > automatically
>>> >> > creates the following directories on host. This should have ideally
>>> >> > not
>>> >> > been
>>> >> > empty as you mentioned.
>>> >> >
>>> >> > /etc/glusterfs    /var/lib/glusterd    /var/log/glusterfs
>>> >> >
>>> >> > Just wanted to know in which circumstances do we get this specific
>>> >> > error
>>> >> > (Failed to get D-Bus connection: Operation not permitted) related to
>>> >> > Readiness probe failing. Searching online took me to discussions
>>> >> > around
>>> >> > running container in privileged mode and some directory to be
>>> >> > mounted.
>>> >> > Based
>>> >> > on that, i also modified my container startup script to the
>>> >> > following:
>>> >> >
>>> >> > #!/bin/bash
>>> >> > sudo docker run --privileged \
>>> >> >         --name=gluster-server \
>>> >> >         -d \
>>> >> >         -v /sys/fs/cgroup:/sys/fs/cgroup \
>>> >> >         -v /etc/glusterfs:/etc/glusterfs \
>>> >> >         -v /var/lib/glusterd:/var/lib/glusterd \
>>> >> >         -v /var/log/glusterfs:/var/log/glusterfs \
>>> >> >         --env 'SERVICE_NAME=gluster' \
>>> >> >         --restart always \
>>> >> >         --publish 2222:22 \
>>> >> >         webcenter/rancher-glusterfs-server
>>> >> > Still, the issue persists.
>>> >> >
>>> >> > I also logged into the container and checked whether systemctl
>>> >> > command
>>> >> > is
>>> >> > present. It was there but manualy running the command also doesn't
>>> >> > work:
>>> >> >
>>> >> > [root at node-c ~]# docker exec -it gluster-server /bin/bash
>>> >> > root at 42150f203f80:/app# systemctl status glusterd.service
>>> >> > WARNING: terminal is not fully functional
>>> >> > Failed to connect to bus: No such file or directory
>>> >> >
>>> >> > Under section 'ADVANCED OPTIONS - Security/Host' in this link, it
>>> >> > talks
>>> >> > about SYS_ADMIN setting. Any idea how i can try this?
>>> >> >
>>> >> > Also, there was this mentioned in the Heketi setup page:
>>> >> > "If you are not able to deploy a hyper-converged GlusterFS cluster,
>>> >> > you
>>> >> > must
>>> >> > have one running somewhere that the Kubernetes nodes can access"
>>> >> >
>>> >> >>>> Does it mean running the three node Gluster cluster outside
>>> >> >>>> Kubernetes,
>>> >> >>>> may be on some VM running on RHEL/CentOS etc? If yes, then how
>>> >> >>>> will i
>>> >> >>>> be
>>> >> >>>> able to tell Gluster which volume from the Kubernetes cluster pod
>>> >> >>>> to
>>> >> >>>> sync?
>>> >> >>>> Any references?
>>> >> >
>>> >> >
>>> >> > I really appreciate your responses despite the fact that you've not
>>> >> > used
>>> >> > RancherOS but still trying to help.
>>> >> >
>>> >> >
>>> >> > Thanks,
>>> >> > Gaurav
>>> >> >
>>> >> >
>>> >> > On Sat, Sep 2, 2017 at 7:35 PM, Jose A. Rivera <jarrpa at redhat.com>
>>> >> > wrote:
>>> >> >>
>>> >> >> I'm afraid I have no experience with RancherOS, so I may be missing
>>> >> >> some things about how it works. My primary experience is with
>>> >> >> Fedora,
>>> >> >> CentOS, and Ubuntu.
>>> >> >>
>>> >> >> What is webcenter/rancher-glusterfs-server? If it's running another
>>> >> >> glusterd then you probably don't want to be running it and should
>>> >> >> remove it from your systems.
>>> >> >>
>>> >> >> The glusterfs pods mount hostpath volumes from the host they're
>>> >> >> running on to persist their configuration. Thus anything they write
>>> >> >> to
>>> >> >> those directories should land on the host. If that's not happening
>>> >> >> then that's an additional problem.
>>> >> >>
>>> >> >> --Jose
>>> >> >>
>>> >> >> On Fri, Sep 1, 2017 at 11:17 PM, Gaurav Chhabra
>>> >> >> <varuag.chhabra at gmail.com> wrote:
>>> >> >> > Hi Jose,
>>> >> >> >
>>> >> >> >
>>> >> >> > I tried your suggestion but there is one confusion regarding
>>> >> >> > point
>>> >> >> > #3.
>>> >> >> > Since
>>> >> >> > RancherOS has everything running as container, i am running
>>> >> >> > webcenter/rancher-glusterfs-server container on all three nodes.
>>> >> >> > Now
>>> >> >> > as
>>> >> >> > far
>>> >> >> > as removing the directories are concerned, i hope you meant
>>> >> >> > removing
>>> >> >> > them on
>>> >> >> > the host and _not_ from within the container. After completing
>>> >> >> > step 1
>>> >> >> > and 2,
>>> >> >> > i checked the contents of all the directories that you specified
>>> >> >> > in
>>> >> >> > point
>>> >> >> > #3. All were empty as you can see in the attached other_logs.txt.
>>> >> >> > So
>>> >> >> > i
>>> >> >> > got
>>> >> >> > confused. I ran the deploy again but the issue persists. Two pods
>>> >> >> > show
>>> >> >> > Liveness error and the third one, Readiness error.
>>> >> >> >
>>> >> >> > I then tried removing those directories (Step #3) from within the
>>> >> >> > container
>>> >> >> > but getting following error:
>>> >> >> >
>>> >> >> > root at c0f8ab4d92a2:/app# rm -rf /var/lib/heketi /etc/glusterfs
>>> >> >> > /var/lib/glusterd /var/log/glusterfs
>>> >> >> > rm: cannot remove '/var/lib/glusterd': Device or resource busy
>>> >> >> >
>>> >> >> >
>>> >> >> >
>>> >> >> > On Fri, Sep 1, 2017 at 8:21 PM, Jose A. Rivera
>>> >> >> > <jarrpa at redhat.com>
>>> >> >> > wrote:
>>> >> >> >>
>>> >> >> >> 1. Add a line to the ssh-exec portion of heketi.json of the
>>> >> >> >> sort:
>>> >> >> >>
>>> >> >> >> "sudo": true,
>>> >> >> >>
>>> >> >> >> 2. Run
>>> >> >> >>
>>> >> >> >> gk-deploy -g --abort
>>> >> >> >>
>>> >> >> >> 3. On the nodes that were/will be running GlusterFS pods, run:
>>> >> >> >>
>>> >> >> >> rm -rf /var/lib/heketi /etc/glusterfs /var/lib/glusterd
>>> >> >> >> /var/log/glusterfs
>>> >> >> >>
>>> >> >> >> Then try the deploy again.
>>> >> >> >>
>>> >> >> >> On Fri, Sep 1, 2017 at 6:05 AM, Gaurav Chhabra
>>> >> >> >> <varuag.chhabra at gmail.com>
>>> >> >> >> wrote:
>>> >> >> >> > Hi Jose,
>>> >> >> >> >
>>> >> >> >> >
>>> >> >> >> > Thanks for the reply. It seems the three gluster pods might
>>> >> >> >> > have
>>> >> >> >> > been
>>> >> >> >> > a
>>> >> >> >> > copy-paste from another set of cluster where i was trying to
>>> >> >> >> > setup
>>> >> >> >> > the
>>> >> >> >> > same
>>> >> >> >> > thing using CentOS. Sorry for that. By the way, i did check
>>> >> >> >> > for
>>> >> >> >> > the
>>> >> >> >> > kernel
>>> >> >> >> > modules and it seems it's already there. Also, i am attaching
>>> >> >> >> > fresh
>>> >> >> >> > set
>>> >> >> >> > of
>>> >> >> >> > files because i created a new cluster and thought of giving it
>>> >> >> >> > a
>>> >> >> >> > try
>>> >> >> >> > again.
>>> >> >> >> > Issue still persists. :(
>>> >> >> >> >
>>> >> >> >> > In heketi.json, there is a slight change w.r.t the user which
>>> >> >> >> > connects
>>> >> >> >> > to
>>> >> >> >> > glusterfs node using SSH. I am not sure how Heketi was using
>>> >> >> >> > root
>>> >> >> >> > user
>>> >> >> >> > to
>>> >> >> >> > login because i wasn't able to use root and do manual SSH.
>>> >> >> >> > With
>>> >> >> >> > rancher
>>> >> >> >> > user, i can login successfully so i think this should be fine.
>>> >> >> >> >
>>> >> >> >> > /etc/heketi/heketi.json:
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >     "executor": "ssh",
>>> >> >> >> >
>>> >> >> >> >     "_sshexec_comment": "SSH username and private key file
>>> >> >> >> > information",
>>> >> >> >> >     "sshexec": {
>>> >> >> >> >       "keyfile": "/var/lib/heketi/.ssh/id_rsa",
>>> >> >> >> >       "user": "rancher",
>>> >> >> >> >       "port": "22",
>>> >> >> >> >       "fstab": "/etc/fstab"
>>> >> >> >> >     },
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >
>>> >> >> >> > Before running gk-deploy:
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> > [root at workstation deploy]# kubectl get
>>> >> >> >> > nodes,pods,daemonset,deployments,services
>>> >> >> >> > NAME                                     STATUS    AGE
>>> >> >> >> > VERSION
>>> >> >> >> > no/node-a.c.kubernetes-174104.internal   Ready     3h
>>> >> >> >> > v1.7.2-rancher1
>>> >> >> >> > no/node-b.c.kubernetes-174104.internal   Ready     3h
>>> >> >> >> > v1.7.2-rancher1
>>> >> >> >> > no/node-c.c.kubernetes-174104.internal   Ready     3h
>>> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >
>>> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
>>> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   3h
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >
>>> >> >> >> > After running gk-deploy:
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> > [root at workstation messagegc]# kubectl get
>>> >> >> >> > nodes,pods,daemonset,deployments,services
>>> >> >> >> > NAME                                     STATUS    AGE
>>> >> >> >> > VERSION
>>> >> >> >> > no/node-a.c.kubernetes-174104.internal   Ready     3h
>>> >> >> >> > v1.7.2-rancher1
>>> >> >> >> > no/node-b.c.kubernetes-174104.internal   Ready     3h
>>> >> >> >> > v1.7.2-rancher1
>>> >> >> >> > no/node-c.c.kubernetes-174104.internal   Ready     3h
>>> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >
>>> >> >> >> > NAME                 READY     STATUS    RESTARTS   AGE
>>> >> >> >> > po/glusterfs-0j9l5   0/1       Running   0          2m
>>> >> >> >> > po/glusterfs-gqz4c   0/1       Running   0          2m
>>> >> >> >> > po/glusterfs-gxvcb   0/1       Running   0          2m
>>> >> >> >> >
>>> >> >> >> > NAME           DESIRED   CURRENT   READY     UP-TO-DATE
>>> >> >> >> > AVAILABLE
>>> >> >> >> > NODE-SELECTOR           AGE
>>> >> >> >> > ds/glusterfs   3         3         0         3            0
>>> >> >> >> > storagenode=glusterfs   2m
>>> >> >> >> >
>>> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
>>> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   3h
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >
>>> >> >> >> > Kernel module check on all three nodes:
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> > [root at node-a ~]# find /lib*/modules/$(uname -r) -name *.ko |
>>> >> >> >> > grep
>>> >> >> >> > 'thin-pool\|snapshot\|mirror' | xargs ls -ltr
>>> >> >> >> > -rw-r--r--    1 root     root         92310 Jun 26 04:13
>>> >> >> >> >
>>> >> >> >> > /lib64/modules/4.9.34-rancher/kernel/drivers/md/dm-thin-pool.ko
>>> >> >> >> > -rw-r--r--    1 root     root         56982 Jun 26 04:13
>>> >> >> >> > /lib64/modules/4.9.34-rancher/kernel/drivers/md/dm-snapshot.ko
>>> >> >> >> > -rw-r--r--    1 root     root         27070 Jun 26 04:13
>>> >> >> >> > /lib64/modules/4.9.34-rancher/kernel/drivers/md/dm-mirror.ko
>>> >> >> >> > -rw-r--r--    1 root     root         92310 Jun 26 04:13
>>> >> >> >> > /lib/modules/4.9.34-rancher/kernel/drivers/md/dm-thin-pool.ko
>>> >> >> >> > -rw-r--r--    1 root     root         56982 Jun 26 04:13
>>> >> >> >> > /lib/modules/4.9.34-rancher/kernel/drivers/md/dm-snapshot.ko
>>> >> >> >> > -rw-r--r--    1 root     root         27070 Jun 26 04:13
>>> >> >> >> > /lib/modules/4.9.34-rancher/kernel/drivers/md/dm-mirror.ko
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >
>>> >> >> >> > Error snapshot attached.
>>> >> >> >> >
>>> >> >> >> > In my first mail, i checked that Readiness Probe failure check
>>> >> >> >> > has
>>> >> >> >> > this
>>> >> >> >> > code
>>> >> >> >> > in kube-templates/glusterfs-daemonset.yaml file:
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >         readinessProbe:
>>> >> >> >> >           timeoutSeconds: 3
>>> >> >> >> >           initialDelaySeconds: 40
>>> >> >> >> >           exec:
>>> >> >> >> >             command:
>>> >> >> >> >             - "/bin/bash"
>>> >> >> >> >             - "-c"
>>> >> >> >> >             - systemctl status glusterd.service
>>> >> >> >> >           periodSeconds: 25
>>> >> >> >> >           successThreshold: 1
>>> >> >> >> >           failureThreshold: 15
>>> >> >> >> >
>>> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >
>>> >> >> >> > I tried logging into glustefs container on one of the node and
>>> >> >> >> > ran
>>> >> >> >> > the
>>> >> >> >> > above
>>> >> >> >> > command:
>>> >> >> >> >
>>> >> >> >> > [root at node-a ~]# docker exec -it c0f8ab4d92a23b6df2 /bin/bash
>>> >> >> >> > root at c0f8ab4d92a2:/app# systemctl status glusterd.service
>>> >> >> >> > WARNING: terminal is not fully functional
>>> >> >> >> > Failed to connect to bus: No such file or directory
>>> >> >> >> >
>>> >> >> >> >
>>> >> >> >> > Any check that i can do manually on nodes to debug further?
>>> >> >> >> > Any
>>> >> >> >> > suggestions?
>>> >> >> >> >
>>> >> >> >> >
>>> >> >> >> > On Thu, Aug 31, 2017 at 6:53 PM, Jose A. Rivera
>>> >> >> >> > <jarrpa at redhat.com>
>>> >> >> >> > wrote:
>>> >> >> >> >>
>>> >> >> >> >> Hey Gaurav,
>>> >> >> >> >>
>>> >> >> >> >> The kernel modules must be loaded on all nodes that will run
>>> >> >> >> >> heketi
>>> >> >> >> >> pods. Additionally, you must have at least three nodes
>>> >> >> >> >> specified
>>> >> >> >> >> in
>>> >> >> >> >> your topology file. I'm not sure how you're getting three
>>> >> >> >> >> gluster
>>> >> >> >> >> pods
>>> >> >> >> >> when you only have two nodes defined... :)
>>> >> >> >> >>
>>> >> >> >> >> --Jose
>>> >> >> >> >>
>>> >> >> >> >> On Wed, Aug 30, 2017 at 5:27 AM, Gaurav Chhabra
>>> >> >> >> >> <varuag.chhabra at gmail.com> wrote:
>>> >> >> >> >> > Hi,
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > I have the following setup in place:
>>> >> >> >> >> >
>>> >> >> >> >> > 1 node    : RancherOS having Rancher application for
>>> >> >> >> >> > Kubernetes
>>> >> >> >> >> > setup
>>> >> >> >> >> > 2 nodes  : RancherOS having Rancher agent
>>> >> >> >> >> > 1 node   : CentOS 7 workstation having kubectl installed
>>> >> >> >> >> > and
>>> >> >> >> >> > folder
>>> >> >> >> >> > cloned/downloaded from
>>> >> >> >> >> > https://github.com/gluster/gluster-kubernetes
>>> >> >> >> >> > using
>>> >> >> >> >> > which i run Heketi setup (gk-deploy -g)
>>> >> >> >> >> >
>>> >> >> >> >> > I also have rancher-glusterfs-server container running with
>>> >> >> >> >> > the
>>> >> >> >> >> > following
>>> >> >> >> >> > configuration:
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >> > [root at node-1 rancher]# cat gluster-server.sh
>>> >> >> >> >> > #!/bin/bash
>>> >> >> >> >> >
>>> >> >> >> >> > sudo docker run --name=gluster-server -d \
>>> >> >> >> >> >         --env 'SERVICE_NAME=gluster' \
>>> >> >> >> >> >         --restart always \
>>> >> >> >> >> >         --env 'GLUSTER_DATA=/srv/docker/gitlab' \
>>> >> >> >> >> >         --publish 2222:22 \
>>> >> >> >> >> >         webcenter/rancher-glusterfs-server
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >> >
>>> >> >> >> >> > In /etc/heketi/heketi.json, following is the only modified
>>> >> >> >> >> > portion:
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >> >     "executor": "ssh",
>>> >> >> >> >> >
>>> >> >> >> >> >     "_sshexec_comment": "SSH username and private key file
>>> >> >> >> >> > information",
>>> >> >> >> >> >     "sshexec": {
>>> >> >> >> >> >       "keyfile": "/var/lib/heketi/.ssh/id_rsa",
>>> >> >> >> >> >       "user": "root",
>>> >> >> >> >> >       "port": "22",
>>> >> >> >> >> >       "fstab": "/etc/fstab"
>>> >> >> >> >> >     },
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >> >
>>> >> >> >> >> > Status before running gk-deploy:
>>> >> >> >> >> >
>>> >> >> >> >> > [root at workstation deploy]# kubectl get
>>> >> >> >> >> > nodes,pods,services,deployments
>>> >> >> >> >> > NAME                                     STATUS    AGE
>>> >> >> >> >> > VERSION
>>> >> >> >> >> > no/node-1.c.kubernetes-174104.internal   Ready     2d
>>> >> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >> > no/node-2.c.kubernetes-174104.internal   Ready     2d
>>> >> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >> > no/node-3.c.kubernetes-174104.internal   Ready     2d
>>> >> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >> >
>>> >> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
>>> >> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   2d
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > Now when i run 'gk-deploy -g', in the Rancher console, i
>>> >> >> >> >> > see
>>> >> >> >> >> > the
>>> >> >> >> >> > following
>>> >> >> >> >> > error:
>>> >> >> >> >> > Readiness probe failed: Failed to get D-Bus connection:
>>> >> >> >> >> > Operation
>>> >> >> >> >> > not
>>> >> >> >> >> > permitted
>>> >> >> >> >> >
>>> >> >> >> >> > From the attached gk-deploy_log i see that it failed at:
>>> >> >> >> >> > Waiting for GlusterFS pods to start ... pods not found.
>>> >> >> >> >> >
>>> >> >> >> >> > In the kube-templates/glusterfs-daemonset.yaml file, i see
>>> >> >> >> >> > this
>>> >> >> >> >> > for
>>> >> >> >> >> > Readiness probe section:
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >> >         readinessProbe:
>>> >> >> >> >> >           timeoutSeconds: 3
>>> >> >> >> >> >           initialDelaySeconds: 40
>>> >> >> >> >> >           exec:
>>> >> >> >> >> >             command:
>>> >> >> >> >> >             - "/bin/bash"
>>> >> >> >> >> >             - "-c"
>>> >> >> >> >> >             - systemctl status glusterd.service
>>> >> >> >> >> >           periodSeconds: 25
>>> >> >> >> >> >           successThreshold: 1
>>> >> >> >> >> >           failureThreshold: 15
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > ------------------------------------------------------------------
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > Status after running gk-deploy:
>>> >> >> >> >> >
>>> >> >> >> >> > [root at workstation deploy]# kubectl get
>>> >> >> >> >> > nodes,pods,deployments,services
>>> >> >> >> >> > NAME                                     STATUS    AGE
>>> >> >> >> >> > VERSION
>>> >> >> >> >> > no/node-1.c.kubernetes-174104.internal   Ready     2d
>>> >> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >> > no/node-2.c.kubernetes-174104.internal   Ready     2d
>>> >> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >> > no/node-3.c.kubernetes-174104.internal   Ready     2d
>>> >> >> >> >> > v1.7.2-rancher1
>>> >> >> >> >> >
>>> >> >> >> >> > NAME                 READY     STATUS    RESTARTS   AGE
>>> >> >> >> >> > po/glusterfs-0s440   0/1       Running   0          1m
>>> >> >> >> >> > po/glusterfs-j7dgr   0/1       Running   0          1m
>>> >> >> >> >> > po/glusterfs-p6jl3   0/1       Running   0          1m
>>> >> >> >> >> >
>>> >> >> >> >> > NAME             CLUSTER-IP   EXTERNAL-IP   PORT(S)   AGE
>>> >> >> >> >> > svc/kubernetes   10.43.0.1    <none>        443/TCP   2d
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > Also, from prerequisite perspective, i was also seeing this
>>> >> >> >> >> > mentioned:
>>> >> >> >> >> >
>>> >> >> >> >> > The following kernel modules must be loaded:
>>> >> >> >> >> >  * dm_snapshot
>>> >> >> >> >> >  * dm_mirror
>>> >> >> >> >> >  * dm_thin_pool
>>> >> >> >> >> >
>>> >> >> >> >> > Where exactly is this to be checked? On all Gluster server
>>> >> >> >> >> > nodes?
>>> >> >> >> >> > How
>>> >> >> >> >> > can i
>>> >> >> >> >> > check whether it's there?
>>> >> >> >> >> >
>>> >> >> >> >> > I have attached topology.json and gk-deploy log for
>>> >> >> >> >> > reference.
>>> >> >> >> >> >
>>> >> >> >> >> > Does this issue has anything to do with the host OS
>>> >> >> >> >> > (RancherOS)
>>> >> >> >> >> > that
>>> >> >> >> >> > i
>>> >> >> >> >> > am
>>> >> >> >> >> > using for Gluster nodes? Any idea how i can fix this? Any
>>> >> >> >> >> > help
>>> >> >> >> >> > will
>>> >> >> >> >> > really
>>> >> >> >> >> > be appreciated.
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > Thanks.
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> >
>>> >> >> >> >> > _______________________________________________
>>> >> >> >> >> > heketi-devel mailing list
>>> >> >> >> >> > heketi-devel at gluster.org
>>> >> >> >> >> > http://lists.gluster.org/mailman/listinfo/heketi-devel
>>> >> >> >> >> >
>>> >> >> >> >
>>> >> >> >> >
>>> >> >> >
>>> >> >> >
>>> >> >
>>> >> >
>>> >
>>> >
>>
>>
>