[Magnum] TroubleShooting
All trouble shooting for Magnum that I have encountered
1. Failed to create certificates for Cluster
Failed to create certificates for Cluster
This occurs due to Magnum unable to generate SSL/TLS certificate. In Magnum, there are three options to generate and store certificates:
Barbican
Magnum’s own database
Local store
Check /etc/magnum/magnum.conf
and look for [certificates]
section. Make the cert_manager_type
as x509keypair
instead of barbican
and local
like below:
Then apply the change by:
2. unexpected keystone client error occurred: publicURL endpoint for orchestration service in RegionOne region not found
unexpected keystone client error occurred: publicURL endpoint for orchestration service in RegionOne region not found
This occurs due to Magnum not having Heat installed. In order for Magnum to work, we need Heat orchestration service running.
Try
This might or might not work. If the command failed with Parameter CONFIG_CONTROLLER_HOST failed validation: Given host does not listen on port 22:
, reinstall whole OpenStack using PackStack.
Seriously. I have tried
--os-controller-host
arguments withsshd
running in the background, but it seems not to recognize the host.
If you know the exact value for
PW_PLACEHOLDER
you can just install it using packstack with creating anotheranswers.txt
file with the same value, but with skipping compute node and network node.
3. ERROR: The Parameter (octavia_provider) was not defined in template.
ERROR: The Parameter (octavia_provider) was not defined in template.
4. I don’t see any images in templates!
The name or UUID of the base image in Glance to boot the servers for the cluster. The image must have the attribute ‘os_distro’ defined as appropriate for the cluster driver. For the currently supported images, the os_distro names are:
Important
The downloade file is
.xz
file which needs to be extracted before being added into Glance as an image. Usexz -d filename.qcow2.xz
to retrieve theqcow2
file. Glance will NOT warn us whether if this was a proper image file or not.Using
qcow2.xz
instead ofqcow2
will generate an instance but will not run. It will be markedActive
and has been assigned IPs, however will NOT be able to access it viassh
or check status byping
5. The state is stuck at CREATE_IN_PROGRESS
CREATE_IN_PROGRESS
CREATE_IN_PROGRESS
takes a bit of time since it intializes all Kubernetes and CNI as well. However, if it seems like it is stuck for a long time, something might be running awkward.
Case 1) You have instances
In instances tab from Horizon, you can actually see your instances running properly. For example, if you see new-new-k8s-5-4g3hmm2jqalj-master-0
from your instances, this means that Magnum actually generated an instance for the cluster.
However, since something went on dead wrong, it could not continue Kubernetes installation. There were some cases that I had encountered:
Check if those newly created instances are actually running. Meaning that check if we can actually
ping
, andssh
into the instances manually. If not, there is high chance of that instance not being created properly. Magnum will just wait for that instance to actually run properly.Check if you can actually make an instance manually with the same flavor and image that you would like the cluster to have. Once you can create and access the instance using
ssh
andping
, it means the image is okay.Check if you can actually access the cluster instance manually with
ssh
. If you can access it, checkkubectl
command's existence. Or checkps -ef | grep kube
for checking if the installation process is still running.
Case 2) You do not have instance
When you do not see an instance running in the instance tab from Horizon, it means that something went wrong prior to Magnum launching an instance. Normally in this case, the status will not be marked asCREATE_IN_PROGRESS
. It will be marked as something like CREATE_FAILED
. In this case, check other trouble shooting methods in this page
Last updated