Skip to content

BUGS: CloudStack 4.20 VXLAN-EVPN - Management VXLAN generates error - Public VXLAN routes not advertised - is Bug LINKED TO #9920? - Initial VMs running but Grey State for Agent State #10659

@tatay188

Description

@tatay188

problem

Using [Network 1] Only for all three [Management], [guest],[public] Tagged as VXLAN and with cloudbr1

  • **Should I use a separate Network for each ? ? **
    

What should I check, in addition to the official manual: I reread again the instructions and saw videos
https://www.youtube.com/watch?v=9gXEmWbgX2o <<Actually this video shown both VMs are down'
https://www.youtube.com/watch?v=DlJg3LYvIIs << this shows both on green.
https://www.youtube.com/watch?v=vZfHfoYEMdY << this one shows the [Agent State] green while the Vms are in starting mode with a bue dot.
I have Read the https://rohityadav.cloud/blog/cloudstack-kvm/

  • Management servers are two physical separated servers. all Ubuntu 22.04
  • DB is mysql running on a separate server. Mysql 8
  • Hypervisor is running on a separate server. Ubuntu22.0 KVM
  • Cloudstack 4.20.0.0

Issue 1) I can't add a second Management server, No idea why !!
Thank you Wei, as soon i shut the second Management server the initial VMs proxy and secondary storage come up
even when I enable and configure the "MultipleServer" = true and have the servers there.

Issue 2) Management Network is Tagged as VXLAN, the UI shows vlan://untagged - "no typo vlan" every-time i use the UI to change to vxlan://untagged
I did try vxlan://untagged parameter Got the message: Vlan parameter : vxlan://untagged is not in valid format
I did try vxlan:// Got the message: Unable to convert to broadcast URI: vxlan://
I did try vxlan:// Got the message: Vlan parameter : vxlan://1000 is not in valid format'

Image

Issue 3) s-VM and Proxy come up but Public IP addresses are not being routed Advertised.
s-1-VM <control-ip-169.x.x.x> [State-Running] [Agent State-Gray]
v-2-VM <control-ip-169.x.x.y> [State-Running] [Agent State-Gray]
If both [State] and [Agent State] need to be green, What other debug can I enable ??

Is grey radio button a normal state as shown in the picture ?

Image

NOTE The VXLAN parameter for the Public network was taken vxlan://1000, But IP addresses are not routed.

Issue 4) The VNI's I Choose for Guest are showing as VLANs,
I try to use the format of vxlan://1024 for vxlan the start and vxlan://2048 for the end and got the same error messages, not valid format

Issue 5) VXLAN 1000 No route to the public IP addresses
The Public IPs are assigned, but there is no network assigned to it: - Is there maybe a secret button on the global configuration ?

Image

I have to use a Single management server as everything messes UP when I run a second Management server with the Same Database.

I reiterate The initial VMs are running:
s-1-VM <control-ip-169.x.x.x>
v-2-VM <control-ip-169.x.x.y>

I do not see the Public IP anywhere, not even advertised on vni1000 or any other, the public VNI is there was created by the system, but nothing is there - No IP addresses advertised

I used VLAN/VNI vxlan://1000

vxlan1000: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9050 master brvx-1000 state forwarding priority
There are 2 VNETS assigned

virsh domiflist s-1-VM
Interface Type Source Model MAC

vnet3 bridge cloud0 virtio 0e:00:a9:fe:a1:14
vnet4 bridge cloudbr1 virtio 1e:00:3f:00:03:33
vnet5 bridge brvx-1000 virtio 1e:00:b8:00:0c:fe

virsh domifaddr s-1-VM
Name MAC address Protocol Address

----- Nothing here -----

the command arp -n, does not show Public IP addresses neither the management IP addresses, just the private IP addresses and the <control-ip-169.x.x.x>

sh bgp evpn route vni 1000
BGP table version is 1, local router ID is 10.1.1.14
Status codes: s suppressed, d damped, h history, * valid, > best, i - internal
Origin codes: i - IGP, e - EGP, ? - incomplete
EVPN type-1 prefix: [1]:[EthTag]:[ESI]:[IPlen]:[VTEP-IP]:[Frag-id]
EVPN type-2 prefix: [2]:[EthTag]:[MAClen]:[MAC]:[IPlen]:[IP]
EVPN type-3 prefix: [3]:[EthTag]:[IPlen]:[OrigIP]
EVPN type-4 prefix: [4]:[ESI]:[IPlen]:[OrigIP]
EVPN type-5 prefix: [5]:[EthTag]:[IPlen]:[IP]

Network Next Hop Metric LocPrf Weight Path
*> [3]:[0]:[32]:[10.1.1.14]
10.1.1.14(hv1)
32768 i
ET:8 RT:5641:1000

Displayed 1 prefixes (1 paths)

At the Agent side only WARNINGS I am not using uefi.

2025-04-03 00:01:44,744 ERROR [kvm.resource.LibvirtComputingResource] (main:[]) (logid:) uefi properties file not found due to: Unable to find file uefi.properties.
2025-04-03 00:01:46,815 WARN [utils.script.Script] (Agent-Handler-1:[]) (logid:) Process [15539] for command [/bin/bash -c virt-v2v --version ] encountered the error: [127].
2025-04-03 00:01:47,921 ERROR [cloud.agent.Agent] (agentRequest-Handler-5:[]) (logid:09929bd6) Unexpected arch null, expected x86_64
2025-04-03 00:01:46,815 WARN [utils.script.Script] (Agent-Handler-1:[]) (logid:) Execution of process [15539] for command [/bin/bash -c virt-v2v --version ] failed.
2025-04-03 00:01:46,815 WARN [utils.script.Script] (Agent-Handler-1:[]) (logid:) Process [15539] for command [/bin/bash -c virt-v2v --version ] encountered the error: [127].

AT the management the rror is:
2025-04-03 00:00:28,115 DEBUG [c.c.c.ClusterServiceServletImpl] (Cluster-Worker-5:[ctx-c96bca9e]) (logid:3fbff637) Executing ClusterServicePdu with service URL: https://10.1.1.1:9090/clusterservice
2025-04-03 00:21:28,122 ERROR [c.c.c.ClusterServiceServletImpl] (Cluster-Worker-5:[ctx-c96bca9e]) (logid:3fbff637) Exception from : https://10.1.1.1:9090/clusterservice, method : null, exception : javax.net.ssl.SSLPeerUnverifiedException: Certificate for <10.1.1.1> doesn't match any of the subject alternative names: [fde0:f:2897:1:1:0:0:1, 172.1.1.1, fe80:0:0:0:e643:4bff:fe81:9660, cs1.myinternaldomain.int, cloudstack.internal]

versions

  • Management servers are two physical separated servers. all Ubuntu 22.04
  • DB is mysql running on a separate server. Mysql 8
  • Hypervisor is running on a separate server. Ubuntu22.0 KVM
  • Cloudstack 4.20.0.0
  • Primary Storage CEPH 19.2.1.1 RBD
  • Secondary Storage EMC NFS

The steps to reproduce the bug

  1. As described above

...

What to do about it?

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions