Replies: 4 comments 15 replies
-
|
@zidz Can you share the managment-server.log from all the management servers covering the time of your tests. Also, share the name of the VM ( which was manually stopped) |
Beta Was this translation helpful? Give feedback.
-
|
@zidz, can you try to hard power down your KVM host and see if those VMs running on this host will failover? Also ensure that your VM is created with "HA enabled=true". In my experience, if you shutdown gracefully, these VMs will not failover. |
Beta Was this translation helpful? Give feedback.
-
|
A note, the Event log for the VM in question looks like this after it's killed (-9), and it's keep doing that in the same frequence until it's stopped in the UI: |
Beta Was this translation helpful? Give feedback.
-
|
Currently there is a issue with HOST HA and VM HA sync, investigation is going to find the root cause I would recommend to use only VM HA feature for now |
Beta Was this translation helpful? Give feedback.


Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I do not know if I have unrealistic expectations of Cloudstack 4.22 in regards to VM HA.
I'm running Cloudstack on hardware with no IPMI and expect some failover when a KVM instance is manually stopped on the host, is unexpectedly killed or dies. Cloudstack lists the VM as running and nothing happens, the VM will not be restarted to any extent. What I expect is that the VM should be detected as not running when it's expected to and should be restarted, preferably on another host. Note that I've waited over an hour to give all checks time to trigger a restart of a killed VM.
The current setup is running on NFS for primary and secondary right now. I expect the next cluster to have its primary on Ceph RBD and secondary on NFS hosted by Ceph.
I've read the HA documentation and I've deactivated all Host HA, as this else will be taken into account before handling the VM HA (As far as I understand). Checked that the files in the KVMHA are updated with timestamps correctly. NTP is running correctly. All VM's do have HA activated, all compute offerings I use have HA enabled, I even forced HA for VM's in the Global configuration.
Please tell me how VM HA is intended to work, if my expectations are off and I would be super grateful for VM HA test examples I can do to trigger the VM HA functionality to get the correct expectations of the VM HA functionality.
I've read earlier discussions here in the Q&A section and other sources to try to solve this HA puzzle, but now I'm here asking for assistance..
Beta Was this translation helpful? Give feedback.
All reactions