cloudstack

Commit Graph

Author	SHA1	Message	Date
Suresh Kumar Anaparti	983f164c57	Fixed src datastore on copy check for PowerFlex/ScaleIO storage driver (#9310 )	2024-06-28 18:46:06 +05:30
Abhisar Sinha	644f3a3f48	Add, Delete Storage Pool commands should be able execute on a host in maintenance (#9301 ) * Restart agent when host comes out of maintenance * Don't send CreateStoragePoolCommand to hosts in maintenance mode * CreateStoragePoolCommand can run when host in maintenance. Reverted the change to restart agent when host was already up and in maintenance * Reverted changes done to ResourceManagerImplTest	2024-06-28 18:18:08 +05:30
Suresh Kumar Anaparti	46f672563e	Improve migration of external VMware VMs into KVM cluster (#8815 ) * Create/Export OVA file of the VM on external vCenter host, to temporary conversion location (NFS) * Fixed ova issue on untar/extract ovf from ova file "tar -xf" cmd on ova fails with "ovf: Not found in archive" while extracting ovf file * Updated VMware to KVM instance migration using OVA * Refactoring and cleanup * test fixes * Consider zone wide pools in the destination cluster for instance conversion * Remove local storage pool support as temporary conversion location - OVA export not possible as the pool is not accessible outside host, NFS pools are supported. * cleanup unused code * some improvements, and refactoring * import nic unit tests * vmware guru unit tests * Separate clone VM and create template file for VMware migration - Export OVA (of the cloned VM) to the conversion location takes time. - Do any validations with cloned VM before creating the template (and fail early). - Updated unit tests. * Check conversion support on host before clone vm / create template on vmware (and fail early) * minor code improvements * Auto select the host with instance conversion capability * Skip instance conversion supported response param for non-KVM hosts * Show supported conversion hosts in the UI * Skip persistence map update if network doesn't exist * Added support to export OVA from KVM host, through ovftool (when installed in KVM host) * Updated importvm api param 'usemsforovaexport' to 'forcemstodownloadvmfiles', to be generic * Updated hardcoded UI messages with message labels * Updated UI to support importvm api param - forcemstodownloadvmfiles * Improved instance conversion support checks on ubuntu hosts, and for windows guest vms * Use OVF template (VM disks and spec files) for instance conversion from VMware, instead of OVA file - this would further increase the migration performance (as it reduces the time for OVA preparation / archiving of the VM files into a single file) * OVF export tool parallel threads code improvements * Updated 'convert.vmware.instance.to.kvm.timeout' config default value to 3 hrs * Config values check & code improvements * Updated import log, with time taken and vm details * Support for parallel downloads of VMware VM disk files while exporting OVF from MS, and other changes below. - Skip clone for powered off VMs - Fixes to support standalone host (with its default datacenter) - Some code improvements * rebase fixes * rebase fixes * minor improvement * code improvements - threads configuration, and api parameter changes to import vm files * typo fix in error msg	2024-06-27 21:14:13 +05:30
Abhishek Kumar	53faf0f66a	xenserver: attach regular iso with configdrive (#9216 ) * xenserver: attach regular iso with configdrive Fixes #7902 This PR allows attaching a regular ISO to a VM when it already has the config drive ISO attached. Config-drive ISO is now attached with the SR name-label <VM-NAME>-CONFIGDRIVE-ISO. While regular ISOs continue to attach with SR name-label <VM-NAME>-ISO. VM which already have the configdrive ISO attached before this fix will return an appropriate error and will need to be stopped-start.	2024-06-27 16:10:33 +05:30
Wei Zhou	22cd00ffb1	veeam: fix issues with PreSetup and DVS and Solidfire (#9256 ) * Veeam: find storage pool by path for PreSetup and VMFS * Veeam: support VMware distributed virtual switch * Veeam: sync volumes on Solidfire after backup restoration user faced the issue that backup is restored but the DATA disk is gone (ROOT disk is ok) ``` 2024-05-03 12:00:32,868 ERROR [o.a.c.b.BackupManagerImpl] (API-Job-Executor-13:ctx-aa8a1d85 job-149661 ctx-73328567) (logid:6510cf06) Failed to import VM [vmInternalName: i-169-9679-VM] from backup restoration [{"backupType":"Full","externalId":"821ca400-a5da-4282-bf3f-7c7e38a6cdb4","id":257,"uuid":"69399101-5cbd-461c-8a48-f0c70eac0b24","vmId":9679}] with hypervisor [type: VMware] due to: [Couldn't find storage pool -iqn.2010-01.com.solidfire:3p53.data-9679.221-0]. ``` On managed storage, the datastore name of DATA disk is determined by the iscsi_name of the volume. * Veeam: set correct path for DATA disks on solidfire	2024-06-26 18:02:25 +05:30
slavkap	6c06e85c80	Temporarily backup StorPool volume before expunge (#8843 ) * Temporarily backup StorPool volume before expunge Sometimes the users delete the volumes by mistake. This enhancment provides a solution to backup the volume before it's deleted. The user will be able to see the snapshot in CloudStack UI/CLI and create only a volume from it. A task will check (by default on every 5mins) if the snapshots are deleted from StorPool Global settings to enable the delay delete option: `storpool.delete.after.interval` - The interval (in seconds) after the StorPool snapshot will be deleted `storpool.list.snapshots.delete.after.interval` - The interval (in seconds) to fetch the StorPool snapshots with deleteAfter flag Minor fix when deleting snapshots * added Apache licence * addressed comments	2024-06-26 13:58:04 +05:30
Wei Zhou	ae4b6d3b6c	CKS/calico: set arp_ignore and arp_announce to 0 in k8s controller/nodes (#9186 )	2024-06-26 12:18:50 +05:30
dahn	6d7c042bc1	Accept a role ID on linking an account to LDAP (#8236 ) * accept role on link account to ldap * reformat tests * validation * Update plugins/user-authenticators/ldap/src/main/java/org/apache/cloudstack/api/command/LinkAccountToLdapCmd.java Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2024-06-26 01:26:28 +05:30
SadiJr	7f0d9a0304	[Veeam] Check for failures in the restore process (#7224 ) * Validate failure state in Veeam restore process * Address Daan review, and properly call method * Address bryan's reviews * remove return Co-authored-by: SadiJr <sadi@scclouds.com.br> Co-authored-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-06-26 00:41:38 +05:30
Abhisar Sinha	4eb43651e2	Ability to specify NFS mount options while adding a primary storage and modify them on a pre-existing primary storage (#8947 ) * Ability to specify NFS mount options while adding a primary storage and modify it later * Pull 8947: Rename all occurrence of nfsopt to nfsMountOpt and added nfsMountOpts to ApiConstants * Pull 8947: Refactor code - move into separate methods * Pull 8947: CollectionsUtils.isNotEmpty and switch statement in LibvirtStoragePoolDef.java * Pull 8947: UI - cancel maintainenace will remount the storage pool and apply the options * Pull 8947: UI - moved edit NFS mount options to edit Primary Storage form * Pull 8947: UI - moved 'NFS Mount Options' to below 'Type' in dataview * Pull 8947: Fixed message in AddPrimaryStorage.vue * Pull 8947: Convert _nfsmountOpts to Set in libvirtStoragePoolDef * Pull 8947: Throw exception and log error if mount fails due to incorrect mount option * Pull 8947: Added UT and moved integration test to component/maint * Pull 8947: Review comments * Pull 8947: Removed password from integration test * Pull 8947: move details allocation to inside the if loop in getStoragePoolNFSMountOpts * Pull 8947: Fixed a bug in AddPrimaryStorage.vue * Pull 8947: Pool should remain in maintenance mode if mount fails * Pull 8947: Removed password from integration test * Pull 8947: Added UT * Pull 8875: Fixed a bug in CloudStackPrimaryDataStoreLifeCycleImplTest * Pull 8875: Fixed a bug in LibvirtStoragePoolDefTest * Pull 8947: minor code restructuring * Pull 8947 : added some ut for coverage * Fix LibvirtStorageAdapterTest UT	2024-06-25 23:45:35 +05:30
Suresh Kumar Anaparti	620ed164d8	VMware: Improve error messaging / logs when starting non-user VMs, and secondary storage not available or doesn't have enough capacity (#9207 )	2024-06-25 12:25:42 +05:30
Rene Glover	6ee6603359	Updates to HPE-Primera and Pure FlashArray Drivers to use Host-based VLUN Assignments (#8889 ) * Updates to change PUre and Primera to host-centric vlun assignments; various small bug fixes * update to add timestamp when deleting pure volumes to avoid future conflicts * update to migrate to properly check disk offering is valid for the target storage pool * Updates to change PUre and Primera to host-centric vlun assignments; various small bug fixes * update to add timestamp when deleting pure volumes to avoid future conflicts * update to migrate to properly check disk offering is valid for the target storage pool * improve error handling when copying volumes to add precision to which step failed * rename pure volume before delete to avoid conflicts if the same name is used before its expunged on the array * remove dead code in AdaptiveDataStoreLifeCycleImpl.java * Fix issues found in PR checks * fix session refresh TTL logic * updates from PR comments * logic to delete by path ONLY on supported OUI * fix to StorageSystemDataMotionStrategy compile error * change noisy debug message to trace message * fix double callback call in handleVolumeMigrationFromNonManagedStorageToManagedStorage * fix for flash array delete error * fix typo in StorageSystemDataMotionStrategy * change copyVolume to use writeback to speed up copy ops * remove returning PrimaryStorageDownloadAnswer when connectPhysicalDisk returns false during KVMStorageProcessor template copy * remove change to only set UUID on snapshot if it is a vmSnapshot * reverting change to UserVmManagerImpl.configureCustomRootDiskSize * add error checking/simplification per comments from @slavkap * Update engine/storage/datamotion/src/main/java/org/apache/cloudstack/storage/motion/StorageSystemDataMotionStrategy.java Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com> * address PR comments from @sureshanaparti --------- Co-authored-by: GLOVER RENE <rg9975@cs419-mgmtserver.rg9975nprd.app.ecp.att.com> Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2024-06-25 10:35:39 +05:30
slavkap	8b07b66f14	Fix volume snapshot of encrypted NFS/StorPool volume (#8873 ) * Fix volume snapshot of encrypted NFS/StorPool volume * remove comments * removed invoking the real qemu convert command * fix UnsatisfiedLink error in unit tests * addressed comments extracted method	2024-06-24 13:09:21 +05:30
Suresh Kumar Anaparti	c17aa0d9ad	Import Remote KVM VM logging improvements (#9284 )	2024-06-24 11:34:37 +05:30
Vishesh	6a518e29b7	Allow deletion of external managed cks nodes (#9183 ) * Allow deleteion of external managed cks nodes * Fix unit tests * Update plugins/integrations/kubernetes-service/src/main/java/com/cloud/kubernetes/cluster/KubernetesClusterHelperImpl.java Co-authored-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-06-23 22:08:13 +05:30
Rene Peinthor	f4612c51ec	libvirtstorage: Make sure netfs storage was really mounted (#8887 )	2024-06-23 19:41:02 +05:30
Vishesh	674495b162	Fixup startVM on simulator (#9199 )	2024-06-21 15:53:45 +05:30
Suresh Kumar Anaparti	5ab23cd9c9	Timeout config to copy the disks of remote KVM instance while importing the instance from an external host (#9213 ) * Added timeout config to copy the disks of remote KVM instance while importing the instance from an external host * Updated copy config units to mins * Cleanup remote converted file and local file when copy failed	2024-06-21 10:28:18 +05:30
Abhishek Kumar	097359bef9	plugins/shutdown: fix triggerShutdown scheduling and response (#9276 ) Earlier the triggerShutdown API would immediately shutdown the MS and if it is the same MS on which API is called it would lead to error in the API call. This change adds a delay to the process so the MS would be able to send response to the API. Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-06-21 10:12:16 +05:30
Michael Wodniok	7dce3d87d4	[linstor] Fix revertSnapshot (#9271 ) Signed-off-by: Michael Wodniok (WorNet AG) <michael.wodniok@wor.net> Co-authored-by: Michael Wodniok <michael.wodniok@wor.net>	2024-06-20 10:52:49 +02:00
Daan Hoogland	3997e59678	Merge release branch 4.18 to 4.19 * 4.18: Update extraconfig for platform param in xen/xcpng (#9248)	2024-06-19 18:55:29 +02:00
Harikrishna	2315a73a20	User friendly name of Downloaded Templates Volumes and ISOs (#9252 )	2024-06-19 12:47:43 +02:00
Wei Zhou	227c15624d	vxlan: do not create duplicated network for private gateway (#9232 )	2024-06-19 09:44:49 +03:00
Suresh Kumar Anaparti	cc52b38e54	Update extraconfig for platform param in xen/xcpng (#9248 ) * Update extraconfig for platform param in xen/xcpng * Fix map param key, not to replace '-' with '_' (replace only applicable to param / map-param) * Added unit tests * Add license for tests file	2024-06-18 23:39:50 +05:30
Abhisar Sinha	591cc4f002	Add action button to enable/disable Oauth provider (#9242 )	2024-06-18 08:32:13 +02:00
Wei Zhou	f360f7048d	vmware: do not tear down vm disks if deploy-as-is vm has vm snapshots (#9243 )	2024-06-18 08:28:20 +02:00
Bryan Lima	00fe25ab01	Fix allocation of VMs with multiple clusters (#8611 ) * Fix allocation of VMs with multiple clusters * Readd debug guard	2024-06-14 13:54:01 +03:00
Abhishek Kumar	ce9b2c52f3	cks: fix events (#9070 ) Fixes #8043 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-06-14 12:22:39 +05:30
Vishesh	74f5e52e6e	Fix unit test failure (#9238 )	2024-06-13 16:06:35 +05:30
Rene Peinthor	37f4398c80	linstor: Support VM-Instance Disk snapshots (#8796 ) * linstor: update to java-linstor 0.5.1 * linstor: Support VM-Instance Disk snapshots This adds VM-Instance disk snapshot support for Linstor primary storage. Instance snapshots are stored on the used Linstor storage pool backend and can be converted into regular volume snapshots and also reverted. Instance VM snapshots are not fully atomic but with the create multi snapshot feature as good as it gets. Snapshots are done over multiple volumes in the same devicemanager run.	2024-06-13 15:26:33 +05:30
Abhishek Kumar	2fef0a32bc	cks: fix list apis response count (#8701 ) * cks: fix list apis count Fixes count value in listKubernetesClusters and listSupportedKubernetesVersions APIs response.	2024-06-13 13:08:19 +05:30
Rohit Yadav	78ace3a750	saml: introduce saml2.check.signature (#9219 ) Adminstrators should ensure that IDP configuration has signing certificate for the actual signature check to be performed. In addition to this, this change introduces a new global setting `saml2.check.signature` which can deliberately fail a SAML login attempt when the SAML response has missing signature. Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-06-13 11:30:33 +05:30
Wei Zhou	b2ef53b8a2	kvm: replace ISO path in vm XML configuration during vm migration (#9212 ) * kvm: replace ISO path in vm XML configuration during vm migration * Update 9212: address comments * kvm: fix vm migration if there are multiple image stores	2024-06-12 16:01:23 +02:00
Suresh Kumar Anaparti	4ec0f823cf	ScaleIO volume live migration - use usable bytes from source disk to format the destination disk (#9174 )	2024-06-12 13:58:10 +05:30
Suresh Kumar Anaparti	2e3f76ec03	Improve error messaging / logs when listing VMs on the remote KVM host (for import) (#9204 )	2024-06-11 14:48:21 +02:00
Harikrishna	acae5c5b9e	kvm: Update the java doc for the method disconnectPhysicalDiskByPath (#9210 ) This PR addresses the issue #8789 The original issue is disconnectPhysicalDiskByPath() implementation in FibreChannelAdaptor always returns true irrespective of the success of the operation. This was already fixed in the PR #8889 . Ideally this method has to be called after choosing the right adapter based on the storage pool type of the volume path, but currently it is just called in a loop. `05b9b6e2e7/plugins/hypervisors/kvm/src/main/java/com/cloud/hypervisor/kvm/storage/KVMStoragePoolManager.java (L200-L212)` while trying to fix the case of running into the loop of all adapters by somehow passing the storage pool type to that caller cleanup() method but this is touching all over the code (which I fear it creates other regressions), instead I feel we can keep it the current way only since Fibrechannel adapter has already fixed. In this PR I've added the java doc explaining the method and situation.	2024-06-11 14:44:46 +05:30
Abhishek Kumar	43ab8a9367	cks,ui: fix npe and check for disable zone (#9105 ) Fixes #8962	2024-06-11 14:36:11 +05:30
Abhishek Kumar	7aacbcb559	api: listApis should return params based on caller (#8973 )	2024-06-11 11:28:08 +05:30
Abhishek Kumar	10f4de0318	kvm: consider provisioning type for local data volumes (#9141 ) Fixes #8644 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-06-10 11:38:31 +03:00
Daan Hoogland	c779b1c616	Merge branch '4.18' into 4.19	2024-06-06 11:24:09 +02:00
Abhishek Kumar	91c7bc722f	server,cks: check if vm is cks node during vm destroy (#9057 ) Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-06-06 10:54:02 +02:00
Rene Peinthor	2339412f73	linstor: make getDevicePath more robust (#9143 )	2024-06-06 09:49:03 +02:00
João Jandre	631d6ad09b	Do not retrieve VM's stats on normal VM listing (#8782 ) * Do not retrieve VM's stats on normal VM listing * Add config to control the behavior * address reviews	2024-06-05 17:45:28 +05:30
Vishesh	87b55af197	Fixup response code on incorrect credentials (#8671 )	2024-05-30 08:48:53 +02:00
Rene Peinthor	f80d205284	linstor: Fix volume format and make resource available on copy target (#8811 ) Linstor primary storage forgot to make sure the volume download/copy target has a Linstor resource available.	2024-05-06 11:00:22 +02:00
Daan Hoogland	92ba476593	Merge release branch 4.18 to 4.19 * 4.18: linstor: disconnect-disk also search for resource name in Linstor (#9035)	2024-05-06 10:35:27 +02:00
Rene Peinthor	ea11128cb3	linstor: disconnect-disk also search for resource name in Linstor (#9035 ) disconnectPhysicalDisk(String, KVMStoragePool) seems to calls the plugin with the resource name instead of the device path, so we also have to search for resource names, while cleaning up.	2024-05-06 09:05:31 +02:00
Rohit Yadav	3de1f8b4ba	Merge remote-tracking branch 'origin/4.18' into 4.19 Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-04-29 13:44:34 +05:30
Rene Peinthor	9d5d4e5564	linstor: cleanup diskless nodes on disconnect (#8790 )	2024-04-26 14:25:07 +02:00
João Jandre	cec6ade257	change live migration API used on kvm (#8952 )	2024-04-25 09:35:25 +02:00
Daan Hoogland	0514caedd6	Merge release branch 4.18 to 4.19 * 4.18: packaging: move contrail network plugin to noredist (#8932)	2024-04-24 11:10:00 +02:00
Wei Zhou	5f6acca049	packaging: move contrail network plugin to noredist (#8932 )	2024-04-24 10:28:59 +02:00
Wei Zhou	0b857def68	New feature: Import/Unamange DATA volume from storage pool (#8808 )	2024-04-23 16:05:59 +02:00
Rohit Yadav	0fa71f5696	Merge remote-tracking branch 'origin/4.18' into 4.19	2024-04-23 15:21:44 +05:30
Rene Peinthor	405aac38bc	linstor: Only set allow-two-primaries if resource is already in use (#8802 ) For live migrate we need the allow-two-primaries option, but we don't know exactly if we are called for a migration operation. Now also check if at least any of the resources is in use somewhere and only then set the option.	2024-04-22 10:04:05 +02:00
Rohit Yadav	5a52ca78ae	kvm: export sysinfo for arm64 domains for cloud-init to work (#8940 ) This fixes a limitation for arm64/aarch64 KVM hosts to correctly export the product name via sysconfig attribute. Without this `cloud-init` doesn't function correctly on arm64 platforms. Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-04-19 21:23:49 +02:00
Daan Hoogland	78e07cff62	Merge release branch 4.18 to 4.19 * 4.18: protect against null-path (#8915) UI: Fix missing locale strings for Status widget (#8792) Add a shutdownhook to remove jobs owned by the process (#8896)	2024-04-19 12:43:34 +02:00
dahn	7affbb1dac	protect against null-path (#8915 ) Co-authored-by: Vladimir Dombrovski <vladimir.dombrovski@bso.co> Co-authored-by: Vishesh <vishesh92@gmail.com> Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2024-04-19 12:23:31 +02:00
João Jandre	8a101fbbc1	Updating pom.xml version numbers for release 4.18.3.0-SNAPSHOT Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-04-17 11:11:57 -03:00
Rohit Yadav	a55ba96a08	Merge remote-tracking branch 'origin/4.18' into 4.19	2024-04-16 16:10:33 +05:30
João Jandre	154566f914	Updating pom.xml version numbers for release 4.18.2.0 Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-04-12 08:25:04 -03:00
Rene Peinthor	6cd5c6a1d0	linstor: Do not pretend handling disconnect paths that are non Linstor (#8897 )	2024-04-12 08:23:15 -03:00
Suresh Kumar Anaparti	d3e020a545	Mark libvirt events experimental, add properties flag (#8825 ) * Mark libvirt events experimental, add properties flag * unit test fixes --------- Co-authored-by: Marcus Sorensen <mls@apple.com>	2024-04-11 17:06:33 +05:30
Vishesh	730cc5d5b8	Change iops on offering change (#8872 ) * Change IOPS on disk offering change * Remove iops & bandwidth limits before copying template * minor refactor * Handle diskOfferingDetails * Fixup	2024-04-11 17:01:55 +05:30
Abhishek Kumar	ff3e9bd821	engine-storage: control download redirection Add a global setting to control whether redirection is allowed while downloading templates and volumes Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-04-04 14:11:05 +05:30
Wei Zhou	939d0b9011	engine-storage: control download redirection Add a global setting to control whether redirection is allowed while downloading templates and volumes core: some changes on SimpleHttpMultiFileDownloader similar as HttpTemplateDownloader Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> (cherry picked from commit `b1642bc3bf`) Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-04-04 11:19:20 +05:30
Marcus Sorensen	2e88eb45a3	Update mysql-connector version (#8753 ) Co-authored-by: Marcus Sorensen <mls@apple.com>	2024-03-21 18:09:06 +05:30
Vishesh	0043540fa3	Use join instead of views (#8321 )	2024-03-18 18:08:19 +01:00
Abhishek Kumar	ffd59720dd	storage,plugins: delegate allow zone-wide volume migration check and access grant check to storage drivers (#8762 ) * storage,plugins: delegate allow zone-wide volume migration check and access grant to storage drivers Following checks have been delegated to storage drivers, - For volumes on zone-wide storage, whether they need storage migration when VM is migrated - Whther volume required grant access Apply fixes in resolving PrimaryDataStore * add tests Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * unused import Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * Update engine/orchestration/src/test/java/org/apache/cloudstack/engine/orchestration/VolumeOrchestratorTest.java --------- Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-03-18 17:28:14 +05:30
Rene Peinthor	001c769054	Linstor 4.19 fix selecting non enabled hosts (#8653 ) * linstor: cleanup resource if copy from template failed * linstor: do not use non enabled hosts for copy operations	2024-03-08 13:52:49 +05:30
Daan Hoogland	d99b1b9c2d	Merge branch '4.18' into 4.19	2024-03-08 08:19:49 +01:00
Henrique Sato	223a9b8031	Quota tariff events (#8030 ) Co-authored-by: Henrique Sato <henrique.sato@scclouds.com.br>	2024-03-06 17:33:39 +01:00
Wei Zhou	a7ec8738a2	kvm: fix NPE while import KVM VMs from other hosts (#8720 )	2024-03-04 09:46:28 +01:00
Abhishek Kumar	9fd410be36	Merge remote-tracking branch 'apache/4.18' into 4.19	2024-03-01 17:34:27 +05:30
Harikrishna	c462be1412	New API "checkVolume" to check and repair any leaks or issues reported by qemu-img check (#8577 ) * Introduced a new API checkVolumeAndRepair that allows users or admins to check and repair if any leaks observed. Currently this is supported only for KVM * some fixes * Added unit tests * addressed review comments * add repair volume while granting access * Changed repair parameter to accept both leaks/all * Introduced new global setting volume.check.and.repair.before.use to do volume check and repair before VM start or volume attach operations * Added volume check and repair changes only during VM start and volume attach operations * Refactored the names to look similar across the code * Some code fixes * remove unused code * Renamed repair values * Fixed unit tests * changed version * Address review comments * Code refactored * used volume name in logs * Changed the API to Async and the setting scope to storage pool * Fixed exit value handling with check volume command * Fixed storage scope to the setting * Fix volume format issues * Refactored the log messages * Fix formatting	2024-02-29 14:41:49 +05:30
dahn	56e0450526	Logging improvements on migration in the VmwareResource (#8300 )	2024-02-28 15:29:35 +05:30
Daan Hoogland	f4987bf8ee	Merge release branch 4.18 to 4.19 * 4.18: Storage plugin support to check if volume on datastore requires access for migration (#8655) CKS: fix /opt/bin/deploy-cloudstack-secret in CKS control nodes (#8697)	2024-02-26 15:53:11 +01:00
Suresh Kumar Anaparti	f731fe882c	Storage plugin support to check if volume on datastore requires access for migration (#8655 ) * Check if volume on datastore requires access for migration, and grant/revoke volume access if requires * Updated default implementation for requiresAccessForMigration method in PrimaryDataStoreDriver	2024-02-26 20:16:31 +05:30
Wei Zhou	18c3d470c6	CKS: fix /opt/bin/deploy-cloudstack-secret in CKS control nodes (#8697 )	2024-02-26 14:21:26 +01:00
Abhishek Kumar	2a56c61ade	Merge remote-tracking branch 'apache/4.18' into 4.19	2024-02-26 12:01:26 +05:30
Wei Zhou	8d4b4dcec4	CKS: add kube config path in extra control nodes (#8658 )	2024-02-16 15:01:27 +01:00
GaOrtiga	6f3e4e6302	fix_filter_and_pagination (#8306 ) Co-authored-by: Gabriel <gabriel.fernandes@scclouds.com.br>	2024-02-16 11:15:55 +01:00
Rohit Yadav	bda49ab08f	Merge remote-tracking branch 'shapeblue/merged-4-18' into 4.19	2024-02-13 12:54:24 +05:30
Vishesh	a8028eecbd	Merge remote-tracking branch 'origin/4.18' into 4.19	2024-02-13 11:44:20 +05:30
Vishesh	1955d8f3db	Add advance settings to fine tune DRS imbalance calculation (#8521 ) * Use free/total instead of free metric to calculate imbalance * Filter out hosts for condensed while checking imbalance * Make DRS more configurable * code refactor * Add unit tests * fixup * Fix validation for drs.imbalance.condensed.skip.threshold * Add logging and other minor changes for drs * Add some logging for drs * Change format for drs imbalance to string * Show drs imbalance as percentage * Fixup label for memorytotal in en.json	2024-02-13 11:18:53 +05:30
Rene Peinthor	70b634fff2	Linstor: add HA support and small cleanups (#8407 ) * linstor: Outline get storagepools from resourcegroup into function * linstor: move getHostname() to kvm/Pool and reimplement * linstor: implement CloudStack HA support	2024-02-13 11:16:12 +05:30
dahn	672206c312	kvm: ITCO watchdog added (#8282 ) * ITCO watchdog added * add inject-nmi action * Update plugins/hypervisors/kvm/src/main/java/com/cloud/hypervisor/kvm/resource/LibvirtVMDef.java Co-authored-by: Wei Zhou <weizhou@apache.org> --------- Co-authored-by: Wei Zhou <weizhou@apache.org>	2024-02-12 08:54:39 +01:00
Wei Zhou	af2e277999	Merge remote-tracking branch 'apache/4.18' into 4.19	2024-02-09 11:53:39 +01:00
Rene Peinthor	393f3d7727	linstor: use relative hostname path (#8633 ) As described in issue #8310 some older distributions don't have hostname in /usr/bin so rely on PATH resolving	2024-02-09 11:49:20 +01:00
Rohit Yadav	a1f547a011	Merge remote-tracking branch 'origin/4.18' into 4.19 Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com> Conflicts: plugins/storage/volume/linstor/src/main/java/org/apache/cloudstack/storage/datastore/util/LinstorUtil.java	2024-02-09 00:10:34 +05:30
slavkap	1d1b332141	remove StorPool tags from detached volumes (#8377 ) * remove tags from detached volumes * Adress comments * address comments * Address comments	2024-02-09 00:05:34 +05:30
Rene Peinthor	56f0448f0d	Linstor fix migration while node offline (#8610 ) * linstor: Add util method getBestErrorMessage from main * linstor: failed remove of allow-two-primaries is no fatal error * linstor: Fix failure if a Linstor node is down while migrating If a Linstor node is down while migrating resource, allow-two-primaries setting will fail because we can't reach the downed node. But it will still set the property on the other nodes and migration should work. We now just report an error instead of completely failing.	2024-02-08 23:57:38 +05:30
Rohit Yadav	0d36098c76	Merge remote-tracking branch 'origin/4.18' into 4.19	2024-02-07 14:20:39 +05:30
Wei Zhou	69e8ebc03f	CKS: retry if unable to drain node or unable to upgrade k8s node (#8402 ) * CKS: retry if unable to drain node or unable to upgrade k8s node I tried CKS upgrade 16 times, 11 of 16 upgrades succeeded. 2 of 16 upgrades failed due to ``` error: unable to drain node "testcluster-of7974-node-18c8c33c2c3" due to error:[error when evicting pods/"cloud-controller-manager-5b8fc87665-5nwlh" -n "kube-system": Post "https://10.0.66.18:6443/api/v1/namespaces/kube-system/pods/cloud-controller-manager-5b8fc87665-5nwlh/eviction": unexpected EOF, error when evicting pods/"coredns-5d78c9869d-h5nkz" -n "kube-system": Post "https://10.0.66.18:6443/api/v1/namespaces/kube-system/pods/coredns-5d78c9869d-h5nkz/eviction": unexpected EOF], continuing command... ``` 3 of 16 upgrades failed due to ``` Error from server: error when retrieving current configuration of: Resource: "rbac.authorization.k8s.io/v1, Resource=roles", GroupVersionKind: "rbac.authorization.k8s.io/v1, Kind=Role" Name: "kubernetes-dashboard", Namespace: "kubernetes-dashboard" from server for: "/mnt/k8sdisk//dashboard.yaml": etcdserver: leader changed ``` * CKS: remove tests of creating/deleting HA clusters as they are covered by the upgrade test * Update PR 8402 as suggested * test: remove CKS cluster if fail to create or verify	2024-02-06 11:14:10 +01:00
Wei Zhou	54225ecd15	Veeam: fix incompatible types: String cannot be converted to Date	2024-02-05 10:50:16 +01:00
Wei Zhou	b8904f75dd	Merge remote-tracking branch 'apache/4.18' into 4.19	2024-02-05 10:08:31 +01:00
slavkap	94c8b1da5c	Option to create StorPool primary storage with a valid URL (#8356 ) * Option to create primary storage with a valid URL * check if the scheme is valid	2024-02-05 14:21:13 +05:30
Marcus Sorensen	9f1b34aeb2	Fix libvirt domain event listener by properly processing events (#8437 ) * Fix libvirt domain event listener by properly processing events * Add javadoc for setupEventListener --------- Co-authored-by: Marcus Sorensen <mls@apple.com>	2024-02-05 13:30:10 +05:30
Abhishek Kumar	a7b97ff3b0	Updating pom.xml version numbers for release 4.19.1.0-SNAPSHOT Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-02-02 18:06:04 +05:30
Lucas Martins	1c98b5a4e5	Change Cryptsetup validation (#8482 ) Co-authored-by: lucas.martins.scclouds <lucas.martins@scclouds.com.br>	2024-02-01 09:43:28 +01:00
Wei Zhou	b34f093137	veeam: fix some issues with restoring volume from backup and attaching it to VM (#8570 ) * veeam: detach only the restored volume during backup restore Steps to reproduce the issue 1. create a VM (A) with ROOT and DATA disk 2. assign to a backup offering 3. create backup 4. create another VM (B) 5. restore the DATA disk of VM A, and attach to VM B 6. When operation is done, check the datastore Without this change, the ROOT image is not removed and left over on the datastore. ``` [root@ref-trl-5933-v-Mr8-wei-zhou-esxi2:/vmfs/volumes/5f60667d-18d828eb] ls -l /vmfs/volumes/5f60667d-18d828eb/CS-RSTR-dfb6f21c-a941-49db-9963-4f0286a17dac total 1784840 -rw------- 1 root root 5242880000 Jan 24 09:23 ROOT-722_2-flat.vmdk -rw------- 1 root root 499 Jan 24 09:23 ROOT-722_2.vmdk ``` With this change, the whole temporary vm has been destroyed. ``` [root@ref-trl-5933-v-Mr8-wei-zhou-esxi2:/vmfs/volumes/5f60667d-18d828eb] ls -l /vmfs/volumes/5f60667d-18d828eb/CS-RSTR-734bee3b-640c-4ff0-a34b-bc45358565b2 ls: /vmfs/volumes/5f60667d-18d828eb/CS-RSTR-734bee3b-640c-4ff0-a34b-bc45358565b2: No such file or directory ``` * veeam: fix wrong disk size in debug message * veeam: sync backup repository after operations are done got exception of some operations which succeeds due to the following error ``` 2024-01-19 10:59:52,846 DEBUG [o.a.c.b.v.VeeamClient] (API-Job-Executor-42:ctx-716501bb job-4373 ctx-2359b76d) (logid:b5e19a17) Veeam response for PowerShell commands [PowerShell Import-Module Veeam.Backup.PowerShell -WarningAction SilentlyContinue;$restorePoint = Get-VBRRestorePoint ^\| Where-Object { $_.Id -eq '1d99106a-b5c8-4a1e-958d-066a987caa5f' };if ($restorePoint) { Remove-VBRRestorePoint -Oib $restorePoint -Confirm:$false;$repo = Get-VBRBackupRepository;Sync-VBRBackupRepository -Repository $repo;} else { ; Write-Output 'Failed to delete'; Exit 1;}] is: [^M Restore Type Job Name State Start Time End Time Description ^M ------------ -------- ----- ---------- -------- ----------- ^M ConfResynchronize Configuration Dat... Starting 19/01/2024 10:59:52 01/01/1900 00:00:00 ^M ^M ^M Remove-VBRRestorePoint : Win32 internal error "Access is denied" 0x5 occurred while reading the console output buffer. ^M Contact Microsoft Customer Support Services.^M At line:1 char:196^M + ... orePoint) { Remove-VBRRestorePoint -Oib $restorePoint -Confirm:$false ...^M + ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^M + CategoryInfo : ReadError: (:) [Remove-VBRRestorePoint], HostException^M + FullyQualifiedErrorId : ReadConsoleOutput,Veeam.Backup.PowerShell.Cmdlets.RemoveVBRRestorePoint^M ^M ]. ``` * veeam: fix unable to detach volume when restore backup and attach to vm then detach the volume It also happened when destroy the original or backup VM ``` 2024-01-24 10:10:03,401 ERROR [c.c.s.r.VmwareStorageProcessor] (DirectAgent-74:ctx-95b24ac7 10.0.35.53, job-25995/job-25996, cmd: DettachCommand) (logid:7260ffb8) Failed to detach volume! java.lang.RuntimeException: Unable to access file [de52fdd3386b3d67b27b3960ecdb08f4] i-2-723-VM/7c2197c129464035bab062edec536a09-flat.vmdk at com.cloud.hypervisor.vmware.util.VmwareClient.waitForTask(VmwareClient.java:426) at com.cloud.hypervisor.vmware.mo.DatastoreMO.moveDatastoreFile(DatastoreMO.java:290) at com.cloud.storage.resource.VmwareStorageLayoutHelper.syncVolumeToRootFolder(VmwareStorageLayoutHelper.java:241) at com.cloud.storage.resource.VmwareStorageProcessor.attachVolume(VmwareStorageProcessor.java:2150) at com.cloud.storage.resource.VmwareStorageProcessor.dettachVolume(VmwareStorageProcessor.java:2408) at com.cloud.storage.resource.StorageSubsystemCommandHandlerBase.execute(StorageSubsystemCommandHandlerBase.java:174) at com.cloud.storage.resource.StorageSubsystemCommandHandlerBase.handleStorageCommands(StorageSubsystemCommandHandlerBase.java:71) at com.cloud.hypervisor.vmware.resource.VmwareResource.executeRequest(VmwareResource.java:589) at com.cloud.agent.manager.DirectAgentAttache$Task.runInContext(DirectAgentAttache.java:315) at org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) at org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) at java.base/java.lang.Thread.run(Thread.java:829) 2024-01-24 10:10:03,402 INFO [c.c.h.v.u.VmwareHelper] (DirectAgent-74:ctx-95b24ac7 10.0.35.53, job-25995/job-25996, cmd: DettachCommand) (logid:7260ffb8) [ignored]failed to get message for exception: Unable to access file [de52fdd3386b3d67b27b3960ecdb08f4] i-2-723-VM/7c2197c129464035bab062edec536a09-flat.vmdk ``` * vmware: create restored volume with new UUID and attach to VM	2024-01-29 11:40:43 +01:00
Abhishek Kumar	2746225b99	Updating pom.xml version numbers for release 4.19.0.0 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-01-29 10:21:52 +05:30
Vishesh	fedcf66de0	Externalise a few timeouts & fix timeout for hostSupportsUefi in libvirt ready command wrapper (#8547 ) This PR fixes bug introduced in #8502. Timeout for script execution was set to 60 ms instead of 60s which resulted in host not getting UEFI enabled. This is a blocker for 4.19 release. We do this by introducing a new agent parameter `agent.script.timeout` (default - 60 seconds) to use as a timeout for the script checking host's UEFI status. We also externalize the timeout for the ReadyCommand by introducing a new global setting `ready.command.wait` (default - 60 seconds). For ModifyStoragePoolCommand, we don't externalize the timeout to avoid confusion for the user. Since, the required timeout can vary depending on the provider in use and we are only setting the wait for default host listener for now. Instead, we reuse the global `wait` setting by dividing it by `5` making the default value of 6 minutes (1800/5 = 360s) for ModifyStoragePoolCommand. Note: the actual time, the MS waits is twice the wait set for a Command. Check reference code below. `19250403e6/engine/orchestration/src/main/java/com/cloud/agent/manager/AgentAttache.java (L406-L442)`	2024-01-27 23:36:13 +05:30
Wei Zhou	33bb92acce	Veeam: Support Veeam 11 and 12 (#8241 ) This PR fixes several issues in the testing of Veeam 11 and Veeam12 - Import Veeam.Backup.PowerShell and silently ignore the warning messages - Fix issue when assign vm to backup offerings, which caused by separator (\r\n) - Fix authorization failure in veeam 12a, which is because v1_4 is not supported in veeam 12a any more - Fix exception if backup name has space - Fix backup metrics in veeam12, which is because powershell command does not return the values needed - Fix Incorrect datetime value, which is because powershell command returns a datetime which is not supported in Java - Fix issue during backup restoration if VM has both ROOT and DATA disks. This PR also has the following update - Add integration test test/integration/smoke/test_backup_recovery_veeam.py - Make some UI changes - Add zone setting backup.plugin.veeam.version. If it is not set, CloudStack will get veeam version via powershell commands. - Add zone setting backup.plugin.veeam.task.poll.interval and backup.plugin.veeam.task.poll.max.retry	2024-01-19 18:42:01 +01:00
Nicolas Vazquez	8d42ca8ccf	Use project version on pom dependencies (#8529 ) This PR fixes the POM dependencies from a hardcoded value to the project.version property on dependencies	2024-01-18 20:16:06 +05:30
Vishesh	c3b77cb7b8	Fix host stuck in connecting state (#8502 ) There are a lot of test failures due to test_vm_life_cycle.py in multiple PRs due to host not available for migration of VMs. #8438 (comment) #8433 (comment) #7344 (comment) While debugging I noticed that the hosts get stuck in Connecting state because MS is waiting for a response of the ReadyCommand from the agent. Since we take a lock on connection and disconnection, restarting the agent doesn't work. To fix this, we have to restart the MS or wait for ~1 hour (default timeout). On the agent side, it gets stuck waiting for a response from the Script execution. To reproduce, run smoke/test_vm_life_cycle.py (TestSecuredVmMigration test class to be specific). Once the tests are complete, you will notice that some hosts are stuck in Connecting state. And restarting the agent fails due to the named lock. Locks on DB can be checked using the below query. SELECT * FROM performance_schema.metadata_locks INNER JOIN performance_schema.threads ON THREAD_ID = OWNER_THREAD_ID WHERE PROCESSLIST_ID <> CONNECTION_ID() \G; This PR adds a wait for the ready command and a timeout to the Script execution to ensure that the thread doesn't get stuck and the named lock from database is released.	2024-01-15 13:56:34 +05:30
Nicolas Vazquez	a3a4833c3e	Fixes for KVM unmanaged instances import on advanced network and VNC password (#8492 ) This PR fixes a regression caused by #8465 on advanced zones, import fails with: 2024-01-10 12:13:33,234 DEBUG [o.a.c.e.o.NetworkOrchestrator] (API-Job-Executor-3:ctx-991bbe9f job-128 ctx-f49517d4) (logid:d7b8e716) Allocating nic for vm 142272e8-9e2e-407b-9d7e-e9a03b81653c in network Network {"id": 204, "name": "Isolated", "uuid": "9679fac5-e3ac-4694-a57b-beb635340f39", "networkofferingid": 10} during import 2024-01-10 12:13:33,239 ERROR [o.a.c.v.UnmanagedVMsManagerImpl] (API-Job-Executor-3:ctx-991bbe9f job-128 ctx-f49517d4) (logid:d7b8e716) Failed to import NICs while importing vm: i-2-31-VM com.cloud.exception.InsufficientVirtualNetworkCapacityException: Unable to acquire Guest IP address for network Network {"id": 204, "name": "Isolated", "uuid": "9679fac5-e3ac-4694-a57b-beb635340f39", "networkofferingid": 10}Scope=interface com.cloud.dc.DataCenter; id=1 at org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.importNic(NetworkOrchestrator.java:4582) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importNic(UnmanagedVMsManagerImpl.java:859) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importVirtualMachineInternal(UnmanagedVMsManagerImpl.java:1198) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importUnmanagedInstanceFromHypervisor(UnmanagedVMsManagerImpl.java:1511) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.baseImportInstance(UnmanagedVMsManagerImpl.java:1342) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importUnmanagedInstance(UnmanagedVMsManagerImpl.java:1282) at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method) Also, addresses the VNC password field set instead of a fixed string	2024-01-12 14:14:01 +05:30
Nicolas Vazquez	59e78cbc45	Fix KVM unmanage disks path (#8483 ) This PR fixes the volumes path on KVM import unmanaged instances Fixes: #8479	2024-01-11 14:45:57 +05:30
Vishesh	4f40eae1c4	DRS: Use free metrics insteado of used for computation (#8458 ) This PR makes changes to use cluster's free metrics instead of used while computing imbalance for the cluster. This allows DRS to run for clusters where hosts doesn't have the same amount of metrics.	2024-01-10 17:52:46 +05:30
slavkap	c569fe9119	Fix KVM import and list unmanaged VMs (#8445 ) VM import fixes 1 - Fix of VM insert for VMs with StorPool volumes 2 - Fix of list/insert unmanaged VMs with RBD volumes	2024-01-10 13:12:07 +05:30
Abhishek Kumar	d6ac91f2df	minio: fix store user creation (#8425 ) To prevent errors during multi-user access, use account UUID to create/access user on the provider side. Also, update the existing secret key for a user that already exists.	2024-01-09 17:44:11 +05:30
Abhishek Kumar	2253a33c1e	Merge remote-tracking branch 'apache/4.18'	2023-12-20 08:58:30 +05:30
Wei Zhou	ab70108f15	CKS: create Security Groups for CKS clusters of each account (#8316 ) This PR fixes #7684 The security groups contain the same rules for port 22 and 6443, no need to recreate for each CKS cluster.	2023-12-20 08:57:27 +05:30
John Bampton	dda672503f	Remove unneeded duplicate words (#8358 ) This PR removes some unneeded duplicate words.	2023-12-15 17:13:32 +05:30
kishankavala	ab20b1220f	KVM Ingestion - Import Instance (#7976 ) This PR adds new functionality to import KVM instances from an external host or from disk images in local or shared storage. Doc PR: https://github.com/apache/cloudstack-documentation/pull/356	2023-12-14 13:08:56 +05:30
Abhishek Kumar	82f7abddb3	Merge remote-tracking branch 'apache/4.18'	2023-12-13 11:24:15 +05:30
Bryan Lima	3bb318bab9	kvm: Add support for cgroupv2 (#8252 ) 1. Problem description In Apache CloudStack (ACS), when a VM is deployed in a host with the KVM hypervisor, an XML file is created in the assigned host, which has a property shares that defines the weight of the VM to access the host CPU. The value of this property has no unit, and it is a relative measure to calculate how much CPU a given VM will have in the host. However, this value has a limit, which depends on the version of cgroup utilized by the host's kernel. The problem lies at the range value of shares that varies between both versions: [2, 264144] for cgroups version 1; and [1, 10000] for cgroups version 2. Currently, ACS calculates the value of shares using Equation 1, presented below, where CPU is the number of cores and speed is the CPU frequency; both specified in the VM's compute offering. Therefore, if a compute offering has, for example, 6 cores at 2 GHz, the shares value will be 12000 and an exception will be thrown by libvirt if the host utilizes cgroup v2. The second version is becoming the default one in current Linux distributions; thus, it is necessary to address this limitation. Equation 1 shares = CPU * speed Fixes: #6744 2. Proposed changes To address the problem described, we propose to apply a scale conversion considering the max shares of the host. Using the same formula currently utilized by ACS, it is possible to calculate the maximum shares of a VM for a given host. In other words, using the number of cores and the nominal speed of the host's CPU as the upper limit of shares allowed to a VM. Then, this value will be scaled to the allowed interval of [1, 10000] of cgroup v2 by using a linear scale conversion. The VM shares would be calculated as Equation 2, presented below, where VM requested shares is the requested shares value calculated using Equation 1, cgroup upper limit is fixed with a value of 10000 (cgroups v2 upper limit), and host max shares is the maximum shares value of the host, calculated using Equation 1. Using Equation 2, the only case where a VM passes the cgroup v2 limit is when the user requests more resources than the host has, which is not possible with the current implementation of ACS. Equation 2 shares = (VM requested shares * cgroup upper limit)/host max shares To implement the proposal, the following APIs will be updated: deployVirtualMachine, migrateVirtualMachine and scaleVirtualMachine. When a VM is being deployed, a new verification will be added to find a suitable host. The max shares of each host will be calculated, and the VM calculated shares will be verified if it does not surpass the host's value. Likewise, the migration of VMs will have a similar new verification. Lastly, the scale of VMs will also have the same verification for the VM's host. To determine the max shares of a given host, we will use the same equation currently used in ACS for calculating the shares of VMs, presented in Section 1. When Equation 1 is used to determine the maximum shares of a host, CPU is the number of cores of the host, and speed is the nominal CPU speed, i.e., considering the CPU's base frequency. It is important to note that these changes are only for hosts with the KVM hypervisor using cgroup v2 for now.	2023-12-13 10:51:24 +05:30
Nicolas Vazquez	27a3d61729	Fix unmanage VM marvin tests and small UI fixes for import (#8338 ) This PR fixes the failing smoke test for test_vm_lifecycle_unmanage_import.py for Vmware and adds a small UI fix on the import wizard	2023-12-13 10:25:05 +05:30
Abhishek Kumar	080a5aee00	Merge remote-tracking branch 'apache/4.18'	2023-12-12 17:01:52 +05:30
Harikrishna	3ce7c39bef	cks: handle errors while scaling cluster (#8107 ) This PR fixes the issue #7920	2023-12-12 16:57:28 +05:30
Abhishek Kumar	4bdf35b7b0	Merge remote-tracking branch 'apache/4.18'	2023-12-09 12:04:21 +05:30
Wei Zhou	fc44df7c95	CKS: create HA cluster with 3 control VMs instead 2 (#8297 ) This PR fixes the test failures with CKS HA-cluster upgrade. In production, the CKS HA cluster should have at least 3 control VMs as well. The etcd cluster requires 3 members to achieve reliable HA. The etcd daemon in control VMs uses RAFT protocol to determine the roles of nodes. During upgrade of CKS with HA, the etcd become unreliable if there are only 2 control VMs.	2023-12-09 11:33:05 +05:30
Rene Glover	1031c31e6a	FiberChannel Multipath for KVM + Pure Flash Array and HPE-Primera Support (#7889 ) This PR provides a new primary storage volume type called "FiberChannel" that allows access to volumes connected to hosts over fiber channel connections. It requires Multipath to provide path discovery and failover. Second, the PR adds an AdaptivePrimaryDatastoreProvider that abstracts how volumes are managed/orchestrated from the connector to communicate with the primary storage provider, using a ProviderAdapter interface, allowing the code interacting with the primary storage provider API's to be simpler and have no direct dependencies on Cloudstack code. Lastly, the PR provides an implementation of the ProviderAdapter classes for the HP Enterprise Primera line of storage solutions and the Pure Flash Array line of storage solutions.	2023-12-09 11:31:33 +05:30
Sina Kashipazha	2993c99363	Add missing hosts info to the prometheus exporter output. (#8328 ) Sometimes the hostStats object of the agents becomes null in the management server. It is a rare situation, and we haven't found the root cause yet, but it occurs occasionally in our CloudStack deployments with many hosts. The hostStat is null, even though the agent is UP and hosting multiple VMs. It is possible to access the VM consoles and execute tasks on them. This pull request doesn't address the issue directly; rather it displays those hosts in Prometheus so we can restart the agent and get the necessary information.	2023-12-08 19:51:06 +05:30
Abhishek Kumar	c599011ef5	Merge remote-tracking branch 'apache/4.18'	2023-12-08 18:06:15 +05:30
Peinthor Rene	bba554bcc4	linstor: Fix possible NPE if Linstor storage-pool data missing (#8319 ) If Linstor doesn't return storage pool info, certain values are null. Now we assume the values are 0 if we get null values.	2023-12-08 17:02:18 +05:30
Vishesh	4e9c4a5895	Fix intermittent build failures (#8312 )	2023-12-07 14:03:26 +01:00
Wei Zhou	7ea068c4dc	kvm: fix error 'Failed to find passphrase for keystore: cloud.jks' when enable SSL for kvm agent (#7923 )	2023-12-07 09:10:11 +01:00
Nicolas Vazquez	371ad9f55b	New Feature: Import VMware VMs into KVM (#7881 ) This PR adds the capability in CloudStack to convert VMware Instances disk(s) to KVM using virt-v2v and import them as CloudStack instances. It enables CloudStack operators to import VMware instances from vSphere into a KVM cluster managed by CloudStack. vSphere/VMware setup might be managed by CloudStack or be a standalone setup. CloudStack will let the administrator select a VM from an existing VMware vCenter in the CloudStack environment or external vCenter requesting vCenter IP, Datacenter name and credentials. The migrated VM will be imported as a KVM instance The migration is done through virt-v2v: https://access.redhat.com/articles/1351473, https://www.ovirt.org/develop/release-management/features/virt/virt-v2v-integration.html The migration process timeout can be set by the setting convert.instance.process.timeout Before attempting the virt-v2v migration, CloudStack will create a clone of the source VM on VMware. The clone VM will be removed after the registration process finishes. CloudStack will delegate the migration action to a KVM host and the host will attempt to migrate the VM invoking virt-v2v. In case the guest OS is not supported then CloudStack will handle the error operation as a failure The migration process using virt-v2v may not be a fast process CloudStack will not perform any check about the guest OS compatibility for the virt-v2v library as indicated on: https://access.redhat.com/articles/1351473.	2023-12-07 12:59:56 +05:30
sato03	fdfbb4fad1	Prioritize hypervisor.uri configuration (#8254 ) Co-authored-by: Henrique Sato <henrique.sato@scclouds.com.br>	2023-12-06 16:43:04 -03:00
Daan Hoogland	14376ce298	Merge release branch 4.18 to main * 4.18: kvm: fix ide controller for rocky/alma vms (#8247)	2023-12-06 16:06:09 +01:00
Wei Zhou	db6dd52f44	kvm: fix ide controller for rocky/alma vms (#8247 )	2023-12-06 15:05:49 +01:00
Peinthor Rene	a15b706fbe	Linstor: Allow snapshot backup also to work on non hyperconverged setups (#8271 ) On no access to the storage nodes, we now create a temporary resource from the snapshot and copy that data into the secondary storage. Revert works the same, just that we now also look additionally for any Linstor agent node. Also enables now backup snapshot by default. This whole BackupSnapshot functionality was introduced in 4.19, so I would be happy if this still could be merged.	2023-12-05 12:59:52 +05:30
kishankavala	5651eab49c	ObjectStore Framework with MinIO and Simulator plugins (#7752 ) This PR adds Object Storage feature to CloudStack. FS: https://cwiki.apache.org/confluence/display/CLOUDSTACK/%5BDRAFT%5D+CloudStack+Object+Store	2023-12-01 17:51:00 +05:30
João Jandre	26b01f6f3b	Flexible tags for hosts and storage pools (#7489 ) Co-authored-by: João Jandre <joao@scclouds.com.br>	2023-11-30 09:36:47 +01:00
Daan Hoogland	98d643efe6	Merge release branch 4.18 to main * 4.18: Fixed spelling and added missing states to response (#8248) Let Prometheus exporter plugin support utf8 characters (#8228)	2023-11-18 18:41:31 +01:00
DK101010	6001772335	multi local storage handling for kvm (#6699 ) Co-authored-by: DK101010 <dirk.klahre@itelligence.de> Co-authored-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2023-11-16 16:43:42 +01:00
Stephan Krug	267a457efc	Externalize KVM HA heartbeat frequency (#6892 ) Co-authored-by: Stephan Krug <stephan.krug@scclouds.com.br> Co-authored-by: GaOrtiga <49285692+GaOrtiga@users.noreply.github.com> Co-authored-by: dahn <daan.hoogland@gmail.com>	2023-11-16 09:17:17 +01:00
GaOrtiga	be4a648f5a	Create global configuration to allow changing the default nic adapter for user VMs in VMware (#7954 ) Co-authored-by: Gabriel <gabriel.fernandes@scclouds.com.br>	2023-11-15 11:18:26 +01:00
dahn	1a2dbebe48	Let Prometheus exporter plugin support utf8 characters (#8228 )	2023-11-15 09:48:11 +01:00
rRajivramachandran	96b07d797b	Fix flaky tungsten test using comparator (#8232 )	2023-11-14 10:17:32 +01:00
Daan Hoogland	05b9b6e2e7	Merge branch '4.18' into main	2023-11-13 11:36:51 +01:00
Abhishek Kumar	d0f3233fda	edge-zone,kvm,iso,cks: allow k8s deployment with direct-download iso (#8142 ) Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2023-11-10 13:56:05 +01:00
Peinthor Rene	68e504aff9	Linstor backup snaphots (#8067 ) This PR adds an config option for the Linstor primary storage driver, that allows you to automatically backup volume snapshots to the secondary storage. Additionally it will not mangle the need java-linstor dependency into the client.jar, but instead just copy the java-linstor.jar into lib. Config option is called: lin.backup.snapshots and is default false The scope of this change should be limited, as it only touches the Linstor driver and a part of copyAsync was implemented with 2 new Linstor specific commands.	2023-11-09 09:38:10 +05:30
Wei Zhou	861107fa5b	CKS: make clustertype optional to keep backwards compatibility (#8180 ) This PR fixes the issue that 4.18 cmk/api to create CKS cluster does not work in 4.19	2023-11-08 00:31:38 +05:30
rRajivramachandran	e9b24b6c32	Make authentication request parameter order to be deterministic (#8185 )	2023-11-06 09:53:49 +01:00
slavkap	2bb182c3e1	KVM Host HA enhancement for StorPool storage (#8045 ) Extending the current functionality of KVM Host HA for the StorPool storage plugin and the option for easy integration for the rest of the storage plugins to support Host HA This extension works like the current NFS storage implementation. It allows it to be used simultaneously with NFS and StorPool storage or only with StorPool primary storage. If it is used with different primary storages like NFS and StorPool, and one of the health checks fails for storage, there is an option to report the failure to the management with the global config kvm.ha.fence.on.storage.heartbeat.failure. By default this option is disabled when enabled the Host HA service will continue with the checks on the host and eventually will fence the host	2023-11-04 12:35:37 +05:30
Codegass	b2938c0528	Refactor testCRUDAcl into Separate Test Cases (#7705 ) - Extracted shared ACL setup logic into a private helper method, setupAcl(). - Split original testCRUDAcl into two separate tests: testCRUDAclReadAll and testCRUDAclReadOne. - Each test case now represents a unique scenario for better readability and maintainability. - Replaced assertTrue(false) with fail() in catch blocks for better test failure indication. These changes aim to enhance the clarity and maintainability of the test suite, and ensure each test case checks only one scenario.	2023-11-03 18:08:15 +05:30
gzhao9	9e8f591ace	Refactoring org.apache.cloudstack.network.tungsten.service (#8098 ) * Refactoring reduces mock cloning of TungstenAnswer * Apply suggestions from code review Great suggestions, thanks a lot! Co-authored-by: dahn <daan.hoogland@gmail.com> * Rename CreateMockTungstenAnswer to MockTungstenAnswerFactory * Updated parameter to camel case. * Revised in accordance with the latest update * Replace all `\r` with `\n`. * Replace all \r with \n. * temp for re-uploading * reupdate * update line ending * update ling ending * Add static methods to avoid duplicate creation of new --------- Co-authored-by: dahn <daan.hoogland@gmail.com>	2023-11-03 17:19:59 +05:30
gzhao9	2f97e3bd83	refactor MockNetworkVO (#8137 ) * refactor MockNetworkVO * Apply suggestions from code review Co-authored-by: dahn <daan.hoogland@gmail.com> * adding static adding a static method to the MockNetworkVO class that generates a MockNetworkVO rather than using new everytime. --------- Co-authored-by: dahn <daan.hoogland@gmail.com>	2023-11-03 17:19:32 +05:30

1 2 3 4 5 ...

4625 Commits