cloudstack

Commit Graph

Author	SHA1	Message	Date
Abhishek Kumar	83bccead3d	schema, refactor: rename cloud.user_vm_details to cloud.vm_instance_details (#10736 ) Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com> Co-authored-by: dahn <daan@onecht.net>	2025-07-24 12:08:29 +02:00
Abhishek Kumar	0b5a5e8043	api,agent,server,engine-schema: scalability improvements (#9840 ) * api,agent,server,engine-schema: scalability improvements Following changes and improvements have been added: - Improvements in handling of PingRoutingCommand 1. Added global config - `vm.sync.power.state.transitioning`, default value: true, to control syncing of power states for transitioning VMs. This can be set to false to prevent computation of transitioning state VMs. 2. Improved VirtualMachinePowerStateSync to allow power state sync for host VMs in a batch 3. Optimized scanning stalled VMs - Added option to set worker threads for capacity calculation using config - `capacity.calculate.workers` - Added caching framework based on Caffeine in-memory caching library, https://github.com/ben-manes/caffeine - Added caching for account/use role API access with expiration after write can be configured using config - `dynamic.apichecker.cache.period`. If set to zero then there will be no caching. Default is 0. - Added caching for account/use role API access with expiration after write set to 60 seconds. - Added caching for some recurring DB retrievals 1. CapacityManager - listing service offerings - beneficial in host capacity calculation 2. LibvirtServerDiscoverer existing host for the cluster - beneficial for host joins 3. DownloadListener - hypervisors for zone - beneficial for host joins 5. VirtualMachineManagerImpl - VMs in progress- beneficial for processing stalled VMs during PingRoutingCommands - Optimized MS list retrieval for agent connect - Optimize finding ready systemvm template for zone - Database retrieval optimisations - fix and refactor for cases where only IDs or counts are used mainly for hosts and other infra entities. Also similar cases for VMs and other entities related to host concerning background tasks - Changes in agent-agentmanager connection with NIO client-server classes 1. Optimized the use of the executor service 2. Refactore Agent class to better handle connections. 3. Do SSL handshakes within worker threads 5. Added global configs to control the behaviour depending on the infra. SSL handshake could be a bottleneck during agent connections. Configs - `agent.ssl.handshake.min.workers` and `agent.ssl.handshake.max.workers` can be used to control number of new connections management server handles at a time. `agent.ssl.handshake.timeout` can be used to set number of seconds after which SSL handshake times out at MS end. 6. On agent side backoff and sslhandshake timeout can be controlled by agent properties. `backoff.seconds` and `ssl.handshake.timeout` properties can be used. - Improvements in StatsCollection - minimize DB retrievals. - Improvements in DeploymentPlanner allow for the retrieval of only desired host fields and fewer retrievals. - Improvements in hosts connection for a storage pool. Added config - `storage.pool.host.connect.workers` to control the number of worker threads that can be used to connect hosts to a storage pool. Worker thread approach is followed currently only for NFS and ScaleIO pools. - Minor improvements in resource limit calculations wrt DB retrievals Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Rohit Yadav <rohit.yadav@shapeblue.com> * test1, domaindetails, capacitymanager fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * test2 - agent tests Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * capacitymanagertest fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * change Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix missing changes Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * address comments Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * revert marvin/setup.py Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix indent Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * use space in sql Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * address duplicate Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * update host logs Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * revert `e36c6a5d07` Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix npe in capacity calculation Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * move schema changes to 4.20.1 upgrade Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * build fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * address comments Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix build Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add some more tests Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * checkstyle fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * remove unnecessary mocks Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * build fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * replace statics Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * engine/orchestration,utils: limit number of concurrent new agent connections Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * refactor - remove unused Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * unregister closed connections, monitor & cleanup Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add check for outdated vm filter in power sync Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * agent: synchronize sendRequest wait Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> --------- Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2025-02-01 12:28:41 +05:30
Vishesh	a4224e58cc	Improve logging to include more identifiable information (#9873 ) * Improve logging to include more identifiable information for kvm plugin * Update logging for scaleio plugin * Improve logging to include more identifiable information for default volume storage plugin * Improve logging to include more identifiable information for agent managers * Improve logging to include more identifiable information for Listeners * Replace ids with objects or uuids * Improve logging to include more identifiable information for engine * Improve logging to include more identifiable information for server * Fixups in engine * Improve logging to include more identifiable information for plugins * Improve logging to include more identifiable information for Cmd classes * Fix toString method for StorageFilterTO.java	2025-01-06 16:42:37 +05:30
Wei Zhou	8a1da3804c	Resize volume: add pool capacity disablethreshold for resize and allow volume auto migration (#9761 ) * server: add global settings for volume resize * resizeVolume: support automigrate * Address Suresh's comments * Update api/src/main/java/org/apache/cloudstack/api/command/user/volume/ResizeVolumeCmd.java Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com> * address Suresh's comments * UI: add autoMigrate to resizeVolume * resizevolume: add unit tests * resizevolume: add unit test for Allocated volume --------- Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com>	2024-12-02 10:28:14 +05:30
mprokopchuk	68f459b334	CapacityManagementImpl.updateCapacityForHost(..) use VM update time in capacity calculation. (#9662 ) VM update time is nullable in DB and can cause NullPointerException if record in vm_instance has defined last_host_id and undefined update_time.	2024-09-11 09:45:50 -03:00
Rohit Yadav	cea4801be1	Merge remote-tracking branch 'origin/4.19' Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-07-10 15:57:06 +05:30
Suresh Kumar Anaparti	37c91abd3d	NPE fix, for test_hostha_kvm_host_fencing (#9355 )	2024-07-09 12:20:10 +05:30
João Jandre	49cecaed06	Normalize loggers and upgrade log4j 1.2 to log4j 2.19 (#7131 ) * Normalize logs All classes that could have their loggers inherited from their fathers had their own loggers deleted; Most loggers didn't have to be static, so most of them were normalized so that they wouldn't be; All loggers are protected now; Static logger's name are now 'LOGGER'; Non-static logger's name are now 'logger'; New class DbUpgradeAbstractImpl created so that all Upgraders extend it and inherit its logger * Upgrade log4j * fix errors caused by the merge * Refactor cglibThrowableRenderer functionality to log4j2 and upgrade the last configuration files * fix sonarcloud bug * Fix errors caused by merge, remove some unused loggers, and rename a variable that was mistakenly renamed on the normalization commit * Readd snmpTrapAppender, remove TestAppender * Regenerate changes * regenerate changes * refactor last custom appender * fix systemvm configuration xml * Regenerate changes * Regenerate changes * regenerate changes * Regenerate changes * regenerate changes * regenerate changes * regenerate changes * Fix utils pom * fix some tests * regenerate changes * Fix jar being printed on exception * fix logging in system VMs, fix commands not having log4j2 classpath. * regenerate changes * Fix some unwanted renomeations * fix end of file * regenerate changes * regenerate changes * fix merge error * regenerate changes * fix tests * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * readd reload4j to tungsten as juniper depends on it * Regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * re-add reload4j dependency to network-contrail, as juniper depends on it * regenerate changes * regenerate changes * regenerate changes * fix typo * regenerate changes * regenerate changes * Fix end of files * regenerate changes * add logj42 to cloud-utils-SHADED.jar * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * regenerate changes * Regenerate changes * Regenerate changes * Regenerate changes * regenerate changes * Regenerate changes * regenerate changes * Regenerate changes * Regenerate changes * Regenerate changes * regenerate changes * Regenerate changes * Regenerate changes * fix some tests * Regenerate changes * Regenerate changes * fix test * Regenerate changes * Regenerate changes	2024-02-08 09:55:41 -03:00
SadiJr	1e253401b0	[Veeam] Block operations in restoring VMs (#7238 ) Co-authored-by: SadiJr <sadi@scclouds.com.br>	2023-04-04 08:49:21 +02:00
John Bampton	f9347ecf2c	Fix spelling (#6597 )	2022-08-03 15:43:47 +05:30
dahn	731a83babf	add global setting to allow parallel execution on vmware (#6413 ) * add global setting to allow parallel execution on vmware * cleanup setting distribution for vmware.create.full.clone * query setting in vmware guru * don´t touch other hypervisor's commands * guru hierarchy cleanup	2022-07-15 10:01:35 +02:00
Abhishek Kumar	fb8d40de54	server: skip max guest limit check for KVM host (#5417 ) Addresses #3015 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2021-09-14 03:12:53 -03:00
Daniel Augusto Veronezi Salvador	cbe380a068	Externalize secondary storage capacity threshold (#4790 ) * Externalize secondary storage capacity threshold * Use default value as threshold when config value is lower than 0.0 * Move config to CapacityManager * Validate config in CapacityManagerImpl * Use config in StorageOrchestrator * Change config description * Remove unused import Co-authored-by: Daniel Augusto Veronezi Salvador <daniel@scclouds.com.br>	2021-07-16 08:38:36 +02:00
Gabriel Beims Bräscher	ca78f5b386	Enhance log messages with host name (#4575 ) * Enhance log messages with hostName * Use host.toString() on most of host logs. * Remove redundant "Host" in logs and enhance logs * duplicated "for" * Adopt String.format, and enhance code * Address reviews enhancing log messages Update server/src/main/java/com/cloud/resource/ResourceManagerImpl.java -- server/src/main/java/com/cloud/vm/UserVmManagerImpl.java -- server/src/main/java/com/cloud/resource/RollingMaintenanceManagerImpl.java Co-authored-by: Daniel Augusto Veronezi Salvador <38945620+GutoVeronezi@users.noreply.github.com> * Fix String.format issue and change log message from debug to warn * Fix checkstyle issue * Fix string.format log * Address review: enhance logs * Enhance log of hosts in maintenance avoid list * Remove "VM" on logs as vm.toString() already appends VM-<details> * Add more details of the VM when postStateTransitionEvent * Address reviewer and enhance VMInstanceVO.toString() Co-authored-by: Daniel Augusto Veronezi Salvador <38945620+GutoVeronezi@users.noreply.github.com>	2021-07-13 17:35:59 -03:00
Rohit Yadav	22f6c19248	Merge remote-tracking branch 'origin/4.15'	2021-04-09 13:21:07 +05:30
Rohit Yadav	ca8920dd36	Merge remote-tracking branch 'origin/4.14' into 4.15	2021-04-09 13:17:39 +05:30
Abhishek Kumar	cd60b8d97d	host-allocator: check capacity for suitable hosts (#4884 ) Fixes #4517 Adds capacity checks for RandomAllocator (host allocator) Factors out host cpu capability and capacity check wrt serviceoffering code into CapacityManager. Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2021-04-09 12:35:58 +05:30
Rohit Yadav	775de36688	Merge remote-tracking branch 'origin/4.15'	2021-03-17 17:46:16 +05:30
Rakesh	e2664197ec	server: Fix NPE while cloudstack agent failed to connect to mgt server (#4779 ) * Fix NPE while cloudstack agent failed to connect to mgt server If `ramOvercommitRatio` field is missing in user_vm_details table is missing then agent throws NPE after restarting It is because in user_vm_details, there are 'cpuOvercommitRatio' for all vms, but for vms the field 'ramOvercommitRatio' is missing in the table. * code feedback	2021-03-17 17:42:02 +05:30
sureshanaparti	eba186aa40	storage: New Dell EMC PowerFlex Plugin (formerly ScaleIO, VxFlexOS) (#4304 ) Added support for PowerFlex/ScaleIO (v3.5 onwards) storage pool as a primary storage in CloudStack (for KVM hypervisor) and enabled VM/Volume operations on that pool (using pool tag). Please find more details in the FS here: https://cwiki.apache.org/confluence/x/cDl4CQ Documentation PR: apache/cloudstack-documentation#169 This enables support for PowerFlex/ScaleIO (v3.5 onwards) storage pool as a primary storage in CloudStack Other improvements addressed in addition to PowerFlex/ScaleIO support: - Added support for config drives in host cache for KVM => Changed configuration "vm.configdrive.primarypool.enabled" scope from Global to Zone level => Introduced new zone level configuration "vm.configdrive.force.host.cache.use" (default: false) to force host cache for config drives => Introduced new zone level configuration "vm.configdrive.use.host.cache.on.unsupported.pool" (default: true) to use host cache for config drives when storage pool doesn't support config drive => Added new parameter "host.cache.location" (default: /var/cache/cloud) in KVM agent.properties for specifying the host cache path and create config drives on the "/config" directory on the host cache path => Maintain the config drive location and use it when required on any config drive operation (migrate, delete) - Detect virtual size from the template URL while registering direct download qcow2 (of KVM hypervisor) templates - Updated full deployment destination for preparing the network(s) on VM start - Propagate the direct download certificates uploaded to the newly added KVM hosts - Discover the template size for direct download templates using any available host from the zones specified on template registration => When zones are not specified while registering template, template size discovery is performed using any available host, which is picked up randomly from one of the available zones - Release the VM resources when VM is sync-ed to Stopped state on PowerReportMissing (after graceful period) - Retry VM deployment/start when the host cannot grant access to volume/template - Mark never-used or downloaded templates as Destroyed on deletion, without sending any DeleteCommand => Do not trigger any DeleteCommand for never-used or downloaded templates as these doesn't exist and cannot be deleted from the datastore - Check the router filesystem is writable or not, before performing health checks => Introduce a new test "filesystem.writable.test" to check the filesystem is writable or not => The router health checks keeps the config info at "/var/cache/cloud" and updates the monitor results at "/root" for health checks, both are different partitions. So, test at both the locations. => Added new script: "filesystem_writable_check.py" at /opt/cloud/bin/ to check the filesystem is writable or not - Fixed NPE issue, template is null for DATA disks. Copy template to target storage for ROOT disk (with template id), skip DATA disk(s) * Addressed some issues for few operations on PowerFlex storage pool. - Updated migration volume operation to sync the status and wait for migration to complete. - Updated VM Snapshot naming, for uniqueness in ScaleIO volume name when more than one volume exists in the VM. - Added sync lock while spooling managed storage template before volume creation from the template (non-direct download). - Updated resize volume error message string. - Blocked the below operations on PowerFlex storage pool: -> Extract Volume -> Create Snapshot for VMSnapshot * Added the PowerFlex/ScaleIO client connection pool to manage the ScaleIO gateway clients, which uses a single gateway client per Powerflex/ScaleIO storage pool and renews it when the session token expires. - The token is valid for 8 hours from the time it was created, unless there has been no activity for 10 minutes. Reference: https://cpsdocs.dellemc.com/bundle/PF_REST_API_RG/page/GUID-92430F19-9F44-42B6-B898-87D5307AE59B.html Other fixes included: - Fail the VM deployment when the host specified in the deployVirtualMachine cmd is not in the right state (i.e. either Resource State is not Enabled or Status is not Up) - Use the physical file size of the template to check the free space availability on the host, while downloading the direct download templates. - Perform basic tests (for connectivity and file system) on router before updating the health check config data => Validate the basic tests (connectivity and file system check) on router => Cleanup the health check results when router is destroyed * Updated PowerFlex/ScaleIO storage plugin version to 4.16.0.0 * UI Changes to support storage plugin for PowerFlex/ScaleIO storage pool. - PowerFlex pool URL generated from the UI inputs(Gateway, Username, Password, Storage Pool) when adding "PowerFlex" Primary Storage - Updated protocol to "custom" for PowerFlex provider - Allow VM Snapshot for stopped VM on KVM hypervisor and PowerFlex/ScaleIO storage pool and Minor improvements in PowerFlex/ScaleIO storage plugin code * Added support for PowerFlex/ScaleIO volume migration across different PowerFlex storage instances. - findStoragePoolsForMigration API returns PowerFlex pool(s) of different instance as suitable pool(s), for volume(s) on PowerFlex storage pool. - Volume(s) with snapshots are not allowed to migrate to different PowerFlex instance. - Volume(s) of running VM are not allowed to migrate to other PowerFlex storage pools. - Volume migration from PowerFlex pool to Non-PowerFlex pool, and vice versa are not supported. * Fixed change service offering smoke tests in test_service_offerings.py, test_vm_snapshots.py * Added the PowerFlex/ScaleIO volume/snapshot name to the paths of respective CloudStack resources (Templates, Volumes, Snapshots and VM Snapshots) * Added new response parameter “supportsStorageSnapshot” (true/false) to volume response, and Updated UI to hide the async backup option while taking snapshot for volume(s) with storage snapshot support. * Fix to remove the duplicate zone wide pools listed while finding storage pools for migration * Updated PowerFlex/ScaleIO volume migration checks and rollback migration on failure * Fixed the PowerFlex/ScaleIO volume name inconsistency issue in the volume path after migration, due to rename failure	2021-02-24 14:58:33 +05:30
Rohit Yadav	6bde1384ff	Merge remote-tracking branch 'origin/4.14' into 4.15 Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2021-02-05 16:01:01 +05:30
Wei Zhou	78f73c1bc6	server: Fix update capacity for hosts take long time if there are many service offerings (#4623 ) Steps to reproduce the issue: (1)Create 10000 service offerings (by db changes below or cloudmonkey). ``` DROP PROCEDURE IF EXISTS cloud.insert_service_offering; DELIMITER $$ CREATE PROCEDURE cloud.insert_service_offering() BEGIN DECLARE count INT DEFAULT 10000; SET @offeringid = (select max(id)+1 from disk_offering); WHILE count > 0 DO INSERT INTO disk_offering (id,name,uuid,display_text,disk_size,type,created) values (@offeringid,'test-offering-wei',uuid(), 'test-offering-wei',0,'Service',now()); INSERT INTO service_offering (id,cpu,speed,ram_size) values (@offeringid, 1, 500,256); SET @offeringid = @offeringid + 1; SET count = count - 1; END WHILE; END $$ DELIMITER ; CALL cloud.insert_service_offering(); mysql> CALL cloud.insert_service_offering(); Query OK, 0 rows affected (2 min 30.85 sec) ``` (2) Check the total time of periodical capacity check in cloudstack. Without this patch, it spend 2.5 seconds (2 hosts) ``` 2021-01-15 16:10:12,793 DEBUG [c.c.a.AlertManagerImpl] (CapacityChecker:ctx-5d5f3b3b) (logid:f5eb68ba) Running Capacity Checker ... 2021-01-15 16:10:15,287 DEBUG [c.c.a.AlertManagerImpl] (CapacityChecker:ctx-5d5f3b3b) (logid:f5eb68ba) Done running Capacity Checker ... ``` With this patch ,it spend 1.3 seconds (2 hosts) ``` 2021-01-15 16:12:43,604 DEBUG [c.c.a.AlertManagerImpl] (CapacityChecker:ctx-a2a7f3f1) (logid:f7e0a4c5) Running Capacity Checker ... 2021-01-15 16:12:44,927 DEBUG [c.c.a.AlertManagerImpl] (CapacityChecker:ctx-a2a7f3f1) (logid:f7e0a4c5) Done running Capacity Checker ... ``` If there are 100 hosts, the total time will be reduced from 100+ seconds to around 10 seconds.	2021-02-04 14:43:57 +05:30
Daan Hoogland	c1fb6b4cb9	Merge branch '4.14'	2020-10-28 10:28:10 +01:00
Rakesh	b9f15fd159	Remove cpu core from op_host_capacity when host is deleted (#4367 ) When a host is put into maintenance mode or its deleted from cloudstack then delete its entries from op_host_capacity table	2020-10-28 09:41:14 +01:00
Spaceman1984	b586eb22f1	Human readable sizes in logs (#4207 ) This PR adds outputting human readable byte sizes in the management server logs, agent logs, and usage records. A non-dynamic global variable is added (display.human.readable.sizes) to control switching this feature on and off. This setting is sent to the agent on connection and is only read from the database when the management server is started up. The setting is kept in memory by the use of a static field on the NumbersUtil class and is available throughout the codebase. Instead of seeing things like: 2020-07-23 15:31:58,593 DEBUG [c.c.a.t.Request] (AgentManager-Handler-12:null) (logid:) Seq 8-1863645820801253428: Processing: { Ans: , MgmtId: 52238089807, via: 8, Ver: v1, Flags: 10, [{"com.cloud.agent.api.NetworkUsageAnswer":{"routerName":"r-224-VM","bytesSent":"106496","bytesReceived":"0","result":"true","details":"","wait":"0",}}] } The KB MB and GB values will be printed out: 2020-07-23 15:31:58,593 DEBUG [c.c.a.t.Request] (AgentManager-Handler-12:null) (logid:) Seq 8-1863645820801253428: Processing: { Ans: , MgmtId: 52238089807, via: 8, Ver: v1, Flags: 10, [{"com.cloud.agent.api.NetworkUsageAnswer":{"routerName":"r-224-VM","bytesSent":"(104.00 KB) 106496","bytesReceived":"(0 bytes) 0","result":"true","details":"","wait":"0",}}] } FS: https://cwiki.apache.org/confluence/display/CLOUDSTACK/Human+Readable+Byte+sizes	2020-08-13 15:55:16 +05:30
Wei Zhou	136505b22c	server: double check host capacity when start/migrate a vm (#3728 ) When start a vm or migrate a vm (away from a host in host maintenance), cloudstack will check capacity of all hosts and choose one. If there are hundreds of hosts on the platform, it will take some seconds. When cloudstack choose a host and start/migrate vm to it, the resource consumption of the host might have been changed. This normally happens when we start/migrate multiple vms. It would be better to double check the host capacity when start vm on a host. This PR includes the fix for cpucore capacity when start/migrate a vm.	2020-01-28 10:55:11 +05:30
Wei Zhou	71e53ab01d	server: Capacity check should take vms in Migrating state into calculation (#3727 ) When we calculate a resource consumption of a host, we need to take the vms in following states into calculation: Running, Starting, Stopping, Migrating (to the host), and vms are Migrating from the host. Because, when stop a vm, the resource on host will be released when vm is stopped. When migrate a vm, the resource on destination host will be increased before migration starts, and resource on source host will be decreased after migraiton succeeds. In cloudstack, there is a task named CapacityChecked which run every 5 minutes (capacity.check.period =300000 ms by default). It recalculates capacity of all hosts. However, it takes only vms in Running and Starting into consideration. We have faced some issues in host maintenance due to it. Steps to reproduce the issue (1) migrate N vms from host A to host B, cpu/ram resource increases before the migration. (2) capacity check recalculate the capacity of hosts. used capacity of Host B will be reset to original value (not including the vms in Migrating). (3) migrate some more vms from other host to host B, the migrations are allowed by cloudstack (because used capacity is incorrect). If the actual used memory exceed the physical memory on the host, there might be some critical issues (for example, libvirt dies)	2020-01-28 10:54:32 +05:30
Rohit Yadav	9f4f2c5348	api: instance and template details are free text (#3240 ) Problem: Users don't know what keys/values to enter for template and VM details. Root Cause: The feature does not exist that can list possible details and options. Solution: Based on the possible VM and template details handled by the codebase, those details were refactored and a list API is introduced that can return users those details along with possible values. When users add details now, they will be presented with a list of key details and their possible options if any. Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2019-06-27 09:14:47 +05:30
Abhishek Kumar	2020bfb6a3	server: allows compute offering with or without constraints (#3245 ) Problem: Custom compute offering does not allow setting min and max values for CPU and VRAM for custom VMs. Root Cause: Custom compute offerings cannot be created with a given range of CPU number and memory instead it allows only fixed values. Solution: createServiceOffering API has been modified to allow setting a defined range for CPU number and memory. Also, UI form for compute offering creation is provided with a new field named 'compute offering type’ with values - Fixed, Custom Constrained, Custom Constrained. It will allow the creation of compute offerings either with a fixed CPU speed and memory for fixed compute offering, or with a range of CPU number and memory for custom constrained compute offering or without predefined CPU number, CPU speed and memory for custom unconstrained compute offering. To allow the user to set CPU number, CPU speed and memory during VM deployment, UI form for VM deployment has been modified to provide controls to change these values. These controls are depicted in screenshots below for custom constrained and custom unconstrained compute offering types. Sample API calls using cmk to create a constrained service offering and deploying a VM using it, create serviceoffering name=Constrained displaytext=Constrained customized=true mincpunumber=2 maxcpunumber=4 cpuspeed=400 minmemory=256 maxmemory=1024 deploy virtualmachine displayname=ConstrainedVM serviceofferingid=60f3e500-6559-40b2-9a61-2192891c2bd6 templateid=8e0f4a3e-601b-11e9-9df4-a0afbd4a2d60 zoneid=9612a0c6-ed28-4fae-9a48-6eb207af29e3 details[0].cpuNumber=3 details[0].memory=800 Signed-off-by: Abhishek Kumar <abhishek.kumar@shapeblue.com>	2019-05-23 11:47:53 +05:30
Wido den Hollander	44c080da11	server: print log on INFO if Host reached Max Guests Limit (#3013 ) This should not be in DEBUG as people would want to know that the host was skipped because it didn't have enough slots available to run the VM. Signed-off-by: Wido den Hollander <wido@widodh.nl>	2018-11-12 11:39:17 +05:30
Rohit Yadav	76a4e56ef3	Merge branch '4.11'	2018-05-23 20:42:10 +05:30
Rohit Yadav	528e6c6dff	Merge branch '4.11'	2018-04-20 00:54:41 +05:30
Marc-Aurèle Brothier	893a88d225	CLOUDSTACK-10105: Use maven standard project structure in all projects (#2283 ) Remove maven standard module (which only a few were using) and get ride of maven customization for the projects structure. - moved all directories to src/main/java, src/main/resources, src/main/scripts, src/test/java, src/test/resources - grep scan to search for src/com and src/org left over - grep for <project>/scripts to fix pom.xml configuration - remove custom <build> configuration in pom.xml Signed-off-by: Marc-Aurèle Brothier <m@brothier.org>	2018-01-20 03:19:27 +05:30

33 Commits