cloudstack

Commit Graph

Author	SHA1	Message	Date
James Peru	b8d069e127	feat(backup): cascade-delete + chain repair for NAS incrementals Adds the delete-with-chain-repair semantics agreed in the RFC review: scripts/vm/hypervisor/kvm/nasbackup.sh - New '-o rebase' operation: rebases an existing on-NAS qcow2 onto a new backing parent. Uses a SAFE rebase (no -u) so the target absorbs blocks of the about-to-be-deleted parent before the backing pointer is moved up to the grandparent. Writes the new backing reference relative to the target's directory so it survives mount-point changes. - New CLI flags --rebase-target, --rebase-new-backing (both passed mount-relative). RebaseBackupCommand + LibvirtRebaseBackupCommandWrapper - New agent command that wraps the script's rebase operation. The provider sends one of these per child that needs re-pointing. NASBackupProvider.deleteBackup - Now plans the chain repair before touching files via computeChainRepair(): * No chain metadata -> single-file delete (legacy behaviour) * Tail incremental -> single delete, no rebase * Middle incremental -> rebase immediate child onto our parent, then delete; shift chain_position of all later descendants by -1 * Full with descendants -> refuse unless forced=true; with forced=true delete full + every descendant newest-first - Updates parent_backup_id, chain_position metadata in backup_details after each rebase so the model in the DB matches the on-disk chain. This implements the cascade-delete behaviour requested in @abh1sar's review point #7. Refs: apache/cloudstack#12899	2026-04-27 19:24:02 +03:00
James Peru	43e2f7504a	feat(backup): on-demand bitmap recreation for incremental NAS backup CloudStack rebuilds the libvirt domain XML on every VM start, which means persistent QEMU dirty bitmaps don't survive a stop/start cycle. Rather than hooking into the VM start lifecycle (intrusive across the orchestration layer), this commit handles the missing bitmap lazily at the next backup attempt: nasbackup.sh - When -M incremental is requested, the script first checks `virsh checkpoint-list` for the parent bitmap. If absent, it recreates the checkpoint on the running domain so libvirt accepts the <incremental> reference. The next incremental will be larger than usual (it captures all writes since recreate, not since the previous incremental) but is correct; subsequent ones return to normal size. - On recreation, emits BITMAP_RECREATED=<name> on stdout for the orchestrator to record. BackupAnswer + bitmapRecreated field surfaced from the agent. LibvirtTakeBackupCommandWrapper - Strips BITMAP_RECREATED= line from stdout before size parsing. - Sets answer.setBitmapRecreated(...). NASBackupChainKeys + BITMAP_RECREATED key for backup_details. NASBackupProvider - When the agent reports a recreated bitmap, persists it under backup_details and logs an info-level message so operators can correlate larger-than-usual incrementals with VM restarts. This satisfies the bitmap-loss-on-VM-restart concern from the RFC review without touching VirtualMachineManager / StartCommand / agent lifecycle. Refs: apache/cloudstack#12899	2026-04-27 19:10:46 +03:00
James Peru	1f2aebca36	feat(backup): orchestrate full vs incremental in NAS provider Adds the Java side of the incremental NAS backup feature: TakeBackupCommand + mode, bitmapNew, bitmapParent, parentPath fields (null for legacy callers — script preserves its existing behaviour when these are omitted). BackupAnswer + bitmapCreated (echoed by the agent on success) + incrementalFallback (true when an incremental was requested but the agent had to fall back to full because the VM was stopped). LibvirtTakeBackupCommandWrapper - Forwards the new fields to nasbackup.sh. - Strips the new BITMAP_CREATED= / INCREMENTAL_FALLBACK= marker lines out of stdout before the existing numeric-suffix size parser runs, so the script can keep the same "size as last line(s)" contract. - Surfaces both markers on the BackupAnswer. NASBackupProvider - decideChain(vm) walks backup_details (chain_id, chain_position, bitmap_name) for the latest BackedUp backup of the VM and decides: * Stopped VM -> full (libvirt backup-begin needs running QEMU) * No prior chain -> full (chain_position=0) * chain_position+1 >= nas.backup.full.every -> new full * otherwise -> incremental, parent=last bitmap - Generates timestamp-based bitmap names ("backup-<epoch>") matching what the script then registers as the libvirt checkpoint name. - persistChainMetadata() writes parent_backup_id, bitmap_name, chain_id, chain_position, type into the existing backup_details key/value table (per the RFC review — no new columns on backups). - Honours the agent's INCREMENTAL_FALLBACK= signal: re-records the backup as a full and starts a fresh chain. - createBackupObject() now takes a type argument so the BackupVO reflects the actual decision instead of always being "FULL". Refs: apache/cloudstack#12899	2026-04-27 19:07:24 +03:00
James Peru	1981469099	feat(backup): add chain-metadata keys + nas.backup.full.every config NASBackupChainKeys defines the keys this provider stores under the existing backup_details kv table (parent_backup_id, bitmap_name, chain_id, chain_position, type). This keeps the backups table provider-agnostic per the RFC review. nas.backup.full.every is a zone-scoped ConfigKey that controls how often a full backup is taken; the remaining backups in the cycle are incremental. Counts backups (not days), so it works for hourly, daily, and ad-hoc schedules. Default 10. Set to 1 to disable incrementals (every backup is full). Refs: apache/cloudstack#12899	2026-04-27 18:49:38 +03:00
Suresh Kumar Anaparti	c3614098da	Merge branch '4.22'	2026-04-08 18:09:43 +05:30
Abhisar Sinha	03de62bf38	Support Linstor Primary Storage for NAS BnR (#12796 )	2026-04-08 15:14:20 +05:30
Suresh Kumar Anaparti	e2d18c0748	Merge branch '4.22'	2026-04-02 10:30:18 +05:30
James Peru Mmbono	6ca6aa1c3f	Fix NPE in NASBackupProvider when no running KVM host is available (#12805 ) * Fix NPE in NASBackupProvider when no running KVM host is available ResourceManager.findOneRandomRunningHostByHypervisor() can return null when no KVM host in the zone has status=Up (e.g. during management server startup, brief agent disconnections, or host state transitions). NASBackupProvider.syncBackupStorageStats() and deleteBackup() call host.getId() without a null check, causing a NullPointerException that crashes the entire BackupSyncTask background job every sync interval. This adds null checks in both methods: - syncBackupStorageStats: log a warning and return early - deleteBackup: throw CloudRuntimeException with a descriptive message	2026-03-27 21:32:13 +05:30
Daan Hoogland	a7dc5efad5	Merge release branch 4.22 to main * 4.22: Fix issue when restoring backup after migration of volume (#12549) Usage: Heartbeat should not schedule usage job when a job is already running (#12616) Allow limit queries without random ordering (#12598) engine/schema: fix cluster/zone settings with encrypted values (#12626) Fix injection of preset variables into the JS interpreter (#12515) Fix issue with multiple KVM Host entries in host table (#12589) Add a Prometheus metric to track host certificate expiry (#12613) ssvm: delete temp directory while deleting entity download url (#12562)	2026-02-17 11:55:04 +01:00
Daan Hoogland	11df71e55c	Merge branch '4.20' into '4.22'	2026-02-17 11:53:08 +01:00
Pearl Dsilva	ae5308bdd2	Fix issue when restoring backup after migration of volume (#12549 )	2026-02-13 15:14:58 +01:00
João Jandre	8171d9568c	Block use of internal and external snapshots on KVM (#11039 )	2025-11-24 11:39:19 +01:00
Harikrishna Patnala	dbda673e1f	Updating pom.xml version numbers for release 4.23.0.0-SNAPSHOT Signed-off-by: Harikrishna Patnala <harikrishna.patnala@gmail.com>	2025-11-05 16:54:39 +05:30
Harikrishna Patnala	d160731b9f	Updating pom.xml version numbers for release 4.22.1.0-SNAPSHOT Signed-off-by: Harikrishna Patnala <harikrishna.patnala@gmail.com>	2025-11-05 16:07:07 +05:30
Harikrishna Patnala	71f47d6130	Updating pom.xml version numbers for release 4.22.0.0 Signed-off-by: Harikrishna Patnala <harikrishna.patnala@gmail.com>	2025-10-30 19:23:56 +05:30
Abhisar Sinha	3d6cafe193	Handle null mountTimeout in RestoreBackupCommand (#11944 )	2025-10-30 19:21:39 +05:30
Wei Zhou	e333ce9782	Updating pom.xml version numbers for release 4.20.3.0-SNAPSHOT	2025-10-24 09:13:19 +02:00
Wei Zhou	4dc3931233	Updating pom.xml version numbers for release 4.20.2.0 Signed-off-by: Wei Zhou <weizhou@apache.org>	2025-10-16 11:42:56 +02:00
Suresh Kumar Anaparti	b7a11cb203	NAS backup provider: Support restore from backup to volumes on Ceph storage pool(s), and take backup for stopped instances with volumes on Ceph storage pool(s) (#11684 ) Co-authored-by: Abhisar Sinha <63767682+abh1sar@users.noreply.github.com>	2025-10-06 09:13:28 +02:00
Abhisar Sinha	23c9e83047	Create Instance from backup on another Zone (DRaaS use case) (#11560 ) * draas initial changes * Added option to enable disaster recovery on a backup respository. Added UpdateBackupRepositoryCmd api. * Added timeout for mount operation in backup restore configurable via global setting * Addressed review comments * fix for simulator test failures * Added UT for coverage * Fix create instance from backup ui for other providers * Added events to add/update backup repository * Fix race in fetchZones * One more fix in fetchZones in DeployVMFromBackup.vue * Fix zone selection in createNetwork via Create Instance from backup form. * Allow template/iso selection in create instance from backup ui * rename draasenabled to crosszoneinstancecreation * Added Cross-zone instance creation in test_backup_recovery_nas.py * Added UT in BackupManagerTest and UserVmManagerImplTest * Integration test added for Cross-zone instance creation in test_backup_recovery_nas.py	2025-09-25 13:28:29 +05:30
Suresh Kumar Anaparti	1033be4b31	Updating pom.xml version numbers for release 4.22.0.0-SNAPSHOT Signed-off-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2025-08-28 12:00:42 +05:30
Suresh Kumar Anaparti	f9513b47bf	Updating pom.xml version numbers for release 4.21.0.0 Signed-off-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2025-08-22 11:42:37 +05:30
Abhisar Sinha	a87c5c2b3a	Create new Instance from VM backup (#10140 ) This feature adds the ability to create a new instance from a VM backup for dummy, NAS and Veeam backup providers. It works even if the original instance used to create the backup was expunged or unmanaged. There are two parts to this functionality: Saving all configuration details that the VM had at the time of taking the backup. And using them to create an instance from backup. Enabling a user to expunge/unmanage an instance that has backups.	2025-07-31 15:47:22 +05:30
Daan Hoogland	0b3959221b	Merge branch '4.20'	2025-07-29 16:50:55 +02:00
Abhisar Sinha	1b74c2dd3f	Fix restore from NAS backup when datadisk is older than the root disk. (#11258 )	2025-07-23 12:45:47 +02:00
Abhisar Sinha	0ebf72df0f	Handle nas backup and restore on Shared mount point. (#11204 )	2025-07-23 12:45:33 +02:00
João Jandre	53eb2c5b9b	File-based disk-only VM snapshot with KVM as hypervisor (#10632 ) Co-authored-by: João Jandre <joao@scclouds.com.br> Co-authored-by: Fabricio Duarte <fabricio.duarte.jr@gmail.com>	2025-07-16 08:54:07 +02:00
Pearl Dsilva	b5e2c181f9	Updating pom.xml version numbers for release 4.20.2.0-SNAPSHOT Signed-off-by: Pearl Dsilva <pearl1594@gmail.com>	2025-06-06 15:38:12 +05:30
Pearl Dsilva	c61a5eb430	Updating pom.xml version numbers for release 4.20.1.0 Signed-off-by: Pearl Dsilva <pearl1594@gmail.com>	2025-05-30 12:43:00 +05:30
Wei Zhou	842b2f8c24	Merge remote-tracking branch 'apache/4.20'	2025-05-19 21:25:37 +02:00
Abhisar Sinha	d55aa70f7e	Restore single backed-up volume on a live instance attaches the volume as a Raw image making it unreadable (#10844 )	2025-05-14 15:23:07 +02:00
Abhishek Kumar	1c1dad977e	Merge remote-tracking branch 'apache/4.20'	2025-03-06 09:55:27 +05:30
Pearl Dsilva	7f4e6a9d51	NAS B&R Plugin enhancements (#9666 ) * NAS B&R Plugin enhancements * Prevent printing mount opts which may include password by removing from response * revert marvin change * add sanity checks to validate minimum qemu and libvirt versions * check is user running script is part of libvirt group * revert changes of retore expunged VM * add code coverage ignore file * remove check * issue with listing schedules and add defensive checks * redirect logs to agent log file * add some more debugging * remove test file * prevent deletion of cks cluster when vms associated to a backup offering * delete all snapshot policies when bkp offering is disassociated from a VM * Fix `updateTemplatePermission` when the UI is set to a language other than English (#9766) * Fix updateTemplatePermission UI in non-english language * Improve fix --------- * Add nobrl in the mountopts for cifs file system * Fix restoration of VM / volumes with cifs * add cifs utils for el8 * add cifs-utils for ubuntu cloudstack-agent * syntax error * remove required constraint on both vmid and id params for the delete bkp schedule command	2025-03-04 11:32:09 -05:00
dahn	48f890a693	resolve merge problems in the backup framework (#10457 ) Co-authored-by: Wei Zhou <weizhou@apache.org>	2025-02-26 14:12:40 +01:00
Abhisar Sinha	a7beaaf73b	Add Resource Limits to Backups and Object Storage (#10017 ) Doc PR : https://github.com/apache/cloudstack-documentation/pull/461 This PR fixes https://github.com/apache/cloudstack/issues/8638 == Description Four new Resource Types have been added. Admin can configure corresponding resource limits for the tenants at different levels (domain, account, project) User dashboard's Storage section will show the new resources, their limits and current usage. 1. backup - No. of backups used by the account 2. backup_storage - Backup storage allocated for the account 3. bucket - No. of buckets used by the accounts 4. object_storage - Object storage allocated for the account. Some other related changes done to BnR framework: 1. Maximum number of Backups to retain can be specified while creating Backup schedules, similar to Scheduled snapshots. 2. Oldest Scheduled backup of the same interval type will be deleted once the number reaches the configured max Backups value. 3. Code refactor: Moved syncBackups method from BackupProvider to the framework BackupManagerImpl, as it is a common functionality and all providers were using duplicated code. Changes done to the Object Storage Framework 1. Quota parameter is made mandatory while creating a bucket. Bucket quota is considered to be the allocated space and will be used to enforce Resource limits. == Schema Changes: 1. New Column `max_backups` added to `backup_schedule` table 4. New Column `backup_interval_type` added to `backups` table == Api Changes: 1. createBackup: new Parameter `scheduleid`. It should be specified whenever a scheduled backup is created. This will translate to the `backup_interval_type` in the `backups` table. 3. createBackupScheduke: new Parameter `max_backups`. To specify maximum number of backups to retain for the given schedule. == Configurations: \|Setting \|Scope \|Default Value \|Description\| \|-------\|--------\|--------------\|-----------\| \|backup.max.hourly \|Global \|8 \|Maximum recurring hourly backups to be retained for an instance\| \|backup.max.daily \|Global \|8 \|Maximum recurring daily backups to be retained for an instance\| \|backup.max.weekly \|Global \|8 \|Maximum recurring weekly backups to be retained for an instance\| \|backup.max.monthly \|Global \|8 \|Maximum recurring monthly backups to be retained for an instance\| \|max.account.backups\| Global\| 20 \| The default maximum number of backups that can be created for an account\| \|max.account.backup.storage\| Global\| 400 \| The default maximum backup storage space (in GiB) that can be used for an account\| \|max.domain.backups\| Global\| 40 \| The default maximum number of backups that can be created for an domain\| \|max.domain.backup.storage\| Global\| 800 \| The default maximum backup storage space (in GiB) that can be used for an domain\| \|max.project.backups\| Global\| 20 \| The default maximum number of backups that can be created for an project\| \|max.project.backup.storage\| Global\| 400 \| The default maximum backup storage space (in GiB) that can be used for an project\| \|Setting \|Scope \|Default Value \|Description\| \|-------\|--------\|--------------\|-----------\| \|max.account.buckets\| Global\| 20 \| The default maximum number of buckets that can be created for an account\| \|max.account.object.storage\| Global\| 400 \| The default maximum object storage space (in GiB) that can be used for an account\| \|max.domain.buckets\| Global\| 40 \| The default maximum number of buckets that can be created for an domain\| \|max.domain.object.storage\| Global\| 800 \| The default maximum object storage space (in GiB) that can be used for an domain\| \|max.project.buckets\| Global\| 20 \| The default maximum number of buckets that can be created for an project\| \|max.project.object.storage\| Global\| 400 \| The default maximum object storage space (in GiB) that can be used for an project\| Co-authored-by: Daan Hoogland <daan@onecht.net> Co-authored-by: Lucas Martins <56271185+lucas-a-martins@users.noreply.github.com> Co-authored-by: Lucas Martins <lucas.martins@scclouds.com.br> Co-authored-by: Pearl Dsilva <pearl1594@gmail.com> Co-authored-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2025-02-07 16:56:20 +05:30
Daan Hoogland	fadb39ece7	Merge release branch 4.20 to main * 4.20: merge errors fixed Restrict the migration of volumes attached to VMs in Starting state (#9725) server, plugin: enhance storage stats for IOPS (#10034) Introducing granular command timeouts global setting (#9659) Improve logging to include more identifiable information (#9873)	2025-01-08 14:01:19 +01:00
Vishesh	a4224e58cc	Improve logging to include more identifiable information (#9873 ) * Improve logging to include more identifiable information for kvm plugin * Update logging for scaleio plugin * Improve logging to include more identifiable information for default volume storage plugin * Improve logging to include more identifiable information for agent managers * Improve logging to include more identifiable information for Listeners * Replace ids with objects or uuids * Improve logging to include more identifiable information for engine * Improve logging to include more identifiable information for server * Fixups in engine * Improve logging to include more identifiable information for plugins * Improve logging to include more identifiable information for Cmd classes * Fix toString method for StorageFilterTO.java	2025-01-06 16:42:37 +05:30
João Jandre	d9774a8462	Updating pom.xml version numbers for release 4.21.0.0-SNAPSHOT Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-11-27 11:47:06 -03:00
João Jandre	c63c7ee63e	Updating pom.xml version numbers for release 4.20.1.0-SNAPSHOT Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-11-27 11:40:45 -03:00
João Jandre	2fe3fcef7c	Updating pom.xml version numbers for release 4.20.0.0 Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-11-19 08:54:07 -03:00
Rohit Yadav	85765c3125	backup: simple NAS backup plugin for KVM (#9451 ) This is a simple NAS backup plugin for KVM which may be later expanded for other hypervisors. This backup plugin aims to use shared NAS storage on KVM hosts such as NFS (or CephFS and others in future), which is used to backup fully cloned VMs for backup & restore operations. This may NOT be as efficient and performant as some of the other B&R providers, but maybe useful for some KVM environments who are okay to only have full-instance backups and limited functionality. Design & Implementation follows the `networker` B&R plugin, which is simply: - Implement B&R plugin interfaces - Use cmd-answer pattern to execute backup and restore operations on KVM host when VM is running (or needs to be restored) - instead of a B&R API client, relies on answers from KVM agent which executes the operations - Backups are full VM domain snapshots, copied to a VM-specific folders on a NAS target (NFS) along with a domain XML - Backup uses libvirt feature: https://libvirt.org/kbase/live_full_disk_backup.html orchestrated via virsh/bash script (nasbackup.sh) as the libvirt-java lacks the bindings - Supported instance volume storage for restore operations: NFS & local storage Refer the doc PR for feature limitations and usage details: https://github.com/apache/cloudstack-documentation/pull/429 Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com> Co-authored-by: Pearl Dsilva <pearl1594@gmail.com> Co-authored-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2024-09-05 22:19:13 +05:30

41 Commits