Commit Graph

3783 Commits

Author SHA1 Message Date
alena b6f58b77b8 bug 11462: 1) when delete PF rule, revoke corresponding firewall first (if exists) 2) never remove PF rule from the table when corresponding firewall rule wasn't removed yet
status 11462: resolved fixed

Reviewed-by: edison@cloud.com

Conflicts:

	server/src/com/cloud/network/lb/LoadBalancingRulesManagerImpl.java
2011-09-20 10:32:57 -07:00
alena 459b32cc6a bug 11493: no need to do cross domain checks for the templates as domain level templates are not supported. We support only public (everybody can see and use) and private (only owner can see and use unless changed with updateTemplatePermissions call)
status 11493: resolved fixed

Reviewed-by: will@cloud.com
2011-09-20 10:31:57 -07:00
Nitin 7bcbdeb834 bug 11051: ListPods - Introduce a flag 'showCapacitites'. When its true it will start displaying all the capacitites (as objects in the response) for the respective cluster. 2011-09-20 15:59:45 +05:30
Nitin 4a5f6faca2 bug 11051: ListClusters - Introduce a flag 'showCapacitites'. When its true it will start displaying all the capacitites (as objects in the response) for the respective cluster. 2011-09-20 14:33:46 +05:30
Edison Su a02ae53417 bug 11495: set bootable as user selected for iso/templates
status 11495: resolved fixed
Reviewed-by:alena@citrix.com
2011-09-19 17:48:55 -07:00
alena 03bd7d0c96 bug 11480: verify account/user state when do single signon verification
status 11480: resolved fixed

Reviewed-by: kelven@cloud.com
2011-09-19 10:57:18 -07:00
alena ba64010cb5 bug 11470: 1) fixed listResourceLimits 2) Fixed -1 to be treated as unlimited for all the resources
status 11470: resolved fixed
2011-09-19 09:47:07 -07:00
prachi 0eea1cb733 Bug 11404 - VM was in Running state, had null for a pod_id, basically didnt allow creation of subsequent vm's
Reviewed-by: Alex

Changes:
- When management server starts, it goes through all the pending work items from op_it_work table and schedules HA work for each. It used to mark each item as done. Instead we should keep the item as pending and let it get marked as Done after the HA work is done.
- Changes in VirtualMachineMgr::advanceStop() :
a) if we find a VM with null hostId, we stop the VM only if it is forced stopped.
b) if VM state transition to Stopping fails,for state Starting and Migrating we try to find the pending work item and then do cleanup the VM. In case state is Stopping we can cleanup directly.
c) We proceed releasing all resources only if state transitioned to 'Stopping'.
- Changes in HA:
a) Depend on VirtualMachineMgr::advanceStop() in case host is not found to do VM cleanup
- When Vm state between mgmt server and agent syncs from starting -> running, mark any pending work item as done.

Conflicts:

	server/src/com/cloud/vm/VirtualMachineManagerImpl.java
2011-09-15 19:06:19 -07:00
alena 117de2a61a Do agent disconnect when agent rebalance fails
Reviewed-by: Alex Huang
2011-09-15 18:37:06 -07:00
prachi 6b82ba3ff9 Bug 11457 - NullPointerException while synching VM state from Starting -> Running between management server and agent
Reviewed-by: Kelven and Alex

Changes:
- Added log statements about the podId during deploying a VM.
2011-09-15 16:16:24 -07:00
prachi df17ce5648 Bug 11457 - NullPointerException while synching VM state from Starting -> Running between management server and agent
reviewed-by: Alex/Kelven

Changes:
1.	UserVmManagerImpl :: finalizeStart()
Added null check for the cmds.getAnswers() object. Return ‘true’ if null.

2.	VirtualMachineManagerImpl :: advanceStart()
Move the line to set PodId to the vm being started above the state transition where hostId gets set, so that podId is not null in case management server goes down when vm starts on the agent. On restart, podId is not updated during fullsync. So this will prevent podId remaining null.

vm.setPodId(dest.getPod().getId());
2011-09-15 16:16:15 -07:00
keshav 60d69aabb2 In inline SRX/F5 mode, collect outbound usage for LB rules from the SRX. 2011-09-15 14:45:06 -07:00
Sheng Yang c67594ef50 bug 11300: Make restartNetwork destroy and recreate DHCP server
status 11300: resolved fixed
2011-09-15 11:56:31 -07:00
alena 2138f86bb3 bug 11447: don't do security group check if both groups belong to the same domain
status 11447: resolved fixed
2011-09-15 10:08:00 -07:00
Nitin d53c3f0798 bug 11051: Listing Secondary Storage - also show the total and used storage. Listing Storage pools show the realtime storage stats and correctly populate the allocated storage. 2011-09-15 15:53:31 +05:30
Naredula Janardhana Reddy 10fb507609 bug 11372:
Added two New values "all" and "default" to global config "network.loadbalancer.haproxy.stats.visibility" . With this change, it can take six possible value:
            global - stats visible from public network.
            guest-network - stats visible only to guestnetwork.
            link-local - stats visible only to link local network(for xen and kvm).
            disabled - stats disabled.
            all - stats available on public,guest and link-local. (Newly added)
            default - stats availble on the serving http port, this does need any specific http port.(Newly added)
   Except "default" and "disabled", all the rest of 4 need to configure the stats port.
2011-09-15 11:46:34 +05:30
Murali Reddy bb7b6d5eee fixing build break due to commit edbd3a862ce16cce1b22f54c843d33f8f6cf809c 2011-09-15 11:40:46 +05:30
anthony 6ecc981e25 bug 9777: introduce Error status for snapshot, if snapshot creation fails, it will be in Error status
status 9777: resolved fixed
2011-09-14 18:14:02 -07:00
Kelven Yang 1b9552ea74 Let VmwareInvestigator return fake but meaningful investigation result 2011-09-14 17:04:28 -07:00
Sheng Yang c85902b04c bug 11307: Using latest query result rather than set bumpup to false
After the host is disconnected, we can't get latest result, but it's not safe to
think it's priority bump up flag changed.
2011-09-14 16:26:48 -07:00
Sheng Yang f19fca6ec6 Fix restartNetwork wrongly report stop failure 2011-09-14 16:24:43 -07:00
Sheng Yang 474d1a6034 bug 11307: Add destroyRouter command 2011-09-14 16:23:42 -07:00
Sheng Yang 684a603a6e bug 11307: Add cleanup parameter to restartNetwork command
Default set it to true. When cleanup=false, restartNetwork would only
re-implement the network, but don't destroy current existed routers.
2011-09-14 16:23:34 -07:00
Sheng Yang e330e97f4b bug 11307: Mark router as to-be-stopped, rather than force stop it.
Force stop the router would release all the resources it used, but router may
still running. Add a column "stop_pending" in the database, and stop it when the
router come back.

Admin would able to choose to force destroy such router, then recover the
network using restartNetwork command with cleanup=false.
2011-09-14 16:23:17 -07:00
Sheng Yang ebd67feae7 bug 11307: Rewrite the redundant router's priority
Now the very first router's priority would be 100, and the second one would be
100 - 2 + 1 = 99.
2011-09-14 16:22:33 -07:00
Sheng Yang dfd36e4ae2 bug 11307: Add Bumped field for domain_router
Only used for redundant virtual router.
2011-09-14 16:22:11 -07:00
Sheng Yang b987123daa bug 11307: Let CheckRouterAnswer() parse the return result 2011-09-14 16:21:33 -07:00
alena bb9fda3435 Fixed the problem in addDhcpEntry code when _dnsBasicZoneUpdates was never reset once set to false.
Conflicts:

	server/src/com/cloud/network/router/VirtualNetworkApplianceManagerImpl.java
2011-09-14 16:11:01 -07:00
Kelven Yang 7a64d8fda4 add VmwareInvestigator and VmwareFencer, use short worker VM name to avoid vCenter truncation 2011-09-14 15:15:26 -07:00
anthony 313b624373 bug 9777: only allow to delete snapshot in BackedUp status
status 9777: resolved fixed
2011-09-14 14:16:19 -07:00
alena 5f9246114b bug 10435: use "hostName" instead of "name" as a search filed in listSystemVms
status 10435: resolved fixed
2011-09-14 11:01:37 -07:00
alena b7f45cfbd6 bug 11434: don't execute destroy call when vm is already in Destroyed/Expunging state
status 11434: resolved fixed
2011-09-14 10:33:45 -07:00
prachi ea97ce2a93 Bug 11430 - add host > Host Tags field missing from DB + UI even when host tags specified in add host 2.2.y -> 2.2.12 KVM
Changes:

A KVM agent always connects to the management server itself, we dont have to do direct connect. This part of code was missing updating the DB host entry with hosttags.
Corrected the code to save the hosttags while adding a KVM host.
2011-09-14 10:27:07 -07:00
Murali Reddy 78a5b651ac bug 11425: dont count direct IP in resource count while updating resource count through API
status 11425: resolved fixed
2011-09-14 22:52:30 +05:30
Nitin 20f0707755 bug 11190: My changes got reverted. Putting them back - Treating the max account limits with -1 values as infinite 2011-09-14 19:35:48 +05:30
anthony d6cbd2b6bc bug 11275:
remove heartbeat entry for this Primary Storage, when put this Primary Storage into maintenance mode
create heartbeat entry for this Primary Storage, when cancal maintenance for this Primary Storage

status 11275: resolved fixed
2011-09-13 22:15:21 -07:00
frank 71385f7cc0 check hypervisorType and StoragePoolType in OCFS2 manager when delete host
don't take lock in processResourceEvent()
2011-09-13 17:35:58 -07:00
alena 45c595d558 Don't update resource count for system account 2011-09-13 16:58:54 -07:00
alena 569427ba1f bug 11036: if resource_count record is missing for account or domain, insert it during the management server startup (with count=0)
status 11036: resolved fixed
2011-09-13 15:23:27 -07:00
anthony f071ce571c bug 10949: if a cluster doesn't have any hosts, set cluster guid to new one when add host to this cluster.
status 10949: resolved fixed
2011-09-13 13:53:23 -07:00
alena b05c69f3b3 bug 11246: return lb rule Id in the async response for createLoadBalancer api
status 11246: resolved fixed
2011-09-13 12:08:12 -07:00
anthony c5bc16ef42 XcpServerDiscoverer is only for XenServer 2011-09-13 11:11:55 -07:00
kishan d2293137c7 bug 11199: Moved DB upgrade changes to schema-2211to2212-premium
status 11199: resolved fixed
2011-09-13 23:28:54 +05:30
alena f6a79c603f bug 11036: resource count update is refactored
status 11036: resolved fixed

1) Use row locks instead of global lock when update resource_count table. When update resource_count for account, make sure that we lock account+all related domains
2) Insert resource_count records for account/domain at the moment when account/domain is created.
3) As a part of DB upgrade, insert missing resource_count records for all non-removed accounts/domains

Conflicts:

	core/src/com/cloud/alert/AlertManager.java
	server/test/com/cloud/agent/MockAgentManagerImpl.java
2011-09-13 10:47:36 -07:00
Chiradeep Vittal 9b424a762d fix compilation of mocks 2011-09-13 09:22:52 -07:00
anthony ae64c56c12 bug 11413: when mark host ad disconnected, set lastping to now - pingtimeout
status 11413: resolved fixed
2011-09-12 18:50:07 -07:00
Chiradeep Vittal 3cb4ad4934 Add more detail when unable to schedule cleanup
clean up tests for security group manager v2

move interval to listener -- allows it to be configurable if needed

fix mocks

Enhanced logging for security group manager (from zucchini)

fix merge issues

merge issues
2011-09-12 18:22:55 -07:00
Chiradeep Vittal 1f3c6efb81 Retry only a few times 2011-09-12 18:20:24 -07:00
Chiradeep Vittal fae5e84699 if the xenserver host cannot do bridge firewalling do not attempt to retry the security rule updat
change some logs to debug level
2011-09-12 18:20:24 -07:00
Chiradeep Vittal 65fb83035f Enable monitoring over JMX
Add the ability to disable some vms from being scheduled

Add ability to trigger ruleset updates from JMX

added a few more dangerous JMX operations
2011-09-12 18:20:24 -07:00
Chiradeep Vittal d81bc5a375 Dequeue the vm with the largest id first -- this ensures that newly started vms get updated early 2011-09-12 18:20:23 -07:00
Chiradeep Vittal 925f5f2f09 Retry 3 times on deadlock: if not updated, then throw a runtime exception to roll back the vm start 2011-09-12 18:20:23 -07:00
Chiradeep Vittal 82fcfe8edc make signature calculation consistent across management servers 2011-09-12 18:20:23 -07:00
Chiradeep Vittal 37b4384990 retry a transaction if it deadlocks 2011-09-12 18:20:23 -07:00
Chiradeep Vittal cfbbe4c031 fix perf. issue in deploy vm. also remove dead code in dao 2011-09-12 18:20:23 -07:00
Chiradeep Vittal c21a214372 if true, this could lead to the dog chasing its own tail 2011-09-12 18:20:23 -07:00
Chiradeep Vittal 5dba8cf430 only update sequence number when asked to. full sync SHOULD NOT request sequence number updates since this will lead to the dog chasing its own tail 2011-09-12 18:20:23 -07:00
Chiradeep Vittal 6252d605c1 Avoid going back to the nics table for information we already have 2011-09-12 18:20:23 -07:00
Chiradeep Vittal d03c5af720 make instance_id unique to allow usage of ON DUPLICATE KEY
useful logging

Reduce batch size to 16 to avoid sql errors

try using multi inserts to overcome deadlock during batch update
2011-09-12 18:20:23 -07:00
Chiradeep Vittal 7e2c2c627f unit test for SG mgr v2 2011-09-12 18:20:22 -07:00
Chiradeep Vittal b52d1157e9 use the new implementation of SG manager 2011-09-12 18:20:22 -07:00
Chiradeep Vittal 4cd6a9217c Use hashset since it is 50% faster. we can use treeset when we support priority 2011-09-12 18:20:22 -07:00
Chiradeep Vittal 2fc782b8ba Add basic Mocks for some managers 2011-09-12 18:20:22 -07:00
Chiradeep Vittal abf4754503 New security group mgr WIP : memory based queueing 2011-09-12 18:20:22 -07:00
Chiradeep Vittal f41b352d36 revert egress rules implementation pending review
Reverts a19212703b9734ebd44ebf55cfdd81ebdc9d7fe4
Reverts 24e4e44b8f0712a37147a3777833de3f9e24829e
2011-09-12 14:45:58 -07:00
Chiradeep Vittal d817f3c364 Revert "bug 10617: Added Egress rules to Security groups."
revert pending review

This reverts commit a19212703b9734ebd44ebf55cfdd81ebdc9d7fe4.
2011-09-12 14:45:58 -07:00
Chiradeep Vittal e216686029 Revert "bug 10617: Intermediate checking"
revert pending review

This reverts commit 561b27565512d438db3d8896df6f97b644b0b498.
2011-09-12 14:45:58 -07:00
Nitin cbe996e1ce bug 11403: Making hypervisor_type of xs-tools.iso (id=200) as XenServer. Made the corresponding DB changes as well. 2011-09-13 03:07:32 +05:30
Kelven Yang 41926cb038 bug 11377: remove the restriction on VMware in API layer to let it support VM manual migration 2011-09-12 14:28:32 -07:00
frank d828f4e434 Bug 11350 - Baremetal destroy instance doesnt power off machine
status 11350: resolved fixed
2011-09-12 14:17:43 -07:00
anthony 299ae3ed91 fixed typo 2011-09-12 12:38:56 -07:00
kishan b30827a0ec bug 9778: Log snapshot create event only if the snapshot is in not removed state after backingUp to secondary. Log snapshot delete event for backed up snapshots only
status 9778: resolved fixed
2011-09-12 20:22:30 +05:30
kishan 5928a9b370 bug 9778: Log snapshot create event only if the snapshot is in not removed state after backingUp to secondary. Log snapshot delete event for backed up snapshots only
status 9778: resolved fixed
2011-09-12 20:01:19 +05:30
Nitin 3e096afba6 bug 10974: Fixed a bug in the search for listRouters API. The bug was introduced due to change in the VmInstancVO where the name field was changed to hostName. This went unnoticed in the DAO layer as we dont throw an exception but an assert error. 2011-09-12 16:51:17 +05:30
Nitin 002d0fda5a bug 11190: Treating the max account limits with -1 values as infinite. When acquiring a lock to check for resourceLimitExceeded if the lock times out then return as limit exceeded. 2011-09-12 14:26:15 +05:30
Kelven Yang 3c41775184 1) Increase working buffer size to 1M when downloading/uploading VMDK, hopefully can improve OVF exporting/importing performance. 2) use atomic SQL operation to get rid of global lock usage 2011-09-09 17:07:09 -07:00
Naredula Janardhana Reddy f9b0962ad9 bug 10617: Intermediate checking
Changes :
         - Fixing API doc +response name + errorMessage
         - Adding seperate events to Egress rules
         - Egress rules Using the same database table as that of ingress with new column type.

   Pending Tasks:
         - db upgrade
         - database table rename from security_ingress_rule to generic name, renaming some of the jave class from ingress to generic name.
         - Retesting on kvm
2011-09-09 18:14:19 +05:30
Nitin 50efe56335 bug 11051: listHostsCmd - Fixed the capacity and stats response. Also removed the unwarranted code. 2011-09-09 14:23:13 +05:30
kishan 4e663bbef9 bug 8338: Added usage.aggregation.range parameter. Also included in 2211 to 2212 upgrade 2011-09-09 13:43:13 +05:30
prachi d854923ac5 Bug 11200 - maximum number of guests per host
Manual Migration should fail if the destination host chosen by Admin already has max guest VMs running.
2011-09-08 18:33:55 -07:00
prachi 4ad9ac5e71 Bug 11200 - maximum number of guests per host
Changes:

To make sure migration does not attempt to pick a host that has running VMs more than the max guest VM's limit:

- Changed manual migration to call host allocators to return a list of hosts suitable for migration. Host allocators check for the max guest VM limit.
- Earlier we returned hosts with enough capacity but now Host Allocators make other checks along with capacity. So the list of hosts returned are hosts that have enough capacity AND satisfy all other conditions like host tags, max guests limit etc. Or in other words Allocators dont return the hosts that dont satisfy all conditions even if they have capacity.
-Therefore, now we mark the list of hosts returned for manual migration as 'suitable' hosts instead of 'hasenoughCapacity' in the HostResponse.
- HA migration already calls allocators, so no change is needed there.
2011-09-08 18:08:31 -07:00
keshav e3252e5e66 Excluded external firewall/LB from host stats collection, and included them in ping checks 2011-09-08 16:43:37 -07:00
Kelven Yang aa7f06d7eb bug 11384: deprecate vmware.guest.nic.device.type global configuration variable 2011-09-08 16:11:17 -07:00
alena 6e6b6ab2ab Fixed the bug in AgentLB: consider only hosts of type=Routing when calculate average load 2011-09-08 14:57:04 -07:00
Kelven Yang a118880836 bug 10874: don't invalid run session of other management server even if it detects it is unreachable as the event may just be temporary network condition 2011-09-08 14:22:52 -07:00
anthony 4423da06a2 1. added timeout in Command Class, then each command can configure itself timeout, if timeout is not configed, use the default timeout , which is 30 minute
2. added following configurable timeout
       PrimaryStorageDownloadWait("Storage", TemplateManager.class, Integer.class, "primary.storage.download.wait", "10800", "In second, timeout for download template to primary storage", null),
       CreateVolumeFromSnapshotWait("Storage", StorageManager.class, Integer.class, "create.volume.from.snapshot.wait", "10800", "In second, timeout for create template from snapshot", null),
       CopyVolumeWait("Storage", StorageManager.class, Integer.class, "copy.volume.wait", "10800", "In second, timeout for copy volume command", null),
       CreatePrivateTemplateFromVolumeWait("Storage", UserVmManager.class, Integer.class, "create.private.template.from.volume.wait", "10800", "In second, timeout for CreatePrivateTemplateFromVolumeCommand", null),
       CreatePrivateTemplateFromSnapshotWait("Storage", UserVmManager.class, Integer.class, "create.private.template.from.snapshot.wait", "10800", "In second, timeout for CreatePrivateTemplateFromSnapshotCommand", null),
       BackupSnapshotWait("Storage", StorageManager.class, Integer.class, "backup.snapshot.wait", "10800", "In second, timeout for BackupSnapshotCommand", null),
2011-09-07 19:17:54 -07:00
Kelven Yang 905df61aac bug 11362: Make VM network adapter device type be selectable based on VM-specific details 2011-09-07 17:58:42 -07:00
Kelven Yang f70958809b bug 11360: make user_vm_details generic to all VMs instead of user-only VMs, VM details info will be carried down to hypervisor resource 2011-09-07 15:42:40 -07:00
prachi 84868b7f9c Bug 11200 - maximum number of guests per host
Changes:
- Adding a new table 'hypervisor_capabilities' that will record capabilities for each hypervisor version. Added db schema changes for this.
- Currently a few capabilities have been added, namely, 'max_guests_limit' and 'security_group_enabled'
- Added a new column 'hypervisor_version' to host table. StartupRouting command now takes in this parameter. It should be set when a host connects.
- If a host's hypervisor version is not present, we find all the capabilities rows for that hypervisor type and use the first record.
- 'max_guests_limit' is the maximum number of running guest Vms that a host can have for the given hypervisor.
- Host Allocators use this limit and skip a host if the number of running VMs on that host exceeds this limit.
2011-09-07 14:53:05 -07:00
alena b883b25a12 bug 11348: make content type for "json" files configurable
status 11348: resolved fixed
2011-09-07 14:22:01 -07:00
alena ba2c6695cc bug 11326: don't try to transfer the agent if it's a forward agent
status 11326: resolved fixed

Also added more logging to the agent rebalance code.

Conflicts:

	server/src/com/cloud/agent/manager/ClusteredAgentManagerImpl.java
2011-09-07 12:47:25 -07:00
anthony 13ee22d421 bug 10078:
1. introduce migratewait in global configuration, the default value is 1 hour
 2. use async xapi VM migration API

status 10078: resolved fixed
2011-09-07 12:36:17 -07:00
kishan 066537a565 bug 4282: Removed caching of signatures. Added detailed error message expires parameter parsing 2011-09-07 17:51:09 +05:30
alena 47d9cbd2ee bug 11331: allow extending vnet range as a part of updateZone call
status 11331: resolved fixed
2011-09-06 17:40:28 -07:00
kishan f9d5a46871 Compare current and previous network stats before updating. Ignore stats if the current stats are not same as the prev stats. Set NetworkUsageAnswer log level to debug 2011-09-05 18:41:42 +05:30
Naredula Janardhana Reddy 3c1f6fd24b bug 10467: Incorrect display of domain router linklocalmac address and linklocal netmask fixed 2011-09-05 13:42:12 +05:30
alena 3c0396bb20 bug 10751: don't remove already removed templates during the account cleanup
status 10751: resolved fixed
2011-09-02 16:06:55 -07:00
alena b93318cb73 bug 11002: userVmDao - create searchBuilder as a part of dao init process (otherwise it's not threadsafe)
status 11002: resolved fixed
2011-09-02 15:51:52 -07:00
Kelven Yang 4d3d47f01c bug 11304: restore host status after initialization failure 2011-09-02 15:19:36 -07:00
anthony c683fda236 set timeout for CheckOnHostCommand to 50 s 2011-09-02 15:01:32 -07:00