Commit Graph

289 Commits

Author SHA1 Message Date
alena 14cdc7de14 bug 9127: covered failure scenarios for agent LB.
status 9127: resolved fixed

The feature is completed; please file separate bugs if any issue arises during the testing.
Wiki link describing how agentLB works: http://intranet.lab.vmops.com/engineering/release-2.2-features/agent-load-balancing
2011-06-05 17:35:30 -07:00
Alex Huang 019cc78976 Fixes problems in routing between management servers 2011-06-05 16:06:54 -07:00
Alex Huang a0ce3da191 fixed problem with header 2011-06-05 13:51:29 -07:00
Sheng Yang 7cf312674b bug 10135: Fix clustered agent manager's version bug
We are using v1 now, instead of v3. This bug result in all the commands from
the other clustered management server would be sent to agentmanager, and then dropped.

status 10135: resolved fixed
2011-06-04 21:28:03 -07:00
Sheng Yang ff86c865e2 bug 10135: Add SSL encryption for non-NIO link
Now Link.write() support SSL encryption. And since there is no user of
Link.read(), we comment it out.
2011-06-04 21:27:18 -07:00
Alex Huang d9e0bcfa1e bug 10126: Renamed getPodId() to getPodIdToDeployIn() 2011-06-03 22:17:08 -07:00
alena 635a1075de bug 9127: intermediate checkin for agent load balancer 2011-06-02 21:47:54 -07:00
Alex Huang 0ab12edd6c network tagging changes 2011-05-31 09:49:27 -07:00
Edison Su 59a6e2157a fix user data, and template sync 2011-05-27 19:44:26 -04:00
alena cd6d933e48 Better logging for cluster agent rebalancing 2011-05-25 18:59:26 -07:00
Frank 44a0fff2ec baremetal able to enter maintenance mode, delete host 2011-05-25 12:13:19 -07:00
alena 55692fd7cf bug 9127: intermediate checkin for agent load balancer 2011-05-24 18:45:00 -07:00
Alex Huang d7f527f8ac get rid of compile error 2011-05-21 15:41:12 -07:00
Alex Huang 567d946ab0 template listing fix 2011-05-19 15:26:24 -07:00
Abhinandan Prateek 9de370be61 do not propogate password changes if Attache is ConnectedAgentattache 2011-05-19 19:18:33 +05:30
Alex Huang 9c69a983ba session info within sequence numbers 2011-05-17 18:01:11 -07:00
Alex Huang 0ba6091ca7 better way to log the agent request and response. Introduced the idea of via agents in the request header 2011-05-17 16:02:44 -07:00
anthony 1501503656 multiple secondary storages fix 2011-05-17 16:01:35 -07:00
Alex Huang ea9703997b proxy agent 2011-05-17 07:12:58 -07:00
anthony 9c4964bc1b fixed merge 2011-05-16 16:54:00 -07:00
anthony d835067f40 multiple secondary storage support
template download works
creating snapshot works
creating volume/template from snapshot works
2011-05-16 16:37:15 -07:00
anthony ba6b2ca670 initial checkin for multiple secondary storage 2011-05-16 16:37:15 -07:00
Alex Huang 5ce631e9d7 Separated resource management and agent management code. It's not all done but at least we make a first step 2011-05-16 10:55:18 -07:00
Alex Huang ab49c70add Added some annotations for logging commands 2011-05-16 10:55:18 -07:00
Abhinandan Prateek 733a892aa4 return true on success 2011-05-16 16:02:35 +05:30
Abhinandan Prateek 542e3d6f53 bug 6451: host username/password should be changable through API
Propogate update password to other managment server nodes in a cluster.
2011-05-16 15:57:35 +05:30
Abhinandan Prateek d161337710 bug 6451: host username/password should be changable through API
Password is queued, on error next password is used.
2011-05-16 12:08:05 +05:30
Abhinandan Prateek f73ed92090 bug 8115: Source IP filtering in Virtual Router
We don't want to send an empty cidr param
2011-05-13 15:41:02 +05:30
Alex Huang 4ee5f1ca42 Better logging and better finalize 2011-05-12 17:02:35 -07:00
Alex Huang 753b6a00a6 faster agent connection 2011-05-12 16:57:09 -07:00
Alex Huang 028f12f1b9 Removed the check for statuses in marking hosts as disconnected 2011-05-12 16:48:02 -07:00
kishan d456f89095 bug 8866: Direct Network Usage, TrafficSentinel support added, work in progress 2011-05-12 19:43:47 +05:30
Alex Huang efedf018c8 propagate b3aea1878395af343e18382b7f1c376b5be04567 2011-05-10 05:48:29 -07:00
Alex Huang 6805788f7d added ping back 2011-05-10 05:45:47 -07:00
Alex Huang 08bb454241 Eliminate the initialize call if we know someone else is connecting to a host 2011-05-10 05:45:30 -07:00
Alex Huang 80b6df17b9 make agent load size configurable 2011-05-10 05:45:06 -07:00
Alex Huang da0fdfcfed Propagate db2fc1713c79e931cfa0f1580943ed80d2b12c8c 2011-05-10 05:45:04 -07:00
anthony 330f1815fa 1. in 2.1.x some hosts don't have cluster.guid in host_details
2. loading one host failure should not affect loading other hosts

Conflicts:

	server/src/com/cloud/agent/manager/ClusteredAgentManagerImpl.java
2011-05-06 12:35:51 -07:00
Alex Huang 43c3c775f2 bug 9741: somehow some of the removed code got back into the code path. Must have hit one undo too many. 2011-05-05 17:57:06 -07:00
Alex Huang 7bc25643a9 propagating fix from 2.2.4 2011-05-05 17:57:06 -07:00
Alex Huang b2cf4e39ea propagate fix from 2.2.4 2011-05-05 17:57:06 -07:00
Alex Huang 117a81acf6 cleanup agent manager 2011-05-05 17:57:06 -07:00
Alex Huang 66f19c6318 white space checks 2011-05-05 16:15:33 -07:00
Alex Huang 0ee7e7678a Propagate fixes from 2.2.4 2011-05-05 16:13:46 -07:00
Alex Huang fba1c95512 bug 9615: Part of the HA cleanup 2011-05-03 16:34:53 -07:00
Chiradeep Vittal 948df86f5d accidental commit 2011-05-03 10:46:13 -07:00
Chiradeep Vittal 7a29f8dc48 more fixes prior to pushing the merge 2011-05-03 10:41:31 -07:00
Chiradeep Vittal b42ebf4872 Build fixes after first pass at merge 2011-05-03 10:41:31 -07:00
Chiradeep Vittal 303e2a7481 Start of zonesfeature / mycloud/cloudkit 2011-05-03 10:40:17 -07:00
Abhinandan Prateek db916401a0 bug 6451: XenServer username/password should be changable through API
status 6451: resolved fixed
2011-05-03 09:45:49 +05:30
Jessica Wang 747ab40f48 fix a bug - adding a cluster of type XenServer returned an error. 2011-05-02 16:54:53 -07:00
prachi 1d4a59ce73 Bug 9666 - hostId and spoolId overlap in op_host_capacity table
Changes:
- When a host connects, we check if it has a CPU and RAM entry in capacity table. If the entry is found, the values are updated if possible. If the entry is not found a new one is inserted.
- The searchCriteria used to check if CPU entry is present was wrong. We were passing in a criteria which did not specify capacityType. So for hostId >= 200, the serach would return capacity entries of storage pools also since poolIDs start from 200 onwards.
- Since an entry was found (although the wrong one), we tried to update it. But update does not happen since the capacity ranges dont match.
And a new insert for CPU also does not happen since an entry is found.
- So as a result CPU entries are never inserted in the table for hostIds >=200

- As a fix, corrected the search criteria.
- During VM deployment, when the entry is not found, we get a NPE. Added a null check to avoid that.
2011-05-02 15:17:30 -07:00
Kelven Yang b6364627a3 Decode the url content sent from UI first as URL has encoded url as a whole 2011-04-29 14:17:43 -07:00
anthony 7c105681b2 cluster guid is set up when old host reconnect happens after upgrade, prohibit adding host to the cluster before guid is etup 2011-04-29 14:13:25 -07:00
alena 671ec62358 bug 9623: set ha_enable to false for consoleProxy vms and service_offering.
status 9623: resolved fixed

Also set ram_size to 1024 for console proxy offering during the upgrade

Conflicts:

	core/src/com/cloud/vm/SecondaryStorageVmVO.java
	server/src/com/cloud/agent/manager/allocator/impl/UserConcentratedAllocator.java
	server/src/com/cloud/consoleproxy/ConsoleProxyManagerImpl.java
	server/src/com/cloud/storage/allocator/LocalStoragePoolAllocator.java
	server/src/com/cloud/storage/secondary/SecondaryStorageManagerImpl.java
2011-04-29 11:53:07 -07:00
Alex Huang 9909371eaa bug 9445: Signal alert for the host if a primary storage pool was unavailable on that host 2011-04-26 15:09:36 -07:00
alena b749c64d4e bug 9550: get storagePool to Host mappings before doing processDisconnect because these references are being deleted as a part of processDisconnect call.
status 9550: resolved fixed

Conflicts:

	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/storage/StorageManagerImpl.java
2011-04-23 20:43:24 -07:00
prachi b84a7477f0 Bug 9539 - cpu.overprovisioning.factor does not work
Changes:
- Changed host allocators/planner  to use cpu.overprovisioning.factor
- Removed following: while adding a new host, we were setting the total_cpu in op_host_capacity to be actual_cpu * cpu.overprovisioning.factor. Now we set it to actual_cpu.
- ListCapacities response now calculates the total CPU as actual * cpu.overprovisioning.factor (This change does not add anything new - listCapacities was pulling total CPU from op_host_capacity DB earlier which had the cpu.overprovisioning.factor applied already. Now we need to apply it over the DB entry.)
- HostResponse has a new field: 'cpuWithOverprovisioning' that returns the cpu after applying the cpu.overprovisioning.factor

- Db Upgrade 222 to 224 now updates the total_cpu in op_host_capacity to be the actual_cpu for each Routing host.
2011-04-22 18:09:31 -07:00
alena 7255d68875 HA: no need to investigate why vm was stopped on host when host is being Dicsonnected with investigate=false option 2011-04-22 13:38:25 -07:00
alena 1a6d78eae4 Code cleanup. No need to declare runtime exceptions (CloudRuntimeException, InvalidParameterException, PermissionDenied exceptions) 2011-04-21 16:26:53 -07:00
alena f881d394e2 bug 9415: deleteHost - cleanup vms running on the host to be removed
status 9415: resolved fixed
2011-04-20 15:50:10 -07:00
Kelven Yang 7103e1c9a7 global replacement to avoid using getClass().getName() for unique names 2011-04-19 17:26:41 -07:00
Abhinandan Prateek efa0417881 bug 5242: adding a host does not return useful error
status 5242: closed fixed
2011-04-15 17:28:46 +05:30
nit 1d55d2fc05 bug 8703: delete op_host_capacity entries for host when host is successfully deleted.
status 8703: resolved fixed
2011-04-15 15:37:55 +05:30
nit 8567935561 bug 9221: Parse storage.overprovisioning.factor as a float rather than as an int.
Status 9221: resolved fixed
2011-04-15 13:46:51 +05:30
Kelven Yang 1b9cbd9166 bug 9223, 9224: persist runid to form cluster session, based on cluster session and DB condition to issue isolation notification for self-fencing 2011-04-13 15:13:54 -07:00
Frank 105db3b15a Merge branch 'baremetal' to master
modifies:
	api/src/com/cloud/api/ApiConstants.java
	api/src/com/cloud/api/commands/AddHostCmd.java
	api/src/com/cloud/api/commands/CreatePodCmd.java
	api/src/com/cloud/api/commands/DeployVMCmd.java
	api/src/com/cloud/dc/Pod.java
	api/src/com/cloud/network/NetworkService.java
	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/configuration/ConfigurationManagerImpl.java
	server/src/com/cloud/dc/HostPodVO.java
	server/src/com/cloud/network/NetworkManager.java
	server/src/com/cloud/network/NetworkManagerImpl.java
	server/src/com/cloud/vm/UserVmManagerImpl.java
	setup/db/create-schema.sql
	utils/src/com/cloud/utils/SerialVersionUID.java
2011-04-11 14:21:41 -07:00
nit debe236a8d bug 8710: CONTD....Introducing a new user role in cloudstack called RESOURCE_DOMAIN_ADMIN. The role would have all the domain_admin rights and the rights to list zone,pods,clusters and so on. More info in the bug 2011-04-11 19:40:37 +05:30
Alex Huang b86828bbbe some formatting changes for agentmanager 2011-04-05 11:04:35 -07:00
Alex Huang 075fba5899 stackmaid is now taskmanager 2011-04-05 10:17:22 -07:00
Kelven Yang 1692caabe8 Make sure hooking of answer command callback be called for both sync/Async commands 2011-04-04 14:46:19 -07:00
Kelven Yang 918df1a838 missing one more place to notify answer commands 2011-04-04 14:44:24 -07:00
Kelven Yang 2759eefcf5 Add weight to command to help future load management, fix issues found in system vm pooling 2011-04-04 09:33:42 -07:00
alena bf588166ed bug 7704: network limits cleanup.
status 7704: resolved fixed

For user vm:
* for default network, take limit from the corresponding service offering
* for all additional networks, take limit from the network offerings

For domainRouter/SSVM/CPVM:
* get info from the network offering

Added new config parameter: "vm.network.throttling.rate". If nw_rate is NULL for serviceOffering, this parameter would be used for default vm's network
2011-04-01 15:48:32 -07:00
Kelven Yang ea95ac1808 bug 8713 : finalize the implementation of supporting system vm pooling 2011-03-31 17:14:09 -07:00
Prasanna Santhanam 41e7642825 don't do a cidr check for the simulator 2011-03-30 19:41:13 +05:30
Murali Reddy 1b251a5fc8 Bug 4061: support for Microsoft Hyper-V changes- part 1 2011-03-30 18:48:51 +05:30
Prasanna Santhanam d431f70447 moving simulator classes to premium 2011-03-30 14:00:50 +05:30
Prasanna Santhanam 29ef62f795 refactoring the simulator for 2.2 2011-03-30 14:00:50 +05:30
alena bb895a7b75 bug 8103: don't allow to remove host if it has Running/Stopping/Starting vms
status 8103: resolved fixed
2011-03-25 17:29:54 -07:00
Alex Huang 9d158dc060 Removed the async create status for volume now that our customers don't use it 2011-03-24 20:04:23 -07:00
prachi 514321cadf Bug 6873: disable/enable mode for clusters (and pods and zones and hosts)
Fixed problems in setting default 'allocation_state'. It is now set to 'Enabled' by default if not provided in the API command.
2011-03-24 12:54:51 -07:00
prachi 923f562aa8 Bug 6873: disable/enable mode for clusters (and pods and zones and hosts)
- Added a new flag 'allocation_state' to zone,pod,cluster and host
- The possible values for this flag are 'Enabled' or 'Disabled'
- When a new zone,pod,cluster or host is added, allocation_state is 'Disabled' by default.
- For existing zone,pod,cluster or host, the state is 'Enabled'.
- All Add/Update/List  commands for each of zone,pod,cluster or host can now take a new parameter 'allocationstate'
- If 'allocation_state' is 'Disabled', Allocators skip that zone or pod or cluster or pod.
- For a root admin, ListZones lists all zones including the 'Disabled' zones. But for any other user, the 'Disabled' zones are not included in the response.
- For any usecase that creates/deploys/adds/registers a resource and takes in zone as parameter, now we check if the Zone is 'Disabled'. If yes then the operation cannot be performed by a user other than root-admin. Add volume, snapshot, templates are examples of this usecase.
- To enable the root admin to test a particular pod/cluster/host, deployVM command takes in 'host_id' parameter that can be passed in only by root admin.
If this parameter is passed in by the admin, allocators do not search for hosts and use that host only. StoragePools are searched in the cluster of that host.
If VM cannot be deployed to that host, allocators and deployVM fails without retrying
2011-03-23 22:15:35 -07:00
Alex Huang b75a23b285 added support for 2.1.8 and 2.2.2 upgrade 2011-03-23 15:29:17 -07:00
prachi 8da874ca0b Fixed the NPE at RecreateHostAllocator found when a Pod with no cluster was present in the system. 2011-03-18 18:08:47 -07:00
Frank b4e18b7fbe Bug 8208 - bare metal provisioning
use hostTags in addHostCmd
2011-03-18 18:04:34 -07:00
anthony 8e741fb80d bug 9024:
two issues here:
1. in some case, two sequent commands are sent out at the same time.
2. before starting a user VM , make sure domr is up

status 9024: resolved fixed
2011-03-17 17:30:19 -07:00
will ae549ed4d1 bug 8208: Added first addHost dialog to take into account baremetal requirements in the resource page.
- Standardized the addHost optional parameters for baremetal.
2011-03-17 16:29:31 -07:00
prachi 2076eec42b Changes for Bug 8997 - Fail to create volume from snapshot: NPE at "allocateToPool(FirstFitStoragePoolAllocator.java:63)
- ClusterId passed in could be null
2011-03-15 18:40:33 -07:00
anthony ebba4d9420 bug 8480: allow multiple hosts in maintenance mode
status 8480: resolved fixed
2011-03-15 16:35:01 -07:00
nit f88fb1e505 bug 8887 : Stats Calculation Improvement - Storage stats wont update DB anymore and would be kept "in memory" just like other stats. For the listCapacityCmd which consumes it (sec. storage used and primary storage used) this would be constructed using the in memory maps rather than DB which wont have sec storage and primary storage used in the DB anymore. 2011-03-14 18:45:00 -07:00
prachi ed9a11c6b6 API changes for Bug 3316 - Manual live migration of Virtual Machines
This is a Root admin only functionality
---------------------
Service API changes:
---------------------
- ManagementServer  will expose new API:
   Pair<List<HostVO>, List<Long>> listHostsForMigrationOfVM(UserVm vm, Long
startIndex, Long pageSize)
The API returns list of all hosts in the VM's cluster minus the current host and also a list of hostIds that seem to have enough CPU and RAM capacity to host this VM.
- ListHostsCmd will call this service API if virtualmachineid is present in the request.
- MigrateVmCmd is the new command added that takes in virtualmachineid and destination hostid
- UserVmService  will expose a new API:  UserVm migrateVirtualMachine(UserVm vm, Host destinationHost)

------------------------------------
API throws error in following cases:
------------------------------------
- User is not a root Admin. (‘Permission denied’)
- A VM uses local storage, we cannot migrate it, so ‘listHosts’ will throw error.
- We fail to migrate the VM on the chosen host.
- API will support migration for XenServer only currently. So error is thrown
if hypervisor is not XenServer (e.g KVM, vSphere etc)
- Destination host is not in same cluster as source host.
- VM is not in running state
2011-03-11 17:49:53 -08:00
anthony 7d7dee791a bug 7877: prevent two mgmt servers connecting to one same host
status 7877: resolved fixed
2011-03-09 18:29:42 -08:00
Kelven Yang 6203ba6bfe bug 8558: improve cluster management 2011-03-09 17:18:48 -08:00
Frank 469280866d Bug 8208 - bare metal provisioning
able to add external DHCP server
2011-03-08 17:29:33 -08:00
Frank 6c819c1491 Merge branch 'bareMetal'
Conflicts:
	api/src/com/cloud/api/ApiConstants.java
	api/src/com/cloud/api/commands/DeployVMCmd.java
	api/src/com/cloud/offering/ServiceOffering.java
	api/src/com/cloud/vm/UserVmService.java
	client/tomcatconf/components.xml.in
	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/configuration/DefaultComponentLibrary.java
	server/src/com/cloud/deploy/FirstFitPlanner.java
	server/src/com/cloud/service/ServiceOfferingVO.java
	server/src/com/cloud/vm/UserVmManagerImpl.java
	server/src/com/cloud/vm/VirtualMachineManagerImpl.java
2011-03-08 14:18:11 -08:00
Frank 5517046a66 Bug 8208 - bare metal provisioning
throw exception when pxe server is not per pod
2011-03-08 10:05:58 -08:00
nit 7c127bf096 bug 8824: remove capacity entry when deleting secondary storage.
status 8824: resolved fixed
2011-03-04 17:40:43 -08:00
Alex Huang 40db230095 more changes for db upgrade 2011-03-04 11:09:24 -08:00
Frank 7fa053370e Bug 8208 - bare metal provisioning
Add bare metal planner
2011-03-01 17:47:37 -08:00