Commit Graph

306 Commits

Author SHA1 Message Date
alena 41f12eb642 Pass isForRebalance parameter to processConnect method of all the listeners - some listeners don't have to be notified when connection happens as a a part of Agent Rebalance process (VirtualMachineManagerImpl listener for instance) 2011-06-27 10:20:41 -07:00
alena 0bf34f3612 bug 10447: don't notify VirtualMachineManager listener when do host rebalance - vm sync is not needed in this case.
status 10447: resolved fixed
2011-06-27 10:20:40 -07:00
alena fce33bcd76 bug 10445: Submit rebalancing task in a separate thread for each host
status 10445: resolved fixed
2011-06-27 10:20:40 -07:00
Edison Su 3642aef4c6 bug 10423: agent in ssvm needs to add default keystore, as we copying templates through https://**realhostip.**
status 10423: resolved fixed
2011-06-24 14:45:47 -04:00
Edison Su 28f0068151 add new option to force destroy vm when delete host, if the VMs are created on local storage 2011-06-23 20:36:13 -04:00
anthony 62249f3eae 1. return message to UI if adding primary storage failed
2. delete primary storage entry if if adding primary storage failed
2011-06-22 18:44:33 -07:00
alena 979e3a2d56 bug 10406: fixed concurrency problem happening during Agent Transfer Monitor task
status 10406: resolved fixed
2011-06-22 14:12:22 -07:00
Abhinandan Prateek 976aab77bb bug 10305: for a systemvm only applicable system vm offering should be displayed
status 10305: resolved fixed
While creating a system vm offering specify the type. If no type specified the default to domainrouter.
While requesting a set of system offering specify the paramter systemvmtype.
2011-06-20 20:01:40 +05:30
Abhinandan Prateek cf15573d06 bug 10313: marking default system offering as default so that they should not be deleted
status 10313: resolved fixed
2011-06-19 12:30:07 +05:30
Edison Su ad5162ef86 fix ebtable cleanup issue: on ubuntu, it's not got deleted if vm is stopped 2011-06-16 19:26:24 -04:00
Edison Su 2e8d1bbd6c bug 10190: add log if failed to delete host when host is in UP state 2011-06-15 12:02:31 -04:00
Kelven Yang 24c87c306b merge adding host fix from 2.2.4 2011-06-14 17:16:19 -07:00
alena 78852376e2 Reverted commit made by mistake
This reverts commit bd40212c3b5e885f89d7b43a6e8ccc14fb6d19fb.
2011-06-13 22:53:05 -07:00
alena 769ec409d7 Added findRequest() method to ClusteredAgentAttache - needed for cancelling requests in transfer queue 2011-06-13 22:43:34 -07:00
alena b2b4a63fd3 Some fixes to agent lb code:
* Schedule agent LB as a TimerTask as it can take time for the management server to start; and it can accept rebalance requests only when it's up and running
* Removed Starting state from mshost as it's not being used anywhere
* Fixed the bug where requests weren't routed properly from the old host owner to the new one.
2011-06-13 22:31:49 -07:00
Frank 379cbc1d55 Store all parameters of url call to BaseCmd.fullUrlParams so there will be no
changes in future API because all parameters can be retrieve from API command itself
2011-06-08 10:25:15 -07:00
alena 14cdc7de14 bug 9127: covered failure scenarios for agent LB.
status 9127: resolved fixed

The feature is completed; please file separate bugs if any issue arises during the testing.
Wiki link describing how agentLB works: http://intranet.lab.vmops.com/engineering/release-2.2-features/agent-load-balancing
2011-06-05 17:35:30 -07:00
Alex Huang 019cc78976 Fixes problems in routing between management servers 2011-06-05 16:06:54 -07:00
Alex Huang a0ce3da191 fixed problem with header 2011-06-05 13:51:29 -07:00
Sheng Yang 7cf312674b bug 10135: Fix clustered agent manager's version bug
We are using v1 now, instead of v3. This bug result in all the commands from
the other clustered management server would be sent to agentmanager, and then dropped.

status 10135: resolved fixed
2011-06-04 21:28:03 -07:00
Sheng Yang ff86c865e2 bug 10135: Add SSL encryption for non-NIO link
Now Link.write() support SSL encryption. And since there is no user of
Link.read(), we comment it out.
2011-06-04 21:27:18 -07:00
Alex Huang d9e0bcfa1e bug 10126: Renamed getPodId() to getPodIdToDeployIn() 2011-06-03 22:17:08 -07:00
alena 635a1075de bug 9127: intermediate checkin for agent load balancer 2011-06-02 21:47:54 -07:00
Alex Huang 0ab12edd6c network tagging changes 2011-05-31 09:49:27 -07:00
Edison Su 59a6e2157a fix user data, and template sync 2011-05-27 19:44:26 -04:00
alena cd6d933e48 Better logging for cluster agent rebalancing 2011-05-25 18:59:26 -07:00
Frank 44a0fff2ec baremetal able to enter maintenance mode, delete host 2011-05-25 12:13:19 -07:00
alena 55692fd7cf bug 9127: intermediate checkin for agent load balancer 2011-05-24 18:45:00 -07:00
Alex Huang d7f527f8ac get rid of compile error 2011-05-21 15:41:12 -07:00
Alex Huang 567d946ab0 template listing fix 2011-05-19 15:26:24 -07:00
Abhinandan Prateek 9de370be61 do not propogate password changes if Attache is ConnectedAgentattache 2011-05-19 19:18:33 +05:30
Alex Huang 9c69a983ba session info within sequence numbers 2011-05-17 18:01:11 -07:00
Alex Huang 0ba6091ca7 better way to log the agent request and response. Introduced the idea of via agents in the request header 2011-05-17 16:02:44 -07:00
anthony 1501503656 multiple secondary storages fix 2011-05-17 16:01:35 -07:00
Alex Huang ea9703997b proxy agent 2011-05-17 07:12:58 -07:00
anthony 9c4964bc1b fixed merge 2011-05-16 16:54:00 -07:00
anthony d835067f40 multiple secondary storage support
template download works
creating snapshot works
creating volume/template from snapshot works
2011-05-16 16:37:15 -07:00
anthony ba6b2ca670 initial checkin for multiple secondary storage 2011-05-16 16:37:15 -07:00
Alex Huang 5ce631e9d7 Separated resource management and agent management code. It's not all done but at least we make a first step 2011-05-16 10:55:18 -07:00
Alex Huang ab49c70add Added some annotations for logging commands 2011-05-16 10:55:18 -07:00
Abhinandan Prateek 733a892aa4 return true on success 2011-05-16 16:02:35 +05:30
Abhinandan Prateek 542e3d6f53 bug 6451: host username/password should be changable through API
Propogate update password to other managment server nodes in a cluster.
2011-05-16 15:57:35 +05:30
Abhinandan Prateek d161337710 bug 6451: host username/password should be changable through API
Password is queued, on error next password is used.
2011-05-16 12:08:05 +05:30
Abhinandan Prateek f73ed92090 bug 8115: Source IP filtering in Virtual Router
We don't want to send an empty cidr param
2011-05-13 15:41:02 +05:30
Alex Huang 4ee5f1ca42 Better logging and better finalize 2011-05-12 17:02:35 -07:00
Alex Huang 753b6a00a6 faster agent connection 2011-05-12 16:57:09 -07:00
Alex Huang 028f12f1b9 Removed the check for statuses in marking hosts as disconnected 2011-05-12 16:48:02 -07:00
kishan d456f89095 bug 8866: Direct Network Usage, TrafficSentinel support added, work in progress 2011-05-12 19:43:47 +05:30
Alex Huang efedf018c8 propagate b3aea1878395af343e18382b7f1c376b5be04567 2011-05-10 05:48:29 -07:00
Alex Huang 6805788f7d added ping back 2011-05-10 05:45:47 -07:00
Alex Huang 08bb454241 Eliminate the initialize call if we know someone else is connecting to a host 2011-05-10 05:45:30 -07:00
Alex Huang 80b6df17b9 make agent load size configurable 2011-05-10 05:45:06 -07:00
Alex Huang da0fdfcfed Propagate db2fc1713c79e931cfa0f1580943ed80d2b12c8c 2011-05-10 05:45:04 -07:00
anthony 330f1815fa 1. in 2.1.x some hosts don't have cluster.guid in host_details
2. loading one host failure should not affect loading other hosts

Conflicts:

	server/src/com/cloud/agent/manager/ClusteredAgentManagerImpl.java
2011-05-06 12:35:51 -07:00
Alex Huang 43c3c775f2 bug 9741: somehow some of the removed code got back into the code path. Must have hit one undo too many. 2011-05-05 17:57:06 -07:00
Alex Huang 7bc25643a9 propagating fix from 2.2.4 2011-05-05 17:57:06 -07:00
Alex Huang b2cf4e39ea propagate fix from 2.2.4 2011-05-05 17:57:06 -07:00
Alex Huang 117a81acf6 cleanup agent manager 2011-05-05 17:57:06 -07:00
Alex Huang 66f19c6318 white space checks 2011-05-05 16:15:33 -07:00
Alex Huang 0ee7e7678a Propagate fixes from 2.2.4 2011-05-05 16:13:46 -07:00
Frank 3b0fb655a9 New hypervisor type 2011-05-04 18:58:57 -07:00
Alex Huang fba1c95512 bug 9615: Part of the HA cleanup 2011-05-03 16:34:53 -07:00
Chiradeep Vittal 948df86f5d accidental commit 2011-05-03 10:46:13 -07:00
Chiradeep Vittal 7a29f8dc48 more fixes prior to pushing the merge 2011-05-03 10:41:31 -07:00
Chiradeep Vittal b42ebf4872 Build fixes after first pass at merge 2011-05-03 10:41:31 -07:00
Chiradeep Vittal 303e2a7481 Start of zonesfeature / mycloud/cloudkit 2011-05-03 10:40:17 -07:00
Abhinandan Prateek db916401a0 bug 6451: XenServer username/password should be changable through API
status 6451: resolved fixed
2011-05-03 09:45:49 +05:30
Jessica Wang 747ab40f48 fix a bug - adding a cluster of type XenServer returned an error. 2011-05-02 16:54:53 -07:00
prachi 1d4a59ce73 Bug 9666 - hostId and spoolId overlap in op_host_capacity table
Changes:
- When a host connects, we check if it has a CPU and RAM entry in capacity table. If the entry is found, the values are updated if possible. If the entry is not found a new one is inserted.
- The searchCriteria used to check if CPU entry is present was wrong. We were passing in a criteria which did not specify capacityType. So for hostId >= 200, the serach would return capacity entries of storage pools also since poolIDs start from 200 onwards.
- Since an entry was found (although the wrong one), we tried to update it. But update does not happen since the capacity ranges dont match.
And a new insert for CPU also does not happen since an entry is found.
- So as a result CPU entries are never inserted in the table for hostIds >=200

- As a fix, corrected the search criteria.
- During VM deployment, when the entry is not found, we get a NPE. Added a null check to avoid that.
2011-05-02 15:17:30 -07:00
Kelven Yang b6364627a3 Decode the url content sent from UI first as URL has encoded url as a whole 2011-04-29 14:17:43 -07:00
anthony 7c105681b2 cluster guid is set up when old host reconnect happens after upgrade, prohibit adding host to the cluster before guid is etup 2011-04-29 14:13:25 -07:00
alena 671ec62358 bug 9623: set ha_enable to false for consoleProxy vms and service_offering.
status 9623: resolved fixed

Also set ram_size to 1024 for console proxy offering during the upgrade

Conflicts:

	core/src/com/cloud/vm/SecondaryStorageVmVO.java
	server/src/com/cloud/agent/manager/allocator/impl/UserConcentratedAllocator.java
	server/src/com/cloud/consoleproxy/ConsoleProxyManagerImpl.java
	server/src/com/cloud/storage/allocator/LocalStoragePoolAllocator.java
	server/src/com/cloud/storage/secondary/SecondaryStorageManagerImpl.java
2011-04-29 11:53:07 -07:00
Alex Huang 9909371eaa bug 9445: Signal alert for the host if a primary storage pool was unavailable on that host 2011-04-26 15:09:36 -07:00
alena b749c64d4e bug 9550: get storagePool to Host mappings before doing processDisconnect because these references are being deleted as a part of processDisconnect call.
status 9550: resolved fixed

Conflicts:

	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/storage/StorageManagerImpl.java
2011-04-23 20:43:24 -07:00
prachi b84a7477f0 Bug 9539 - cpu.overprovisioning.factor does not work
Changes:
- Changed host allocators/planner  to use cpu.overprovisioning.factor
- Removed following: while adding a new host, we were setting the total_cpu in op_host_capacity to be actual_cpu * cpu.overprovisioning.factor. Now we set it to actual_cpu.
- ListCapacities response now calculates the total CPU as actual * cpu.overprovisioning.factor (This change does not add anything new - listCapacities was pulling total CPU from op_host_capacity DB earlier which had the cpu.overprovisioning.factor applied already. Now we need to apply it over the DB entry.)
- HostResponse has a new field: 'cpuWithOverprovisioning' that returns the cpu after applying the cpu.overprovisioning.factor

- Db Upgrade 222 to 224 now updates the total_cpu in op_host_capacity to be the actual_cpu for each Routing host.
2011-04-22 18:09:31 -07:00
alena 7255d68875 HA: no need to investigate why vm was stopped on host when host is being Dicsonnected with investigate=false option 2011-04-22 13:38:25 -07:00
alena 1a6d78eae4 Code cleanup. No need to declare runtime exceptions (CloudRuntimeException, InvalidParameterException, PermissionDenied exceptions) 2011-04-21 16:26:53 -07:00
alena f881d394e2 bug 9415: deleteHost - cleanup vms running on the host to be removed
status 9415: resolved fixed
2011-04-20 15:50:10 -07:00
Kelven Yang 7103e1c9a7 global replacement to avoid using getClass().getName() for unique names 2011-04-19 17:26:41 -07:00
Abhinandan Prateek efa0417881 bug 5242: adding a host does not return useful error
status 5242: closed fixed
2011-04-15 17:28:46 +05:30
nit 1d55d2fc05 bug 8703: delete op_host_capacity entries for host when host is successfully deleted.
status 8703: resolved fixed
2011-04-15 15:37:55 +05:30
nit 8567935561 bug 9221: Parse storage.overprovisioning.factor as a float rather than as an int.
Status 9221: resolved fixed
2011-04-15 13:46:51 +05:30
Kelven Yang 1b9cbd9166 bug 9223, 9224: persist runid to form cluster session, based on cluster session and DB condition to issue isolation notification for self-fencing 2011-04-13 15:13:54 -07:00
Frank 105db3b15a Merge branch 'baremetal' to master
modifies:
	api/src/com/cloud/api/ApiConstants.java
	api/src/com/cloud/api/commands/AddHostCmd.java
	api/src/com/cloud/api/commands/CreatePodCmd.java
	api/src/com/cloud/api/commands/DeployVMCmd.java
	api/src/com/cloud/dc/Pod.java
	api/src/com/cloud/network/NetworkService.java
	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/configuration/ConfigurationManagerImpl.java
	server/src/com/cloud/dc/HostPodVO.java
	server/src/com/cloud/network/NetworkManager.java
	server/src/com/cloud/network/NetworkManagerImpl.java
	server/src/com/cloud/vm/UserVmManagerImpl.java
	setup/db/create-schema.sql
	utils/src/com/cloud/utils/SerialVersionUID.java
2011-04-11 14:21:41 -07:00
nit debe236a8d bug 8710: CONTD....Introducing a new user role in cloudstack called RESOURCE_DOMAIN_ADMIN. The role would have all the domain_admin rights and the rights to list zone,pods,clusters and so on. More info in the bug 2011-04-11 19:40:37 +05:30
Alex Huang b86828bbbe some formatting changes for agentmanager 2011-04-05 11:04:35 -07:00
Alex Huang 075fba5899 stackmaid is now taskmanager 2011-04-05 10:17:22 -07:00
Kelven Yang 1692caabe8 Make sure hooking of answer command callback be called for both sync/Async commands 2011-04-04 14:46:19 -07:00
Kelven Yang 918df1a838 missing one more place to notify answer commands 2011-04-04 14:44:24 -07:00
Kelven Yang 2759eefcf5 Add weight to command to help future load management, fix issues found in system vm pooling 2011-04-04 09:33:42 -07:00
alena bf588166ed bug 7704: network limits cleanup.
status 7704: resolved fixed

For user vm:
* for default network, take limit from the corresponding service offering
* for all additional networks, take limit from the network offerings

For domainRouter/SSVM/CPVM:
* get info from the network offering

Added new config parameter: "vm.network.throttling.rate". If nw_rate is NULL for serviceOffering, this parameter would be used for default vm's network
2011-04-01 15:48:32 -07:00
Kelven Yang ea95ac1808 bug 8713 : finalize the implementation of supporting system vm pooling 2011-03-31 17:14:09 -07:00
Prasanna Santhanam 41e7642825 don't do a cidr check for the simulator 2011-03-30 19:41:13 +05:30
Murali Reddy 1b251a5fc8 Bug 4061: support for Microsoft Hyper-V changes- part 1 2011-03-30 18:48:51 +05:30
Prasanna Santhanam d431f70447 moving simulator classes to premium 2011-03-30 14:00:50 +05:30
Prasanna Santhanam 29ef62f795 refactoring the simulator for 2.2 2011-03-30 14:00:50 +05:30
alena bb895a7b75 bug 8103: don't allow to remove host if it has Running/Stopping/Starting vms
status 8103: resolved fixed
2011-03-25 17:29:54 -07:00
Alex Huang 9d158dc060 Removed the async create status for volume now that our customers don't use it 2011-03-24 20:04:23 -07:00
prachi 514321cadf Bug 6873: disable/enable mode for clusters (and pods and zones and hosts)
Fixed problems in setting default 'allocation_state'. It is now set to 'Enabled' by default if not provided in the API command.
2011-03-24 12:54:51 -07:00
prachi 923f562aa8 Bug 6873: disable/enable mode for clusters (and pods and zones and hosts)
- Added a new flag 'allocation_state' to zone,pod,cluster and host
- The possible values for this flag are 'Enabled' or 'Disabled'
- When a new zone,pod,cluster or host is added, allocation_state is 'Disabled' by default.
- For existing zone,pod,cluster or host, the state is 'Enabled'.
- All Add/Update/List  commands for each of zone,pod,cluster or host can now take a new parameter 'allocationstate'
- If 'allocation_state' is 'Disabled', Allocators skip that zone or pod or cluster or pod.
- For a root admin, ListZones lists all zones including the 'Disabled' zones. But for any other user, the 'Disabled' zones are not included in the response.
- For any usecase that creates/deploys/adds/registers a resource and takes in zone as parameter, now we check if the Zone is 'Disabled'. If yes then the operation cannot be performed by a user other than root-admin. Add volume, snapshot, templates are examples of this usecase.
- To enable the root admin to test a particular pod/cluster/host, deployVM command takes in 'host_id' parameter that can be passed in only by root admin.
If this parameter is passed in by the admin, allocators do not search for hosts and use that host only. StoragePools are searched in the cluster of that host.
If VM cannot be deployed to that host, allocators and deployVM fails without retrying
2011-03-23 22:15:35 -07:00