Commit Graph

280 Commits

Author SHA1 Message Date
Sheng Yang 6c493bfb82 Add exception message for AgentManagerImpl.investigate() 2011-07-27 10:53:06 -07:00
Sheng Yang 3a8e13f968 Add exception message for AgentManagerImpl.investigate() 2011-07-27 10:52:48 -07:00
Sheng Yang 96568df702 Add exception message for AgentManagerImpl.investigate() 2011-07-27 10:51:44 -07:00
Alex Huang c610925304 moved agent ping to in memory rather than db based 2011-07-25 15:21:06 -07:00
Alex Huang 10ac7753ed Switched ping to use the same db connection so that running out of db connections won't affect basic operations 2011-07-25 10:36:00 -07:00
Kelven Yang 3a6f3b71e0 bug 10791: add data integrity check upon management server startup 2011-07-21 17:08:29 -07:00
Kelven Yang 95b48ba222 bug 10791: add data integrity check upon management server startup 2011-07-21 17:07:21 -07:00
alena c21273d23a bug 10734: removed global lock in "DirectAgentScanTimerTask". This lock used to prevent the task from executing on multiple management server simultaniously.
status 10734: resolved fixed
2011-07-21 16:18:43 -07:00
anthony bef3b480cd bug 10197:
The step to upgrade xenserver,

1. put cluster in Unmanaged state through UI , then MS will not talk to hosts in the cluster
2. upgrade xenserver according to XenServer upgrade guide.
3. put cluster in Managed state through UI, then MS will reconnect hosts

TODO,

1. UI
2. vm pool sync , leveraged from kelven's work
2011-07-19 15:44:37 -07:00
anthony 3881e13387 bug 10197:
The step to upgrade xenserver,

1. put cluster in Unmanaged state through UI , then MS will not talk to hosts in the cluster
2. upgrade xenserver according to XenServer upgrade guide.
3. put cluster in Managed state through UI, then MS will reconnect hosts

TODO,

1. UI
2. vm pool sync , leveraged from kelven's work
2011-07-19 15:26:25 -07:00
alena c48c3edfbc bug 10271: don't include removed records when search for local storage pool
status 10217: resolved fixed
2011-07-19 11:10:53 -07:00
Alex Huang d54f6d536a propagating transaction isolation fix for merovingian2 2011-07-18 16:48:49 -07:00
Alex Huang 5590cd93ce bug 10716: Changed the connection's isolation level to allow for more concurrency 2011-07-18 16:32:12 -07:00
alena 7a04334b60 bug 10734: removed global lock in "DirectAgentScanTimerTask". This lock used to prevent the task from executing on multiple management server simultaniously.
status 10734: resolved fixed
2011-07-18 15:00:13 -07:00
alena 62270fe62e bug 10734: removed global lock in "DirectAgentScanTimerTask". This lock used to prevent the task from executing on multiple management server simultaniously.
status 10734: resolved fixed
2011-07-18 14:32:57 -07:00
Alex Huang e52a97b969 Switched ping to use the same db connection so that running out of db connections won't affect basic operations 2011-07-18 14:22:49 -07:00
anthony 0c53bddb16 bug 10628: root cause is CheckHealthCommand return false, XenServerInvestigator is not called
status 10628: resolved fixed
2011-07-15 10:12:54 -07:00
anthony 18003deedf bug 10628: root cause is CheckHealthCommand return false, XenServerInvestigator is not called
status 10628: resolved fixed
2011-07-14 20:42:26 -07:00
Alex Huang 161b7f1cad Switched ping to use the same db connection so that running out of db connections won't affect basic operations 2011-07-14 10:48:07 -07:00
Edison Su 0dacd3913a bug 10423: agent in ssvm needs to add default keystore, as we copying templates through https://**realhostip.**
status 10423: resolved fixed
2011-07-08 13:49:43 -07:00
Edison Su a24eef5c81 add new option to force destroy vm when delete host, if the VMs are created on local storage 2011-07-08 13:49:05 -07:00
Edison Su 8306bf7ee5 fix ebtable cleanup issue: on ubuntu, it's not got deleted if vm is stopped 2011-07-08 13:31:34 -07:00
Edison Su 76d4636823 bug 10190: add log if failed to delete host when host is in UP state 2011-07-08 13:19:16 -07:00
anthony 761af73f24 bug 9855: two fixes.
1. can not cancel maintenace mode.
2. maintenance related modes are preserved through MS restart

status 9855: resolved fixed
2011-06-27 13:47:43 -07:00
anthony 468136be74 bug 9855: two fixes.
1. can not cancel maintenace mode.
2. maintenance related modes are preserved through MS restart

status 9855: resolved fixed
2011-06-27 13:48:12 -07:00
alena 18dc85c765 Pass isForRebalance parameter to processConnect method of all the listeners - some listeners don't have to be notified when connection happens as a a part of Agent Rebalance process (VirtualMachineManagerImpl listener for instance) 2011-06-27 10:37:00 -07:00
alena 826f5eedf1 bug 10447: don't notify VirtualMachineManager listener when do host rebalance - vm sync is not needed in this case.
status 10447: resolved fixed
2011-06-27 10:37:00 -07:00
alena 41f12eb642 Pass isForRebalance parameter to processConnect method of all the listeners - some listeners don't have to be notified when connection happens as a a part of Agent Rebalance process (VirtualMachineManagerImpl listener for instance) 2011-06-27 10:20:41 -07:00
alena 0bf34f3612 bug 10447: don't notify VirtualMachineManager listener when do host rebalance - vm sync is not needed in this case.
status 10447: resolved fixed
2011-06-27 10:20:40 -07:00
Edison Su 3642aef4c6 bug 10423: agent in ssvm needs to add default keystore, as we copying templates through https://**realhostip.**
status 10423: resolved fixed
2011-06-24 14:45:47 -04:00
Edison Su 28f0068151 add new option to force destroy vm when delete host, if the VMs are created on local storage 2011-06-23 20:36:13 -04:00
anthony 62249f3eae 1. return message to UI if adding primary storage failed
2. delete primary storage entry if if adding primary storage failed
2011-06-22 18:44:33 -07:00
anthony 4bbed17d02 1. return message to UI if adding primary storage failed
2. delete primary storage entry if if adding primary storage failed
2011-06-22 18:39:21 -07:00
Edison Su ad5162ef86 fix ebtable cleanup issue: on ubuntu, it's not got deleted if vm is stopped 2011-06-16 19:26:24 -04:00
alena fc28a332c5 bug 10271: don't include removed records when search for local storage pool
status 10217: resolved fixed
2011-06-16 16:00:45 -07:00
Edison Su 2e8d1bbd6c bug 10190: add log if failed to delete host when host is in UP state 2011-06-15 12:02:31 -04:00
Kelven Yang f7e4fc742f merge adding host fix from 2.2.4 2011-06-14 17:54:42 -07:00
Kelven Yang 24c87c306b merge adding host fix from 2.2.4 2011-06-14 17:16:19 -07:00
Frank 379cbc1d55 Store all parameters of url call to BaseCmd.fullUrlParams so there will be no
changes in future API because all parameters can be retrieve from API command itself
2011-06-08 10:25:15 -07:00
alena 14cdc7de14 bug 9127: covered failure scenarios for agent LB.
status 9127: resolved fixed

The feature is completed; please file separate bugs if any issue arises during the testing.
Wiki link describing how agentLB works: http://intranet.lab.vmops.com/engineering/release-2.2-features/agent-load-balancing
2011-06-05 17:35:30 -07:00
Alex Huang 019cc78976 Fixes problems in routing between management servers 2011-06-05 16:06:54 -07:00
Alex Huang d9e0bcfa1e bug 10126: Renamed getPodId() to getPodIdToDeployIn() 2011-06-03 22:17:08 -07:00
Alex Huang 0ab12edd6c network tagging changes 2011-05-31 09:49:27 -07:00
Edison Su 59a6e2157a fix user data, and template sync 2011-05-27 19:44:26 -04:00
Frank 44a0fff2ec baremetal able to enter maintenance mode, delete host 2011-05-25 12:13:19 -07:00
Alex Huang d7f527f8ac get rid of compile error 2011-05-21 15:41:12 -07:00
Abhinandan Prateek 9de370be61 do not propogate password changes if Attache is ConnectedAgentattache 2011-05-19 19:18:33 +05:30
Alex Huang 9c69a983ba session info within sequence numbers 2011-05-17 18:01:11 -07:00
Alex Huang 0ba6091ca7 better way to log the agent request and response. Introduced the idea of via agents in the request header 2011-05-17 16:02:44 -07:00
anthony 1501503656 multiple secondary storages fix 2011-05-17 16:01:35 -07:00
Alex Huang ea9703997b proxy agent 2011-05-17 07:12:58 -07:00
anthony 9c4964bc1b fixed merge 2011-05-16 16:54:00 -07:00
anthony d835067f40 multiple secondary storage support
template download works
creating snapshot works
creating volume/template from snapshot works
2011-05-16 16:37:15 -07:00
anthony ba6b2ca670 initial checkin for multiple secondary storage 2011-05-16 16:37:15 -07:00
Alex Huang 5ce631e9d7 Separated resource management and agent management code. It's not all done but at least we make a first step 2011-05-16 10:55:18 -07:00
Alex Huang ab49c70add Added some annotations for logging commands 2011-05-16 10:55:18 -07:00
Abhinandan Prateek 733a892aa4 return true on success 2011-05-16 16:02:35 +05:30
Abhinandan Prateek 542e3d6f53 bug 6451: host username/password should be changable through API
Propogate update password to other managment server nodes in a cluster.
2011-05-16 15:57:35 +05:30
Abhinandan Prateek d161337710 bug 6451: host username/password should be changable through API
Password is queued, on error next password is used.
2011-05-16 12:08:05 +05:30
Abhinandan Prateek f73ed92090 bug 8115: Source IP filtering in Virtual Router
We don't want to send an empty cidr param
2011-05-13 15:41:02 +05:30
Alex Huang 753b6a00a6 faster agent connection 2011-05-12 16:57:09 -07:00
Alex Huang 028f12f1b9 Removed the check for statuses in marking hosts as disconnected 2011-05-12 16:48:02 -07:00
kishan d456f89095 bug 8866: Direct Network Usage, TrafficSentinel support added, work in progress 2011-05-12 19:43:47 +05:30
Alex Huang efedf018c8 propagate b3aea1878395af343e18382b7f1c376b5be04567 2011-05-10 05:48:29 -07:00
Alex Huang 6805788f7d added ping back 2011-05-10 05:45:47 -07:00
Alex Huang 08bb454241 Eliminate the initialize call if we know someone else is connecting to a host 2011-05-10 05:45:30 -07:00
Alex Huang 80b6df17b9 make agent load size configurable 2011-05-10 05:45:06 -07:00
anthony 330f1815fa 1. in 2.1.x some hosts don't have cluster.guid in host_details
2. loading one host failure should not affect loading other hosts

Conflicts:

	server/src/com/cloud/agent/manager/ClusteredAgentManagerImpl.java
2011-05-06 12:35:51 -07:00
Alex Huang 43c3c775f2 bug 9741: somehow some of the removed code got back into the code path. Must have hit one undo too many. 2011-05-05 17:57:06 -07:00
Alex Huang 7bc25643a9 propagating fix from 2.2.4 2011-05-05 17:57:06 -07:00
Alex Huang b2cf4e39ea propagate fix from 2.2.4 2011-05-05 17:57:06 -07:00
Alex Huang 117a81acf6 cleanup agent manager 2011-05-05 17:57:06 -07:00
Alex Huang 66f19c6318 white space checks 2011-05-05 16:15:33 -07:00
Alex Huang 0ee7e7678a Propagate fixes from 2.2.4 2011-05-05 16:13:46 -07:00
Frank 3b0fb655a9 New hypervisor type 2011-05-04 18:58:57 -07:00
Chiradeep Vittal 7a29f8dc48 more fixes prior to pushing the merge 2011-05-03 10:41:31 -07:00
Chiradeep Vittal 303e2a7481 Start of zonesfeature / mycloud/cloudkit 2011-05-03 10:40:17 -07:00
Abhinandan Prateek db916401a0 bug 6451: XenServer username/password should be changable through API
status 6451: resolved fixed
2011-05-03 09:45:49 +05:30
Jessica Wang 747ab40f48 fix a bug - adding a cluster of type XenServer returned an error. 2011-05-02 16:54:53 -07:00
prachi 1d4a59ce73 Bug 9666 - hostId and spoolId overlap in op_host_capacity table
Changes:
- When a host connects, we check if it has a CPU and RAM entry in capacity table. If the entry is found, the values are updated if possible. If the entry is not found a new one is inserted.
- The searchCriteria used to check if CPU entry is present was wrong. We were passing in a criteria which did not specify capacityType. So for hostId >= 200, the serach would return capacity entries of storage pools also since poolIDs start from 200 onwards.
- Since an entry was found (although the wrong one), we tried to update it. But update does not happen since the capacity ranges dont match.
And a new insert for CPU also does not happen since an entry is found.
- So as a result CPU entries are never inserted in the table for hostIds >=200

- As a fix, corrected the search criteria.
- During VM deployment, when the entry is not found, we get a NPE. Added a null check to avoid that.
2011-05-02 15:17:30 -07:00
Kelven Yang b6364627a3 Decode the url content sent from UI first as URL has encoded url as a whole 2011-04-29 14:17:43 -07:00
anthony 7c105681b2 cluster guid is set up when old host reconnect happens after upgrade, prohibit adding host to the cluster before guid is etup 2011-04-29 14:13:25 -07:00
Alex Huang 9909371eaa bug 9445: Signal alert for the host if a primary storage pool was unavailable on that host 2011-04-26 15:09:36 -07:00
alena b749c64d4e bug 9550: get storagePool to Host mappings before doing processDisconnect because these references are being deleted as a part of processDisconnect call.
status 9550: resolved fixed

Conflicts:

	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/storage/StorageManagerImpl.java
2011-04-23 20:43:24 -07:00
prachi b84a7477f0 Bug 9539 - cpu.overprovisioning.factor does not work
Changes:
- Changed host allocators/planner  to use cpu.overprovisioning.factor
- Removed following: while adding a new host, we were setting the total_cpu in op_host_capacity to be actual_cpu * cpu.overprovisioning.factor. Now we set it to actual_cpu.
- ListCapacities response now calculates the total CPU as actual * cpu.overprovisioning.factor (This change does not add anything new - listCapacities was pulling total CPU from op_host_capacity DB earlier which had the cpu.overprovisioning.factor applied already. Now we need to apply it over the DB entry.)
- HostResponse has a new field: 'cpuWithOverprovisioning' that returns the cpu after applying the cpu.overprovisioning.factor

- Db Upgrade 222 to 224 now updates the total_cpu in op_host_capacity to be the actual_cpu for each Routing host.
2011-04-22 18:09:31 -07:00
alena 7255d68875 HA: no need to investigate why vm was stopped on host when host is being Dicsonnected with investigate=false option 2011-04-22 13:38:25 -07:00
alena 1a6d78eae4 Code cleanup. No need to declare runtime exceptions (CloudRuntimeException, InvalidParameterException, PermissionDenied exceptions) 2011-04-21 16:26:53 -07:00
alena f881d394e2 bug 9415: deleteHost - cleanup vms running on the host to be removed
status 9415: resolved fixed
2011-04-20 15:50:10 -07:00
Abhinandan Prateek efa0417881 bug 5242: adding a host does not return useful error
status 5242: closed fixed
2011-04-15 17:28:46 +05:30
nit 1d55d2fc05 bug 8703: delete op_host_capacity entries for host when host is successfully deleted.
status 8703: resolved fixed
2011-04-15 15:37:55 +05:30
nit 8567935561 bug 9221: Parse storage.overprovisioning.factor as a float rather than as an int.
Status 9221: resolved fixed
2011-04-15 13:46:51 +05:30
Frank 105db3b15a Merge branch 'baremetal' to master
modifies:
	api/src/com/cloud/api/ApiConstants.java
	api/src/com/cloud/api/commands/AddHostCmd.java
	api/src/com/cloud/api/commands/CreatePodCmd.java
	api/src/com/cloud/api/commands/DeployVMCmd.java
	api/src/com/cloud/dc/Pod.java
	api/src/com/cloud/network/NetworkService.java
	server/src/com/cloud/agent/manager/AgentManagerImpl.java
	server/src/com/cloud/configuration/ConfigurationManagerImpl.java
	server/src/com/cloud/dc/HostPodVO.java
	server/src/com/cloud/network/NetworkManager.java
	server/src/com/cloud/network/NetworkManagerImpl.java
	server/src/com/cloud/vm/UserVmManagerImpl.java
	setup/db/create-schema.sql
	utils/src/com/cloud/utils/SerialVersionUID.java
2011-04-11 14:21:41 -07:00
nit debe236a8d bug 8710: CONTD....Introducing a new user role in cloudstack called RESOURCE_DOMAIN_ADMIN. The role would have all the domain_admin rights and the rights to list zone,pods,clusters and so on. More info in the bug 2011-04-11 19:40:37 +05:30
Alex Huang b86828bbbe some formatting changes for agentmanager 2011-04-05 11:04:35 -07:00
Alex Huang 075fba5899 stackmaid is now taskmanager 2011-04-05 10:17:22 -07:00
Kelven Yang 1692caabe8 Make sure hooking of answer command callback be called for both sync/Async commands 2011-04-04 14:46:19 -07:00
Kelven Yang 918df1a838 missing one more place to notify answer commands 2011-04-04 14:44:24 -07:00
Kelven Yang 2759eefcf5 Add weight to command to help future load management, fix issues found in system vm pooling 2011-04-04 09:33:42 -07:00
Kelven Yang ea95ac1808 bug 8713 : finalize the implementation of supporting system vm pooling 2011-03-31 17:14:09 -07:00
Prasanna Santhanam 41e7642825 don't do a cidr check for the simulator 2011-03-30 19:41:13 +05:30