Commit Graph

286 Commits

Author SHA1 Message Date
anthony 5f9884d97a Bug 10197:
1. don't try HA vms if host hypervisor version changes
    2. fixed a bug related to VM full sync with hosttrack enabled
2011-08-02 16:48:27 -07:00
Alex Huang f043f63eaa Merged changes from 2.2.8.zucchini 2011-08-02 15:33:48 -07:00
anthony 7d02ed344e Bug 10197: do not check timeout against cluster which is not managed 2011-08-01 17:00:58 -07:00
frank b0b3f16dae Two things:
Load non-routing resource in ClusteredAgentManager includes External DHCP, PxeServer, ExternalFirewall, ExternalLoadBalancer

Bug 9887 - baremetal: support for image operation (create template from guest disk) (edit)

changes in line with UI
2011-07-29 11:28:09 -07:00
Alex Huang 6fea146903 more index. moved op_lock to memory table to try it 2011-07-27 14:06:40 -07:00
Sheng Yang 6c493bfb82 Add exception message for AgentManagerImpl.investigate() 2011-07-27 10:53:06 -07:00
Sheng Yang 3a8e13f968 Add exception message for AgentManagerImpl.investigate() 2011-07-27 10:52:48 -07:00
Alex Huang 9c627a15f3 Inaccurate clock new gets an mbean to control it 2011-07-25 16:01:31 -07:00
Alex Huang c610925304 moved agent ping to in memory rather than db based 2011-07-25 15:21:06 -07:00
Alex Huang 1b56808be5 brought over agent ping uses the same db connection 2011-07-25 10:57:00 -07:00
Alex Huang 10ac7753ed Switched ping to use the same db connection so that running out of db connections won't affect basic operations 2011-07-25 10:36:00 -07:00
Alex Huang 3f18192df8 Make all connections READ COMMITTED isolation level instead of setting it everytime we get the db connection causing useless round trips 2011-07-23 14:58:32 -07:00
Alex Huang 12cd5db620 deleted a file by mistake 2011-07-22 11:39:16 -07:00
Alex Huang b59c6b4ab6 propagate lock table fix 2011-07-22 11:35:47 -07:00
Alex Huang 44ce9488a6 propagate lock table fixes 2011-07-22 11:30:23 -07:00
Kelven Yang 3a6f3b71e0 bug 10791: add data integrity check upon management server startup 2011-07-21 17:08:29 -07:00
alena c21273d23a bug 10734: removed global lock in "DirectAgentScanTimerTask". This lock used to prevent the task from executing on multiple management server simultaniously.
status 10734: resolved fixed
2011-07-21 16:18:43 -07:00
alena ee98887176 2 fixes for Agent Load Balancer:
* when management server dies and notifies other management servers about this, the running management server has to cleanup host_transfer records belonging to the died management server
* issue agent load balancing task only when agent load (number of connected agents in the system) exceeds "agent.load.threshold" - 70% by default

Conflicts:

	server/src/com/cloud/configuration/Config.java
	server/src/com/cloud/host/dao/HostDaoImpl.java
	setup/db/db/schema-228to229.sql
2011-07-21 15:28:11 -07:00
alena 307741edcd 2 fixes for Agent Load Balancer:
* when management server dies and notifies other management servers about this, the running management server has to cleanup host_transfer records belonging to the died management server
* issue agent load balancing task only when agent load (number of connected agents in the system) exceeds "agent.load.threshold" - 70% by default

Conflicts:

	server/src/com/cloud/configuration/Config.java
	setup/db/db/schema-228to229.sql
2011-07-21 15:27:50 -07:00
anthony 3881e13387 bug 10197:
The step to upgrade xenserver,

1. put cluster in Unmanaged state through UI , then MS will not talk to hosts in the cluster
2. upgrade xenserver according to XenServer upgrade guide.
3. put cluster in Managed state through UI, then MS will reconnect hosts

TODO,

1. UI
2. vm pool sync , leveraged from kelven's work
2011-07-19 15:26:25 -07:00
alena c48c3edfbc bug 10271: don't include removed records when search for local storage pool
status 10217: resolved fixed
2011-07-19 11:10:53 -07:00
Alex Huang d54f6d536a propagating transaction isolation fix for merovingian2 2011-07-18 16:48:49 -07:00
alena 7a04334b60 bug 10734: removed global lock in "DirectAgentScanTimerTask". This lock used to prevent the task from executing on multiple management server simultaniously.
status 10734: resolved fixed
2011-07-18 15:00:13 -07:00
Alex Huang 392c03ce6f propagated db connection savings from master 2011-07-18 14:22:49 -07:00
Alex Huang e52a97b969 Switched ping to use the same db connection so that running out of db connections won't affect basic operations 2011-07-18 14:22:49 -07:00
anthony 18003deedf bug 10628: root cause is CheckHealthCommand return false, XenServerInvestigator is not called
status 10628: resolved fixed
2011-07-14 20:42:26 -07:00
alena 98b728bf70 Fixed typo 2011-07-11 10:31:25 -07:00
alena edb68eb6b7 More logging to agent lb 2011-07-11 10:29:57 -07:00
alena e16e0597c9 bug 10497: always execute next command in sequence, even when the current one fails to complete; otherwise all the remaining commands will be stuck in the queue forever
status 10497: resolved fixed
2011-06-29 15:17:29 -07:00
alena 5134b44ff5 bug 10496: fixed various problems in scanDirectAgentToLoad code (1) case when loadSize=clusterSize wasn't working 2) Long value comparison was done incorrectly)
status 10496: resolved fixed
2011-06-29 11:27:01 -07:00
alena 1581828dda Added more logging to DirectAgentsLoad task 2011-06-29 09:29:46 -07:00
anthony 468136be74 bug 9855: two fixes.
1. can not cancel maintenace mode.
2. maintenance related modes are preserved through MS restart

status 9855: resolved fixed
2011-06-27 13:48:12 -07:00
alena 41f12eb642 Pass isForRebalance parameter to processConnect method of all the listeners - some listeners don't have to be notified when connection happens as a a part of Agent Rebalance process (VirtualMachineManagerImpl listener for instance) 2011-06-27 10:20:41 -07:00
alena 0bf34f3612 bug 10447: don't notify VirtualMachineManager listener when do host rebalance - vm sync is not needed in this case.
status 10447: resolved fixed
2011-06-27 10:20:40 -07:00
alena fce33bcd76 bug 10445: Submit rebalancing task in a separate thread for each host
status 10445: resolved fixed
2011-06-27 10:20:40 -07:00
Edison Su 3642aef4c6 bug 10423: agent in ssvm needs to add default keystore, as we copying templates through https://**realhostip.**
status 10423: resolved fixed
2011-06-24 14:45:47 -04:00
Edison Su 28f0068151 add new option to force destroy vm when delete host, if the VMs are created on local storage 2011-06-23 20:36:13 -04:00
anthony 62249f3eae 1. return message to UI if adding primary storage failed
2. delete primary storage entry if if adding primary storage failed
2011-06-22 18:44:33 -07:00
alena 979e3a2d56 bug 10406: fixed concurrency problem happening during Agent Transfer Monitor task
status 10406: resolved fixed
2011-06-22 14:12:22 -07:00
Abhinandan Prateek 976aab77bb bug 10305: for a systemvm only applicable system vm offering should be displayed
status 10305: resolved fixed
While creating a system vm offering specify the type. If no type specified the default to domainrouter.
While requesting a set of system offering specify the paramter systemvmtype.
2011-06-20 20:01:40 +05:30
Abhinandan Prateek cf15573d06 bug 10313: marking default system offering as default so that they should not be deleted
status 10313: resolved fixed
2011-06-19 12:30:07 +05:30
Edison Su ad5162ef86 fix ebtable cleanup issue: on ubuntu, it's not got deleted if vm is stopped 2011-06-16 19:26:24 -04:00
Edison Su 2e8d1bbd6c bug 10190: add log if failed to delete host when host is in UP state 2011-06-15 12:02:31 -04:00
Kelven Yang 24c87c306b merge adding host fix from 2.2.4 2011-06-14 17:16:19 -07:00
alena 78852376e2 Reverted commit made by mistake
This reverts commit bd40212c3b5e885f89d7b43a6e8ccc14fb6d19fb.
2011-06-13 22:53:05 -07:00
alena 769ec409d7 Added findRequest() method to ClusteredAgentAttache - needed for cancelling requests in transfer queue 2011-06-13 22:43:34 -07:00
alena b2b4a63fd3 Some fixes to agent lb code:
* Schedule agent LB as a TimerTask as it can take time for the management server to start; and it can accept rebalance requests only when it's up and running
* Removed Starting state from mshost as it's not being used anywhere
* Fixed the bug where requests weren't routed properly from the old host owner to the new one.
2011-06-13 22:31:49 -07:00
alena 14cdc7de14 bug 9127: covered failure scenarios for agent LB.
status 9127: resolved fixed

The feature is completed; please file separate bugs if any issue arises during the testing.
Wiki link describing how agentLB works: http://intranet.lab.vmops.com/engineering/release-2.2-features/agent-load-balancing
2011-06-05 17:35:30 -07:00
Alex Huang 019cc78976 Fixes problems in routing between management servers 2011-06-05 16:06:54 -07:00
Alex Huang a0ce3da191 fixed problem with header 2011-06-05 13:51:29 -07:00