Commit Graph

43 Commits

Author SHA1 Message Date
Alex Huang b59c6b4ab6 propagate lock table fix 2011-07-22 11:35:47 -07:00
Kelven Yang 3a6f3b71e0 bug 10791: add data integrity check upon management server startup 2011-07-21 17:08:29 -07:00
alena 307741edcd 2 fixes for Agent Load Balancer:
* when management server dies and notifies other management servers about this, the running management server has to cleanup host_transfer records belonging to the died management server
* issue agent load balancing task only when agent load (number of connected agents in the system) exceeds "agent.load.threshold" - 70% by default

Conflicts:

	server/src/com/cloud/configuration/Config.java
	setup/db/db/schema-228to229.sql
2011-07-21 15:27:50 -07:00
Alex Huang 392c03ce6f propagated db connection savings from master 2011-07-18 14:22:49 -07:00
Kelven Yang 11527ad914 bug 10699: log on listener registration 2011-07-12 16:58:55 -07:00
Kelven Yang 40145b0b8b add logging to clustering notification 2011-07-12 16:14:58 -07:00
Kelven Yang 72cbd65db2 bug 10668: add DB upgrade scripts for 2.2.7 to 2.2.8 upgrade 2011-07-08 18:12:13 -07:00
Alex Huang cfc25d01be bug 10501: This is really Kelven's bug but I'll fix it for him anyways. DAO code already have a way to extract the DB connection from a transaction that is stored in the TLS. There's no real reason for the DAO code to add special semantics to use a different DB connection. That can be done by simply switching the transaction before it even reached the dao code. Think about it. Why would anyone want to call one dao function, switch transaction, and then switch back. The right thing is for the caller to switch transaction, call a series of dao codes, and switch it back. That's the semantics I changed to. By doing this, it also eliminates the number of debug messages in this bug. 2011-07-01 11:03:12 -07:00
Alex Huang 63f552995f bug 10322: integrated into cluster management 2011-06-29 12:01:43 -07:00
Alex Huang cbee5a6044 bug 10322: arrival of the next generation Merovingians 2011-06-29 12:01:43 -07:00
Kelven Yang 0be54ec4b6 bug 10029: update mshost state instead of removing it 2011-06-24 11:16:40 -07:00
Kelven Yang f91122654b bug 10176: add cluster listener registration back 2011-06-23 14:52:15 -07:00
alena b2b4a63fd3 Some fixes to agent lb code:
* Schedule agent LB as a TimerTask as it can take time for the management server to start; and it can accept rebalance requests only when it's up and running
* Removed Starting state from mshost as it's not being used anywhere
* Fixed the bug where requests weren't routed properly from the old host owner to the new one.
2011-06-13 22:31:49 -07:00
alena 158807c877 Fixed unittest 2011-06-06 16:52:11 -07:00
alena 14cdc7de14 bug 9127: covered failure scenarios for agent LB.
status 9127: resolved fixed

The feature is completed; please file separate bugs if any issue arises during the testing.
Wiki link describing how agentLB works: http://intranet.lab.vmops.com/engineering/release-2.2-features/agent-load-balancing
2011-06-05 17:35:30 -07:00
alena 635a1075de bug 9127: intermediate checkin for agent load balancer 2011-06-02 21:47:54 -07:00
alena cd6d933e48 Better logging for cluster agent rebalancing 2011-05-25 18:59:26 -07:00
alena 388aea7be3 bug 9127: intermediate checkin for agent load balancer 2011-05-24 19:04:50 -07:00
alena 55692fd7cf bug 9127: intermediate checkin for agent load balancer 2011-05-24 18:45:00 -07:00
Alex Huang 2b650acc17 more gson crap 2011-05-18 11:01:46 -07:00
Chiradeep Vittal 303e2a7481 Start of zonesfeature / mycloud/cloudkit 2011-05-03 10:40:17 -07:00
Frank 92155522f2 Add license header to files 2011-04-14 11:23:14 -07:00
Kelven Yang 1b9cbd9166 bug 9223, 9224: persist runid to form cluster session, based on cluster session and DB condition to issue isolation notification for self-fencing 2011-04-13 15:13:54 -07:00
Kelven Yang 2a45327a89 Miss out one place to use queue instead, this is for decoupling notification from heartbeat to avoid being potentially blocked in callbacks 2011-04-12 15:06:08 -07:00
Kelven Yang 03afe54084 Regression fixes for adding openNew() to Transaction.java 2011-04-11 17:44:31 -07:00
Kelven Yang 149eb0fade Use seperate DB connection to handle cluster management 2011-04-11 16:37:58 -07:00
Alex Huang 2230c24ab3 check in changes from 2.2.4 2011-04-05 11:01:35 -07:00
Alex Huang 075fba5899 stackmaid is now taskmanager 2011-04-05 10:17:22 -07:00
Kelven Yang fa0dd8a228 Allow cluster management confliction detection to handle quick manageemnt server restarts 2011-03-28 15:14:51 -07:00
Kelven Yang 359dc18992 Fix possible duplicated JMX registration issue 2011-03-25 15:16:41 -07:00
Alex Huang 82f7504832 checking in the upgrade stuff for now 2011-03-19 09:27:10 -07:00
Kelven Yang 65d4cc98be Allow template re-deployment once template is deleted from hypervisor and CloudStack still holds out-dated status 2011-03-17 17:59:51 -07:00
Kelven Yang 14c0eb74a0 Use the correct one to find ManagementServerHostVO 2011-03-10 15:32:58 -08:00
Kelven Yang 6b8f073f99 Fix issues for management server ping support in cluster management 2011-03-10 10:32:17 -08:00
Kelven Yang 6203ba6bfe bug 8558: improve cluster management 2011-03-09 17:18:48 -08:00
Kelven Yang 33b3333d65 Add JMX support to async job management, active job details will be checked in jConsole 2011-03-08 16:34:17 -08:00
Kelven Yang d0892a964d Add JMX monitoring to cluster management 2011-03-07 18:40:33 -08:00
Kelven Yang cbb5aa0e89 Allow to query management server node alive status 2011-03-04 17:39:26 -08:00
Kelven Yang fd27af5d8f bug 8249 : save combined persistent and run-time id into worker to help worker VM GC 2011-02-25 11:23:32 -08:00
Kelven Yang 27f2b6cbb1 Let cluster host-heartbeat be in a sperated context to avoid update delay by other activities 2011-02-01 17:38:12 -08:00
Kelven Yang 6ed8b20a76 add catch-all clause in cluster heartbeat 2011-02-01 12:40:32 -08:00
Alex Huang ade0097a0a wrap cluster heartbeat in a try-catch 2011-02-01 12:04:08 -08:00
Frank e9652e67ad migrate premium to oss 2011-01-28 16:07:46 -08:00