Commit Graph

62 Commits

Author SHA1 Message Date
Sheng Yang 407b45ee2e Fix ping in cloud-early-config
The old ping don't have -w option
2011-08-12 13:47:13 -07:00
Sheng Yang 2aa349f769 bug 11083: Fix broadcast address is 0.0.0.0
The reason is:
1. In redundant router, we won't enable eth2(public network interface) until
keepalived determine the router is MASTER.
2. ipassoc.sh normally kick in before keepalived process running. And it would
set eth2's IP address using "ip addr add $dev $ip"
3. "ip addr add $dev $ip" won't add mask for the device, then there is no way to
update broadcast address for eth2. Then broadcast address is 0.0.0.0.
4. As long as "ip addr add $dev $ip" executed, later executed "ifconfig $dev $ip
netmask $mask" won't calculated the broadcast address from $ip and $mask.

To fix this, we enable and configure eth2 temporaily when cloud-early-config
executed, then disable eth2 interface. By this way, broadcast address of should
be calculated and set correctly.

status 11083: resolved fixed
2011-08-12 00:11:52 -07:00
Sheng Yang 29cc88571f Redundant router script fix, also fix CheckRouterTask 2011-08-11 17:57:12 -07:00
Sheng Yang 4e4a44f2c4 Fix sometime system VM is up but mgmt server can't reach it 2011-08-11 11:01:14 -07:00
Abhinandan Prateek 16803387c6 bug 10812: changes to dhcp option 15 and 119 2011-08-11 10:23:20 +05:30
Sheng Yang 9f6dc993ab Fix dhcp_entry.sh fail on redundant router
And don't worry about "chkconfig dnsmasq off", because keepalived script should
take care of it.
2011-08-10 17:13:45 -07:00
Sheng Yang 34682b3bb0 Fix ping commandline parameter 2011-08-10 12:07:43 -07:00
Sheng Yang ab4c4de989 Fix redundant router start up
commit e4fe14a9ce19fbbdb15bbfaad586d80031ca9fbc break redundant router, because
at time of ping, the network is not up for redundant router.

Add timout for ping
2011-08-10 12:07:37 -07:00
Sheng Yang 5cf6feb2e5 Fix "RTNETLINK answers: No such process" when starting redundant router
The issue happened quite rare, but indeed can show.

And when the issue happen, the status of redundant router would be "Status:
FAULT".

It's due to ipassoc.sh wasn't executed before the system bring eth2 up and go to
master mode, then eth2 wasn't configured correctly. Then "ip route add default
xx" can't complete.

This commit should fixes the issue.
2011-08-10 12:06:53 -07:00
Kelven Yang adfb6a1f03 bug 11017: kick VMware vSwitch by pinging the default gateway to work around mysterous VMware network problem 2011-08-09 15:47:22 -07:00
Sheng Yang 55eb0e4b65 bug 11003: Fix HAproxy with new template
New debian package didn't create directory /var/lib/haproxy, which result in
haproxy fail to work.

status 11003: resolved fixed
2011-08-08 13:36:53 -07:00
Abhinandan Prateek dae2f4d958 bug 10748: fixed it when the VM acts as a dhcp server only
status 10748: resolved fixed
2011-08-05 12:07:59 +05:30
Abhinandan Prateek f6303c13fb bug 10812: putting key in ZoneConfig and other nomenclature changes 2011-08-03 14:42:29 +05:30
Abhinandan Prateek 2f804dd799 bug 10748: use external dns, configurable thru global param 2011-08-02 16:21:55 +05:30
Abhinandan Prateek 5cb3882349 bug 10812: adding domain suffixes to zone's router conf 2011-08-02 11:23:33 +05:30
Kelven Yang 1e9efb2f86 bug 10214: merge fixes from 2.2.8/2.2.4 2011-06-10 14:29:01 -07:00
Sheng Yang e92d00d3bf bug 9495: Fix DomR logging error on boot
status: resolved fixed
2011-06-09 15:04:35 -07:00
Sheng Yang 8542c9ca13 Fix keepalived start up issue
Sometime when keepalived start up(during system boot up period), it would fail
to(likely due to unable to receive the packet), and think itself is the only
router, then make itself master.

Add 10 seconds delay after start up to work around the issue.
2011-06-07 14:47:46 -07:00
Sheng Yang 58ade71a5b Enable only vrrp mode of keepalived 2011-06-07 14:47:46 -07:00
Sheng Yang 819e67b189 Add file lock for keepalived scripts
They are not blocked callings.
2011-06-07 14:47:46 -07:00
Sheng Yang 62ac899091 bug 9154: Initial check in for enabling redundant virtual router
This patch enable redundant virtual routers.

1. To enable this feature, db need to be updated using follow SQL by now(we
would get a UI way later):

UPDATE network_offerings SET redundant_router=1 WHERE guest_type="Virtual" AND
system_only=0;

2. System would try to start up two routers at different hosts. But if there is
only one host in the zone, system would start up two routers on it.

3. The failover part is using keepalived, and connection tracking part is using
conntrackd. There would be one master router and one backup router. The status
of router(master or backup) can be query from the database table domain_router
now. Management server would update the status every 30s by default.

4. The routers for the same zone would use same external NIC(same ip and mac).
The script used for fail-over would ensure only one external NIC present in the
network at any time.

5. Currently management server don't got the ability to stop one of router is
both of them reported as master. The feature is in the todo list.

After two routers start up, disconnect anyone of them, the guest network
shouldn't be affected, and established connection(http, ssh, etc.) should still
works. The fail-over on gateway part should be 3~4 seconds.

Currently the patch works with KVM. Would deal with vmware and XenServer soon.
2011-06-07 14:47:45 -07:00
anthony 37e6ded80c bug 10023: only set ip route for internal DNS
status 10023: resolved fixed
2011-05-25 17:39:20 -07:00
Edison Su a13be45f7b bug 8350:
haproxy tunning:
0. Test case:
   httpd running in 5 user VMs, all of them created on a xenserver host(16 core, 42G memroy, 10G network)
   domR running on an anther host with same hardware configuration.
   test application, ab, running on anther host behind an anther seperate switch
1.haproxy is not a memory intensive app. I can get 4625.96 connection/s with 1G memory. While it's really a CPU intensive app, domR always uses around 100% CPU on the host.
2.By default, you can't get better connection/s rate, because ip_conntrack_max and tw_bucket are too small, you will see the error in domR like:
   "TCP: time wait bucket table overflow" or "nf_conntrack: table full, dropping packet".
  So I increase these numbers to 1000000 from 65536, then I can steadly get around 4600 connection/s when memory is >= 1G.

  Here is the connection per second, tested by "ab -n 1000000 -c 100 http://192.168.170.152:880/test.html"
  domR memory   conn/s
  128M:         3545.55
  256M:         4081.38
  512M:         4318.18
  1G:           4625.96
  7G:           4745.53

3. If I enable notrack for both connections between domr/user vm, and public network, that tell iptable in domR don't track the connection during my test, then I can get better number, around
5800 connections/s. But we can't enable notrack, as iptables is used to track throughput in domR.

4. In a word, with this commit, the connection rate of haproxy can be increased from 1000-2000/s to 4700/s when domR's memory is larger than 1G.

5. How many CPU need to assign to domR to get this number?  Haven't finished yet, as CPU is shared by all the VMs on the host, if other VMs are busy, it will impact the performance of haproxy.
2011-05-21 21:05:28 -04:00
Frank 3b75abbde7 Fix DomR reply to windows's periodical DNS requirement 2011-04-21 11:23:23 -07:00
Frank 92155522f2 Add license header to files 2011-04-14 11:23:14 -07:00
Chiradeep Vittal 1e6024b83f bug 8959: 1. rp_filter prevents connections that span eth2 and eth3 because of default route
2. delete before add of default nat rules
status 8959: resolved fixed
2011-04-04 18:59:40 -07:00
Edison Su 8b3e22e84c enable serial console for system vm, to make debug easier 2011-03-31 19:11:09 -04:00
Murali Reddy 1b251a5fc8 Bug 4061: support for Microsoft Hyper-V changes- part 1 2011-03-30 18:48:51 +05:30
alena ec96a9afea Reverted Anthony's checkin for systemVmPatch which he added by mistake. 2011-03-18 18:12:53 -07:00
anthony 2b188b37dd fixed start management server 2011-03-18 16:43:51 -07:00
Chiradeep Vittal 20045b3664 bug 8892: accept nameserver configurations on non-default networks. this is because if dnsmasq sends no nameservers, dhclient deletes all entries from resolv.conf. the MS will send the eth0 ip of the dhcp server / virtual router on the default network as the nameserver on the non-default networks 2011-03-07 17:04:45 -08:00
Edison Su 88f3d04eb2 fix dns issue that in direct network mode, user vm can not get domr ip as dns server, then user vm can resolve its hostname, the consequence is that "starting sendmail" is tooo slow 2011-02-15 16:29:57 -05:00
Chiradeep Vittal 42f6e21d77 Detect hypervisor more reliably 2011-01-26 15:13:43 -08:00
Kelven Yang 8c77158bb9 fix domR setup issue when interface number is less than 3 2011-01-21 17:42:12 -08:00
Chiradeep Vittal ffdbab065b bug 7141: rotate standard logs more aggressively 2011-01-13 12:36:38 -08:00
Chiradeep Vittal 1327d6f914 bug 7141: make ppp(vpn) less verbose 2011-01-13 12:36:38 -08:00
Chiradeep Vittal a724d587bf bug 7141: make dnsmasq less verbose 2011-01-13 12:36:38 -08:00
Chiradeep Vittal 8032b2d62a bug 7141: limit apache logs to 5 days 2011-01-13 12:36:38 -08:00
Chiradeep Vittal 17451e1dae bug 7141: cleanup cloudstack logs older than 7 days 2011-01-13 12:36:38 -08:00
Chiradeep Vittal 1934e79900 bug 7141: rotate logs 2011-01-12 17:31:39 -08:00
Chiradeep Vittal 6ecbbe0b37 Add logrotate for dnsmasq and haproxy 2011-01-12 17:31:39 -08:00
Chiradeep Vittal ca2d32ad44 initial debian config for logging 2011-01-12 17:31:38 -08:00
nit 6850a0ae9a bug 7519 : Opening eth2 for public traffic on ssvm. Implementing the ip firewall for zone to zone copy in the Apache layer. Disabling directory listing for extract server root. 2011-01-11 20:05:35 +05:30
Edison Su d33d61b1b4 fix default gw one more time... 2011-01-07 08:40:22 -05:00
Edison Su a0ce28b769 one more fix for router gw, and rhel build 2011-01-07 08:27:24 -05:00
Edison Su 4a409200dc fix build for rhel6 and fix router default gw 2011-01-07 08:02:15 -05:00
Chiradeep Vittal dd3f184d0e bug 7790: do not set default route in some cases 2010-12-30 18:27:18 -08:00
Chiradeep Vittal c9c08a3ff8 bug 7790: do not set default route in some cases 2010-12-30 18:25:39 -08:00
edison b49503064a bug 7518: disable nfs-common for router/console
status 7518: resovled fixed
2010-12-15 11:35:52 -08:00
edison 97efc6671f bug 7294: disable rp_filter for cpvm&ssvm
status 7294: resolved fixed
2010-11-22 23:27:25 -08:00