Commit Graph

25665 Commits

Author SHA1 Message Date
nvazquez c0c32b4d9b Release connections fix 2018-03-01 16:36:18 -03:00
nvazquez 5fb5890ea3 Fix backport issue and retrying logic 2018-02-28 11:37:09 -03:00
nvazquez 12c1db7556 Fix null input stream issue 2018-02-28 08:39:45 -03:00
nvazquez c8628a4593 Fix retrying logic 2018-02-27 19:39:34 -03:00
nvazquez 880850b7e7 Retrieve clean metalink checksums 2018-02-27 19:35:16 -03:00
nvazquez e8e4cd70af Fix metalink urls issue 2018-02-27 19:34:07 -03:00
Daan Hoogland cd3a72c1f9 command name returned 2018-02-19 11:54:21 +01:00
Daan Hoogland ef72bd7c67 import 2018-02-15 09:38:36 +01:00
nvazquez cdf9ac4679 Fix metalink download, checksum retry logic and metalink SSVM downloader 2018-02-14 19:40:16 +01:00
nvazquez 1c71a9b867 Refactor and fix upload certificate error message even though operation is successful 2018-02-14 19:16:17 +01:00
nvazquez d39664a382 CLOUDSTACK-10238: Fix for metalink support on SSVM agents 2018-02-14 19:01:35 +01:00
nvazquez 8e4cec1d87 Metalink downloader rework, retrying logic and refactoring 2018-02-06 15:04:28 -03:00
nvazquez ced27374a8 HTTPS downloader fix 2018-01-23 16:02:00 -03:00
nvazquez 12fa8db128 Reinstall VM for bypassed templates 2018-01-23 11:47:19 -03:00
nvazquez e9f2d982b6 Fix for metalink support on SSVM agents 2018-01-18 01:40:45 -03:00
nvazquez 6c60a0fb59 Direct Download label issue 2018-01-15 10:43:47 -03:00
nvazquez e54071e674 Dettach ISO and direct download label fixes 2018-01-15 09:12:05 -03:00
Daan Hoogland e3ec10b826 fr16 checksum in java instead of script to be more platform independent, hopefully 2018-01-12 17:31:15 +01:00
Daan Hoogland 800af08db9 labels 2018-01-12 11:52:23 +01:00
Daan Hoogland 2381669c46 CLOUDSTACK-10146 direct download for admin only 2018-01-11 21:50:41 +01:00
Daan Hoogland ec05530ba1 unused import removed 2018-01-09 15:21:50 +01:00
Daan Hoogland 23b1c4ddf6 backport of master fixes 2018-01-09 15:10:09 +01:00
nvazquez 0a68b73ad0 Remove aria2 dependency 2018-01-05 14:48:27 -03:00
nvazquez a5c90400d3 CLOUDSTACK-10146: Bypass Secondary Storage 2018-01-05 03:36:51 -03:00
Rohit Yadav 260b523a4b APPLE-512: Use host table to iterate for zone/cluster metrics (CW-1592)
Per Ilya's reply, host_view may contain duplicate entries when hosts
have tags. Changing the host_view may cause unseen regressions so
to fix the issues we've modified the zone/cluster metrics code to use
the `host` table (hostdao) to iterate through the list of hosts in a
cluster during zone/cluster metrics listing.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-12-15 14:36:09 +05:30
Rohit Yadav 02dee6a029
CW1542: Fix thread leak in RemoteEndHostEndPoint (#54)
This fixes the following:
- Fixes thread leaks in RemoteEndHostEndPoint
- Fixes a potential NPE while finding EP for a storage/scope

Unbounded thread growth can be reproduced with following findings:
- Every unreachable template would produce 6 new threads (in a single
ScheduledExecutorService instance) spaced by 10 seconds
- Every reachable template url without the template would produce 1 new
thread (and one ScheduledExecutorService instance), it errors out quickly without
causing more thread growth.
- Every valid url will produce upto 10 threads as the same ep (endpoint
instance) will be reused to query upload/download (async callback)
progresses.

Every RemoteHostEndPoint instances creates its own
ScheduledExecutorService instance which is why in the jstack dump, we
see several threads that share the prefix RemoteHostEndPoint-{1..10}
(given poolsize is defined as 10, it uses suffixes 1-10).

This fixes the discovered thread leakage with following notes:
- Instead of ScheduledExecutorService instance, a cached pool could be
used instead and was implemented, and with `static` scope to be reused
among other future RemoteHostEndPoint instances.
- It was not clear why we would want to wait when we've Answers returned
from the remote EP, and therefore a scheduled/delayed Runnable was
not required at all for processing answers. ScheduledExecutorService
was therefore not really required, moved to ExecutorService instead.
- Another benefit of using a cached pool is that it will shutdown
threads if they are not used in 60 seconds, and they get re-used for
future runnable submissions.
- Caveat: the executor service is still unbounded, however, the use-case
that this method is used for short jobs to check upload/download
progresses fits the case here.
- Refactored CmdRunner to not use/reference objects from parent class.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-11-09 12:32:38 +05:30
Rohit Yadav 7df52405b0 FR3: Host-HA backported changes from master (#50)
- Improves job scheduling using state/event-driven logic
- Reduced database and cpu load, by reducing all background threads to one
- Improves Simulator and KVM host-ha integration tests
- Triggers VM HA on successful host (ipmi reboot) recovery
- Improves internal datastructures and checks around HA counter
- New FSM events to retry fencing and recovery
- Fixes KVM activity script to aggresively check against last update time

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-11-07 15:51:56 +05:30
Rohit Yadav 1f52cd4245 FR12: Have basic constraint in CA certificate (#52)
* FR12: Have basic constraint in CA certificate

- Refactors certificate generation to use V3
- Removes use of V1 based certificate generator
- Puts basic constraint and keyusage extentions in certificate generator
  when caCert is not provided, i.e. for building CA certificate
- For normal certificate generation, skips putting basic constraint
  instead puts authority key identifier (the ca cert)
- Fixes tests to use the V3 certificate generator

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>

* FR12: backup and restore cpvm/ssvm keystore during reboot

This is backported from:
https://github.com/apache/cloudstack/pull/2278

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-10-10 10:38:13 +05:30
Rohit Yadav 5cd57a5011 FR12: Stop SSL handshake when clients don't present valid certificates (#51)
Per communication with Marcus, and his test results, this enforces the
clients to provide certificates and on failure this stops SSL
negotiations when auth strictness is set to true.

This uses the `setNeedClientAuth`, where if the option is set and the
client chooses not to provide authentication information about itself,
the negotiations will stop and the engine will begin its closure
procedure:
https://docs.oracle.com/javase/7/docs/api/javax/net/ssl/SSLEngine.html#setNeedClientAuth(boolean)

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-10-03 16:45:05 +05:30
Rohit Yadav 366d82e292 FR12 (CLOUDSTACK-9993): Secure Agent Communications (#38)
This introduces a new certificate authority framework that allows
pluggable CA provider implementations to handle certificate operations
around issuance, revocation and propagation. The framework injects
itself to `NioServer` to handle agent connections securely. The
framework adds assumptions in `NioClient` that a keystore if available
with known name `cloud.jks` will be used for SSL negotiations and
handshake.

This includes a default 'root' CA provider plugin which creates its own
self-signed root certificate authority on first run and uses it for
issuance and provisioning of certificate to CloudStack agents such as
the KVM, CPVM and SSVM agents and also for the management server for
peer clustering.

Additional changes and notes:
- Comma separate list of management server IPs can be set to the 'host'
  global setting. Newly provisioned agents (KVM/CPVM/SSVM etc) will get
  radomized comma separated list to which they will attempt connection
  or reconnection in provided order. This removes need of a TCP LB on
  port 8250 (default) of the management server(s).
- All fresh deployment will enforce two-way SSL authentication where
  connecting agents will be required to present certificates issued
  by the 'root' CA plugin.
- Existing environment on upgrade will continue to use one-way SSL
  authentication and connecting agents will not be required to present
  certificates.
- A script `keystore-setup` is responsible for initial keystore setup
  and CSR generation on the agent/hosts.
- A script `keystore-cert-import` is responsible for import provided
  certificate payload to the java keystore file.
- Agent security (keystore, certificates etc) are setup initially using
  SSH, and later provisioning is handled via an existing agent connection
  using command-answers. The supported clients and agents are limited to
  CPVM, SSVM, and KVM agents, and clustered management server (peering).
- Certificate revocation does not revoke an existing agent-mgmt server
  connection, however rejects a revoked certificate used during SSL
  handshake.
- Older `cloudstackmanagement.keystore` is deprecated and will no longer
  be used by mgmt server(s) for SSL negotiations and handshake. New
  keystores will be named `cloud.jks`, any additional SSL certificates
  should not be imported in it for use with tomcat etc. The `cloud.jks`
  keystore is stricly used for agent-server communications.
- Management server keystore are validated and renewed on start up only,
  the validity of them are same as the CA certificates.

New APIs:
- listCaProviders: lists all available CA provider plugins
- listCaCertificate: lists the CA certificate(s)
- issueCertificate: issues X509 client certificate with/without a CSR
- provisionCertificate: provisions certificate to a host
- revokeCertificate: revokes a client certificate using its serial

Global settings for the CA framework:
- ca.framework.provider.plugin: The configured CA provider plugin
- ca.framework.cert.keysize: The key size for certificate generation
- ca.framework.cert.signature.algorithm: The certificate signature algorithm
- ca.framework.cert.validity.period: Certificate validity in days
- ca.framework.cert.automatic.renewal: Certificate auto-renewal setting
- ca.framework.background.task.delay: CA background task delay/interval
- ca.framework.cert.expiry.alert.period: Days to check and alert expiring certificates

Global settings for the default 'root' CA provider:
- ca.plugin.root.private.key: (hidden/encrypted) CA private key
- ca.plugin.root.public.key: (hidden/encrypted) CA public key
- ca.plugin.root.ca.certificate: (hidden/encrypted) CA certificate
- ca.plugin.root.issuer.dn: The CA issue distinguished name
- ca.plugin.root.auth.strictness: Are clients required to present certificates
- ca.plugin.root.allow.expired.cert: Are clients with expired certificates allowed

UI changes:
- Button to download/save the CA certificates.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-09-26 09:19:31 +05:30
Nicolas Vazquez c4f76a199b FR26: Update rule permission of a role permission (#48) 2017-08-23 18:18:58 +02:00
Abhinandan Prateek 327360279d Fr17b (#41)
* FR17: 1. Add timeout to the volume stats command
2. When a unknown command is received return a BadCommand from request processor

* FR17: Unit test for checking bad and a good command sent to the agent as json
2017-08-10 14:39:36 +02:00
Rohit Yadav a9f268b52b java: add java 1.7 version for jenv
Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-08-06 18:41:10 +02:00
dahn 576b4c7c27 FR23 plugable isolation methods
FR23 plugable isolation methods
This is brought to the public version as CLOUDSTACK-10007
2017-07-28 16:28:38 +02:00
Rohit Yadav c1118c2a4e FIX3: Consider overcommit ratios with total/threshold values for host metrics
Consider the CPU and memory overcommit ratios with total cpu/ram values
or thresholds for host metrics. This will fix incorrect notification
(cells turning yellow/red) in the metrics view.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-07-24 12:38:28 +02:00
Rohit Yadav fe88eecfd6 Revert "FIX3: Consider overcommit ratios with total/threshold values"
This reverts commit 9c82452b82.
2017-07-22 13:13:25 +02:00
dahn 3caef4487e Merge pull request #42 from shapeblue/fr13-annotations
annotations (on hosts)
2017-07-13 10:31:41 +02:00
Boris 5a229b369f Adding marvin tests 2017-07-13 10:30:33 +02:00
Daan Hoogland 09173a4466 annotations on hosts 2017-07-13 10:29:51 +02:00
Rohit Yadav b539b48a69 FIX2: Allow creation of roles with names of deleted roles
This allows admins to create roles with names of previously deleted
roles.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-07-11 14:06:24 +05:30
Rohit Yadav 9c82452b82 FIX3: Consider overcommit ratios with total/threshold values
Consider the CPU and memory overcommit ratios with total cpu/ram values
or thresholds for host and cluster metrics. This will fix incorrect
notification (cells turning yellow/red) in the metrics view.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-07-11 12:52:48 +05:30
Rohit Yadav 2590c2521a CLOUDSTACK-9983: Hide credentials in listClusters response
This removes username and passwords details from the listClusters
response. The details are usually seen in VMware environments only.
With dynamic roles features, the listClusters API may be provided
to a read-only root-admin user role/type which should not be able to get
the credentials.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-07-05 13:32:45 +05:30
Abhinandan Prateek c5e9e98ab5 FR17-b: Following enhancements are made to FR17
1. Add timeout to the volume stats command
2. When a unknown command is received return a BadCommand from request processor
3. Unit test for checking bad and a good command sent to the agent as json
2017-06-14 08:40:41 +05:30
Rohit Yadav 986497d891 FR20: Allow native CloudStack users to change password from the UI
This allows native CloudStack users to change password from the UI.
Overall changes:
- New 'usersource' key returned in the listUsers API
- Removed ldap specific check from the UI, added checks based on usersource
- Native CloudStack users will be allowed to change password from the UI

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-05-30 14:23:38 +05:30
Rohit Yadav bf7efaa98d cw1314: Fix high CPU deviation issues seen in metrics view
HostStats returns cpu usage in percentage while memory usage in bytes.
This fixes a regression in maximum CPU usage deviation that did not
assume the values to be in percentage and multiple the final ratios
with 100 which leads to 100x the actual deviation value.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-05-10 12:42:12 +05:30
dahn b70140c1bd Merge pull request #30 from shapeblue/nio-logging
logging improvements
2017-05-09 13:55:11 +01:00
Rohit Yadav 5d58f34a4f Merge pull request #36 from shapeblue/cw-1300
CW1300: Fix hyperv log4j transformation
2017-04-27 14:59:58 +05:30
Rohit Yadav 43a1c9e24a CW1300: Fix hyperv log4j transformation
Fixes log4j transformation using replace.properties to translate
@AGENTLOG@ to a valid value during rpm/mvn build.

Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>
2017-04-25 15:21:25 +05:30
Abhinandan Prateek 8c433b2307 Merge pull request #35 from shapeblue/9182
CLOUDSTACK-9182: Some running VMs turned off on manual migration when…
2017-04-25 11:18:01 +05:30
Rohit Yadav f89d06b0f6 Merge pull request #34 from shapeblue/fr19-oobm-plugin-cloudstack
APPLE-333: Oobm plugin for nested-cloudstack environments
2017-04-21 12:26:19 +05:30