C12s/c12s-kubespray

Author	SHA1	Message	Date
Matthew Mosesohn	70e122e7c2	Use async for slow long loop cert tasks Checking for certs and generating tokens takes up to 1.5s per node for each of three tasks. Async should parallelize this and reduce the time significantly.	2017-03-02 11:36:16 +04:00
Antoine Legrand	77e5171679	Merge pull request #1076 from VincentS/etcd_openssl_count_fix Fixed counter in ETCD Openssl.conf	2017-03-01 14:17:27 +01:00
Sergii Golovatiuk	f9ff93c606	Make etcd data dir configurable. Closes: #1073 Signed-off-by: Sergii Golovatiuk <sgolovatiuk@mirantis.com>	2017-02-27 21:35:51 +01:00
Vincent Schwarzer	0cbc3d8df6	Fixed counter in ETCD Openssl.conf When a apiserver_loadbalancer_domain_name is added to the Openssl.conf the counter gets not increased correctly. This didnt seem to have an effect at the current kargo version.	2017-02-27 12:01:09 +01:00
Sergii Golovatiuk	00cfead9bb	Increase SSL TTL to 3650 days In real scenarios 365 days is short period of time. 3650 days is good enough for long running k8s environments	2017-02-24 15:38:13 +01:00
Matthew Mosesohn	d821448e2f	Merge branch 'master' into synthscale	2017-02-21 22:17:43 +03:00
Matthew Mosesohn	0afadb9149	Merge pull request #1046 from skyscooby/pedantic-syntax-cleanup Cleanup legacy syntax, spacing, files all to yml	2017-02-21 17:03:16 +03:00
Matthew Mosesohn	d19e6dec7a	speed up etcd preupgrade check	2017-02-20 20:18:10 +03:00
Matthew Mosesohn	a21eb036ee	Add no_log to cert tar tasks This works around 4MB limit for gitlab CI runner.	2017-02-18 14:09:57 +04:00
Matthew Mosesohn	9c1701f2aa	Add synthetic scale deployment mode New deploy modes: scale, ha-scale, separate-scale Creates 200 fake hosts for deployment with fake hostvars. Useful for testing certificate generation and propagation to other master nodes. Updated test cases descriptions.	2017-02-18 14:09:55 +04:00
Andrew Greenwood	ca9ea097df	Cleanup legacy syntax, spacing, files all to yml Migrate older inline= syntax to pure yml syntax for module args as to be consistant with most of the rest of the tasks Cleanup some spacing in various files Rename some files named yaml to yml for consistancy	2017-02-17 16:22:34 -05:00
Antoine Legrand	e16ebcad6e	Merge pull request #1042 from holser/fix_facts Fix fact tags	2017-02-17 17:56:29 +01:00
Sergii Golovatiuk	e91e58aec9	Fix fact tags Ansible playbook fails when tags are limited to "facts,etcd" or to "facts". This patch allows to run ansible-playbook to gather facts only that don't require calico/flannel/weave components to be verified. This allows to run ansible with 'facts,bootstrap-os' or just 'facts' to gether facts that don't require specific components. Signed-off-by: Sergii Golovatiuk <sgolovatiuk@mirantis.com>	2017-02-17 12:32:33 +01:00
Matthew Mosesohn	80c0e747a7	Fix references to CoreOS and Container Linux by CoreOS Fixes #967	2017-02-16 19:25:17 +03:00
Vladimir Rutsky	09847567ae	set "check_mode: no" for read-only "shell" steps that registers result "shell" step doesn't support check mode, which currently leads to failures, when Ansible is being run in check mode (because Ansible doesn't run command, assuming that command might have effect, and no "rc" or "output" is registered). Setting "check_mode: no" allows to run those "shell" commands in check mode (which is safe, because those shell commands doesn't have side effects).	2017-02-13 18:53:41 +03:00
Josh Conant	245e05ce61	Vault security hardening and role isolation	2017-02-08 21:41:36 +00:00
Josh Conant	f4ec2d18e5	Adding the Vault role	2017-02-08 21:31:28 +00:00
Matthew Mosesohn	0180ad7f38	Merge pull request #990 from mattymo/fix_cert_upgrade Fix check for node-NODEID certs existence	2017-02-08 14:44:09 +03:00
Matthew Mosesohn	e5779ab786	Fix check for node-NODEID certs existence Fixes upgrade from pre-individual node cert envs.	2017-02-07 21:06:48 +03:00
Matthew Mosesohn	71e14a13b4	Re-tune ETCD performance params Reduce election timeout to 5000ms (was 10000ms) Raise heartbeat interval to 250ms (was 100ms) Remove etcd cpu share (was 300) Make etcd_cpu_limit and etcd_memory_limit optional.	2017-02-07 20:15:14 +03:00
Matthew Mosesohn	fd30131dc2	Revert "Drop linux capabilities and rework users/groups"	2017-02-06 15:58:54 +03:00
Bogdan Dobrelya	cb2e5ac776	Drop linux capabilities and rework users/groups * Drop linux capabilities for unprivileged containerized worlkoads Kargo configures for deployments. * Configure required securityContext/user/group/groups for kube components' static manifests, etcd, calico-rr and k8s apps, like dnsmasq daemonset. * Rework cloud-init (etcd) users creation for CoreOS. * Fix nologin paths, adjust defaults for addusers role and ensure supplementary groups membership added for users. * Add netplug user for network plugins (yet unused by privileged networking containers though). * Grant the kube and netplug users read access for etcd certs via the etcd certs group. * Grant group read access to kube certs via the kube cert group. * Remove priveleged mode for calico-rr and run it under its uid/gid and supplementary etcd_cert group. * Adjust docs. * Align cpu/memory limits and dropped caps with added rkt support for control plane. Signed-off-by: Bogdan Dobrelya <bogdando@mail.ru>	2017-01-20 08:50:42 +01:00
Matthew Mosesohn	8ce32eb3e1	Merge pull request #905 from galthaus/async-runs Add tasks to ensure that the first nodes have their directories for cert gen	2017-01-19 18:32:27 +03:00
Greg Althaus	0d44599a63	Add explicit name printing in task names for deletgated task during cert creation	2017-01-18 14:06:50 -06:00
Sergii Golovatiuk	43fa72b7b7	Flush handlers before etcd restart systemctl daemon-reload should be run before when task modifies/creates union for etcd. Otherwise etcd won't be able to start Closes #892 Signed-off-by: Sergii Golovatiuk <sgolovatiuk@mirantis.com>	2017-01-17 15:04:25 +01:00
Greg Althaus	6c69da1573	This PR adds/or modifies a few tasks to allow for the playbook to be run by limit on each node without regard for order. The changes make sure that all of the directories needed to do certificate management are on the master[0] or etcd[0] node regardless of when the playbook gets run on each node. This allows for separate ansible playbook runs in parallel that don't have to be synchronized.	2017-01-14 23:24:34 -06:00
Greg Althaus	95bf380d07	If the inventory name of the host exceeds 63 characters, the openssl tools will fail to create signing requests because the CN is too long. This is mainly a problem when FQDNs are used in the inventory file. THis will truncate the hostname for the CN field only at the first dot. This should handle the issue for most cases.	2017-01-13 10:02:23 -06:00
Aleksandr Didenko	d9539e0f27	Fix etcd cert generation for calico-rr role "etcd_node_cert_data" variable is undefinded for "calico-rr" role. This patch adds "calico-rr" nodes to task where "etcd_node_cert_data" variable is registered.	2017-01-09 12:06:25 +01:00
Bogdan Dobrelya	5af2c42bde	Better fix for different CoreOS os family facts Signed-off-by: Bogdan Dobrelya <bogdando@mail.ru>	2017-01-05 16:32:08 +01:00
Bogdan Dobrelya	f7447837c5	Rename CoreOS fact Signed-off-by: Bogdan Dobrelya <bogdando@mail.ru>	2017-01-05 14:02:29 +01:00
Brad Beam	4b6f29d5e1	Adding kubelet in rkt	2017-01-03 14:49:48 -06:00
Brad Beam	8dc19374cc	Allowing etcd to run via rkt	2017-01-03 10:10:38 -06:00
Bogdan Dobrelya	58062be2a3	Drop non systemd OS types support Signed-off-by: Bogdan Dobrelya <bogdando@mail.ru>	2017-01-02 12:14:03 +01:00
Matthew Mosesohn	1f9f885379	Fix etcd cert generation to support large deployments Due to bash max args limits, we should pass all node filenames and base64-encoded tar data through stdin/stdout instead. Fixes #832	2016-12-30 12:55:26 +03:00
Bogdan Dobrelya	a56d9de502	Systemd units, limits, and bin path fixes * Add restart for weave service unit * Reuse docker_bin_dir everythere * Limit systemd managed docker containers by CPU/RAM. Do not configure native systemd limits due to the lack of consensus in the kernel community requires out-of-tree kernel patches. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-28 15:49:42 +01:00
Matthew Mosesohn	f0c0390646	Fix creation and sync of etcd certs Admin certs only go to etcd nodes Only generate cert-data for nodes that need sync	2016-12-28 14:21:17 +04:00
Matthew Mosesohn	e7a1949d85	Merge pull request #818 from mattymo/calico-rr-certs Fix calico-rr to use etcd certs instead of kube certs	2016-12-28 08:47:16 +03:00
Matthew Mosesohn	6d9cd2d720	Fix calico-rr to use etcd certs instead of kube certs	2016-12-27 17:04:50 +03:00
Bogdan Dobrelya	79996b557b	Rework ignore_errors to report no reds Signed-off-by: Bogdan Dobrelya <bogdando@mail.ru>	2016-12-27 13:00:50 +01:00
Matthew Mosesohn	385f7f6e75	Update etcd.j2	2016-12-22 22:29:24 +03:00
Matthew Mosesohn	9f1e3db906	Adjust etcd server certificates ETCD doesn't need cert/key options set. It only requires peer cert options.	2016-12-22 23:05:17 +04:00
Spencer Smith	b63d900625	Workaround etcdctl not yet being installed (#797 ) workaround case for etcdctl not yet being installed, only allow for return code of 0 (no error)	2016-12-22 12:41:38 -05:00
Matthew Mosesohn	ad796d188d	Individual etcd ssl certs Includes hooks for triggering calico, kubelet, and kube-apiserver restarts if etcd certs changed.	2016-12-22 13:31:11 +03:00
Matthew Mosesohn	348fc5b109	Fix etcd to-SSL upgrade and task register vars	2016-12-19 15:05:49 +03:00
Matthew Mosesohn	9cc73bdf08	Fix etcd member list when upgrading ETCD from an old version	2016-12-15 12:00:45 +04:00
Bogdan Dobrelya	a15d626771	Preconfigure DNS stack and docker early In order to enable offline/intranet installation cases: * Move DNS/resolvconf configuration to preinstall role. Remove skip_dnsmasq_k8s var as not needed anymore. * Preconfigure DNS stack early, which may be the case when downloading artifacts from intranet repositories. Do not configure K8s DNS resolvers for hosts /etc/resolv.conf yet early (as they may be not existing). * Reconfigure K8s DNS resolvers for hosts only after kubedns/dnsmasq was set up and before K8s apps to be created. * Move docker install task to early stage as well and unbind it from the etcd role's specific install path. Fix external flannel dependency on docker role handlers. Also fix the docker restart handlers' steps ordering to match the expected sequence (the socket then the service). * Add default resolver fact, which is the cloud provider specific and remove hardcoded GCE resolver. * Reduce default ndots for hosts /etc/resolv.conf to 2. Multiple search domains combined with high ndots values lead to poor performance of DNS stack and make ansible workers to fail very often with the "Timeout (12s) waiting for privilege escalation prompt:" error. * Update docs. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-09 17:30:55 +01:00
Bogdan Dobrelya	8cc84e132a	Add tags Add tags to allow more granular tasks filtering. Add generator script for MD formatted tags found. Add docs for tags how-to. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-09 12:14:28 +01:00
Spencer Smith	8870178a2d	Merge pull request #627 from kubernetes-incubator/issue-626 add restart flag for docker run kubelet	2016-12-06 08:47:18 -08:00
Dan Bode	ff675d40f9	Ensure that etcd health checks always pass in the etcd handler, the reload etcd action was called after ansible waits for etcd to be up, this means that the health checks which are called immediately after fail (resulting in the etcd role always failing and never finishing) This patch changes the order to move the 'wait for etcd up' resource after the 'reload etcd resource', ensuring that the service is up before the health check is called.	2016-11-18 14:15:00 -08:00
Spencer Smith	0eebe43c08	updated all instances of restart always to restart on-failure with a max of 5 times	2016-11-18 14:33:22 -05:00

1 2 3

109 commits