C12s/c12s-kubespray

Author	SHA1	Message	Date
Bogdan Dobrelya	a15d626771	Preconfigure DNS stack and docker early In order to enable offline/intranet installation cases: * Move DNS/resolvconf configuration to preinstall role. Remove skip_dnsmasq_k8s var as not needed anymore. * Preconfigure DNS stack early, which may be the case when downloading artifacts from intranet repositories. Do not configure K8s DNS resolvers for hosts /etc/resolv.conf yet early (as they may be not existing). * Reconfigure K8s DNS resolvers for hosts only after kubedns/dnsmasq was set up and before K8s apps to be created. * Move docker install task to early stage as well and unbind it from the etcd role's specific install path. Fix external flannel dependency on docker role handlers. Also fix the docker restart handlers' steps ordering to match the expected sequence (the socket then the service). * Add default resolver fact, which is the cloud provider specific and remove hardcoded GCE resolver. * Reduce default ndots for hosts /etc/resolv.conf to 2. Multiple search domains combined with high ndots values lead to poor performance of DNS stack and make ansible workers to fail very often with the "Timeout (12s) waiting for privilege escalation prompt:" error. * Update docs. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-09 17:30:55 +01:00
Bogdan Dobrelya	7897c34ba3	Merge pull request #700 from bogdando/tags Add tags	2016-12-09 13:23:56 +01:00
Bogdan Dobrelya	8cc84e132a	Add tags Add tags to allow more granular tasks filtering. Add generator script for MD formatted tags found. Add docs for tags how-to. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-09 12:14:28 +01:00
Aleksandr Didenko	ee8d6ab4fc	Convert docker_versioned_pkg dict keys to string This will allow to use '-e docker_version=1.12' in ansible playbook execution. It's also backward-compatible and will work with floating docker_version format in custom yaml files. Closes #702	2016-12-09 09:17:36 +01:00
Matthew Mosesohn	a80745b5bd	Merge pull request #668 from bodepd/etcd_access_address Use etcd host ip instead of hostname to build etcd_access_addresses	2016-12-09 07:54:12 +03:00
Bogdan Dobrelya	710d5ae48e	Merge pull request #691 from adidenko/calico-old-cni-fix Fix possible problems with legacy calicoctl	2016-12-08 12:00:08 +01:00
Dan Bode	eec2ed5809	Allow etcd_access_addresses to be more flexible The variale etcd_access_addresses is used to determine how to address communication from other roles to the etcd cluster. It was set to the address that ansible uses to connect to instance ({{ item }})s and not the the variable: ip_access which had already been created and could already be overridden through the access_ip variable. This change allows ansible to connect to a machine using a different address than the one used to access etcd.	2016-12-07 10:33:15 -08:00
Matthew Mosesohn	bfc9bcb8c7	Force hardlink for calico/canal certs Fixes: #669	2016-12-07 19:03:22 +03:00
Bogdan Dobrelya	8eb26c21be	Merge pull request #692 from bogdando/gce_fixes Change GCE sysctls placement and docs	2016-12-07 16:17:30 +01:00
Bogdan Dobrelya	f0f2b81276	Change GCE sysctls placement and docs Override GCE sysctl in /etc/sysctl.d/99-sysctl.conf instead of the /etc/sysctl.d/11-gce-network-security.conf. It is recreated by GCE, f.e. if gcloud CLI invokes some security related changes, thus losing customizations we want to be persistent. Update cloud providers firewall requirements in calico docs. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-07 12:53:45 +01:00
Aleksandr Didenko	c9290182be	Fix possible problems with legacy calicoctl When running legacy calicoctl we do not specify calico hostname in calico-node container thus we should not specify it in CNI config. Also move 'legacy_calicoctl' set_fact task to the top.	2016-12-07 12:26:44 +01:00
fen4o	246c8209c1	add cluster-signing to kube-controller-manager kube-controller-manager's cluster signing cert and key points by default to not existing `/etc/kubernetes/ca/ca.pem` and `/etc/kubernetes/ca/ca.key` [docs][1] [1]: http://kubernetes.io/docs/admin/kube-controller-manager/#options	2016-12-07 11:20:18 +02:00
Bogdan Dobrelya	36fe2cb5ea	Merge pull request #584 from chadswen/docker-options-refactor Docker Options Refactor	2016-12-07 07:57:53 +01:00
Bogdan Dobrelya	9d6cc3a8d5	Merge pull request #684 from adidenko/fix-calico-peering Calico: fix peering with routers for new version	2016-12-06 22:42:02 +01:00
Spencer Smith	8870178a2d	Merge pull request #627 from kubernetes-incubator/issue-626 add restart flag for docker run kubelet	2016-12-06 08:47:18 -08:00
Aleksandr Didenko	b0079ccd77	Calico: fix peering with routers for new version In new `calicoctl` version nodes peering with routers is broken. We need to use predictable node names for calico-node and the same names in calico `bgpPeer` resources and CNI.	2016-12-06 17:17:39 +01:00
Bogdan Dobrelya	2c1db56213	Merge pull request #678 from adidenko/update-calico-unit Update calico-node systemd unit	2016-12-06 13:51:37 +01:00
Aleksandr Didenko	f1d7af11ee	Update calico-node systemd unit New calicoctl does not support --detach=false option, so we should use a recommended way to run calico-node service: http://docs.projectcalico.org/v2.0/usage/configuration/as-service Closes #674, #675	2016-12-06 11:34:12 +01:00
Bogdan Dobrelya	59a097b255	Merge pull request #679 from kubernetes-incubator/kube-proxy-dbus Add dbus socket dir to kube-proxy	2016-12-06 11:08:16 +01:00
Matthew Mosesohn	7a3a473ccf	Fix ipv4 forwarding on GCE ipv4 forwarding gets broken when restarting networking, which breaks all networking for all pods.	2016-12-06 11:57:57 +03:00
Matthew Mosesohn	2cdf752481	Add dbus socket dir to kube-proxy	2016-12-05 19:25:27 +03:00
Chad Swenson	8b5b27bb51	Docker Options Refactor	2016-12-02 15:07:51 -06:00
Bogdan Dobrelya	7328e0e1ac	Merge pull request #672 from kubernetes-incubator/fail_all_on_error Fail all nodes on error	2016-12-02 17:08:10 +01:00
Bogdan Dobrelya	c13d0db0cc	Merge pull request #656 from YorikSar/nginx-proxy-timeout Set proxy_timeout to 10m in nginx.conf	2016-12-02 12:48:18 +01:00
ant31	dba2026002	Fail all nodes on error	2016-12-02 12:37:22 +01:00
Sebastian Melchior	bb55f68f95	add basic azure support for kargo	2016-11-29 10:20:28 +01:00
Yuriy Taraday	658543c949	Set proxy_timeout to 10m in nginx.conf Fixes #655. This is a teporary solution for long-polling idle connections to apiserver. It will make Nginx not cut them for the duration of expected timeout. It will also make Nginx extremely slow in realizing that there is some issue with connectivity to apiserver as well, so it might not be perfect permanent solution.	2016-11-28 20:27:47 +03:00
Antoine Legrand	5b382668f5	Merge pull request #529 from bogdando/netcheck Add a k8s app for advanced e2e netcheck for DNS	2016-11-28 15:26:30 +01:00
Bogdan Dobrelya	b7692fad09	Add advanced net check for DNS K8s app * Add an option to deploy K8s app to test e2e network connectivity and cluster DNS resolve via Kubedns for nethost/simple pods (defaults to false). * Parametrize existing k8s apps templates with kube_namespace and kube_config_dir instead of hardcode. * For CoreOS, ensure nameservers from inventory to be put in the first place to allow hostnet pods connectivity via short names or FQDN and hostnet agents to pass as well, if netchecker deployed. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-28 13:23:25 +01:00
Bogdan Dobrelya	fbdda81515	Merge pull request #652 from kubernetes-incubator/debug_mode Tune dnsmasq/kubedns limits, replicas, logging	2016-11-25 16:57:15 +01:00
Bogdan Dobrelya	2d18e19263	Tune dnsmasq/kubedns limits, replicas, logging * Add dns_replicas, dns_memory/cpu_limit/requests vars for dns related apps. * When kube_log_level=4, log dnsmasq queries as well. * Add log level control for skydns (part of kubedns app). * Add limits/requests vars for dnsmasq (part of kubedns app) and dnsmasq daemon set. * Drop string defaults for kube_log_level as it is int and is defined in the global vars as well. * Add docs Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-25 12:49:17 +01:00
Aleksandr Didenko	ff7d489f2d	Update calico/ctl image tag We no longer need to use v0.22.0 for calicoctl since Kargo has support for new calicoctl CLI format. Also fixing condition logic for calico pool task.	2016-11-25 11:23:27 +01:00
Bogdan Dobrelya	6d29a5981c	Merge pull request #651 from bogdando/fix_docker_install Fix download dnsmasq image dependency on docker	2016-11-24 18:44:12 +01:00
Bogdan Dobrelya	10b75d1d51	Merge pull request #648 from artem-panchenko/fix_calicoctl_node_run Fix Calico jinja template (systemd)	2016-11-24 18:33:34 +01:00
Bogdan Dobrelya	aa447585c4	Fix download dnsmasq image dependency on docker When download_run_once with download_localhost is used, docker is expected to be running on the delegate localhost. That may be not the case for a non localhost delegate, which is the kube-master otherwise. Then the dnsmasq role, had it been invoked early before deployment starts, would fail because of the missing docker dependency. * Fix that dependency on docker and do not pre download dnsmasq image for the dnsmasq role, if download_localhost is disabled. * Remove become: false for docker CLI invocation because that's not the common pattern to allow users access docker CLI w/o sudo. * Fix opt bin path hack for localhost delegate to ignore errors when it fails with "sudo password required" otherwise. * Describe download_run_once with download_localhost use case in docs as well. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-24 18:31:26 +01:00
Bogdan Dobrelya	d208896c46	Ensure /etc/resolv.conf content for CoreOS Use cloud-init config to replace /etc/resolv.conf with the content for kubelet to properly configure hostnet pods. Do not use systemd-resolved yet, see https://coreos.com/os/docs/latest/configuring-dns.html "Only nss-aware applications can take advantage of the systemd-resolved cache. Notably, this means that statically linked Go programs and programs running within Docker/rkt will use /etc/resolv.conf only, and will not use the systemd-resolve cache." Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-23 16:51:49 +01:00
Artem Panchenko	2c4b11f321	Fix Calico jinja template (systemd)	2016-11-23 11:43:53 +02:00
Bogdan Dobrelya	d890d2f277	Fix nginx container download for download_run_once mode W/o this patch, the "Download containers" task may be skipped when running on the delegate node due to wrong "when" confition. Then it fails to upload nginx image to the nodes as well. Fix download nginx dependency so it always can be pushed to nodes when download_run_once is enabled. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-23 10:37:08 +01:00
Bogdan Dobrelya	793f3990a0	Merge pull request #642 from kubernetes-incubator/k8s_imgpull Allow pre-downloaded images to be used effectively	2016-11-22 18:09:38 +01:00
Aleksandr Didenko	db03f17486	Set defaults for ansible_ssh_user When setting permission for containers download/upload dir we're using `ansible_ssh_user`. But if playbook is executed without user being explicitly set `ansible_ssh_user` may be undefined. In such situations dir ownership will default to `ansible_user_id` Closes: #644	2016-11-22 18:00:56 +01:00
Bogdan Dobrelya	dff78f616e	Allow pre-downloaded images to be used effectively According to http://kubernetes.io/docs/user-guide/images/ : By default, the kubelet will try to pull each image from the specified registry. However, if the imagePullPolicy property of the container is set to IfNotPresent or Never, then a local\ image is used (preferentially or exclusively, respectively). Use IfNotPresent value to allow images prepared by the download role dependencies to be effectively used by kubelet without pull errors resulting apps to stay blocked in PullBackOff/Error state even when there are images on the localhost exist. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-22 16:16:04 +01:00
Antoine Legrand	d3a4d8dc24	Merge pull request #638 from pskrzyns/fix_setting_loadbalancer_apiserver_localhost Fix conditional when setting loadbalancer_apiserver_localhost	2016-11-22 15:15:38 +01:00
Bogdan Dobrelya	dc58159d16	Merge pull request #621 from xenolog/calico_network_backend Add ability to define network backend for Calico.	2016-11-22 14:55:47 +01:00
Antoine Legrand	b60d5647a2	Merge pull request #635 from kubernetes-incubator/download_images Download images as dependencies of roles	2016-11-22 14:53:12 +01:00
Bogdan Dobrelya	66f27ed1f3	Download images as dependencies of roles Pre download all required container images as roles' deps. Drop unused flannel-server-helper images pre download. Improve pods creation post-install test pre downloaded busybox. Improve logs collection script with kubectl describe, fix sudo/etcd/weave commands. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-22 11:13:57 +01:00
Paweł Skrzyński	32a5453473	Fix conditional when setting loadbalancer_apiserver_localhost	2016-11-21 19:36:05 +01:00
Bogdan Dobrelya	1bd1825ecb	Add missing liveness probe for apiserver static pod Fix unreliable waiting for the apiserver to become ready. Remove logfile mount to align with the rest of static pods and because containers shall write logs to stdout only. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-21 13:15:51 +01:00
Bogdan Dobrelya	20e36191bb	Merge pull request #629 from kubernetes-incubator/fix-download-once Fix download once	2016-11-21 10:55:54 +01:00
Bogdan Dobrelya	769566f36c	Merge pull request #633 from bodepd/etcd_fix Ensure that etcd health checks always pass	2016-11-21 10:29:35 +01:00
Dan Bode	ff675d40f9	Ensure that etcd health checks always pass in the etcd handler, the reload etcd action was called after ansible waits for etcd to be up, this means that the health checks which are called immediately after fail (resulting in the etcd role always failing and never finishing) This patch changes the order to move the 'wait for etcd up' resource after the 'reload etcd resource', ensuring that the service is up before the health check is called.	2016-11-18 14:15:00 -08:00

1 2 3 4 5 ...

615 commits