C12s/c12s-kubespray

Author	SHA1	Message	Date
Alexander Block	ddc399605a	Add playbook and role to reset the cluster This deletes everything related to the cluster and allows to start from scratch.	2016-12-09 11:15:36 +01:00
Aleksandr Didenko	63b655cd7b	Convert docker_versioned_pkg dict keys to string This will allow to use '-e docker_version=1.12' in ansible playbook execution. It's also backward-compatible and will work with floating docker_version format in custom yaml files. Closes #702	2016-12-09 09:17:36 +01:00
Matthew Mosesohn	be117265d9	Merge pull request #668 from bodepd/etcd_access_address Use etcd host ip instead of hostname to build etcd_access_addresses	2016-12-09 07:54:12 +03:00
Bogdan Dobrelya	aee21136ce	Merge pull request #691 from adidenko/calico-old-cni-fix Fix possible problems with legacy calicoctl	2016-12-08 12:00:08 +01:00
Dan Bode	f7a7e064b5	Allow etcd_access_addresses to be more flexible The variale etcd_access_addresses is used to determine how to address communication from other roles to the etcd cluster. It was set to the address that ansible uses to connect to instance ({{ item }})s and not the the variable: ip_access which had already been created and could already be overridden through the access_ip variable. This change allows ansible to connect to a machine using a different address than the one used to access etcd.	2016-12-07 10:33:15 -08:00
Matthew Mosesohn	75782aa262	Force hardlink for calico/canal certs Fixes: #669	2016-12-07 19:03:22 +03:00
Bogdan Dobrelya	1b17efee19	Merge pull request #692 from bogdando/gce_fixes Change GCE sysctls placement and docs	2016-12-07 16:17:30 +01:00
Bogdan Dobrelya	965b27e48e	Change GCE sysctls placement and docs Override GCE sysctl in /etc/sysctl.d/99-sysctl.conf instead of the /etc/sysctl.d/11-gce-network-security.conf. It is recreated by GCE, f.e. if gcloud CLI invokes some security related changes, thus losing customizations we want to be persistent. Update cloud providers firewall requirements in calico docs. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-12-07 12:53:45 +01:00
Aleksandr Didenko	611038e306	Fix possible problems with legacy calicoctl When running legacy calicoctl we do not specify calico hostname in calico-node container thus we should not specify it in CNI config. Also move 'legacy_calicoctl' set_fact task to the top.	2016-12-07 12:26:44 +01:00
fen4o	10ce760450	add cluster-signing to kube-controller-manager kube-controller-manager's cluster signing cert and key points by default to not existing `/etc/kubernetes/ca/ca.pem` and `/etc/kubernetes/ca/ca.key` [docs][1] [1]: http://kubernetes.io/docs/admin/kube-controller-manager/#options	2016-12-07 11:20:18 +02:00
Bogdan Dobrelya	8e28cb8095	Merge pull request #584 from chadswen/docker-options-refactor Docker Options Refactor	2016-12-07 07:57:53 +01:00
Bogdan Dobrelya	ae7769d832	Merge pull request #684 from adidenko/fix-calico-peering Calico: fix peering with routers for new version	2016-12-06 22:42:02 +01:00
Spencer Smith	08fb5b6c01	Merge pull request #627 from kubernetes-incubator/issue-626 add restart flag for docker run kubelet	2016-12-06 08:47:18 -08:00
Aleksandr Didenko	992fcd1680	Calico: fix peering with routers for new version In new `calicoctl` version nodes peering with routers is broken. We need to use predictable node names for calico-node and the same names in calico `bgpPeer` resources and CNI.	2016-12-06 17:17:39 +01:00
Bogdan Dobrelya	f63f99d774	Merge pull request #678 from adidenko/update-calico-unit Update calico-node systemd unit	2016-12-06 13:51:37 +01:00
Aleksandr Didenko	f3231b40e7	Update calico-node systemd unit New calicoctl does not support --detach=false option, so we should use a recommended way to run calico-node service: http://docs.projectcalico.org/v2.0/usage/configuration/as-service Closes #674, #675	2016-12-06 11:34:12 +01:00
Bogdan Dobrelya	567f5ac4c6	Merge pull request #679 from kubernetes-incubator/kube-proxy-dbus Add dbus socket dir to kube-proxy	2016-12-06 11:08:16 +01:00
Matthew Mosesohn	eeb3b9f7e1	Fix ipv4 forwarding on GCE ipv4 forwarding gets broken when restarting networking, which breaks all networking for all pods.	2016-12-06 11:57:57 +03:00
Matthew Mosesohn	224f5fae63	Add dbus socket dir to kube-proxy	2016-12-05 19:25:27 +03:00
Chad Swenson	b7959020c6	Docker Options Refactor	2016-12-02 15:07:51 -06:00
Bogdan Dobrelya	16a4b4f336	Merge pull request #672 from kubernetes-incubator/fail_all_on_error Fail all nodes on error	2016-12-02 17:08:10 +01:00
Bogdan Dobrelya	220a375cb9	Merge pull request #656 from YorikSar/nginx-proxy-timeout Set proxy_timeout to 10m in nginx.conf	2016-12-02 12:48:18 +01:00
ant31	e8e2c84ca4	Fail all nodes on error	2016-12-02 12:37:22 +01:00
Sebastian Melchior	254e02c69e	add basic azure support for kargo	2016-11-29 10:20:28 +01:00
Yuriy Taraday	d92124561d	Set proxy_timeout to 10m in nginx.conf Fixes #655. This is a teporary solution for long-polling idle connections to apiserver. It will make Nginx not cut them for the duration of expected timeout. It will also make Nginx extremely slow in realizing that there is some issue with connectivity to apiserver as well, so it might not be perfect permanent solution.	2016-11-28 20:27:47 +03:00
Antoine Legrand	f75e2c5119	Merge pull request #529 from bogdando/netcheck Add a k8s app for advanced e2e netcheck for DNS	2016-11-28 15:26:30 +01:00
Bogdan Dobrelya	d5b21b34c2	Add advanced net check for DNS K8s app * Add an option to deploy K8s app to test e2e network connectivity and cluster DNS resolve via Kubedns for nethost/simple pods (defaults to false). * Parametrize existing k8s apps templates with kube_namespace and kube_config_dir instead of hardcode. * For CoreOS, ensure nameservers from inventory to be put in the first place to allow hostnet pods connectivity via short names or FQDN and hostnet agents to pass as well, if netchecker deployed. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-28 13:23:25 +01:00
Bogdan Dobrelya	779d676414	Merge pull request #652 from kubernetes-incubator/debug_mode Tune dnsmasq/kubedns limits, replicas, logging	2016-11-25 16:57:15 +01:00
Bogdan Dobrelya	c34c49d4d9	Tune dnsmasq/kubedns limits, replicas, logging * Add dns_replicas, dns_memory/cpu_limit/requests vars for dns related apps. * When kube_log_level=4, log dnsmasq queries as well. * Add log level control for skydns (part of kubedns app). * Add limits/requests vars for dnsmasq (part of kubedns app) and dnsmasq daemon set. * Drop string defaults for kube_log_level as it is int and is defined in the global vars as well. * Add docs Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-25 12:49:17 +01:00
Aleksandr Didenko	0e49c5f240	Update calico/ctl image tag We no longer need to use v0.22.0 for calicoctl since Kargo has support for new calicoctl CLI format. Also fixing condition logic for calico pool task.	2016-11-25 11:23:27 +01:00
Bogdan Dobrelya	09a1a1a963	Merge pull request #651 from bogdando/fix_docker_install Fix download dnsmasq image dependency on docker	2016-11-24 18:44:12 +01:00
Bogdan Dobrelya	91cc141662	Merge pull request #648 from artem-panchenko/fix_calicoctl_node_run Fix Calico jinja template (systemd)	2016-11-24 18:33:34 +01:00
Bogdan Dobrelya	417a931f78	Fix download dnsmasq image dependency on docker When download_run_once with download_localhost is used, docker is expected to be running on the delegate localhost. That may be not the case for a non localhost delegate, which is the kube-master otherwise. Then the dnsmasq role, had it been invoked early before deployment starts, would fail because of the missing docker dependency. * Fix that dependency on docker and do not pre download dnsmasq image for the dnsmasq role, if download_localhost is disabled. * Remove become: false for docker CLI invocation because that's not the common pattern to allow users access docker CLI w/o sudo. * Fix opt bin path hack for localhost delegate to ignore errors when it fails with "sudo password required" otherwise. * Describe download_run_once with download_localhost use case in docs as well. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-24 18:31:26 +01:00
Bogdan Dobrelya	bbd57d5f5e	Ensure /etc/resolv.conf content for CoreOS Use cloud-init config to replace /etc/resolv.conf with the content for kubelet to properly configure hostnet pods. Do not use systemd-resolved yet, see https://coreos.com/os/docs/latest/configuring-dns.html "Only nss-aware applications can take advantage of the systemd-resolved cache. Notably, this means that statically linked Go programs and programs running within Docker/rkt will use /etc/resolv.conf only, and will not use the systemd-resolve cache." Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-23 16:51:49 +01:00
Artem Panchenko	0437f9584d	Fix Calico jinja template (systemd)	2016-11-23 11:43:53 +02:00
Bogdan Dobrelya	a4d5a14791	Fix nginx container download for download_run_once mode W/o this patch, the "Download containers" task may be skipped when running on the delegate node due to wrong "when" confition. Then it fails to upload nginx image to the nodes as well. Fix download nginx dependency so it always can be pushed to nodes when download_run_once is enabled. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-23 10:37:08 +01:00
Bogdan Dobrelya	bfa15cd8ea	Merge pull request #642 from kubernetes-incubator/k8s_imgpull Allow pre-downloaded images to be used effectively	2016-11-22 18:09:38 +01:00
Aleksandr Didenko	f0b1884104	Set defaults for ansible_ssh_user When setting permission for containers download/upload dir we're using `ansible_ssh_user`. But if playbook is executed without user being explicitly set `ansible_ssh_user` may be undefined. In such situations dir ownership will default to `ansible_user_id` Closes: #644	2016-11-22 18:00:56 +01:00
Bogdan Dobrelya	1bd3d3a080	Allow pre-downloaded images to be used effectively According to http://kubernetes.io/docs/user-guide/images/ : By default, the kubelet will try to pull each image from the specified registry. However, if the imagePullPolicy property of the container is set to IfNotPresent or Never, then a local\ image is used (preferentially or exclusively, respectively). Use IfNotPresent value to allow images prepared by the download role dependencies to be effectively used by kubelet without pull errors resulting apps to stay blocked in PullBackOff/Error state even when there are images on the localhost exist. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-22 16:16:04 +01:00
Antoine Legrand	bca996bf0b	Merge pull request #638 from pskrzyns/fix_setting_loadbalancer_apiserver_localhost Fix conditional when setting loadbalancer_apiserver_localhost	2016-11-22 15:15:38 +01:00
Bogdan Dobrelya	539a47b0fa	Merge pull request #621 from xenolog/calico_network_backend Add ability to define network backend for Calico.	2016-11-22 14:55:47 +01:00
Antoine Legrand	0016ba1759	Merge pull request #635 from kubernetes-incubator/download_images Download images as dependencies of roles	2016-11-22 14:53:12 +01:00
Bogdan Dobrelya	793cedc522	Download images as dependencies of roles Pre download all required container images as roles' deps. Drop unused flannel-server-helper images pre download. Improve pods creation post-install test pre downloaded busybox. Improve logs collection script with kubectl describe, fix sudo/etcd/weave commands. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-22 11:13:57 +01:00
Paweł Skrzyński	67b61c5c42	Fix conditional when setting loadbalancer_apiserver_localhost	2016-11-21 19:36:05 +01:00
Bogdan Dobrelya	523c9d77df	Add missing liveness probe for apiserver static pod Fix unreliable waiting for the apiserver to become ready. Remove logfile mount to align with the rest of static pods and because containers shall write logs to stdout only. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>	2016-11-21 13:15:51 +01:00
Bogdan Dobrelya	b44479d911	Merge pull request #629 from kubernetes-incubator/fix-download-once Fix download once	2016-11-21 10:55:54 +01:00
Bogdan Dobrelya	d2f9c11299	Merge pull request #633 from bodepd/etcd_fix Ensure that etcd health checks always pass	2016-11-21 10:29:35 +01:00
Dan Bode	aad73ea90e	Ensure that etcd health checks always pass in the etcd handler, the reload etcd action was called after ansible waits for etcd to be up, this means that the health checks which are called immediately after fail (resulting in the etcd role always failing and never finishing) This patch changes the order to move the 'wait for etcd up' resource after the 'reload etcd resource', ensuring that the service is up before the health check is called.	2016-11-18 14:15:00 -08:00
Spencer Smith	106dcc3898	updated all instances of restart always to restart on-failure with a max of 5 times	2016-11-18 14:33:22 -05:00
Bogdan Dobrelya	cf7c6ae859	Add download localhost and enable for CI * Add download_localhost for the download_run_once mode, which is use the ansible host (a travis node for CI case) to store and distribute containers across cluster nodes in inventory. Defaults to false. * Rework download_run_once logic to fix idempotency of uploading containers. * For Travis CI, enable docker images caching and run Travis workers with sudo enabled as a dependency * For Travis CI, deploy with download_localhost and download_run_once enabled to shourten dev path drastically. * Add compression for saved container images. Defaults to 'best'. Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com> Co-authored-by: Aleksandr Didenko <adidenko@mirantis.com>	2016-11-18 16:00:07 +01:00

1 2 3 4 5 ...

613 commits