Deploy a Production Ready Kubernetes Cluster on bare metal or raw VMs - This is a clone of https://github.com/kubernetes-sigs/kubespray.git with a kitten twist.
Find a file
Andreas Holmsten 7d8da8348e Cherry pick #4857 #4859 #4867 into release-2.10 (#4877)
* fix start CoreDNS when init secondary master (#4867)

* Update dns-autoscaler.yml.j2 (#4857)

Merge two tolerations.  because the latest tolerations will cover the first tolerations.

* Remove GCE tests and CNCF funding ended (#4859)
2019-06-13 05:22:17 -07:00
.github Add a PR template (#4491) 2019-04-11 03:04:14 -07:00
.gitlab-ci Run dns_late preinstall tasks on all k8s nodes (#4672) 2019-04-29 05:12:21 -07:00
contrib ansible-lint: Add exception for invocation of "rm" (#4609) 2019-04-29 01:34:20 -07:00
docs Documentation and playbook for recovering control plane from node failure (#4146) 2019-04-29 01:40:20 -07:00
extra_playbooks Yamllint fixes (#4410) 2019-04-01 02:38:33 -07:00
inventory k8s 1.14.3 (#4855) 2019-06-09 03:41:05 -07:00
library recursive option for kube ansible module (#4273) 2019-02-25 22:17:23 -08:00
logo Add logo folders (#4515) 2019-04-12 11:00:47 -07:00
roles Cherry pick #4857 #4859 #4867 into release-2.10 (#4877) 2019-06-13 05:22:17 -07:00
scripts Cleanup references to Travis CI (#4208) 2019-04-20 06:17:40 -07:00
test-infra/image-builder Fix typo in test-infra playbook (#4644) 2019-04-24 13:34:46 -07:00
tests updated pinning to prevent breaking changes (#4783) (#4873) 2019-06-13 02:36:19 -07:00
.ansible-lint ansible-lint: Add exception for invocation of "rm" (#4609) 2019-04-29 01:34:20 -07:00
.gitignore Tell git to ignore .terraform directory (#4428) 2019-04-05 01:27:18 -07:00
.gitlab-ci.yml Cherry pick #4857 #4859 #4867 into release-2.10 (#4877) 2019-06-13 05:22:17 -07:00
.gitmodules Remove submodules 2016-03-04 16:14:01 +01:00
.nojekyll Publish docs with docsify (#4193) 2019-02-07 04:52:08 -08:00
.yamllint Adding yamllinter to ci steps (#1556) 2017-08-24 12:09:52 +03:00
_config.yml Yamllint fixes (#4410) 2019-04-01 02:38:33 -07:00
ansible.cfg Add playbook to install mitogen (#3622) 2018-10-31 11:52:47 +01:00
cluster.yml Run dns_late preinstall tasks on all k8s nodes (#4672) 2019-04-29 05:12:21 -07:00
CNAME Update CNAME 2019-02-07 16:30:25 +03:00
code-of-conduct.md Update code-of-conduct.md 2017-12-20 14:12:38 -05:00
CONTRIBUTING.md Add CNCF CLA to the contributing document (#4281) 2019-02-20 06:47:17 -08:00
Dockerfile Update CI to use 2.9.0 release and update Dockerfile to now use 18.04 (#4472) 2019-04-09 05:57:06 -07:00
index.html Add logo folders (#4515) 2019-04-12 11:00:47 -07:00
LICENSE Create LICENSE 2016-03-01 15:37:01 +01:00
Makefile Add playbook to install mitogen (#3622) 2018-10-31 11:52:47 +01:00
mitogen.yaml Yamllint fixes (#4410) 2019-04-01 02:38:33 -07:00
OWNERS Updated OWNERS file pointing to docs (#4184) 2019-02-18 05:49:36 -08:00
OWNERS_ALIASES add miouge1 to reviewers (slack - maxguy) (#4108) 2019-01-28 00:42:22 -08:00
README.md k8s 1.14.3 (#4855) 2019-06-09 03:41:05 -07:00
recover-control-plane.yml Documentation and playbook for recovering control plane from node failure (#4146) 2019-04-29 01:40:20 -07:00
RELEASE.md fix typo 'on' > 'one' 2017-07-14 15:25:09 -04:00
remove-node.yml Upgrade to Ansible 2.7.8 (#4535) 2019-04-17 10:18:05 -07:00
requirements.txt updated pinning to prevent breaking changes (#4783) (#4873) 2019-06-13 02:36:19 -07:00
reset.yml Upgrade to Ansible 2.7.8 (#4535) 2019-04-17 10:18:05 -07:00
scale.yml Define and implement specs for bootstrap-os (#4455) 2019-04-23 15:46:02 -07:00
SECURITY_CONTACTS Adding SECURITY_CONTACTS fixes #2816 (#2833) 2018-05-31 10:48:49 +03:00
setup.cfg Update all kubernetes-incubator/kubespray refs to kubernetes-sigs/kubespray (#3780) 2018-11-28 09:15:25 +01:00
setup.py Add pbr build configuration 2017-08-18 12:56:01 +02:00
upgrade-cluster.yml Run dns_late preinstall tasks on all k8s nodes (#4672) 2019-04-29 05:12:21 -07:00
Vagrantfile Add missing comma (#4636) 2019-04-24 07:10:02 -07:00

Kubernetes Logo

Deploy a Production Ready Kubernetes Cluster

If you have questions, check the documentation and join us on the kubernetes slack, channel #kubespray. You can get your invite here

  • Can be deployed on AWS, GCE, Azure, OpenStack, vSphere, Packet (bare metal), Oracle Cloud Infrastructure (Experimental), or Baremetal
  • Highly available cluster
  • Composable (Choice of the network plugin for instance)
  • Supports most popular Linux distributions
  • Continuous integration tests

Quick Start

To deploy the cluster you can use :

Ansible

Usage

# Install dependencies from ``requirements.txt``
sudo pip install -r requirements.txt

# Copy ``inventory/sample`` as ``inventory/mycluster``
cp -rfp inventory/sample inventory/mycluster

# Update Ansible inventory file with inventory builder
declare -a IPS=(10.10.1.3 10.10.1.4 10.10.1.5)
CONFIG_FILE=inventory/mycluster/hosts.yml python3 contrib/inventory_builder/inventory.py ${IPS[@]}

# Review and change parameters under ``inventory/mycluster/group_vars``
cat inventory/mycluster/group_vars/all/all.yml
cat inventory/mycluster/group_vars/k8s-cluster/k8s-cluster.yml

# Deploy Kubespray with Ansible Playbook - run the playbook as root
# The option `-b` is required, as for example writing SSL keys in /etc/,
# installing packages and interacting with various systemd daemons.
# Without -b the playbook will fail to run!
ansible-playbook -i inventory/mycluster/hosts.yml --become --become-user=root cluster.yml

Note: When Ansible is already installed via system packages on the control machine, other python packages installed via sudo pip install -r requirements.txt will go to a different directory tree (e.g. /usr/local/lib/python2.7/dist-packages on Ubuntu) from Ansible's (e.g. /usr/lib/python2.7/dist-packages/ansible still on Ubuntu). As a consequence, ansible-playbook command will fail with:

ERROR! no action detected in task. This often indicates a misspelled module name, or incorrect module path.

probably pointing on a task depending on a module present in requirements.txt (i.e. "unseal vault").

One way of solving this would be to uninstall the Ansible package and then, to install it via pip but it is not always possible. A workaround consists of setting ANSIBLE_LIBRARY and ANSIBLE_MODULE_UTILS environment variables respectively to the ansible/modules and ansible/module_utils subdirectories of pip packages installation location, which can be found in the Location field of the output of pip show [package] before executing ansible-playbook.

Vagrant

For Vagrant we need to install python dependencies for provisioning tasks. Check if Python and pip are installed:

python -V && pip -V

If this returns the version of the software, you're good to go. If not, download and install Python from here https://www.python.org/downloads/source/ Install the necessary requirements

sudo pip install -r requirements.txt
vagrant up

Documents

Supported Linux Distributions

  • Container Linux by CoreOS
  • Debian Buster, Jessie, Stretch, Wheezy
  • Ubuntu 16.04, 18.04
  • CentOS/RHEL 7
  • Fedora 28
  • Fedora/CentOS Atomic
  • openSUSE Leap 42.3/Tumbleweed

Note: Upstart/SysV init based OS types are not supported.

Supported Components

Note: The list of validated docker versions was updated to 1.11.1, 1.12.1, 1.13.1, 17.03, 17.06, 17.09, 18.06. kubeadm now properly recognizes Docker 18.09.0 and newer, but still treats 18.06 as the default supported version. The kubelet might break on docker's non-standard version numbering (it no longer uses semantic versioning). To ensure auto-updates don't break your cluster look into e.g. yum versionlock plugin or apt pin).

Requirements

  • Ansible v2.7.8 (or newer) and python-netaddr is installed on the machine that will run Ansible commands
  • Jinja 2.9 (or newer) is required to run the Ansible Playbooks
  • The target servers must have access to the Internet in order to pull docker images. Otherwise, additional configuration is required (See Offline Environment)
  • The target servers are configured to allow IPv4 forwarding.
  • Your ssh key must be copied to all the servers part of your inventory.
  • The firewalls are not managed, you'll need to implement your own rules the way you used to. in order to avoid any issue during deployment you should disable your firewall.
  • If kubespray is ran from non-root user account, correct privilege escalation method should be configured in the target servers. Then the ansible_become flag or command parameters --become or -b should be specified.

Hardware:
These limits are safe guarded by Kubespray. Actual requirements for your workload can differ. For a sizing guide go to the Building Large Clusters guide.

  • Master
    • Memory: 1500 MB
  • Node
    • Memory: 1024 MB

Network Plugins

You can choose between 6 network plugins. (default: calico, except Vagrant uses flannel)

  • flannel: gre/vxlan (layer 2) networking.

  • calico: bgp (layer 3) networking.

  • canal: a composition of calico and flannel plugins.

  • cilium: layer 3/4 networking (as well as layer 7 to protect and secure application protocols), supports dynamic insertion of BPF bytecode into the Linux kernel to implement security services, networking and visibility logic.

  • contiv: supports vlan, vxlan, bgp and Cisco SDN networking. This plugin is able to apply firewall policies, segregate containers in multiple network and bridging pods onto physical networks.

  • weave: Weave is a lightweight container overlay network that doesn't require an external K/V database cluster. (Please refer to weave troubleshooting documentation).

  • kube-router: Kube-router is a L3 CNI for Kubernetes networking aiming to provide operational simplicity and high performance: it uses IPVS to provide Kube Services Proxy (if setup to replace kube-proxy), iptables for network policies, and BGP for ods L3 networking (with optionally BGP peering with out-of-cluster BGP peers). It can also optionally advertise routes to Kubernetes cluster Pods CIDRs, ClusterIPs, ExternalIPs and LoadBalancerIPs.

  • multus: Multus is a meta CNI plugin that provides multiple network interface support to pods. For each interface Multus delegates CNI calls to secondary CNI plugins such as Calico, macvlan, etc.

The choice is defined with the variable kube_network_plugin. There is also an option to leverage built-in cloud provider networking instead. See also Network checker.

Community docs and resources

Tools and projects on top of Kubespray

CI Tests

Build graphs

CI/end-to-end tests sponsored by Google (GCE) See the test matrix for details.