Bogdan Dobrelya 390764c2b4 Add retry_stagger var for failed download/pushes.

* Add the retry_stagger var to tweak push and retry time strategies.
* Add large deployments related docs.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

2016-09-15 16:43:58 +02:00

825 B

Raw Blame History

Large deployments of K8s

For a large scaled deployments, consider the following configuration changes:

Tune ansible settings for forks and timeout vars to fit large numbers of nodes being deployed.
Override containers' foo_image_repo vars to point to intranet registry.
Override the download_run_once: true to download binaries and container images only once then push to nodes in batches.
Adjust the retry_stagger global var as appropriate. It should provide sane load on a delegate (the first K8s master node) then retrying failed push or download operations.

For example, when deploying 200 nodes, you may want to run ansible with --forks=50, --timeout=600 and define the retry_stagger: 60.

825 B Raw Blame History

Large deployments of K8s

825 B

Raw Blame History