Swarming Raspberry Pi: Docker Swarm Discovery Options

Docker Swarm supports a variety of methods for discovering swarm members. Each has arguments in its favor. In this post I shall discuss the various methods and thoughts regarding each.

Background

I originally started with the idea of having a portable cluster, a “cloud in a box” if you will, so that I could go and give talks without having to worry about network dependencies and so forth. I also was intrigued by the idea of low power, inexpensive devices which could be used for building a cloud. Two days after my initial purchase of 5 Pi B+, the Pi 2 was released. Despite my initial grump, I realized that this presented possibilities for distributing workloads across a heterogenous environment which is an interesting problem space — determining how best to distribute work across an infrastructure.

I still have the goal, for the present of having a portable cloud. I’ve been challenged, however, to build a larger one than Britain’s GCHQ Pi cloud. It is tempting. Since they’re using all single core Pi’s, it wouldn’t be terribly difficult to build a cloud with more oomph and far fewer nodes. Of course, if the workload is IO intensive then more members are needed.

At present, my cloud is consisting of the following:

5x Pi B+ Worker Nodes, 16GB micro SD
5x Pi 2B Worker Nodes, 16GB micro SD
1x Pi 2B Master Node (temporarily being used as a porting/compiling node), 16GB Micro SD
2x Pi 2B Data Nodes (one of these will become a docker registry, among other things)
a. One has 2x 240GB SSD
b. One has a 240GB SSD and a 160GB Spinning disk (for “slow” data)
16 Port 100Mbit Switch. This may shortly be swapped out for gigabit switch(es).

Criteria for Evaluation

I strongly believe that metrics, monitoring, and alerting are necessary in building any infrastructure.

I am seeking maximum portability; my Cloud in a Box™ should be able to do Real Work™ without depending on anything outside the cluster. Additionally, the less I need to know ahead of time the better. Names trump numbers — if I can use a name or use a name to look up a number that is better than having to remember a number.

Given the limited resources of the Pi, lightweight solutions are preferred over heavyweight, saving where they can serve dual purposes.

The Contendors

The list of Discovery Services can be found in the Docker Documentation.

Hosted Discovery Service

The hosted discovery service presents an easy way to test and get started with Swarm. Swarm communicates with the Docker Hub in order to maintain a list of swarm members.

The Good

It’s easy, presented in the tutorial, and is supported by Docker.

The Bad

Unfortunately the requirement of connecting the the Docker Hub means that it’s not self contained; in order for it to work a network connection is needed.

As of today, there a couple of issues with it:

There is no way to remove a host from the swarm.
docker -H $SWARM_MANAGER info returns what I believe is an incorrect count:

$ sudo docker -H 127.0.0.1:3456 info
Containers: 68
Nodes: 10
 apis-rpi-03: 192.168.1.103:2375
  ? Containers: 5
  ? Reserved CPUs: 0 / 4
  ? Reserved Memory: 0 B / 925.3 MiB
 apis-rpi-02: 192.168.1.102:2375
  ? Containers: 11
  ? Reserved CPUs: 0 / 4
  ? Reserved Memory: 0 B / 925.3 MiB
 apis-rpi-10: 192.168.1.110:2375
  ? Containers: 11
  ? Reserved CPUs: 0 / 1
  ? Reserved Memory: 0 B / 434.4 MiB
 apis-rpi-08: 192.168.1.108:2375
  ? Containers: 6
  ? Reserved CPUs: 0 / 1
  ? Reserved Memory: 0 B / 434.4 MiB
 apis-rpi-06: 192.168.1.106:2375
  ? Containers: 6
  ? Reserved CPUs: 0 / 1
  ? Reserved Memory: 0 B / 434.4 MiB
 apis-rpi-07: 192.168.1.107:2375
  ? Containers: 7
  ? Reserved CPUs: 0 / 1
  ? Reserved Memory: 0 B / 434.4 MiB
 apis-rpi-09: 192.168.1.109:2375
  ? Containers: 7
  ? Reserved CPUs: 0 / 1
  ? Reserved Memory: 0 B / 434.4 MiB
 apis-rpi-04: 192.168.1.104:2375
  ? Containers: 6
  ? Reserved CPUs: 0 / 4
  ? Reserved Memory: 0 B / 925.3 MiB
 apis-rpi-05: 192.168.1.105:2375
  ? Containers: 5
  ? Reserved CPUs: 0 / 4
  ? Reserved Memory: 0 B / 925.3 MiB
 apis-rpi-01: 192.168.1.101:2375
  ? Containers: 4
  ? Reserved CPUs: 0 / 4
  ? Reserved Memory: 0 B / 925.3 MiB

$ sudo docker -H 127.0.0.1:3456 info

Containers: 68

Nodes: 10

apis-rpi-03: 192.168.1.103:2375

? Containers: 5

? Reserved CPUs: 0 / 4

? Reserved Memory: 0 B / 925.3 MiB

apis-rpi-02: 192.168.1.102:2375

? Containers: 11

? Reserved CPUs: 0 / 4

? Reserved Memory: 0 B / 925.3 MiB

apis-rpi-10: 192.168.1.110:2375

? Containers: 11

? Reserved CPUs: 0 / 1