Docker Swarm 集群模式实操 | 隔叶黄莺 Yanbin Blog

2020-03-22 | 阅读(735)

在正式进入 Kubernetes 之前希望能对早先的 Docker Swarm 有所了解，虽然它目前基本上没什么用处。Swarm 提供的是 Docker 宿主机的集群，集群中至少有一个 Manager(或多个), 再加上 0 或多个 Worker。它的大意是有个 Docker 服务(容器)通过 Manager 告诉用所管理的 Swarm 集群来执行，它就会在该集群中找到相应的宿主机来执行。Manager 和 Workder 都可用来运行 Docker 容器，但只有 Manager 才能执行部分管理命令。

下面两张图很好的体现了我们平时在一个宿主机上和 Swarm 集群宿主机上运行 Docker 容器的区别

所有容器自己扛

容器大家一起扛

Docker 在 1.12 之后自带了 Swarm 模式。本文实质上对使用Docker Swarm模式搭建Swarm集群的一次实际操作与体验。只是自己方便使用了三个 Vagrant 虚拟机进行实践，其中一个 Swarm Manager 节点，两个 worker 节点

ubuntu-master: 172.28.128.6
ubuntu-worker1: 172.28.128.7
ubuntu-worker2: 172.28.128.7

所用的 Vagrantfile 是

Vagrant.configure("2") do |config|
  config.vm.box = "ubuntu/disco64"
  config.vm.hostname = "ubuntu-master"    #相应的 worker 为 ubuntu-worker1 和 ubuntu-worker2
  config.vm.network "private_network", type: "dhcp"
end

Vagrant.configure("2") do |config|

config.vm.box = "ubuntu/disco64"

config.vm.hostname = "ubuntu-master" #相应的 worker 为 ubuntu-worker1 和 ubuntu-worker2

config.vm.network "private_network", type: "dhcp"

end

vagrant up, vagrant ssh 进入机器后，sudo su - 切换到 root 用户，用 snap install docker 或 apt install docker.io 安装 docker 都行，然后就可用 docker swarm 命令了。

创建 Swarm 集群

一个 Swarm 集群中至少要有一个 Manager 节点

创建 Manager 节点

在 ubuntu-master 机器上执行

root@ubuntu-master:~# docker swarm init --advertise-addr 172.28.128.6
Swarm initialized: current node (sjfg7nljuyt54yapffvosrzmh) is now a manager.

To add a worker to this swarm, run the following command:

    docker swarm join --token SWMTKN-1-3wnd3z2xp49j6vyf6nt8kiuz38bjgjbbs90kz8x48z6dhyr7rc-c2cwfzg2jzhi3qz12wicn89a0 172.28.128.6:2377

To add a manager to this swarm, run 'docker swarm join-token manager' and follow the instructions.

root@ubuntu-master:~# docker swarm init --advertise-addr 172.28.128.6

Swarm initialized: current node (sjfg7nljuyt54yapffvosrzmh) is now a manager.

To add a worker to this swarm, run the following command:

docker swarm join --token SWMTKN-1-3wnd3z2xp49j6vyf6nt8kiuz38bjgjbbs90kz8x48z6dhyr7rc-c2cwfzg2jzhi3qz12wicn89a0 172.28.128.6:2377

To add a manager to this swarm, run 'docker swarm join-token manager' and follow the instructions.

--addvertise-addr 表示 worker 节点与 manager 的联系地址与端口，所以 master 的 2377 端口要打开。上面同时显示了在 worker 中如何加入到 Swarm 集群中的命令，在 master 上用以下两命令可随时查看加入 worker 或新 manager 的完整命令

root@ubuntu-master: ~# docker swarm join-token worker
......
root@ubuntu-master: ~# docker swarm join-token manager
....

root@ubuntu-master: ~# docker swarm join-token worker

......

root@ubuntu-master: ~# docker swarm join-token manager

....

这时可使用 docker info 和 docker node ls 查看 Swarm 集群的信息

root@ubuntu-master:~# docker info
......
Swarm: active
 NodeID: sjfg7nljuyt54yapffvosrzmh
 Is Manager: true
 ClusterID: r2cr7km655esw58olbc6gyncn
 Managers: 1
 Nodes: 1
 Default Address Pool: 10.0.0.0/8
 SubnetSize: 24
 Orchestration:
  Task History Retention Limit: 5
......

root@ubuntu-master:~# docker node ls
ID                            HOSTNAME            STATUS              AVAILABILITY        MANAGER STATUS      ENGINE VERSION
sjfg7nljuyt54yapffvosrzmh *   ubuntu-master       Ready               Active              Leader              18.09.9

root@ubuntu-master:~# docker info

......

Swarm: active

NodeID: sjfg7nljuyt54yapffvosrzmh

Is Manager: true

ClusterID: r2cr7km655esw58olbc6gyncn

Managers: 1

Nodes: 1

Default Address Pool: 10.0.0.0/8

SubnetSize: 24

Orchestration:

Task History Retention Limit: 5

......

root@ubuntu-master:~# docker node ls

ID HOSTNAME STATUS AVAILABILITY MANAGER STATUS ENGINE VERSION

sjfg7nljuyt54yapffvosrzmh * ubuntu-master Ready Active Leader 18.09.9

加入 Worker 节点到 Swarm 集群

在 ubuntu-worker1 和 ubuntu-worker2 两个节点上执行相同的命令

root@ubuntu-worker1:~# docker swarm join --token SWMTKN-1-3wnd3z2xp49j6vyf6nt8kiuz38bjgjbbs90kz8x48z6dhyr7rc-c2cwfzg2jzhi3qz12wicn89a0 172.28.128.6:2377
This node joined a swarm as a worker.

1 2	root@ubuntu-worker1:~# docker swarm join --token SWMTKN-1-3wnd3z2xp49j6vyf6nt8kiuz38bjgjbbs90kz8x48z6dhyr7rc-c2cwfzg2jzhi3qz12wicn89a0 172.28.128.6:2377 This node joined a swarm as a worker.

完后在 manager 机器上再用 docker info 和 docker node ls 查看下

root@ubuntu-master:~# docker info
......
Swarm: active
 NodeID: sjfg7nljuyt54yapffvosrzmh
 Is Manager: true
 ClusterID: r2cr7km655esw58olbc6gyncn
 Managers: 1
 Nodes: 3
 Default Address Pool: 10.0.0.0/8
 SubnetSize: 24
 Orchestration:
  Task History Retention Limit: 5
......
root@ubuntu-master:~# docker node ls
ID                            HOSTNAME            STATUS              AVAILABILITY        MANAGER STATUS      ENGINE VERSION
sjfg7nljuyt54yapffvosrzmh *   ubuntu-master       Ready               Active              Leader              18.09.9
q82ndq11h1mie6lwmog5pfymu     ubuntu-worker1      Ready               Active                                  18.09.7
xuzc43qrrdirkmabivdzg8ruc     ubuntu-worker2      Ready               Active                                  18.09.9

root@ubuntu-master:~# docker info

......

Swarm: active

NodeID: sjfg7nljuyt54yapffvosrzmh

Is Manager: true

ClusterID: r2cr7km655esw58olbc6gyncn

Managers: 1

Nodes: 3

Default Address Pool: 10.0.0.0/8

SubnetSize: 24

Orchestration:

Task History Retention Limit: 5

......

root@ubuntu-master:~# docker node ls

ID HOSTNAME STATUS AVAILABILITY MANAGER STATUS ENGINE VERSION

sjfg7nljuyt54yapffvosrzmh * ubuntu-master Ready Active Leader 18.09.9

q82ndq11h1mie6lwmog5pfymu ubuntu-worker1 Ready Active 18.09.7

xuzc43qrrdirkmabivdzg8ruc ubuntu-worker2 Ready Active 18.09.9

在 worker 上看到的 docker info 信息略有不同，且 docker node 命令是不能在 worker 上执行的。

在 Swarm 集群中部署服务

现在我们有了一个由一个 manager 和两个 worker 组成的 Swarm 集群了，可以开始在这个集群中运行我们的 docker 容器(服务) 了。

有别于我们通常用 docker run 的方式在当前机器为宿主机上运行容器，在 Swarm 集群中部署服务要在 manager 节点上用 docker service 命令。

root@ubuntu-master:~# docker service create --replicas 2 --name helloworld alpine ping docker.com
15aptyokhcp42qa47rfmhgx82
overall progress: 2 out of 2 tasks
1/2: running   [==================================================>]
2/2: running   [==================================================>]
verify: Service converged

root@ubuntu-master:~# docker service create --replicas 2 --name helloworld alpine ping docker.com

15aptyokhcp42qa47rfmhgx82

overall progress: 2 out of 2 tasks

1/2: running [==================================================>]

2/2: running [==================================================>]

verify: Service converged

--replicas 指定启动服务由几个实例组成
--name 与 docker run 时的 --name 一样，指定服务名
alpine 为 docker 镜像名称
ping docker.com 服务执行的命令，由于 Linux 下的 ping 命令是不会停的，因此后面可以观察运行状态

查看 Swarm 中服务信息

docker service ls 查看服务列表

root@ubuntu-master:~# docker service ls
ID                  NAME                MODE                REPLICAS            IMAGE               PORTS
15aptyokhcp4        helloworld          replicated          2/2                 alpine:latest

root@ubuntu-master:~# docker service ls

ID NAME MODE REPLICAS IMAGE PORTS

15aptyokhcp4 helloworld replicated 2/2 alpine:latest

docker service inspect 查看服务信息，与 docker inspect 查看容器信息类似

root@ubuntu-master:~# docker service inspect --pretty helloworld

ID:     15aptyokhcp42qa47rfmhgx82
Name:       helloworld
Service Mode:   Replicated
 Replicas:  2
Placement:
UpdateConfig:
 Parallelism:   1
......
ContainerSpec:
 Image:     alpine:latest@sha256:ab00606a42621fb68f2ed6ad3c88be54397f981a7b70a79db3d1172b11c4367d
 Args:      ping docker.com
 Init:      false

root@ubuntu-master:~# docker service inspect --pretty helloworld

ID: 15aptyokhcp42qa47rfmhgx82

Name: helloworld

Service Mode: Replicated

Replicas: 2

Placement:

UpdateConfig:

Parallelism: 1

......

ContainerSpec:

Image: alpine:latest@sha256:ab00606a42621fb68f2ed6ad3c88be54397f981a7b70a79db3d1172b11c4367d

Args: ping docker.com

Init: false

类似于 docker ps 的 docker service ps <SERVICE-ID> 命令可查询哪个(些) 节点正在运行该服务

root@ubuntu-master:~# docker service ps 15
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE           ERROR               PORTS
ay8qybtmtpqo        helloworld.1        alpine:latest       ubuntu-worker1      Running             Running 6 minutes ago
ou1gltvexpkc        helloworld.2        alpine:latest       ubuntu-master       Running             Running 6 minutes ago

root@ubuntu-master:~# docker service ps 15

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

ay8qybtmtpqo helloworld.1 alpine:latest ubuntu-worker1 Running Running 6 minutes ago

ou1gltvexpkc helloworld.2 alpine:latest ubuntu-master Running Running 6 minutes ago

我们注意到前面指定了 --replicas=2 运行的两份服务分别在 ubuntu-worker1 和 ubuntu-master 两个节点上。所以我们可以跑到所有节点上用 docker ps 验证一下

root@ubuntu-master:~# docker ps
CONTAINER ID        IMAGE               COMMAND             CREATED             STATUS              PORTS               NAMES
16eb4dbfc88b        alpine:latest       "ping docker.com"   7 minutes ago       Up 7 minutes                            helloworld.2.ou1gltvexpkcbna6rttbjbosl

root@ubuntu-master:~# docker ps

CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES

16eb4dbfc88b alpine:latest "ping docker.com" 7 minutes ago Up 7 minutes helloworld.2.ou1gltvexpkcbna6rttbjbosl

root@ubuntu-worker1:~# docker ps
CONTAINER ID        IMAGE               COMMAND             CREATED             STATUS              PORTS               NAMES
5b4367423853        alpine:latest       "ping docker.com"   7 minutes ago       Up 7 minutes                            helloworld.1.ay8qybtmtpqotpokauyde7ob0

root@ubuntu-worker1:~# docker ps

CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES

5b4367423853 alpine:latest "ping docker.com" 7 minutes ago Up 7 minutes helloworld.1.ay8qybtmtpqotpokauyde7ob0

root@ubuntu-worker2:~# docker ps
CONTAINER ID        IMAGE               COMMAND             CREATED             STATUS              PORTS               NAMES

1 2	root@ubuntu-worker2:~# docker ps CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES

没问题，Docker Swarm 的集群就相当于我们跑到每个独立结节上按需运行 docker run --name helloworld.x alpine ping docker.com 的效果。

在 Swarm 集群中服务的动态伸缩

有集群的好处就是我们可以在群集中对服务进行动态伸缩，规模超过集群节点数也不是问题，无外乎在一个节点上会运行多个服务。

root@ubuntu-master:~# docker service scale helloworld=5
helloworld scaled to 5
overall progress: 5 out of 5 tasks
1/5: running   [==================================================>]
2/5: running   [==================================================>]
3/5: running   [==================================================>]
4/5: running   [==================================================>]
5/5: running   [==================================================>]
verify: Service converged

root@ubuntu-master:~# docker service scale helloworld=5

helloworld scaled to 5

overall progress: 5 out of 5 tasks

1/5: running [==================================================>]

2/5: running [==================================================>]

3/5: running [==================================================>]

4/5: running [==================================================>]

5/5: running [==================================================>]

verify: Service converged

helloworld=5 相当于创建服务时指定的 --replicas=5 一样

再查看一下服务节点分配的列表

root@ubuntu-master:~# docker service ps helloworld
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE            ERROR               PORTS
ay8qybtmtpqo        helloworld.1        alpine:latest       ubuntu-worker1      Running             Running 14 minutes ago
ou1gltvexpkc        helloworld.2        alpine:latest       ubuntu-master       Running             Running 14 minutes ago
8xhmp5ehboo6        helloworld.3        alpine:latest       ubuntu-worker2      Running             Running 47 seconds ago
kagiwq7sh105        helloworld.4        alpine:latest       ubuntu-worker2      Running             Running 47 seconds ago
ijmgo4udbe1d        helloworld.5        alpine:latest       ubuntu-worker1      Running             Running 47 seconds ago

root@ubuntu-master:~# docker service ps helloworld

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

ay8qybtmtpqo helloworld.1 alpine:latest ubuntu-worker1 Running Running 14 minutes ago

ou1gltvexpkc helloworld.2 alpine:latest ubuntu-master Running Running 14 minutes ago

8xhmp5ehboo6 helloworld.3 alpine:latest ubuntu-worker2 Running Running 47 seconds ago

kagiwq7sh105 helloworld.4 alpine:latest ubuntu-worker2 Running Running 47 seconds ago

ijmgo4udbe1d helloworld.5 alpine:latest ubuntu-worker1 Running Running 47 seconds ago

其中有两个节点运行了两个服务。不妨往小里试一番

root@ubuntu-master:~# docker service scale helloworld=1
helloworld scaled to 1
overall progress: 1 out of 1 tasks
1/1: running   [==================================================>]
verify: Service converged

root@ubuntu-master:~# docker service scale helloworld=1

helloworld scaled to 1

overall progress: 1 out of 1 tasks

1/1: running [==================================================>]

verify: Service converged

再看服务列表

root@ubuntu-master:~# docker service ps helloworld
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE            ERROR               PORTS
ay8qybtmtpqo        helloworld.1        alpine:latest       ubuntu-worker1      Running             Running 17 minutes ago

root@ubuntu-master:~# docker service ps helloworld

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

ay8qybtmtpqo helloworld.1 alpine:latest ubuntu-worker1 Running Running 17 minutes ago

在每一次用 docker service scale helloworld=<NUM> 之后可以到具体节点上去用 docker ps 查看当前节点所运行的容器来验证。

到这时候我们真切的体验到了 Swarm 集群一些好处，但实际应用场景好像不是这么一回事。实践中节点都是动态增减的，除非首先启动一个 manager 节点，其后新启的节点自动加为 worker, 节点消亡后自动退出 Swarm 集群。

下面是 Swarm 集群中服务的管理

删除 Swarm 集群中的服务

用 docker service rmi helloworld 命令删除服务。

更新 Swarm 集群中的服务

比如要更新服务中的镜像版本，基本命令是 docker service update --image alpine:3.10 helloworld, 以下是完整演示把镜像从 alpine:latest 更新到 alpine:3.10

root@ubuntu-master:~# docker service create --replicas 3 --name helloworld alpine ping docker.com
pz8ifs41o90fren4hc6dgc1gz
overall progress: 3 out of 3 tasks
1/3: running   [==================================================>]
2/3: running   [==================================================>]
3/3: running   [==================================================>]
verify: Service converged
root@ubuntu-master:~# docker service ps helloworld
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE                ERROR               PORTS
qhmkkife291l        helloworld.1        alpine:latest       ubuntu-worker1      Running             Running about a minute ago
ia6bpvky5fdd        helloworld.2        alpine:latest       ubuntu-worker2      Running             Running about a minute ago
i6ux7j6jpwqi        helloworld.3        alpine:latest       ubuntu-master       Running             Running about a minute ago
root@ubuntu-master:~# docker service update --image alpine:3.10 helloworld
helloworld
overall progress: 3 out of 3 tasks                                     //这里可以看到怎么动态的更新过程
1/3: running   [==================================================>]
2/3: running   [==================================================>]
3/3: running   [==================================================>]
verify: Service converged
root@ubuntu-master:~# docker service ps helloworld                     //显示了由  latest 更新到了 3.10
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE             ERROR               PORTS
lgqlxpkdjyl8        helloworld.1        alpine:3.10         ubuntu-worker1      Running             Running 14 seconds ago
qhmkkife291l         \_ helloworld.1    alpine:latest       ubuntu-worker1      Shutdown            Shutdown 15 seconds ago
5qj7hf6dm75x        helloworld.2        alpine:3.10         ubuntu-worker2      Running             Running 26 seconds ago
ia6bpvky5fdd         \_ helloworld.2    alpine:latest       ubuntu-worker2      Shutdown            Shutdown 27 seconds ago
p1aibrthhs4k        helloworld.3        alpine:3.10         ubuntu-master       Running             Running 38 seconds ago
i6ux7j6jpwqi         \_ helloworld.3    alpine:latest       ubuntu-master       Shutdown            Shutdown 39 seconds ago

root@ubuntu-master:~# docker service create --replicas 3 --name helloworld alpine ping docker.com

pz8ifs41o90fren4hc6dgc1gz

overall progress: 3 out of 3 tasks

1/3: running [==================================================>]

2/3: running [==================================================>]

3/3: running [==================================================>]

verify: Service converged

root@ubuntu-master:~# docker service ps helloworld

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

qhmkkife291l helloworld.1 alpine:latest ubuntu-worker1 Running Running about a minute ago

ia6bpvky5fdd helloworld.2 alpine:latest ubuntu-worker2 Running Running about a minute ago

i6ux7j6jpwqi helloworld.3 alpine:latest ubuntu-master Running Running about a minute ago

root@ubuntu-master:~# docker service update --image alpine:3.10 helloworld

helloworld

overall progress: 3 out of 3 tasks //这里可以看到怎么动态的更新过程

1/3: running [==================================================>]

2/3: running [==================================================>]

3/3: running [==================================================>]

verify: Service converged

root@ubuntu-master:~# docker service ps helloworld //显示了由 latest 更新到了 3.10

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

lgqlxpkdjyl8 helloworld.1 alpine:3.10 ubuntu-worker1 Running Running 14 seconds ago

qhmkkife291l \_ helloworld.1 alpine:latest ubuntu-worker1 Shutdown Shutdown 15 seconds ago

5qj7hf6dm75x helloworld.2 alpine:3.10 ubuntu-worker2 Running Running 26 seconds ago

ia6bpvky5fdd \_ helloworld.2 alpine:latest ubuntu-worker2 Shutdown Shutdown 27 seconds ago

p1aibrthhs4k helloworld.3 alpine:3.10 ubuntu-master Running Running 38 seconds ago

i6ux7j6jpwqi \_ helloworld.3 alpine:latest ubuntu-master Shutdown Shutdown 39 seconds ago

更新时为了尽量不中断服务，Swarm 会进行逐个服务停止，更新，再启动。

Swarm 中节点的离线与上线

对运行服务中的 Swarm 集群，我们可以让某个点节离线，那么其上的服务就会转移到另一个可用的节点上。离线的节点可以重新上线，但先前上面的服务不会重新转移过来，只要在进行服务伸缩或创建新服务时才会分配到新上线的节点上。看下面的一个完整演示

root@ubuntu-master:~# docker service create --replicas 3 --name helloworld alpine ping docker.com
zn19c80zdpehq40dwz25lgton
overall progress: 3 out of 3 tasks
1/3: running   [==================================================>]
2/3: running   [==================================================>]
3/3: running   [==================================================>]
verify: Service converged
root@ubuntu-master:~# docker service ps helloworld          //每个节点上都运行一个容器
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE            ERROR               PORTS
r36yh4yfyrjj        helloworld.1        alpine:latest       ubuntu-worker1      Running             Running 13 seconds ago
2fyzydfs2n8w        helloworld.2        alpine:latest       ubuntu-worker2      Running             Running 13 seconds ago
smqv1mfh1lba        helloworld.3        alpine:latest       ubuntu-master       Running             Running 13 seconds ago
root@ubuntu-master:~# docker node update --availability drain ubuntu-worker1
ubuntu-worker1
root@ubuntu-master:~# docker service ps helloworld          // ubuntu-worker1 离线后，其上的任务转移到了 ubuntu-master 节点上
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE            ERROR               PORTS
5z10pj4z9ldq        helloworld.1        alpine:latest       ubuntu-master       Running             Running 1 second ago
r36yh4yfyrjj         \_ helloworld.1    alpine:latest       ubuntu-worker1      Shutdown            Shutdown 2 seconds ago
2fyzydfs2n8w        helloworld.2        alpine:latest       ubuntu-worker2      Running             Running 50 seconds ago
smqv1mfh1lba        helloworld.3        alpine:latest       ubuntu-master       Running             Running 50 seconds ago
root@ubuntu-master:~# docker node update --availability active ubuntu-worker1
ubuntu-worker1
root@ubuntu-master:~# docker service ps helloworld         // ubuntu-worker1 重新上线后，任务分配情况保持不变
ID                  NAME                IMAGE               NODE                DESIRED STATE       CURRENT STATE                ERROR               PORTS
5z10pj4z9ldq        helloworld.1        alpine:latest       ubuntu-master       Running             Running 56 seconds ago
r36yh4yfyrjj         \_ helloworld.1    alpine:latest       ubuntu-worker1      Shutdown            Shutdown 56 seconds ago
2fyzydfs2n8w        helloworld.2        alpine:latest       ubuntu-worker2      Running             Running about a minute ago
smqv1mfh1lba        helloworld.3        alpine:latest       ubuntu-master       Running             Running about a minute ago

root@ubuntu-master:~# docker service create --replicas 3 --name helloworld alpine ping docker.com

zn19c80zdpehq40dwz25lgton

overall progress: 3 out of 3 tasks

1/3: running [==================================================>]

2/3: running [==================================================>]

3/3: running [==================================================>]

verify: Service converged

root@ubuntu-master:~# docker service ps helloworld //每个节点上都运行一个容器

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

r36yh4yfyrjj helloworld.1 alpine:latest ubuntu-worker1 Running Running 13 seconds ago

2fyzydfs2n8w helloworld.2 alpine:latest ubuntu-worker2 Running Running 13 seconds ago

smqv1mfh1lba helloworld.3 alpine:latest ubuntu-master Running Running 13 seconds ago

root@ubuntu-master:~# docker node update --availability drain ubuntu-worker1

ubuntu-worker1

root@ubuntu-master:~# docker service ps helloworld // ubuntu-worker1 离线后，其上的任务转移到了 ubuntu-master 节点上

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

5z10pj4z9ldq helloworld.1 alpine:latest ubuntu-master Running Running 1 second ago

r36yh4yfyrjj \_ helloworld.1 alpine:latest ubuntu-worker1 Shutdown Shutdown 2 seconds ago

2fyzydfs2n8w helloworld.2 alpine:latest ubuntu-worker2 Running Running 50 seconds ago

smqv1mfh1lba helloworld.3 alpine:latest ubuntu-master Running Running 50 seconds ago

root@ubuntu-master:~# docker node update --availability active ubuntu-worker1

ubuntu-worker1

root@ubuntu-master:~# docker service ps helloworld // ubuntu-worker1 重新上线后，任务分配情况保持不变

ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS

5z10pj4z9ldq helloworld.1 alpine:latest ubuntu-master Running Running 56 seconds ago

r36yh4yfyrjj \_ helloworld.1 alpine:latest ubuntu-worker1 Shutdown Shutdown 56 seconds ago

2fyzydfs2n8w helloworld.2 alpine:latest ubuntu-worker2 Running Running about a minute ago

smqv1mfh1lba helloworld.3 alpine:latest ubuntu-master Running Running about a minute ago

管理 Swarm 集群

最后是几个管理 Swarm 集群的命令

worker 离开 Swarm 集群

root@ubuntu-worker1:~# docker swarm leave
Node left the swarm.

1 2	root@ubuntu-worker1:~# docker swarm leave Node left the swarm.

离开后用 docker node ls 看到的节点状态是 down

root@ubuntu-master:~# docker node ls
ID                            HOSTNAME            STATUS              AVAILABILITY        MANAGER STATUS      ENGINE VERSION
sjfg7nljuyt54yapffvosrzmh *   ubuntu-master       Ready               Active              Leader              18.09.9
q82ndq11h1mie6lwmog5pfymu     ubuntu-worker1      Down                Active                                  18.09.7
xuzc43qrrdirkmabivdzg8ruc     ubuntu-worker2      Down                Active                                  18.09.9

root@ubuntu-master:~# docker node ls

ID HOSTNAME STATUS AVAILABILITY MANAGER STATUS ENGINE VERSION

sjfg7nljuyt54yapffvosrzmh * ubuntu-master Ready Active Leader 18.09.9

q82ndq11h1mie6lwmog5pfymu ubuntu-worker1 Down Active 18.09.7

xuzc43qrrdirkmabivdzg8ruc ubuntu-worker2 Down Active 18.09.9

注意 worker 离开后，在 Swarm 集群中运行的服务仍然会依赖 --replicas=<NUM> 进行重新分配。如果所有 worker 都删除后，服务全都会部署到 Manager 节点上。如果需要从 docker node ls 中删除节点，需要用

root@ubuntu-master:~# docker node rm ubuntu-worker1
ubuntu-worker1
root@ubuntu-master:~# docker node rm --force ubuntu-worker2
ubuntu-worker2

root@ubuntu-master:~# docker node rm ubuntu-worker1

ubuntu-worker1

root@ubuntu-master:~# docker node rm --force ubuntu-worker2

ubuntu-worker2

删不掉就加上 --force 参数

最后 manager 节点的退出相当于 Swarm 集群的消失

root@ubuntu-master:~# docker swarm leave --force
Node left the swarm.
oot@ubuntu-master:~# docker node ls
Error response from daemon: This node is not a swarm manager. Use "docker swarm init" or "docker swarm join" to connect this node to swarm and try again.

root@ubuntu-master:~# docker swarm leave --force

Node left the swarm.

oot@ubuntu-master:~# docker node ls

Error response from daemon: This node is not a swarm manager. Use "docker swarm init" or "docker swarm join" to connect this node to swarm and try again.

Swarm 集群最络也被删除了。

这差不多是 Docker Swarm 的基本的内容了，如果 Swarm 集群创建有问题，请留意以下两个端口号没被防火墙挡住

7946 端口，用于集群节点之间的通信
4789 端口，用于 overlay 网络流量

接下来就会移步到 Docker compose, 甚至 compose 可以部署到 Swarm 集群中，终极目标还是会去了解 Kubernetes。

本文链接 https://yanbin.blog/docker-swarm-cluster-in-action/, 来自隔叶黄莺 Yanbin Blog

1 Comment

Inline Feedbacks

View all comments

Docker Compose 实践 | 隔叶黄莺 Yanbin Blog - 软件编程实践

5 years ago

[…] Kubernetes 进发，上一篇 Docker Swarm 集群模式实操了解完 Swarm 后，有必要对 Docker Compose 了解一番。Docker Swarm 是把 Docker […]

CTO on 让 Putty 保存密码，自动登陆的三种方法用了-load 载入对应的session就不要-ssh来连服务器了,不然seesion里面存的东西都没用,后面带-l 和-pw配用户名与密码...
天天下载 on Python 3.12 新特性很棒的网站，感谢分享
gidroizolyaciya cena_gwsl on Java, Python 两种形式的 base64encodeгидроизоляция цена http://gidroizolyaciya-cena-1.ru/ .
PKSpin Game on 精彩开发网站链接PKSpin Game is a free Android gaming application and is free to u...
Polo on 想选一种动态语言＋跨平台界面组件的组合，希望大家给点意见Perl + Tkx