docker集装箱不会自动启动泊坞群

我的系统像昨天一样崩溃,我一直无法恢复。 我没有设置这个,所以我不知道所有的细节,但请要求任何细节。 它曾经是在任何虚拟机重启系统会自动开始工作,但在这次崩溃后,它不再是:

这是我的docker info输出

 Containers: 168 Running: 0 Paused: 0 Stopped: 168 Images: 241 Server Version: 1.12.2 Storage Driver: aufs Root Dir: /var/lib/docker/aufs Backing Filesystem: extfs Dirs: 1228 Dirperm1 Supported: true Logging Driver: json-file Cgroup Driver: cgroupfs Plugins: Volume: local Network: overlay bridge host null Swarm: pending NodeID: bg8sh8m6zm5llezlmcw00nqx6 Is Manager: true ClusterID: 1wfvx3ze7tm1bb56a5zyk9xqs Managers: 1 Nodes: 2 Orchestration: Task History Retention Limit: 5 Raft: Snapshot Interval: 10000 Heartbeat Tick: 1 Election Tick: 3 Dispatcher: Heartbeat Period: 5 seconds CA Configuration: Expiry Duration: 3 months Node Address: ADDRESS //hidden for security reasons Runtimes: runc Default Runtime: runc Security Options: apparmor seccomp Kernel Version: 4.4.0-91-generic Operating System: Ubuntu 16.04.1 LTS OSType: linux Architecture: x86_64 CPUs: 2 Total Memory: 6.804 GiB Name: swarm-manager-1 ID: AXPO:VFSV:TDT3:6X7Y:QNAO:OZJN:U23R:V5S2:FU33:WUNI:CRPK:2E2C Docker Root Dir: /var/lib/docker Debug Mode (client): false Debug Mode (server): false Registry: https://index.docker.io/v1/ WARNING: No swap limit support Insecure Registries: 127.0.0.0/8 

我的docker节点ls输出:

 ID HOSTNAME STATUS AVAILABILITY MANAGER STATUS bg8sh8m6zm5llezlmcw00nqx6 * swarm-manager-1 Ready Active Leader c21j8nzzy3151vv06m54yyd1u swarm-worker-1 Unknown Active 

这里是docker node inspect swarm-manager-1

 [ { "ID": "bg8sh8m6zm5llezlmcw00nqx6", "Version": { "Index": 67823 }, "CreatedAt": "2016-10-13T23:46:00.580142516Z", "UpdatedAt": "2017-08-29T19:48:35.4197366Z", "Spec": { "Role": "manager", "Availability": "active" }, "Description": { "Hostname": "swarm-manager-1", "Platform": { "Architecture": "x86_64", "OS": "linux" }, "Resources": { "NanoCPUs": 2000000000, "MemoryBytes": 7305609216 }, "Engine": { "EngineVersion": "1.12.2", "Plugins": [ { "Type": "Network", "Name": "bridge" }, { "Type": "Network", "Name": "host" }, { "Type": "Network", "Name": "null" }, { "Type": "Network", "Name": "overlay" }, { "Type": "Volume", "Name": "local" } ] } }, "Status": { "State": "ready" }, "ManagerStatus": { "Leader": true, "Reachability": "reachable", "Addr": "ADDRESS" //hidden } } ] 

这里是docker node inspect swarm-worker-1

 [ { "ID": "c21j8nzzy3151vv06m54yyd1u", "Version": { "Index": 67824 }, "CreatedAt": "2017-02-21T05:42:31.467777741Z", "UpdatedAt": "2017-08-29T19:48:35.4252027Z", "Spec": { "Role": "worker", "Availability": "active" }, "Description": { "Hostname": "swarm-worker-1", "Platform": { "Architecture": "x86_64", "OS": "linux" }, "Resources": { "NanoCPUs": 2000000000, "MemoryBytes": 7305609216 }, "Engine": { "EngineVersion": "1.12.2", "Plugins": [ { "Type": "Network", "Name": "bridge" }, { "Type": "Network", "Name": "host" }, { "Type": "Network", "Name": "null" }, { "Type": "Network", "Name": "overlay" }, { "Type": "Volume", "Name": "local" } ] } }, "Status": { "State": "unknown", "Message": "Node moved to \"unknown\" state due to leadership change in cluster" } } ] 

任何想法如何恢复工作?

在swarm worker上执行docker swarm leave ,然后在master上run docker swarm join-token worker并在run docker swarm join-token worker执行结果令牌命令。 它应该开始工作。

崩溃可能导致了一个问题。 这破坏了群体的状态,因此你需要重新创build它