[Investigating] 12:18 Received alarm about disruption of JuiceFS metadata service in UCloud beijing region. Investigation started.
12:20 Saw that two follower nodes can't replay change log from leader, kept restarting.
12:23 Restarted the previous leader.
12:26 Service is fully recovered, back to normal.
13:30 Find the bug that cause the failure of applying.
20:30 Deployed the bugfix.
November 16, 2017 12:26 PST
[Resolved] This is caused by a bug introduced in 4.2.2, which have a slightly different logic to apply change log, that introduce different internal result and fail the checksum. The bug was fixed and deployed.