You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
So, when the total downtime exceeds 480s, BR may fail. The possibility is related to how long between two TiKVs down. The longer the outage happens, the lower the possibility of BR failure.
Personally I guess when two outages happen in more than 15 minutes, the total downtime counter for failure can be reset.
Note: restore requires all nodes are online. When one of TiKV was down, it starts retry. And, once this TiKV goes back, the request may still not finish, hence a continuous outage of different TiKV may consume the same retry counter. Once all retry chances consumed, BR will fail.
Bug Report
Please answer these questions before submitting your issue. Thanks!
1. Minimal reproduce step (Required)
1、br restore failed
2、injection tikv failure 5 minutes every 5 minutes and total injection fault twice
br.log.2024-09-10T18.03.34Z.zip
2. What did you expect to see? (Required)
br restore success
3. What did you see instead (Required)
br restore
start time: 2024-09-11 02:03:34, failed time: 2024-09-11 02:19:48
stdout:
Detail BR log in /tmp/br.log.2024-09-10T18.03.34Z
[2024/09/10 18:19:48.220 +00:00] [INFO] [collector.go:73] [DataBase Restore failed summary] [total-ranges=129] [ranges-succeed=128] [ranges-failed=1] [split-region=9m5.021030894s] [restore-ranges=7897] [unit-name=file] [error=rpc error: code = Unavailable desc = Cancelling all calls; rpc error: code = Unavailable desc = connection error: desc = \transport: error while dialing: dial tcp 10.200.68.65:20160: connect: connection refused; rpc error: code = Unavailable desc = connection error
4. What is your TiDB version? (Required)
./tidb-server -V
Release Version: v6.5.11
Edition: Community
Git Commit Hash: 305cf42
Git Branch: HEAD
UTC Build Time: 2024-09-10 08:34:23
GoVersion: go1.19.13
Race Enabled: false
TiKV Min Version: 6.2.0-alpha
Check Table Before Drop: false
Store: unistore
2024-09-11T02:03:30.451+0800
./br -V
Release Version: v6.5.11
Git Commit Hash: 305cf42
Git Branch: HEAD
Go Version: go1.19.13
UTC Build Time: 2024-09-10 08:35:44
Race Enabled: false
The text was updated successfully, but these errors were encountered: