Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tracking issue for reducing high latency #4757

Closed
36 of 37 tasks
overvenus opened this issue Mar 3, 2022 · 2 comments · Fixed by #5820 or #6613
Closed
36 of 37 tasks

Tracking issue for reducing high latency #4757

overvenus opened this issue Mar 3, 2022 · 2 comments · Fixed by #5820 or #6613
Assignees
Labels
area/ticdc Issues or PRs related to TiCDC. type/enhancement The issue or PR belongs to an enhancement.

Comments

@overvenus
Copy link
Member

overvenus commented Mar 3, 2022

The status quo and the goal of the tracking issue.

Scenario Status quo (2022/03/02) Goal
Planned outage
TiCDC rolling upgrade 2-8 mins <= 10s
TiKV/TiDB rolling upgrade <= 1min <= 30s
PD rolling upgrade <= 1min <= 1min
TiKV scale out/off <= 1min <= 30s
TiCDC scale out/off 2-8 mins <= 5s
PD scale out/off <= 1min <= 1min
TiDB scale out/off <= 1min <= 30s
Unplaned outage
(less than ⅓ total node number, outage last 5mins)
TiKV/TiDB outage (power down, disk failure etc) 2-4 mins <= 2min
TiCDC  outage (power down, disk failure etc) 2-8 mins 2-8 mins
PD  outage (power down, disk failure etc) 2-8 mins 2-8 mins
No outage
Delay <= 10s <= 2s
Delay spikes (99%) <= 1 min <= 5s

Extreme high latency (>30 min)

Latency spike

Two-phase scheduling

Two-phase scheduling aims to solve high latency spike (up to minutes) caused by moving table (move a table from a TiCDC to another).

Graceful shutdown and upgrade

Cross version grace upgrade

Reduce p99 latency in normal cases

@overvenus overvenus added the area/ticdc Issues or PRs related to TiCDC. label Mar 3, 2022
@overvenus overvenus self-assigned this Mar 3, 2022
@overvenus overvenus added the type/enhancement The issue or PR belongs to an enhancement. label May 17, 2022
ti-chi-bot pushed a commit that referenced this issue Aug 3, 2022
@overvenus overvenus reopened this Aug 4, 2022
ti-chi-bot pushed a commit that referenced this issue Sep 16, 2022
ti-chi-bot pushed a commit that referenced this issue Oct 13, 2022
@overvenus
Copy link
Member Author

With the default settings in v6.5.0, changefeed replication lag is less 2s for both normal scenario and planned rolling restart/upgrade. Except the large table scenario, I think we have met the requirement described in this issue.
Let's track the large table scenario in #7720.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/ticdc Issues or PRs related to TiCDC. type/enhancement The issue or PR belongs to an enhancement.
Projects
None yet
2 participants