-
Notifications
You must be signed in to change notification settings - Fork 36
logpuller: fix puller resolvedTs stuck #4088
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
logpuller: fix puller resolvedTs stuck #4088
Conversation
Signed-off-by: dongmen <[email protected]>
Summary of ChangesHello @asddongmen, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request addresses a critical issue where the Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
📝 WalkthroughWalkthroughRefactors handleResolvedTs to use a shouldAdvance guard: zero-interval forces heap-min TS and immediate advancement; non-zero intervals use time-based checks and RangeLock.ResolvedTs() as candidate. Also changes default KV client AdvanceIntervalInMs from 0 to 100 and adds a test for throttled behavior. Changes
Sequence Diagram(s)sequenceDiagram
participant TiKV
participant LogPuller
participant RangeLock
participant Heap
participant KVClient
TiKV->>LogPuller: send batched resolvedTs events
LogPuller->>RangeLock: update locked-range state
alt advanceInterval == 0
LogPuller->>Heap: GetHeapMinTs()
Heap-->>LogPuller: heapMinTs
LogPuller->>LogPuller: force advancement (use heapMinTs)
LogPuller->>KVClient: push new resolved TS / update state
else advanceInterval > 0
LogPuller->>LogPuller: check time since lastAdvance
LogPuller->>RangeLock: ResolvedTs()
RangeLock-->>LogPuller: candidateResolvedTs
alt shouldAdvance
LogPuller->>KVClient: push new resolved TS / update state
end
end
LogPuller-->>TiKV: continue processing / ack
Estimated code review effort🎯 4 (Complex) | ⏱️ ~45 minutes Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request addresses an issue where the logpuller's resolved timestamp could get stuck. The fix involves changing the default AdvanceIntervalInMs to 100ms and modifying the logic in handleResolvedTs. The new logic correctly uses rangeLock.ResolvedTs() when advanceInterval is greater than 0, which avoids dependency on a potentially partially updated heap and correctly calculates the minimum resolved timestamp. This is a good fix for the default case.
However, I've noticed that the code path for advanceInterval == 0 still uses GetHeapMinTs(), which can lead to the same 'stuck resolved-ts' issue under certain conditions. I've left a specific comment with a suggestion to make this path robust as well.
Signed-off-by: dongmen <[email protected]>
|
/test all |
Signed-off-by: dongmen <[email protected]>
|
/test all |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: flowbehappy, lidezhu, wk989898 The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What problem does this PR solve?
Issue Number: close #4084
What is changed and how it works?
This PR fixes a case where the logpuller’s resolved-ts could stop advancing when there are large amount (eg. 400k) regions in a single span(or table).
handleResolvedTsnow advances:It also changes the default KVClient
advance-interval-in-msfrom 0 to 100 to throttle resolved-ts advancement and reduce per-event overhead.Check List
Tests
Questions
Will it cause performance regression or break compatibility?
Do you need to update user documentation, design documentation or monitoring documentation?
Release note
Summary by CodeRabbit
Chores
Tests
✏️ Tip: You can customize this high-level summary in your review settings.