feat(ts): add the support of TWA aggregator to Range and MRange #3262

var-nan · 2025-11-15T05:09:49Z

It closes #3217 .
rangeCommon() in redis_timeseries.cc will add two samples, prev_sample and next_sample to the res vector when the aggregator is twa. These two samples are used to compute the area of the polygons that are at the front of the first bucket and at the end of last bucket.

prev_sample is the biggest sample in the data with timestamp less than or equal to first sample of filtered range and next_sample is the smallest sample in the data with timestamp greater than or equal to the last sample of filtered range.

TODO: test TWA with FILTER_BY_TS/FILTER_BY_VALUE option.

rangeCommon() in redis_timeseries.cc will add two samples, 'prev_sample' and 'next_sample' to the 'res' vector when the aggregator is twa. These two samples are used to compute the area of the polygons that are at the front of the first bucket and at the end of last bucket. prev_sample is the biggest sample in the data with timestamp less than or equal to first sample of filtered range and next_sample is the smallest sample in the data with timestamp greater than or equal to the last sample of filtered range. TODO: test TWA with FILTER_BY_TS/FILTER_BY_VALUE option.

When filtered with FILTER_BY_TS/FILTER_BY_VALUE, the `next` and `prev` samples are discarded while computing the area.

var-nan · 2025-11-17T18:46:02Z

279c9bd will correctly calculate TWA when samples are filtered with FILTER_BY_TS/FITLER_BY_VALUE.

PragmaTwice · 2025-11-18T03:07:34Z

/home/runner/work/kvrocks/kvrocks/src/types/redis_timeseries.cc:91:70: error: variable 'prev_available' is not initialized [cppcoreguidelines-init-variables,-warnings-as-errors]
Suppressed 8485 warnings (8464 in non-user code, 21 NOLINT).
   91 |   bool is_twa_aggregator = aggregator.type == TSAggregatorType::TWA, prev_available, next_available;
Use -header-filter=.* to display errors from all non-system headers. Use -system-headers to display errors from system headers as well.
2 warnings treated as errors
      |                                                                      ^             
      |                                                                                     = false
/home/runner/work/kvrocks/kvrocks/src/types/redis_timeseries.cc:91:86: error: variable 'next_available' is not initialized [cppcoreguidelines-init-variables,-warnings-as-errors]
   91 |   bool is_twa_aggregator = aggregator.type == TSAggregatorType::TWA, prev_available, next_available;
      |                                                                                      ^             
      |                                                                                                     = false

The CI failed due to a clang-tidy report in the changes. Could you fix it to pass the CI?

var-nan · 2025-11-18T03:30:40Z

Some of these errors didn't show up when I ran ./x.py check tidy locally; I guess it's due to a version change.

yezhizi · 2025-11-18T07:05:08Z

Sorry for the wait! Been a bit busy these days. I'll review this later today. : )

src/types/redis_timeseries.cc

some code is refactored for readability.

var-nan · 2025-11-26T14:54:35Z

Thanks @yezhizi . I was about to push the clang-tidy fixes.

src/types/redis_timeseries.cc

sonarqubecloud · 2025-11-28T05:01:36Z

Quality Gate passed

Issues
10 New issues
0 Accepted issues

Measures
0 Security Hotspots
65.0% Coverage on New Code
0.9% Duplication on New Code

See analysis details on SonarQube Cloud

yezhizi · 2025-11-28T04:19:42Z

src/types/redis_timeseries.cc

+  TSSample prev_sample, next_sample;
+  bool is_twa_aggregator = aggregator.type == TSAggregatorType::TWA, prev_available = false, next_available = false;
+  if (is_twa_aggregator) {
+    const bool discard_boundaries = !option.filter_by_ts.empty() || option.filter_by_value.has_value();
+    next_sample = samples.back();
+    samples.pop_back();
+    prev_sample = samples.back();
+    samples.pop_back();
+    // When FILTER_BY_TS/FILTER_BY_VALUE is enabled, discard out-of-boundary samples.
+    prev_available = discard_boundaries ? false : !samples.empty() && (samples.front().ts != prev_sample.ts);
+    next_available = discard_boundaries ? false : !samples.empty() && (samples.back().ts != next_sample.ts);
+  }


I think we can pass next_sample, next_available, etc. as function parameters instead of calculating them inside the AggregateSamplesByRangeOption function. For example, we can add a struct:

struct TWABounds { std::optional<TSSample> prev_sample; std::optional<TSSample> next_sample; };

And modify the function interface to:

AggregateSamplesByRangeOption(std::vector<TSSample> samples, const TSRangeOption &option, const TWABounds&)

yezhizi · 2025-11-28T04:36:59Z

src/types/redis_timeseries.cc

+  auto non_empty_left_bucket_idx = [&spans](size_t curr) {
+    while (--curr && spans[curr].empty());
+    return curr;
+  };
+  auto non_empty_right_bucket_idx = [&spans](size_t curr) {
+    while (++curr < spans.size() && spans[curr].empty());
+    return curr;
+  };
+
+  std::vector<std::pair<TSSample, TSSample>> neighbors;
+  neighbors.reserve(spans.size());
+  for (size_t i = 0; i < spans.size(); i++) {
+    TSSample prev = (i != 0) ? spans[non_empty_left_bucket_idx(i)].back() : prev_sample;
+    TSSample next = (i != (spans.size() - 1)) ? spans[non_empty_right_bucket_idx(i)].front() : next_sample;
+    neighbors.emplace_back(prev, next);
+  }
+


The nested while loops inside the for loop result in O(N^2) time complexity, which can be optimized to O(N). We could:

Iterate from 0 to N to resolve all prev neighbors by maintaining a "last seen non-empty" variable.

Iterate from N to 0 to resolve all next neighbors similarly.

var-nan and others added 2 commits November 14, 2025 23:04

Merge branch 'unstable' into twa_agg

e83a348

PragmaTwice requested a review from yezhizi November 17, 2025 15:25

fix TWA aggregator with FILTER_BY_TS/FILTER_BY_VALUE

279c9bd

When filtered with FILTER_BY_TS/FILTER_BY_VALUE, the `next` and `prev` samples are discarded while computing the area.

fix Clang-tidy errors

9d8e076

yezhizi reviewed Nov 18, 2025

View reviewed changes

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

var-nan and others added 3 commits November 25, 2025 17:55

fix: correct results when EMPTY flag is specified.

9174002

some code is refactored for readability.

Merge branch 'unstable' into twa_agg

d4b814b

fix clang-tidy

ae2268f

yezhizi reviewed Nov 27, 2025

View reviewed changes

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

yezhizi reviewed Nov 27, 2025

View reviewed changes

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

src/types/redis_timeseries.cc Outdated Show resolved Hide resolved

var-nan and others added 2 commits November 27, 2025 09:28

TWA code refactor

021bae7

Merge branch 'unstable' into twa_agg

7e2e5e9

yezhizi reviewed Nov 28, 2025

View reviewed changes

feat(ts): add the support of TWA aggregator to Range and MRange #3262

Are you sure you want to change the base?

feat(ts): add the support of TWA aggregator to Range and MRange #3262

Conversation

var-nan commented Nov 15, 2025 • edited by yezhizi Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

var-nan commented Nov 17, 2025

Uh oh!

PragmaTwice commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

var-nan commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yezhizi commented Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

var-nan commented Nov 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Nov 28, 2025

Quality Gate passed

Uh oh!

yezhizi Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

yezhizi Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

var-nan commented Nov 15, 2025 •

edited by yezhizi

Loading

PragmaTwice commented Nov 18, 2025 •

edited

Loading

var-nan commented Nov 18, 2025 •

edited

Loading