Skip to content

Conversation

@joseph-isaacs
Copy link
Contributor

No description provided.

@joseph-isaacs joseph-isaacs requested a review from a10y January 23, 2026 19:20
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
@joseph-isaacs joseph-isaacs force-pushed the ji/cuda-host-canonical branch from 3fcdf83 to 53c146a Compare January 23, 2026 19:20
@joseph-isaacs joseph-isaacs added the changelog/feature A new feature label Jan 23, 2026
Signed-off-by: Joe Isaacs <joe.isaacs@live.co.uk>
@codspeed-hq
Copy link

codspeed-hq bot commented Jan 23, 2026

CodSpeed Performance Report

Merging this PR will degrade performance by 29.75%

Comparing ji/cuda-host-canonical (4abf4bd) with develop (c6161f6)

⚠️ Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

Summary

⚡ 3 improved benchmarks
❌ 7 regressed benchmarks
✅ 1252 untouched benchmarks
⏩ 1290 skipped benchmarks1

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Performance Changes

Mode Benchmark BASE HEAD Efficiency
WallTime u8_FoR[10M] 6.3 µs 5.6 µs +12.34%
Simulation canonical_into_non_nullable[(10000, 100, 0.01)] 2.1 ms 2.9 ms -27.39%
Simulation canonical_into_non_nullable[(10000, 100, 0.0)] 1.9 ms 2.7 ms -29.75%
Simulation canonical_into_non_nullable[(10000, 100, 0.1)] 3.7 ms 4.5 ms -18.15%
Simulation canonical_into_nullable[(10000, 10, 0.0)] 528.5 µs 444.1 µs +19.03%
Simulation into_canonical_non_nullable[(10000, 100, 0.01)] 2.2 ms 3 ms -27.05%
Simulation into_canonical_non_nullable[(10000, 100, 0.1)] 3.8 ms 4.6 ms -17.7%
Simulation canonical_into_nullable[(10000, 100, 0.0)] 4.9 ms 4.1 ms +19.79%
Simulation into_canonical_non_nullable[(10000, 100, 0.0)] 1.9 ms 2.7 ms -29.42%
Simulation into_canonical_nullable[(10000, 100, 0.0)] 4.4 ms 5.2 ms -15.62%

Footnotes

  1. 1290 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports.

@joseph-isaacs joseph-isaacs changed the title feat[cuda]: dict decimal value feat[cuda]: canonical to_host Jan 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

changelog/feature A new feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants