perf: aggregate min/max #8061
perf: aggregate min/max #8061
1 benchmark regressed
⚠️ Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.
For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
⚠️ Different runtime environments detected
Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.
⚡ 10 improved benchmarks
❌ 1 regressed benchmark
✅ 1240 untouched benchmarks
🆕 10 new benchmarks
Warning
Please fix the performance issues or acknowledge them on CodSpeed.
Performance Changes
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ⚡ | WallTime | cuda/bitpacked_u8/unpack/3bw[100M] |
353 µs | 299.9 µs | +17.69% |
| 🆕 | Simulation | max_i32 |
N/A | 222.6 µs | N/A |
| 🆕 | Simulation | max_i32_nulls_clustered |
N/A | 254.3 µs | N/A |
| 🆕 | Simulation | max_i32_nulls_scattered |
N/A | 1.6 ms | N/A |
| 🆕 | Simulation | max_i64 |
N/A | 436.3 µs | N/A |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(100, 100)] |
358.3 µs | 323.3 µs | +10.84% |
| 🆕 | Simulation | sum_i32 |
N/A | 222.1 µs | N/A |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(1000, 10)] |
211.8 µs | 176.2 µs | +20.17% |
| 🆕 | Simulation | sum_i32_nulls_clustered |
N/A | 236.2 µs | N/A |
| 🆕 | Simulation | sum_i32_nulls_scattered |
N/A | 1.6 ms | N/A |
| 🆕 | Simulation | sum_i64 |
N/A | 600.4 µs | N/A |
| ❌ | Simulation | chunked_varbinview_opt_canonical_into[(1000, 10)] |
187.5 µs | 224.7 µs | -16.54% |
| 🆕 | Simulation | sum_u32 |
N/A | 222.1 µs | N/A |
| 🆕 | Simulation | max_f64 |
N/A | 1.1 ms | N/A |
| ⚡ | Simulation | encode_primitives[u8, (10000, 2)] |
313.9 µs | 278.2 µs | +12.84% |
| ⚡ | Simulation | encode_primitives[u8, (10000, 32)] |
317.7 µs | 282.4 µs | +12.51% |
| ⚡ | Simulation | encode_primitives[u8, (10000, 4)] |
314.2 µs | 278.4 µs | +12.88% |
| ⚡ | Simulation | encode_primitives[u8, (10000, 512)] |
334.9 µs | 299.1 µs | +11.96% |
| ⚡ | Simulation | encode_primitives[u8, (10000, 8)] |
315.2 µs | 279.2 µs | +12.9% |
| ⚡ | Simulation | for_compress_i32 |
753.2 µs | 444.1 µs | +69.61% |
| ... | ... | ... | ... | ... | ... |
ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.
Tip
Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.
Comparing claude/great-edison-jrGY0 (6a44f51) with develop (495f30e)