feat: enable rvr execution of benchmarks by gdmlcjs · Pull Request #2753 · openvm-org/openvm

gdmlcjs · 2026-04-24T22:04:10Z

Enables running benchmarks through rvr extension. Benchmark tests do not check execution correctness and currently execution involving extensions other than RV32IM fail because VmRvrExtension trait implementation is not properly wired.

closes INT-7480

github-actions · 2026-04-24T22:04:30Z

Claude finished @gdmlcjs's task in 3m 3s —— View job

Code Review: PR #2753 — feat: enable rvr execution of benchmarks

Gather context and read changed files
Review benchmark execution logic
Review RVR metered changes
Post review feedback

Overall this is a clean, well-structured change that follows the existing patterns (AOT caching, feature-gated branches) closely. A few observations:

Feature flag structure looks correct

The priority chain aot > rvr > interpreted is consistent across benchmark_execute and benchmark_execute_metered. For benchmark_execute_metered_cost, the simpler rvr / not(rvr) split is fine because the underlying metered_cost_instance() API in vm.rs is itself gated #[cfg(not(feature = "rvr"))] (covering both aot and default modes). No issue here.

The rvr feature in Cargo.toml correctly propagates to all extension circuit crates — this was the missing piece that the PR description mentions.

Nits / suggestions

rvr-openvm-lift dependency may be unnecessary (Cargo.toml:30, Cargo.toml:68)
The dep:rvr-openvm-lift is listed in the rvr feature and added as an optional dependency, but it's never directly imported anywhere in the benchmark crate (no use rvr_openvm_lift::... in any .rs file). It appears to be purely transitive through openvm-circuit/rvr. If it's only needed to ensure the dependency is pulled in, the sub-crate feature flags (openvm-circuit/rvr, etc.) should already handle that. Consider removing it unless there's a reason it needs to be a direct dependency.
Duplicated VM setup boilerplate (execute.rs:386-394 vs execute.rs:414-423 vs execute.rs:443-446)
The three create_*_rvr_instance functions each re-derive executor_idx_to_air_idx by constructing a VirtualMachine. create_rvr_instance and create_metered_rvr_instance create a full VirtualMachine just for this mapping, while create_metered_cost_rvr_instance pulls it from metered_cost_setup(). Consider extracting the executor_idx_to_air_idx computation into a shared OnceLock (similar to how metered_cost_setup already caches it) so you don't spin up redundant VMs. This follows the existing pattern — metered_cost_setup() already caches (MeteredCostCtx, Vec<usize>) in a OnceLock.
TODO comments in core crate are a nice callout (metered.rs:759-760, metered_cost.rs:231-232)
The TODOs about hoisting compilation to instance construction are good breadcrumbs. These are performance-relevant for benchmarks since compile_metered / compile_metered_cost are called on every execution rather than once at instance creation time. Worth tracking as a follow-up if this becomes a bottleneck in benchmark measurements.
Minor: benchmark_execute_metered_cost is ignore = true (execute.rs:552)
This benchmark is annotated with ignore = true in the #[divan::bench] attribute, meaning the newly added RVR path won't actually run unless explicitly un-ignored. This is fine if intentional (the original was also ignored), but worth noting that the new RVR path hasn't been exercised via CI benchmarks yet.

No issues found

Feature gate logic is sound and consistent with the existing AOT pattern
The #[cfg] gates don't create dead code or unreachable paths
Cache + Arc pattern for instance reuse across benchmark iterations is correct
The MeteredCtx.clone() in bench closures matches how the AOT variant handles it

Looks good to merge.

github-actions · 2026-04-24T22:09:23Z

group	app.proof_time_ms	app.cycles	leaf.proof_time_ms
fibonacci	3,870	12,000,265	960
keccak	18,580	18,655,329	3,307
sha2_bench	9,029	14,793,960	1,397
regex	1,433	4,137,067	376
ecrecover	642	123,583	271
pairing	902	1,745,757	285
kitchen_sink	2,090	2,579,903	436

Note: cells_used metrics omitted because CUDA tracegen does not expose unpadded trace heights.

Commit: 0c3ad8c

Benchmark Workflow

shuklaayush

it's okay for now since we're going to remove aot later but i wonder if we can share more code between executor types. maybe we should have all executors conform to some trait interface

gdmlcjs · 2026-04-28T22:42:25Z

Yes, that would be better. Should I add a ticket and merge?

shuklaayush · 2026-04-29T14:24:40Z

Yes, that would be better. Should I add a ticket and merge?

yes, let's do that

github-actions · 2026-04-29T21:03:13Z

Claude finished @gdmlcjs's task in 1m 27s —— View job

Code Review: PR #2753 — feat: enable rvr execution of benchmarks

Gather context and read changed files
Review Cargo.toml changes and feature flags
Review benchmark execution logic (execute.rs)
Review VM crate changes (metered.rs, metered_cost.rs)
Post review feedback

Overall this is a solid, well-structured addition that mirrors the existing AOT benchmark patterns. The feature flag propagation to all extension circuit crates (the key fix) is correct, and the caching via OnceLock<Mutex<HashMap<...>>> is consistent with the existing approach. A few observations:

Feature gate inconsistency in `benchmark_execute_metered_cost`

In benchmark_execute and benchmark_execute_metered, the priority chain is aot > rvr > interpreted using the pattern:

#[cfg(feature = "aot")]          // aot wins
#[cfg(all(feature = "rvr", not(feature = "aot")))]  // rvr if no aot
#[cfg(all(not(feature = "aot"), not(feature = "rvr")))]  // fallback

But benchmark_execute_metered_cost (execute.rs:553-592) uses a simpler split:

#[cfg(feature = "rvr")]       // rvr wins even over aot
#[cfg(not(feature = "rvr"))]  // everything else

This means if both aot and rvr features are enabled simultaneously, the first two benchmarks use AOT while the metered-cost benchmark uses RVR — an inconsistent mix. This is likely fine in practice (both features probably aren't enabled together), but it's worth either (a) adding a comment that this is intentional, or (b) aligning the gates with the three-tier pattern for consistency.

`rvr-openvm-lift` dependency appears unused

rvr-openvm-lift is added as an optional dependency (Cargo.toml:30) and listed in the rvr feature (Cargo.toml:68), but it is never directly imported in any .rs file in this crate. The sub-crate feature flags (openvm-circuit/rvr, openvm-rv32im-circuit/rvr, etc.) each independently pull in rvr-openvm-lift via their own dep:rvr-openvm-lift. Unless this is needed for some linking reason, it could be removed to keep the dependency list clean.

Fix this →

Redundant `VirtualMachine` construction for `executor_idx_to_air_idx`

create_rvr_instance (execute.rs:388-395) and create_metered_rvr_instance (execute.rs:418-423) each spin up a full VirtualMachine just to extract executor_idx_to_air_idx. Meanwhile, metered_cost_setup() already caches this mapping in a OnceLock. Consider caching this mapping once (or reusing metered_cost_setup().1) so you don't construct redundant VMs. The TODO at line 307 acknowledges this broader duplication — good to see it tracked.

TODO comments in VM crate are well-placed

The TODOs in metered.rs:759-760 and metered_cost.rs:231-232 about hoisting compilation to instance construction are good breadcrumbs. Since compile_metered / compile_metered_cost are called on every execution rather than once at instance creation, this is a meaningful performance optimization for benchmarks. Worth tracking as a follow-up.

`benchmark_execute_metered_cost` is `ignore = true`

The #[divan::bench(ignore = true, ...)] on line 553 means the new RVR path won't run in CI benchmarks unless explicitly un-ignored. The original was also ignored, so this is consistent — just noting that the new path hasn't been exercised via CI yet.

Summary

The PR achieves its goal — enabling RVR execution for benchmarks — and follows established patterns well. The main actionable items are the potentially unnecessary rvr-openvm-lift dependency and the minor feature-gate inconsistency in benchmark_execute_metered_cost. The duplicate boilerplate is acknowledged via the TODO and tracked for follow-up. Looks good overall.

gdmlcjs requested a review from shuklaayush April 24, 2026 22:04

gdmlcjs self-assigned this Apr 24, 2026

shuklaayush approved these changes Apr 28, 2026

View reviewed changes

gdmlcjs mentioned this pull request Apr 29, 2026

feat: implement vm execution instance that uses rvr #2747

Merged

gdmlcjs force-pushed the feat/rvr-instance branch from 8902898 to e3c53f9 Compare April 29, 2026 19:45

Base automatically changed from feat/rvr-instance to develop-v2.1.0-rvr April 29, 2026 20:30

gdmlcjs added 2 commits April 29, 2026 16:45

feat: enable rvr execution of benchmarks

323ddfa

chore: add todo for removing duplicate code in benchmark execution

e55b826

gdmlcjs force-pushed the feat/rvr-benchmarks branch from 0c3ad8c to e55b826 Compare April 29, 2026 21:02

gdmlcjs merged commit af0cc91 into develop-v2.1.0-rvr Apr 29, 2026
32 of 46 checks passed

gdmlcjs deleted the feat/rvr-benchmarks branch April 29, 2026 22:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enable rvr execution of benchmarks#2753

feat: enable rvr execution of benchmarks#2753
gdmlcjs merged 2 commits intodevelop-v2.1.0-rvrfrom
feat/rvr-benchmarks

gdmlcjs commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

shuklaayush left a comment

Uh oh!

gdmlcjs commented Apr 28, 2026

Uh oh!

shuklaayush commented Apr 29, 2026

Uh oh!

github-actions Bot commented Apr 29, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gdmlcjs commented Apr 24, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: PR #2753 — feat: enable rvr execution of benchmarks

Feature flag structure looks correct

Nits / suggestions

No issues found

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

shuklaayush left a comment

Choose a reason for hiding this comment

Uh oh!

gdmlcjs commented Apr 28, 2026

Uh oh!

shuklaayush commented Apr 29, 2026

Uh oh!

github-actions Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: PR #2753 — feat: enable rvr execution of benchmarks

Feature gate inconsistency in benchmark_execute_metered_cost

rvr-openvm-lift dependency appears unused

Redundant VirtualMachine construction for executor_idx_to_air_idx

TODO comments in VM crate are well-placed

benchmark_execute_metered_cost is ignore = true

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions Bot commented Apr 24, 2026 •

edited

Loading

github-actions Bot commented Apr 29, 2026 •

edited

Loading

Feature gate inconsistency in `benchmark_execute_metered_cost`

`rvr-openvm-lift` dependency appears unused

Redundant `VirtualMachine` construction for `executor_idx_to_air_idx`

`benchmark_execute_metered_cost` is `ignore = true`