feat(fill, execute): track execution & setup testing phase #2157

LouisTsai-Csie · 2025-09-16T09:04:08Z

🗒️ Description

Background for reviewers:

In some benchmark tests, there are not only benchmark transactions but also setup transactions that create the scenario for the benchmark target. For example, in the SSTORE benchmark we first need a setup transaction that deploys as many contracts as possible, and then a separate transaction to perform the storage updates. This feature helps gas-limit testing distinguish between the setup and execution phases.

This PR extends transaction metadata, which labeling phases such as setup, testing, and cleanup. Currently, operations like deploy_contract or fund_eoa are tagged as setup phase, while transactions/blocks included in state_test or blockchain_test are tagged as testing. However, the current labeling is not always precise enough.

Take test_worst_blockhash as an example. The first 256 blocks should be classified as setup, while only the final block should count as the testing phase. Without this distinction, we might mistakenly include the setup blocks in benchmark accounting.

def test_worst_blockhash(
    blockchain_test: BlockchainTestFiller,
    pre: Alloc,
    gas_benchmark_value: int,
):
    """Test running a block with as many blockhash accesses to the oldest allowed block as possible."""
    # Create 256 dummy blocks to fill the blockhash window.
    blocks = [Block()] * 256

    # Always ask for the oldest allowed BLOCKHASH block.
    execution_code = Op.PUSH1(1) + While(
        body=Op.POP(Op.BLOCKHASH(Op.DUP1)),
    )
    execution_code_address = pre.deploy_contract(code=execution_code)
    op_tx = Transaction(
        to=execution_code_address,
        gas_limit=gas_benchmark_value,
        sender=pre.fund_eoa(),
    )
    blocks.append(Block(txs=[op_tx]))

    blockchain_test(
        pre=pre,
        post={},
        blocks=blocks,
    )

The design introduces a test_phase attribute at both the block and transaction level (currently supporting execution and setup phases). This attribute is used during execution (see transaction_post.py for details), and the metadata is updated accordingly.

🔗 Related Issues or PRs

This is the follow-up PR for #1945, more description could be found in issue #2137.
Related discussion: 1, 2

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx --with=tox-uv tox -e lint,typecheck,spellcheck,markdownlint
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.
Tests: For PRs implementing a missed test case, update the post-mortem document to add an entry the list.
Ported Tests: All converted JSON/YML tests from ethereum/tests or tests/static have been assigned @ported_from marker.

LouisTsai-Csie · 2025-09-17T09:51:18Z

Note: I have not yet tested against execute command. I will provide some result later

fselmo

Hey @LouisTsai-Csie. I don't have much to add on the implementation. This looks great. I'm going to test this early next week but just had some minor observations at first pass and left as comments.

I wanted to just get a bit of clarity on the scope on how the metadata of each phase will be used. I think we're capturing the data in a good way and I will continue my review next week, but I'd like to understand all the ways we currently plan to use this kind of metadata to get a better understanding for the design.

I think we mentioned being able to call execute on a network with a setup phase first, and later call it with the execution phase. Is there any other case I'm missing so I can understand the end goal on capturing this split? Thanks!

cc: @marioevz

fselmo · 2025-10-03T23:00:44Z

src/ethereum_test_types/tests/test_phase_manager.py

+        assert len(manager.setup_blocks) == 0
+        assert len(manager.execution_transactions) == 0
+        assert len(manager.execution_blocks) == 0
+        assert manager.get_current_phase() == TestPhase.EXECUTION


Perhaps we should have a test with two managers which tests that changing the phase of one does not impact the other (no context muddying). This will be the case for sure because the phase is set on the instance, but just to sanity check against point 2 in this comment in the old PR in case the phase logic ever changes.

It could be simple... something like entering a setup context block with manager1, checking that its context changed to setup, and making sure manager2.get_current_phase() is still execution (default).

Thoughts?

fselmo · 2025-10-03T23:02:45Z

src/ethereum_test_specs/blockchain.py

    block_access_list: Bytes | None = Field(None)
-    """EIP-7928: Block-level access lists (serialized)."""
+    """
+        EIP-7928: Block-level access lists (serialized).


nit: Any reason this was changed? There's an empty space here but we can probably put it back to one line? 👀

LouisTsai-Csie self-assigned this Sep 16, 2025

LouisTsai-Csie added scope:fw Scope: Framework (evm|tools|forks|pytest) scope:fill Scope: fill command scope:execute Scope: Changes to the execute command labels Sep 16, 2025

LouisTsai-Csie mentioned this pull request Sep 16, 2025

feat(benchmark): add benchmark_test test type #1945

Merged

5 tasks

LouisTsai-Csie force-pushed the feat/add-phase-manager branch from ef9acbe to ec37e3a Compare September 17, 2025 09:23

danceratopz self-requested a review September 25, 2025 13:41

LouisTsai-Csie added 4 commits September 26, 2025 17:14

feat(tests): add phase manager to track testing phase

42a1e8c

refactor: update test phase manager instance model

2e8ee6a

fix: resolve linting issue

2dab80d

test: add case for TestPhaseManager functionality

9f85ffc

LouisTsai-Csie force-pushed the feat/add-phase-manager branch from ec37e3a to 9f85ffc Compare September 26, 2025 11:20

LouisTsai-Csie mentioned this pull request Sep 29, 2025

eest_tests sends blocks with too much gas NethermindEth/gas-benchmarks#60

Open

fselmo reviewed Oct 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(fill, execute): track execution & setup testing phase #2157

feat(fill, execute): track execution & setup testing phase #2157

Uh oh!

LouisTsai-Csie commented Sep 16, 2025 •

edited

Loading

Uh oh!

LouisTsai-Csie commented Sep 17, 2025

Uh oh!

fselmo left a comment •

edited

Loading

Uh oh!

fselmo Oct 3, 2025 •

edited

Loading

Uh oh!

fselmo Oct 3, 2025

Uh oh!

Uh oh!

feat(fill, execute): track execution & setup testing phase #2157

Are you sure you want to change the base?

feat(fill, execute): track execution & setup testing phase #2157

Uh oh!

Conversation

LouisTsai-Csie commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🗒️ Description

🔗 Related Issues or PRs

✅ Checklist

Uh oh!

LouisTsai-Csie commented Sep 17, 2025

Uh oh!

fselmo left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fselmo Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fselmo Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LouisTsai-Csie commented Sep 16, 2025 •

edited

Loading

fselmo left a comment •

edited

Loading

fselmo Oct 3, 2025 •

edited

Loading