Skip to content

Conversation

wgzintel
Copy link
Contributor

@wgzintel wgzintel commented Jun 23, 2025

  1. Support to collect latency for transformers V4.52.0.
  2. move hook sample files to dir llm_hook_sample.
  3. move hook beam search files to dir llm_hook_beam_search.

@github-actions github-actions bot added the category: llm_bench Label for tool/llm_bench folder label Jun 23, 2025
@wgzintel wgzintel changed the title Support to collect lantency for transfomers V4.52.0 Support to collect latency for transfomers V4.52.0 Jun 23, 2025
@wgzintel wgzintel changed the title Support to collect latency for transfomers V4.52.0 Support to collect latency for transformers V4.52.0 Jun 23, 2025
@wgzintel wgzintel requested review from sbalandi and Wovchena June 24, 2025 01:17
@Wovchena Wovchena requested a review from Copilot June 24, 2025 06:50
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds support for latency collection when using transformers v4.52.0 along with reorganizing hook sample and beam search files into dedicated directories. Key changes include replacing the hook sample implementation for v4.52.0, updating beam search logic for both v4.52.0 and v4.51.0 variants, and modifying the import paths in the greedy search hook.

Reviewed Changes

Copilot reviewed 4 out of 12 changed files in this pull request and generated 1 comment.

File Description
tools/llm_bench/llm_bench_utils/llm_hook_sample/hook_sample_v52.py New sample-generation implementation for transformers v4.52.0 with latency collection
tools/llm_bench/llm_bench_utils/llm_hook_beam_search/hook_beam_search_v52.py New beam search implementation for transformers v4.52.0 with latency collection
tools/llm_bench/llm_bench_utils/llm_hook_beam_search/hook_beam_search_v51.py Retained beam search implementation for transformers v4.51.0 with latency collection
tools/llm_bench/llm_bench_utils/hook_greedy_search.py Updated import paths and version checks to use the appropriate hook sample module

@Wovchena Wovchena added this pull request to the merge queue Jun 25, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jun 25, 2025
@Wovchena Wovchena added this pull request to the merge queue Jun 25, 2025
Merged via the queue into openvinotoolkit:master with commit 04fe9ce Jun 25, 2025
68 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: llm_bench Label for tool/llm_bench folder
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants