Skip to content

Conversation

yatarkan
Copy link
Contributor

@yatarkan yatarkan commented Jun 20, 2025

@github-actions github-actions bot added the category: GH Pages Docs Github Pages documentation label Jun 20, 2025
@yatarkan yatarkan requested a review from Wovchena June 20, 2025 11:08
<li>Control output with different generation parameters (sampling, temperature, etc.)</li>
<li>Optimize for conversational scenarios by using chat mode</li>
<li>Apply LoRA adapters and dynamically switch between them without recompilation</li>
<li>Accelerate generation using draft models via Speculative Decoding</li>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
<li>Accelerate generation using draft models via Speculative Decoding</li>
<li>Accelerate generation using prompt lookup decoding or draft models via Speculative Decoding</li>

By the way, it seems that prompt lookup isn't covered in docs

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added task to the ticket with docs leftovers

@yatarkan yatarkan marked this pull request as ready for review June 23, 2025 17:07
@yatarkan yatarkan requested a review from Wovchena June 23, 2025 17:23
@Wovchena Wovchena added this pull request to the merge queue Jun 24, 2025
Merged via the queue into openvinotoolkit:master with commit cb3f1de Jun 24, 2025
68 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GH Pages Docs Github Pages documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants