Allow running `ramalama` without a GPU #1909

kpouget · 2025-09-08T08:55:02Z

This PR adds the ability to run ramalama run/serve --device none ..., and pass no device to the Podman command line.

This will be useful for nightly testing of RamaLama CPU-only performance. Before this PR, it was not possible to run CPU-only on a system with a GPU.

Summary by Sourcery

Allow using '--device none' to skip passing any device to the container, enabling CPU-only runs.

New Features:

Add support for a 'none' device option to run without GPU by omitting device flags

Documentation:

Update the CLI --device help text to mention the 'none' option

sourcery-ai · 2025-09-08T08:55:12Z

Reviewer's guide (collapsed on small PRs)

Reviewer's Guide

This PR enables CPU-only execution by allowing users to specify --device none, which causes the engine to skip adding any --device flags to the Podman command. The implementation introduces a check for the 'none' device and updates the CLI help text accordingly.

Class diagram for updated device handling in RamaLama engine

classDiagram
    class Engine {
        +add_device_options()
        -exec_args
        -args
    }
    class Args {
        +device: list[str]
    }
    Engine --> Args: uses
    Args : device can be ['none']
    Engine : add_device_options() skips device flags if device == ['none']

File-Level Changes

Change	Details	Files
Detect and handle 'none' device to skip device flags	Add logic to interpret device arg equals 'none' as a request for no device Early return in add_device_options to omit `--device` flags when 'none' is requested	`ramalama/engine.py`
Update CLI help to document 'none' option	Extend --device help text to mention '(or 'none' to pass no device)'	`ramalama/cli.py`

Possibly linked issues

Configure Renovate #1: The PR adds a --device none option to prevent the /dev/dri error, allowing ramalama to run without a GPU.

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

gemini-code-assist

Summary of Changes

Hello @kpouget, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request enhances the ramalama tool by allowing users to explicitly opt out of GPU usage, even when a GPU is present on the system. This is achieved through a new --device none option, which prevents any device arguments from being passed to the underlying container runtime. This capability is essential for accurately testing and benchmarking CPU-only performance of ramalama in environments where GPUs might otherwise be automatically detected and utilized.

Highlights

CPU-only Execution: Introduced the ability to run ramalama run/serve without a GPU by specifying --device none.
Enhanced Testing Capabilities: This feature is crucial for nightly testing of RamaLama's CPU-only performance on systems that are equipped with GPUs, ensuring accurate benchmarks.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

sourcery-ai

Hey there - I've reviewed your changes - here's some feedback:

The check for ['none'] via str(args.device) is brittle—compare the list directly (e.g. args.device == ['none']) or leverage argparse choices to handle a special none value.
You may want to explicitly error or warn if users pass multiple devices including 'none' to prevent ambiguous behavior.

Prompt for AI Agents

Please address the comments from this code review:
## Overall Comments
- The check for `['none']` via str(args.device) is brittle—compare the list directly (e.g. `args.device == ['none']`) or leverage argparse choices to handle a special `none` value.
- You may want to explicitly error or warn if users pass multiple devices including 'none' to prevent ambiguous behavior.

## Individual Comments

### Comment 1
<location> `ramalama/engine.py:121` </location>
<code_context>
             self.exec_args += ["-p", f"{host}{self.args.port}:{self.args.port}"]

     def add_device_options(self):
+        request_no_device = str(getattr(self.args, "device", None)) == "['none']"
+        if request_no_device:
+            return
+
</code_context>

<issue_to_address>
The check for 'none' as device is fragile and may not handle all input cases.

Directly comparing to the string "['none']" is brittle. Instead, check if the device argument is a list containing 'none', or if 'none' appears in the list, to ensure the logic works for all input formats.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-09-08T08:55:44Z

ramalama/engine.py

+        request_no_device = str(getattr(self.args, "device", None)) == "['none']"
+        if request_no_device:


issue: The check for 'none' as device is fragile and may not handle all input cases.

Directly comparing to the string "['none']" is brittle. Instead, check if the device argument is a list containing 'none', or if 'none' appears in the list, to ensure the logic works for all input formats.

gemini-code-assist

Code Review

This pull request introduces the ability to run ramalama without a GPU by using the --device none flag. This is a useful feature for CPU-only testing. The implementation involves updating the CLI help text and adding logic to the engine to handle this new option. While the intent is clear, the implementation for checking the --device none option in ramalama/engine.py is brittle and can lead to errors in certain edge cases. I have provided a suggestion to make this check more robust and user-friendly.

gemini-code-assist · 2025-09-08T08:56:21Z

ramalama/engine.py

+        request_no_device = str(getattr(self.args, "device", None)) == "['none']"
+        if request_no_device:
+            return


The check for request_no_device is brittle because it relies on the string representation of a list (str(...) == "['none']"). This approach has a few issues:

Fragility: It depends on Python's internal list.__str__ implementation, which could theoretically change.

Incorrectness: It fails to handle cases where --device is used multiple times. For example, ramalama run --device none --device none ... would result in self.args.device being ['none', 'none'], and str(['none', 'none']) is not "['none']". The same issue occurs with ... --device none --device /dev/kvm. In these cases, invalid arguments would be passed to the container engine, causing it to fail.

I suggest a more robust check that directly inspects the list of devices. This also provides a helpful warning to the user if they specify --device none along with other devices.

Suggested change

request_no_device = str(getattr(self.args, "device", None)) == "['none']"

if request_no_device:

return

devices = getattr(self.args, "device", None)

if devices and "none" in devices:

if len(set(devices)) > 1:

logger.warning(

"`--device=none` was specified with other device arguments, which will be ignored."

)

return

fixed the "['none']" fragility

rhatdan · 2025-09-08T10:22:08Z

Please document this change in man pages.

Useful for testing the CPU performance Signed-off-by: Kevin Pouget <[email protected]>

Signed-off-by: Kevin Pouget <[email protected]>

rhatdan · 2025-09-08T12:34:12Z

LGTM

rhatdan · 2025-09-08T12:35:16Z

I probably should have made you add a test for this, so could you add a follow up with a test using --dryrun to make sure there are no --device options in the podman line.

kpouget · 2025-09-08T13:19:54Z

sure, done in #1911 👍🏻

Needs containers/ramalama#1909 to work.  ## Summary by CodeRabbit * **New Features** * Added a Ramalama “no‑GPU” platform option, available in presets and platform flavors for easier selection on systems without GPU access. * **Bug Fixes** * Non‑GPU runs now use a safe “none” device configuration for improved stability. * Automatically disables GPU when the selected inference server flavor is set to no‑GPU, preventing unintended GPU usage.

kpouget requested review from rhatdan, bmahabirbu, maxamillion, swarajpande5, jhjaggars, cgruver and engelmi as code owners September 8, 2025 08:55

gemini-code-assist bot reviewed Sep 8, 2025

View reviewed changes

sourcery-ai bot approved these changes Sep 8, 2025

View reviewed changes

gemini-code-assist bot reviewed Sep 8, 2025

View reviewed changes

kpouget force-pushed the no-device branch from 6500dbf to 2fcee78 Compare September 8, 2025 08:59

kpouget mentioned this pull request Sep 8, 2025

[mac-ai] Enable RamaLama CPU nightly testing openshift-psap/topsail#823

Merged

kpouget force-pushed the no-device branch from 2fcee78 to 5a9a679 Compare September 8, 2025 10:09

kpouget force-pushed the no-device branch from f7b9c01 to f0ca708 Compare September 8, 2025 11:32

kpouget added 2 commits September 8, 2025 13:37

Allow running a model without any device

1bfefa0

Useful for testing the CPU performance Signed-off-by: Kevin Pouget <[email protected]>

docs: document the --device=none flag

28a776a

Signed-off-by: Kevin Pouget <[email protected]>

kpouget force-pushed the no-device branch from f0ca708 to 28a776a Compare September 8, 2025 11:37

rhatdan merged commit 2f9e143 into containers:main Sep 8, 2025
9 of 46 checks passed

kpouget deleted the no-device branch September 8, 2025 12:36

kpouget mentioned this pull request Sep 8, 2025

Add tests for --device none #1911

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow running `ramalama` without a GPU #1909

Allow running `ramalama` without a GPU #1909

Uh oh!

kpouget commented Sep 8, 2025 •

edited by sourcery-ai bot

Loading

Uh oh!

sourcery-ai bot commented Sep 8, 2025 •

edited

Loading

Reviewer's Guide

Class diagram for updated device handling in RamaLama engine

File-Level Changes

Possibly linked issues

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

sourcery-ai bot left a comment

Uh oh!

sourcery-ai bot Sep 8, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 8, 2025

Uh oh!

kpouget Sep 8, 2025

Uh oh!

rhatdan commented Sep 8, 2025

Uh oh!

rhatdan commented Sep 8, 2025

Uh oh!

Uh oh!

rhatdan commented Sep 8, 2025

Uh oh!

kpouget commented Sep 8, 2025

Uh oh!

Uh oh!

		request_no_device = str(getattr(self.args, "device", None)) == "['none']"
		if request_no_device:

-        request_no_device = str(getattr(self.args, "device", None)) == "['none']"
-        if request_no_device:
-            return
+        devices = getattr(self.args, "device", None)
+        if devices and "none" in devices:
+            if len(set(devices)) > 1:
+                logger.warning(
+                    "`--device=none` was specified with other device arguments, which will be ignored."
+                )
+            return

Allow running ramalama without a GPU #1909

Allow running ramalama without a GPU #1909

Uh oh!

Conversation

kpouget commented Sep 8, 2025 • edited by sourcery-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by Sourcery

Uh oh!

sourcery-ai bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Class diagram for updated device handling in RamaLama engine

File-Level Changes

Possibly linked issues

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

kpouget Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

rhatdan commented Sep 8, 2025

Uh oh!

rhatdan commented Sep 8, 2025

Uh oh!

Uh oh!

rhatdan commented Sep 8, 2025

Uh oh!

kpouget commented Sep 8, 2025

Uh oh!

Uh oh!

Allow running `ramalama` without a GPU #1909

Allow running `ramalama` without a GPU #1909

kpouget commented Sep 8, 2025 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Sep 8, 2025 •

edited

Loading