Skip to content

kernc/sandbox-venv

Repository files navigation

sandbox-venv: secure container virtualenv wrapper

Build status Language: shell / Bash Source lines of code Script size Issues Sponsors

Problem statement

Python virtual environments (package virtualenv or built-in module venv) isolate your project’s interpreter and dependencies, but they offer no security or execution sandboxing like a virtual machine or a Docker container would. Therefore, running virtualenv Python programs as-is (unsecured), any rogue dependency* 🎯 or hacked library code 🏴‍☠️ (et cet. ⚠️) can wreak havoc, including access all your private parts ‼️—think current user's credentials and personal bits like:

  • ~/.ssh/id_ed25519,
  • ~/.pki/nssdb/,
  • ~/.mozilla/firefox/<profile>/key4.db,
  • ~/.mozilla/firefox/<profile>/formhistory.sqlite ...

✱ Installing something as seemingly harmless as the popular package poetry pulls in nearly a hundred dependencies or over 70 MB of Python sources! 😬

In someone else's words:

Using virtualenv is more secure?

No. Not in the slightest.

Solution

In order to execute installed Python programs in secure virtual environments, one is better advised to either look to OS VM primitives like those provided by Docker and containers, e.g.:

podman run -it -v .:/src python:3 bash  # ...

The simpler alternative is automatic lightweight container wrapping with bubblewrap (of Flatpak fame) using sandbox-venv script from this repo.

Installation

There are no dependencies other than a POSIX shell with its standard set of utilities and bubblewrap. The installation process, as well as the script runtime, should behave similarly on all relevant compute platforms, including GNU/Linux and even Windos/WSL. 🤞

# Install the few, unlikely to be missing dependencies, e.g.
sudo apt install coreutils binutils bubblewrap libseccomp2 python3
# A working XDG Desktop Portal is recommended to xdg-open hyperlinks
sudo apt install xdg-dbus-proxy xdg-desktop-portal*  # Note: only need one

# Download the script and put it somewhere on PATH
curl -vL 'https://bit.ly/sandbox-venv' | sudo tee /usr/local/bin/sandbox-venv
sudo chmod +x /usr/local/bin/sandbox-venv  # Mark executable

sandbox-venv --help
# Usage: sandbox-venv [VENV_DIR] [BWRAP_OPTS]
sandbox-venv path/to/my-project/.venv

Usage

Whenever you create a new virtual environment, simply invoke sandbox-venv on it afterwards, e.g.:

cd project
python -m venv .venv  # Create a new project virtualenv
sandbox-venv .venv    # Passing virtualenv dir is optional; defaults to ".venv"

From now on, directory .venv and everything under it (in particular, everything in the bin folder, e.g. .venv/bin/python, .venv/bin/pip etc.) sets up and transparently runs in a secure container sandbox.

Extra Bubblewrap arguments

Other than the optional virtualenv dir, all arguments initially passed to sandbox-venv are forwarded to bubblewrap. See bubblewrap --help or man 1 bwrap. You can also pass additional bubblewrap arguments to individual process invocations via $BWRAP_ARGS environment variable. E.g.:

BWRAP_ARGS='--bind /lib /lib' \
    python -c 'import os; print(os.listdir("/lib"))'

To run the sandboxed process as superuser (while still retaining all the security functionality of the container sandbox), e.g. to open privileged ports, use args:

--uid 0 --cap-add cap_net_bind_service

Filesystem mounts

The directory that contains your venv dir, i.e. .venv/.. or the project directory, is mounted with read-write permissions, while everything else (including project/.git) is mounted read-only. In addition:

  • "$venv/cache" is bind-mounted as "$HOME/.cache"
  • "$HOME/.cache/pip" is bind-mounted as "$HOME/.cache/pip" (only if environment variable SANDBOX_USE_PIP_CACHE= is set as this may enable cache poisoning attachs).

To mount extra endpoints, use Bubblewrap switches --bind or --bind-ro. Anything else not explicitly mounted by an extra CLI switch is lost upon container termination.

Linux Seccomp

Linux kernel seccomp facility for restricting syscalls is automatically enabled when the appropriate package is availableapt install libseccomp2 python3-seccomp (requires virtualenv with --system-site-packages) or pip install pyseccomp (also requires libseccomp2). The initializing module sitecustomize.py installs a filter that thereafter only allows syscalls listed in the environment variable SANDBOX_SECCOMP_ALLOW= (or, by default, some 200 syscalls that should cover all non-special cases). You can populate the variable at runtime with a custom, stricter syscalls list or set it to blank (i.e. export SANDBOX_SECCOMP_ALLOW=) to force-disable seccomp completely.

Runtime monitoring

If environment variable VERBOSE= is set to a non-empty value, the full bwrap command line is emitted to stderr before execution.

You can list bubblewraped processes using the command lsns or the following shell function:

list_bwrap () { lsns -u -W | { IFS= read header; echo "$header"; grep bwrap; }; }

list_bwrap  # Function call

You can run $venv/bin/shell to spawn interactive shell inside the sandbox.

Environment variables

  • BWRAP_ARGS=– Extra arguments passed to bwrap process; space or line-delimited (if arguments such as paths themselves contain spaces).
  • SANDBOX_SECCOMP_ALLOW=– Space-separated list of system calls allowed by the installed Linux seccomp filter. Requires libseccomp2 and python3-seccomp / Python package pyseccomp
  • SANDBOX_USE_PIP_CACHE=– Mount current user's $HOME/.cache/pip inside the sandbox. Insecure, but can be used to cache large, trusted package downloads. Otherwise, pip is always invoked with a clean cache as if pip --no-cache-dir ... were used.
  • VERBOSE=– Print full exec bwrap command line right before execution.

Examples

To install a heavy package that requires a compiler, it is easiest to supply it with full /usr and /lib:

BWRAP_ARGS='--ro-bind /usr /usr --ro-bind /lib /lib'  pip install ...

To pass extra environment variables, other than those filtered by default, use bwrap --setenv, e.g.:

BWRAP_ARGS='--setenv OPENAI_API_KEY c4f3b4b3'  my-ai-prog

To run GUI (X11) apps, some prior success was achieved using e.g.:

BWRAP_ARGS='--bind /tmp/.X11-unix/X0 /tmp/.X11-unix/X8 --setenv DISPLAY :8'

See more examples on the ArchWiki.

Security Model

Entrypoints in $venv/bin are wrapped to exec bwrap so that every invocation runs inside a fresh Bubblewrap container.

$venv/bin/pip wrapper re-wraps any newly created executables in $venv/bin, ensuring they always use the wrapped $venv/bin/python.

Rather than giving the sandbox full filesystem access, minimal shared library (*.so) dependencies are collected and made available inside the container, as well as specific host binaries (e.g. /usr/bin/python3, /usr/bin/git, /bin/sh etc.)—actual runtime behavior depends on this list and which paths exist on the host. Most paths are mounted read-only, while the project directory (sans its .venv, .git) is mounted read-write.

Optionally, a seccomp filter is installed at Python startup using the sitecustomize mechanism. The BWRAP_ARGS= environment variable lets you extend or relax the sandbox at runtime.

Paths inside the sandbox mirror the host paths, potentially exposing your username, directory layout etc. This was done for simplicity—pull requests appreciated!

flowchart TB
  Attacker[Malicious package /
           rogue dependency]
  Attacker -->|filesystem access| TryFS
  Attacker -->|use Linux syscalls| TrySys
  Attacker -->|network bind| TryNet

  subgraph Threats
    TryFS[try to read ~/.ssh, /etc,
          or other host secrets]
    TrySys[call a forbidden /
           privileged syscall]
    TryNet[bind to a privileged port]
  end

  TryFS -->|failure| FS
  TrySys -->|blocked| Seccomp[seccomp sandbox]
  TryNet -->|optional| Caps

  subgraph Mitigations
    FS[**Mount policy**: project dir is RW;
       everything else is RO or absent]
    Seccomp[**seccomp** syscall whitelist
            <code>SANDBOX_SECCOMP_ALLOW=</code>]
    Caps[capabilities / UID mapping
         controlled by <code><b>BWRAP_ARGS=</b></code>]
  end
Loading

Viable alternatives

  1. A popular alternative are the aforementioned Docker/OCI containers and manual management of their runtime. This comes free when the worked on project itself deals in Continerfiles.
  2. On Linux, AppArmor, even with apparmor.d applied, doesn't ship a generic python profile, so one would go through direct aa-exec --profile my-custom-env, but writing custom AppArmor profiles is less common than simply using containers.
  3. Firejail. An indie C project with virtually no dependencies (which Red HatIBM has a reasonable position on) that sets up its own sandbox. I guess it's a matter of trust. Similarly to AppArmor, requires writing a custom profile.
  4. A custom seccomp initialization script, executed at interpreter startup using PYTHONSTARTUP= sitecustomize startup hook.
  5. On macOS, sandbox-exec or Apple Containerization®.

In comparison to the above, sandbox-venv is like chroot on steroids. It uses the same isolation primitives that containers use (process sandbox via Linux namespaces, isolated filesystem view), but without all of the container runtime baggage—YMMV.