Python virtual environments (package
virtualenv
or built-in module
venv)
isolate your project’s interpreter and dependencies, but they offer
no security or execution sandboxing like a virtual machine or a Docker
container would. Therefore, running virtualenv Python programs as-is (unsecured),
any rogue dependency*
🎯 or hacked library code
🏴☠️ (et cet.
~/.ssh/id_ed25519
,~/.pki/nssdb/
,~/.mozilla/firefox/<profile>/key4.db
,~/.mozilla/firefox/<profile>/formhistory.sqlite
...
✱ Installing something as seemingly harmless as the popular package poetry pulls in nearly a hundred dependencies or over 70 MB of Python sources! 😬
In someone else's words:
Using virtualenv is more secure?
In order to execute installed Python programs in secure virtual environments, one is better advised to either look to OS VM primitives like those provided by Docker and containers, e.g.:
podman run -it -v .:/src python:3 bash # ...
The simpler alternative is automatic lightweight container wrapping with
bubblewrap (of
Flatpak fame)
using sandbox-venv
script from this repo.
There are no dependencies other than a POSIX shell with
its standard set of utilities
and bubblewrap
.
The installation process, as well as the script runtime,
should behave similarly on all relevant compute platforms,
including GNU/Linux and even
Windos/WSL. 🤞
# Install the few, unlikely to be missing dependencies, e.g.
sudo apt install coreutils binutils bubblewrap libseccomp2 python3
# A working XDG Desktop Portal is recommended to xdg-open hyperlinks
sudo apt install xdg-dbus-proxy xdg-desktop-portal* # Note: only need one
# Download the script and put it somewhere on PATH
curl -vL 'https://bit.ly/sandbox-venv' | sudo tee /usr/local/bin/sandbox-venv
sudo chmod +x /usr/local/bin/sandbox-venv # Mark executable
sandbox-venv --help
# Usage: sandbox-venv [VENV_DIR] [BWRAP_OPTS]
sandbox-venv path/to/my-project/.venv
Whenever you create a new virtual environment,
simply invoke sandbox-venv
on it afterwards, e.g.:
cd project
python -m venv .venv # Create a new project virtualenv
sandbox-venv .venv # Passing virtualenv dir is optional; defaults to ".venv"
From now on, directory .venv and everything under it
(in particular, everything in the bin folder,
e.g. .venv/bin/python
, .venv/bin/pip
etc.)
sets up and transparently runs in a secure container sandbox.
Other than the optional virtualenv dir, all arguments initially passed to
sandbox-venv
are forwarded to bubblewrap. See bubblewrap --help
or
man 1 bwrap
. You can also pass additional bubblewrap arguments to individual
process invocations via $BWRAP_ARGS
environment variable. E.g.:
BWRAP_ARGS='--bind /lib /lib' \
python -c 'import os; print(os.listdir("/lib"))'
To run the sandboxed process as superuser (while still retaining all the security functionality of the container sandbox), e.g. to open privileged ports, use args:
--uid 0 --cap-add cap_net_bind_service
The directory that contains your venv dir, i.e. .venv/..
or
the project directory, is mounted with read-write permissions,
while everything else (including project/.git
)
is mounted read-only. In addition:
"$venv/cache"
is bind-mounted as"$HOME/.cache"
"$HOME/.cache/pip"
is bind-mounted as"$HOME/.cache/pip"
(only if environment variableSANDBOX_USE_PIP_CACHE=
is set as this may enable cache poisoning attachs).
To mount extra endpoints, use Bubblewrap switches --bind
or --bind-ro
.
Anything else not explicitly mounted by an extra CLI switch
is lost upon container termination.
Linux kernel seccomp
facility for restricting syscalls is
automatically enabled when the appropriate package is
available—apt install libseccomp2 python3-seccomp
(requires virtualenv with --system-site-packages
)
or pip install pyseccomp
(also requires libseccomp2
).
The initializing module sitecustomize.py
installs a filter
that thereafter only allows syscalls listed in the environment variable
SANDBOX_SECCOMP_ALLOW=
(or, by default, some 200 syscalls that should cover all non-special cases).
You can populate the variable at runtime with a custom, stricter syscalls list
or set it to blank
(i.e. export SANDBOX_SECCOMP_ALLOW=
) to force-disable seccomp completely.
If environment variable VERBOSE=
is set to a non-empty value,
the full bwrap
command line is emitted to stderr before execution.
You can list bubblewraped processes using the
command lsns
or the following shell function:
list_bwrap () { lsns -u -W | { IFS= read header; echo "$header"; grep bwrap; }; }
list_bwrap # Function call
You can run $venv/bin/shell
to spawn interactive shell inside the sandbox.
BWRAP_ARGS=
– Extra arguments passed tobwrap
process; space or line-delimited (if arguments such as paths themselves contain spaces).SANDBOX_SECCOMP_ALLOW=
– Space-separated list of system calls allowed by the installed Linux seccomp filter. Requireslibseccomp2
andpython3-seccomp
/ Python packagepyseccomp
SANDBOX_USE_PIP_CACHE=
– Mount current user's$HOME/.cache/pip
inside the sandbox. Insecure, but can be used to cache large, trusted package downloads. Otherwise, pip is always invoked with a clean cache as ifpip --no-cache-dir ...
were used.VERBOSE=
– Print fullexec bwrap
command line right before execution.
To install a heavy package that requires a compiler, it is easiest to supply it with full /usr and /lib:
BWRAP_ARGS='--ro-bind /usr /usr --ro-bind /lib /lib' pip install ...
To pass extra environment variables, other than those filtered by default,
use bwrap --setenv
, e.g.:
BWRAP_ARGS='--setenv OPENAI_API_KEY c4f3b4b3' my-ai-prog
To run GUI (X11) apps, some prior success was achieved using e.g.:
BWRAP_ARGS='--bind /tmp/.X11-unix/X0 /tmp/.X11-unix/X8 --setenv DISPLAY :8'
See more examples on the ArchWiki.
Entrypoints in $venv/bin
are wrapped to exec bwrap
so that every invocation runs inside a fresh Bubblewrap container.
$venv/bin/pip
wrapper re-wraps any newly created executables in $venv/bin
,
ensuring they always use the wrapped $venv/bin/python
.
Rather than giving the sandbox full filesystem access,
minimal shared library (*.so
) dependencies are collected and made available inside the container,
as well as specific host binaries (e.g. /usr/bin/python3
, /usr/bin/git
, /bin/sh
etc.)—actual
runtime behavior depends on this list and which paths exist on the host.
Most paths are mounted read-only, while the project directory
(sans its .venv
, .git
) is mounted read-write.
Optionally, a seccomp filter is installed at Python startup using the sitecustomize
mechanism.
The BWRAP_ARGS=
environment variable lets you extend or relax the sandbox at runtime.
Paths inside the sandbox mirror the host paths, potentially exposing your username, directory layout etc. This was done for simplicity—pull requests appreciated!
flowchart TB
Attacker[Malicious package /
rogue dependency]
Attacker -->|filesystem access| TryFS
Attacker -->|use Linux syscalls| TrySys
Attacker -->|network bind| TryNet
subgraph Threats
TryFS[try to read ~/.ssh, /etc,
or other host secrets]
TrySys[call a forbidden /
privileged syscall]
TryNet[bind to a privileged port]
end
TryFS -->|failure| FS
TrySys -->|blocked| Seccomp[seccomp sandbox]
TryNet -->|optional| Caps
subgraph Mitigations
FS[**Mount policy**: project dir is RW;
everything else is RO or absent]
Seccomp[**seccomp** syscall whitelist
<code>SANDBOX_SECCOMP_ALLOW=</code>]
Caps[capabilities / UID mapping
controlled by <code><b>BWRAP_ARGS=</b></code>]
end
- A popular alternative are the aforementioned Docker/OCI containers and manual management of their runtime. This comes free when the worked on project itself deals in Continerfiles.
- On Linux, AppArmor, even with
apparmor.d
applied, doesn't ship a generic
python
profile, so one would go through directaa-exec --profile my-custom-env
, but writing custom AppArmor profiles is less common than simply using containers. - Firejail.
An indie C project with virtually no dependencies (which
Red HatIBM has a reasonable position on) that sets up its own sandbox. I guess it's a matter of trust. Similarly to AppArmor, requires writing a custom profile. - A custom
seccomp
initialization script, executed at interpreter startup usingPYTHONSTARTUP=
sitecustomize
startup hook. - On macOS,
sandbox-exec
or Apple Containerization®.
In comparison to the above, sandbox-venv
is like chroot
on steroids.
It uses the same isolation primitives that containers use
(process sandbox via Linux namespaces, isolated filesystem view),
but without all of the container runtime baggage—YMMV.