Skip to content

[Bug]: building docker from Dockerfile #15872

@surak

Description

@surak

Your current environment

The output of `python collect_env.py`
There isn't because there's no pytorch on the system. It's a Mac OS with python from uv

🐛 Describe the bug

Can't build from Dockerfile.

DOCKER_BUILDKIT=1 docker build . -t registry.jsc.fz-juelich.de/sdlaml/blablador/vllm --file Dockerfile

Results in

...
#31 67.58 -- Found Python: /usr/bin/python3 (found version "3.12.9") found components: Interpreter Development.Module Development.SABIModule
#31 71.29 CMake Warning at .deps/vllm-flash-attn-src/CMakeLists.txt:75 (message):
#31 71.29   Pytorch version 2.4.0 expected for CUDA build, saw 2.6.0 instead.
#31 71.29 
#31 71.29 
#31 71.29 -- CUDA target architectures: 7.0;7.5;8.0;8.6;8.9;9.0
#31 71.29 -- CUDA supported target architectures: 8.0;8.6;8.9;9.0
#31 75.39 -- FA2_ARCHS: 8.0;9.0
#31 75.40 -- FA3_ARCHS: 9.0a;8.0
#31 75.42 -- vllm-flash-attn is available at /workspace/.deps/vllm-flash-attn-src
#31 75.42 -- Configuring done (58.3s)
#31 75.57 -- Generating done (0.2s)
#31 75.57 -- Build files have been written to: /workspace/build/temp.linux-x86_64-cpython-312
#31 75.60 Using MAX_JOBS=2 as the number of jobs.
#31 75.60 /bin/sh: 1: lsmod: not found
#31 75.62 Using NVCC_THREADS=8 as the number of nvcc threads.
#31 78.64 [1/308] Building CXX object CMakeFiles/_moe_C.dir/csrc/moe/torch_bindings.cpp.o
#31 78.69 [2/308] Building CXX object CMakeFiles/cumem_allocator.dir/csrc/cumem_allocator.cpp.o
#31 79.51 [3/308] Linking CXX shared module cumem_allocator.abi3.so
#31 212.1 [4/308] Building CUDA object CMakeFiles/_C.dir/csrc/cache_kernels.cu.o
#31 212.1 FAILED: CMakeFiles/_C.dir/csrc/cache_kernels.cu.o 
#31 212.1 ccache /usr/local/cuda/bin/nvcc -forward-unknown-to-host-compiler -DCUTLASS_ENABLE_DIRECT_CUDA_DRIVER_CALL=1 -DPy_LIMITED_API=3 -DTORCH_EXTENSION_NAME=_C -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -D_C_EXPORTS -I/workspace/csrc -I/workspace/.deps/cutlass-src/include -I/workspace/.deps/cutlass-src/tools/util/include -isystem /usr/include/python3.12 -isystem /usr/local/lib/python3.12/dist-packages/torch/include -isystem /usr/local/lib/python3.12/dist-packages/torch/include/torch/csrc/api/include -isystem /usr/local/cuda/include -DONNX_NAMESPACE=onnx_c2 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O2 -g -DNDEBUG -std=c++17 -Xcompiler=-fPIC --expt-relaxed-constexpr -DENABLE_FP8 --threads=8 -DENABLE_SCALED_MM_SM90=1 -DENABLE_SCALED_MM_C2X=1 -DENABLE_SPARSE_SCALED_MM_C3X=1 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_90,code=sm_90 -MD -MT CMakeFiles/_C.dir/csrc/cache_kernels.cu.o -MF CMakeFiles/_C.dir/csrc/cache_kernels.cu.o.d -x cu -c /workspace/csrc/cache_kernels.cu -o CMakeFiles/_C.dir/csrc/cache_kernels.cu.o
#31 212.1 Killed
#31 212.1 Killed
#31 212.1 Killed
#31 212.1 ninja: build stopped: subcommand failed.
#31 212.4 Traceback (most recent call last):
#31 212.4   File "/workspace/setup.py", line 676, in <module>
#31 212.5     setup(
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/__init__.py", line 117, in setup
#31 212.5     return distutils.core.setup(**attrs)
#31 212.5            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/core.py", line 186, in setup
#31 212.5     return run_commands(dist)
#31 212.5            ^^^^^^^^^^^^^^^^^^
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/core.py", line 202, in run_commands
#31 212.5     dist.run_commands()
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/dist.py", line 1002, in run_commands
#31 212.5     self.run_command(cmd)
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/dist.py", line 1104, in run_command
#31 212.5     super().run_command(command)
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/dist.py", line 1021, in run_command
#31 212.5     cmd_obj.run()
#31 212.5   File "/usr/local/lib/python3.12/dist-packages/setuptools/command/bdist_wheel.py", line 370, in run
#31 212.6     self.run_command("build")
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/cmd.py", line 357, in run_command
#31 212.6     self.distribution.run_command(command)
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/dist.py", line 1104, in run_command
#31 212.6     super().run_command(command)
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/dist.py", line 1021, in run_command
#31 212.6     cmd_obj.run()
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/command/build.py", line 135, in run
#31 212.6     self.run_command(cmd_name)
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/cmd.py", line 357, in run_command
#31 212.6     self.distribution.run_command(command)
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/dist.py", line 1104, in run_command
#31 212.6     super().run_command(command)
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/dist.py", line 1021, in run_command
#31 212.6     cmd_obj.run()
#31 212.6   File "/workspace/setup.py", line 267, in run
#31 212.6     super().run()
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/command/build_ext.py", line 99, in run
#31 212.6     _build_ext.run(self)
#31 212.6   File "/usr/local/lib/python3.12/dist-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
#31 212.6     self.build_extensions()
#31 212.6   File "/workspace/setup.py", line 238, in build_extensions
#31 212.6     subprocess.check_call(["cmake", *build_args], cwd=self.build_temp)
#31 212.6   File "/usr/lib/python3.12/subprocess.py", line 415, in check_call
#31 212.6     raise CalledProcessError(retcode, cmd)
#31 212.6 subprocess.CalledProcessError: Command '['cmake', '--build', '.', '-j=1', '--target=_moe_C', '--target=_vllm_fa2_C', '--target=_vllm_fa3_C', '--target=_flashmla_C', '--target=cumem_allocator', '--target=_C']' returned non-zero exit status 1.
------
executor failed running [/bin/sh -c if [ "$USE_SCCACHE" != "1" ]; then         rm -rf .deps &&         mkdir -p .deps &&         python3 setup.py bdist_wheel --dist-dir=dist --py-limited-api=cp38;     fi]: exit code: 1

This fails with vllm==0.7.4, main, and 0.8.2

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingstaleOver 90 days of inactivity

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions