-
Notifications
You must be signed in to change notification settings - Fork 13.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
tests: override test_set_rows::max_nmse_err to allow for occasional rounding differences
testing
Everything test related
#16295
opened Sep 28, 2025 by
jeffbolznv
Loading…
ci: update vulkan ci
devops
improvements to build systems and github actions
#16294
opened Sep 27, 2025 by
netrunnereve
Loading…
vulkan: Fix validation failure in quantized flash attention
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16292
opened Sep 27, 2025 by
jeffbolznv
Loading…
hip : substituted bpermute ops with swizzle ops (gfx906, maybe all AMD)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16291
opened Sep 27, 2025 by
iacopPBK
Loading…
webui : added download action (#13552)
examples
server
#16282
opened Sep 26, 2025 by
srogmann
Loading…
Update convert_hf_to_gguf_update.py
python
python script changes
#16280
opened Sep 26, 2025 by
cpumaxx
Loading…
Support FP16 as intermediate results in graph computation
ggml
changes relating to the ggml tensor library for machine learning
musa: update compile flags
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16265
opened Sep 26, 2025 by
yeahdongcn
Loading…
common : fix reasoning before forced tool call via tool_choice = required
#16264
opened Sep 26, 2025 by
crat0z
Loading…
CANN: Update several operators to support FP16 data format
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#16251
opened Sep 25, 2025 by
hipudding
Loading…
ci : add AMD runners and workflows
devops
improvements to build systems and github actions
#16249
opened Sep 25, 2025 by
ggerganov
Loading…
kleidiai: fix work size and threads sync for fp16
ggml
changes relating to the ggml tensor library for machine learning
#16246
opened Sep 25, 2025 by
chaxu01
Loading…
ggml-cpu: detect correct cpu flags for arm64 (#16229)
ggml
changes relating to the ggml tensor library for machine learning
#16239
opened Sep 25, 2025 by
lizhenneng
Loading…
Extend CI for i8mm kernels as well
testing
Everything test related
#16234
opened Sep 24, 2025 by
Rohanjames1997
Loading…
metal : extend mat-mat multiplication support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16225
opened Sep 24, 2025 by
ggerganov
Loading…
HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16221
opened Sep 24, 2025 by
IMbackK
Loading…
Model: Granite docling + Idefics3 preprocessing (SmolVLM)
examples
python
python script changes
#16206
opened Sep 23, 2025 by
gabe-l-hart
Loading…
vulkan: Add ACC_TYPE_VEC2 implementation
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16203
opened Sep 23, 2025 by
SavicStefan
Loading…
tools/main: llama-cli: prevent spurious assistant token (#13402)
examples
#16202
opened Sep 23, 2025 by
vinkal-chudgar
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.