ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 13.2k
Star 87k

Code
Issues 334
Pull requests 549
Discussions
Actions
Projects 10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 77 Milestones 0

New pull request New

549 Open 7,119 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

tests: override test_set_rows::max_nmse_err to allow for occasional rounding differences testing

Everything test related

#16295 opened Sep 28, 2025 by jeffbolznv

Loading…

ci: update vulkan ci devops

improvements to build systems and github actions

#16294 opened Sep 27, 2025 by netrunnereve

Loading…

vulkan: Fix validation failure in quantized flash attention ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#16292 opened Sep 27, 2025 by jeffbolznv

Loading…

hip : substituted bpermute ops with swizzle ops (gfx906, maybe all AMD) ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#16291 opened Sep 27, 2025 by iacopPBK

Loading…

webui : added download action (#13552) examples server

#16282 opened Sep 26, 2025 by srogmann

Loading…

Update convert_hf_to_gguf_update.py python

python script changes

#16280 opened Sep 26, 2025 by cpumaxx

Loading…

rpc : add support for multiple devices examples ggml

changes relating to the ggml tensor library for machine learning

#16276 opened Sep 26, 2025 by rgerganov • Draft

Support FP16 as intermediate results in graph computation ggml

changes relating to the ggml tensor library for machine learning

#16270 opened Sep 26, 2025 by hipudding • Draft

musa: update compile flags ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#16265 opened Sep 26, 2025 by yeahdongcn

Loading…

common : fix reasoning before forced tool call via tool_choice = required

#16264 opened Sep 26, 2025 by crat0z

Loading…

Correct XTC threshold args documentation

#16260 opened Sep 25, 2025 by Volko61

Loading…

Refactor llama-model.cpp

#16252 opened Sep 25, 2025 by pwilkin

Loading…

CANN: Update several operators to support FP16 data format Ascend NPU

issues specific to Ascend NPUs

ggml

changes relating to the ggml tensor library for machine learning

#16251 opened Sep 25, 2025 by hipudding

Loading…

ci : add AMD runners and workflows devops

improvements to build systems and github actions

#16249 opened Sep 25, 2025 by ggerganov

Loading…

kleidiai: fix work size and threads sync for fp16 ggml

changes relating to the ggml tensor library for machine learning

#16246 opened Sep 25, 2025 by chaxu01

Loading…

llama : merge logit and p fields in llama_token_data examples server testing

Everything test related

#16241 opened Sep 25, 2025 by danbev • Draft

ggml-cpu: detect correct cpu flags for arm64 (#16229) ggml

changes relating to the ggml tensor library for machine learning

#16239 opened Sep 25, 2025 by lizhenneng

Loading…

Update the docs on -t --threads examples server

#16236 opened Sep 24, 2025 by takasurazeem

Loading…

Extend CI for i8mm kernels as well testing

Everything test related

#16234 opened Sep 24, 2025 by Rohanjames1997

Loading…

metal : extend mat-mat multiplication support Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

#16225 opened Sep 24, 2025 by ggerganov

Loading…

Improve Mobile UI for dialogs and action dropdowns examples server

#16222 opened Sep 24, 2025 by allozaur

Loading…

HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0 ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#16221 opened Sep 24, 2025 by IMbackK

Loading…

Model: Granite docling + Idefics3 preprocessing (SmolVLM) examples python

python script changes

#16206 opened Sep 23, 2025 by gabe-l-hart

Loading…

vulkan: Add ACC_TYPE_VEC2 implementation ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#16203 opened Sep 23, 2025 by SavicStefan

Loading…

tools/main: llama-cli: prevent spurious assistant token (#13402) examples

#16202 opened Sep 23, 2025 by vinkal-chudgar

Loading…

Previous 1 2 3 4 5 … 21 22 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!