Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

batch : add optional for sequential equal split
#14511 opened Jul 3, 2025 by ggerganov Loading…
opencl: broadcast for soft_max ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14510 opened Jul 3, 2025 by lhez Loading…
vulkan: support mixed/deepseekR1 FA head sizes ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14509 opened Jul 2, 2025 by jeffbolznv Loading…
sycl: Fix conditional enabling following arch checks for ggml-sycl ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14504 opened Jul 2, 2025 by s-Nick Loading…
MUSA: upgrade musa sdk to <<TBD>> ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14498 opened Jul 2, 2025 by yeahdongcn Draft
Allow truncation when embedding examples server
#14493 opened Jul 2, 2025 by huydt84 Loading…
vulkan: unpack more values at a time for iquants mat mul ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14485 opened Jul 1, 2025 by netrunnereve Loading…
ggml: backward pass for split swiglu ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14483 opened Jul 1, 2025 by JohannesGaessler Loading…
llama : reuse compute graphs
#14482 opened Jul 1, 2025 by ggerganov Loading…
3 of 7 tasks
opencl : add GELU_ERF ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14476 opened Jul 1, 2025 by CISC Loading…
Pr/7191 build Compilation issues devops improvements to build systems and github actions python python script changes
#14447 opened Jun 29, 2025 by esrakorkmz Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#14445 opened Jun 29, 2025 by CISC Loading…
Added CI with RISC-V RVV1.0 Hardware devops improvements to build systems and github actions
#14439 opened Jun 29, 2025 by alitariq4589 Loading…
model : add hunyuan moe python python script changes
#14425 opened Jun 27, 2025 by ngxson Loading…
4 tasks done
ggml : add ggml_scale_bias Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14417 opened Jun 27, 2025 by ngxson Draft
[CANN] weight format to nz for Ascend310P3 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14407 opened Jun 27, 2025 by tqgy6 Loading…
OpenCL: add conv2d kernel ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14403 opened Jun 26, 2025 by rmatif Loading…
ggml : add pointer to attach user data ggml changes relating to the ggml tensor library for machine learning
#14397 opened Jun 26, 2025 by koush Loading…
compare-commits.sh: support both llama-bench and test-backend-ops python python script changes script Script related
#14392 opened Jun 26, 2025 by yeahdongcn Loading…
ggml-cpu: Build variant targeting Neoverse-V2 ggml changes relating to the ggml tensor library for machine learning
#14380 opened Jun 25, 2025 by ckastner Loading…
ProTip! Filter pull requests by the default branch with base:master.