Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
chraac
/
llama.cpp
Public
forked from
ggml-org/llama.cpp
Notifications
You must be signed in to change notification settings
Fork
5
Star
47
Code
Issues
9
Pull requests
0
Discussions
Actions
Projects
1
Wiki
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights
Commits
Branch selector
dev-refactoring
User selector
All users
Datepicker
All time
Commit History
Commits on Nov 19, 2025
Merge branch 'master' into dev-refactoring
chraac
committed
3b818dd
Copy full SHA for 3b818dd
Commits on Nov 17, 2025
ggml : add missing AVX512 feature checks (#17270)
Show description for cb623de
angt
authored
cb623de
Copy full SHA for cb623de
metal : support I32 -> I32 copy (#17317)
ggerganov
authored
7aaeedc
Copy full SHA for 7aaeedc
metal : faster argsort (#17315)
Show description for 3347e6d
ggerganov
authored
3347e6d
Copy full SHA for 3347e6d
metal : add cumsum (#17305)
ggerganov
authored
1a13964
Copy full SHA for 1a13964
CANN: Use smart pointers to manage ACL objects (#17238)
Show description for 2376b77
hipudding
authored
2376b77
Copy full SHA for 2376b77
Commits on Nov 16, 2025
vulkan: add LOG operation support for F32 and F16 (#17183)
Show description for dbed612
zayac
authored
dbed612
Copy full SHA for dbed612
vulkan: fix MMQ quantize_y condition (#17301)
0cc4m
authored
80deff3
Copy full SHA for 80deff3
ci : revert #16249 (#17303)
Show description for 8b1c339
netrunnereve
authored
8b1c339
Copy full SHA for 8b1c339
metal : remove obosolete asserts (#17295)
ggerganov
authored
416e7c7
Copy full SHA for 416e7c7
server : handle context overflow during decode (#17267)
Show description for 5b2093b
ggerganov
authored
5b2093b
Copy full SHA for 5b2093b
opencl: fix rms_norm_mul (#17250)
Show description for 52e5d42
lhez
authored
52e5d42
Copy full SHA for 52e5d42
opencl: add kernel to handle mat mul in attention to improve encoding speed (#17181)
Show description for 4db5641
shaofeiqi
authored
4db5641
Copy full SHA for 4db5641
Commits on Nov 15, 2025
sycl : unify unary kernels with a generic implementation and enable wide operator support (#17213)
Show description for 72bd732
shani-f
authored
72bd732
Copy full SHA for 72bd732
webui: Fix clickability around chat processing statistics UI (#17278)
Show description for 22e1ce2
allozaur
authored
22e1ce2
Copy full SHA for 22e1ce2
webui: add OAI-Compat Harmony tool-call streaming visualization and persistence in chat UI (#16618)
Show description for 1411d92
ServeurpersoCom
and
allozaur
authored
1411d92
Copy full SHA for 1411d92
convert : remove unnecessary chat template patching (#17289)
CISC
authored
662192e
Copy full SHA for 662192e
vulkan: Fuse mul_mat_id+add_id+mul and mul_mat+add+add. (#17287)
Show description for 24dc769
jeffbolznv
authored
24dc769
Copy full SHA for 24dc769
vulkan: Replace 16-bit unpack8 calls to work around legacy Windows AMD driver bug (#17285)
0cc4m
authored
4dca015
Copy full SHA for 4dca015
convert : use all parts in safetensors index (#17286)
CISC
authored
9a8860c
Copy full SHA for 9a8860c
convert : set expert gating func in base class (#17279)
CISC
authored
9d3ef48
Copy full SHA for 9d3ef48
mtmd-cli: Avoid logging to stdout for model loading messages in mtmd-cli (#17277)
ankurvdev
authored
c7b7db0
Copy full SHA for c7b7db0
vulkan: implement ABS and NEG (#17245)
Show description for 1568d13
giuseppe
authored
1568d13
Copy full SHA for 1568d13
vulkan: Use ggml_vk_tensor_subbuffer in mul_mat_vec(id) paths (#17244)
Show description for 439342e
jeffbolznv
authored
439342e
Copy full SHA for 439342e
vulkan: skip all-negative-inf blocks in FA (#17186)
jeffbolznv
authored
234ae7d
Copy full SHA for 234ae7d
vulkan: change graph_compute to be async and enable get_tensor_async (#17158)
Show description for 38eaf32
jeffbolznv
authored
38eaf32
Copy full SHA for 38eaf32
Commits on Nov 14, 2025
mtmd: add mtmd_log_set (#17268)
ngxson
authored
9b17d74
Copy full SHA for 9b17d74
model : add AfmoeForCausalLM support (#16477)
Show description for e1fcf8b
bartowski1182
and
CISC
authored
e1fcf8b
Copy full SHA for e1fcf8b
fix : Dangling pointer for non-empty trigger words in lazy grammar construction (#17048)
Show description for 6cd0cf7
marek-hradil
authored
6cd0cf7
Copy full SHA for 6cd0cf7
server : fix "can batch with" bug (#17263)
ggerganov
authored
d396b43
Copy full SHA for d396b43
metal : support argsort for ne00 > 1024 (#17247)
Show description for 45c6ef7
ggerganov
authored
45c6ef7
Copy full SHA for 45c6ef7
metal : make the FA extra sizes consistent (#17143)
ggerganov
authored
2606b0a
Copy full SHA for 2606b0a
readme : add RVV,ZVFH,ZFH,ZICBOP support for RISC-V (#17259)
Show description for 307772f
ixgbe
authored
307772f
Copy full SHA for 307772f
Better UX for handling multiple attachments in WebUI (#17246)
allozaur
authored
f1bad23
Copy full SHA for f1bad23
Commits on Nov 13, 2025
ggml-cpu: handle 3d tensors in repack mat_mul (#17241)
Show description for becc481
Alcpz
authored
becc481
Copy full SHA for becc481
Pagination
Previous
Next
You can’t perform that action at this time.