Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
vllm-project
/
vllm
Public
Uh oh!
There was an error while loading.
Please reload this page
.
Notifications
You must be signed in to change notification settings
Fork
12.3k
Star
66.6k
Code
Issues
1.8k
Pull requests
1.3k
Discussions
Actions
Projects
20
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights
Actions: vllm-project/vllm
Actions
All workflows
All workflows
Actions
Loading...
Loading
Sorry, something went wrong.
Uh oh!
There was an error while loading.
Please reload this page
.
Showing runs from all workflows
260,720 workflow runs
260,720 workflow runs
Event
Filter by Event
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching events.
Status
Filter by Status
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching statuses.
Branch
Filter by Branch
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching branches.
Actor
Filter by Actor
Sorry, something went wrong.
Filter
Loading
Sorry, something went wrong.
No matching users.
[Feature] limit thinking tokens (hard limit)
pre-commit
#77108:
Pull request
#20859
synchronize by
llsj14
3m 43s
llsj14:feat/thinking-budget
llsj14:feat/thinking-budget
3m 43s
View #20859
View workflow file
[Feature] limit thinking tokens (hard limit)
BC Lint
#40340:
Pull request
#20859
synchronize by
llsj14
19s
llsj14:feat/thinking-budget
llsj14:feat/thinking-budget
19s
View #20859
View workflow file
cwm tool parser
pre-commit
#77107:
Pull request
#31340
synchronize by
ErezSC42
3m 29s
ErezSC42:cwm1-tool-parser
ErezSC42:cwm1-tool-parser
3m 29s
View #31340
View workflow file
cwm tool parser
BC Lint
#40339:
Pull request
#31340
synchronize by
ErezSC42
19s
ErezSC42:cwm1-tool-parser
ErezSC42:cwm1-tool-parser
19s
View #31340
View workflow file
[Bugfix]: update global_rank when adjusting rpc_rank to fix layer key error
pre-commit
#77106:
Pull request
#31580
synchronize by
zhaoninge
3m 30s
zhaoninge:dev/zhaoning/adjust_rank
zhaoninge:dev/zhaoning/adjust_rank
3m 30s
View #31580
View workflow file
[Bugfix]: update global_rank when adjusting rpc_rank to fix layer key error
BC Lint
#40338:
Pull request
#31580
synchronize by
zhaoninge
22s
zhaoninge:dev/zhaoning/adjust_rank
zhaoninge:dev/zhaoning/adjust_rank
22s
View #31580
View workflow file
[Performance]: If the next request is sent immediately after the previous one finishes, its TTFT will be relatively small; if the next request is sent 10 seconds after the previous one ends, its TTFT will be relatively large.
Label issues based on keywords
#5195:
Issue
#31602
edited by
xpzwzwz
6s
6s
View workflow file
[Performance]: If the next request is sent immediately after the previous one finishes, its TTFT will be relatively small; if the next request is sent 10 seconds after the previous one ends, its TTFT will be relatively large.
Label issues based on keywords
#5194:
Issue
#31602
edited by
xpzwzwz
8s
8s
View workflow file
[Performance]: If the next request is sent immediately after the previous one finishes, its TTFT will be relatively small; if the next request is sent 10 seconds after the previous one ends, its TTFT will be relatively large.
Label issues based on keywords
#5193:
Issue
#31602
edited by
xpzwzwz
8s
8s
View workflow file
[Performance]: If the next request is sent immediately after the previous one finishes, its TTFT will be relatively small; if the next request is sent 10 seconds after the previous one ends, its TTFT will be relatively large.
Label issues based on keywords
#5192:
Issue
#31602
edited by
xpzwzwz
8s
8s
View workflow file
[Bugfix]: update global_rank when adjusting rpc_rank to fix layer key error
Cleanup PR Body
#35867:
Pull request
#31580
edited by
zhaoninge
15s
15s
View #31580
View workflow file
[Bugfix]: update global_rank when adjusting rpc_rank to fix layer key error
Cleanup PR Body
#35866:
Pull request
#31580
edited by
zhaoninge
12s
12s
View #31580
View workflow file
[Bugfix]: update global_rank when adjusting rpc_rank to fix layer key error
Cleanup PR Body
#35865:
Pull request
#31580
edited by
zhaoninge
17s
17s
View #31580
View workflow file
[Performance]: If the next request is sent immediately after the previous one finishes, its TTFT will be relatively small; if the next request is sent 10 seconds after the previous one ends, its TTFT will be relatively large.
Label issues based on keywords
#5191:
Issue
#31602
edited by
xpzwzwz
7s
7s
View workflow file
[Performance]: If the next request is sent immediately after the previous one finishes, its TTFT will be relatively small; if the next request is sent 10 seconds after the previous one ends, its TTFT will be relatively large.
Label issues based on keywords
#5190:
Issue
#31602
opened by
xpzwzwz
8s
8s
View workflow file
[Core] NGram GPU Implementation compatible with Async Scheduler
BC Lint
#40337:
Pull request
#29184
synchronize by
PatchouliTIS
19s
PatchouliTIS:patchy/async_ngram
PatchouliTIS:patchy/async_ngram
19s
View #29184
View workflow file
[Core] NGram GPU Implementation compatible with Async Scheduler
pre-commit
#77105:
Pull request
#29184
synchronize by
PatchouliTIS
3m 33s
PatchouliTIS:patchy/async_ngram
PatchouliTIS:patchy/async_ngram
3m 33s
View #29184
View workflow file
[Bugfix] Fix byte fallback handling when using outlines
pre-commit
#77104:
Pull request
#31391
synchronize by
Alnusjaponica
3m 13s
pfnet:fix-byte-fallback
pfnet:fix-byte-fallback
3m 13s
View #31391
View workflow file
[Bugfix] Fix byte fallback handling when using outlines
BC Lint
#40336:
Pull request
#31391
synchronize by
Alnusjaponica
19s
pfnet:fix-byte-fallback
pfnet:fix-byte-fallback
19s
View #31391
View workflow file
Add multimodal input method in the documentation
pre-commit
#77103:
Pull request
#31601
synchronize by
labAxiaoming
3m 29s
labAxiaoming:main_multimodal_input_method
labAxiaoming:main_multimodal_input_method
3m 29s
View #31601
View workflow file
Add multimodal input method in the documentation
BC Lint
#40335:
Pull request
#31601
synchronize by
labAxiaoming
21s
labAxiaoming:main_multimodal_input_method
labAxiaoming:main_multimodal_input_method
21s
View #31601
View workflow file
Add multimodal input method in the documentation
BC Lint
#40334:
Pull request
#31601
labeled by
mergify
bot
20s
labAxiaoming:main_multimodal_input_method
labAxiaoming:main_multimodal_input_method
20s
View #31601
View workflow file
Add multimodal input method in the documentation
BC Lint
#40333:
Pull request
#31601
opened by
labAxiaoming
20s
labAxiaoming:main_multimodal_input_method
labAxiaoming:main_multimodal_input_method
20s
View #31601
View workflow file
Add multimodal input method in the documentation
pre-commit
#77102:
Pull request
#31601
opened by
labAxiaoming
3m 35s
labAxiaoming:main_multimodal_input_method
labAxiaoming:main_multimodal_input_method
3m 35s
View #31601
View workflow file
Add multimodal input method in the documentation
PR Reminder Comment Bot
#15581:
Pull request
#31601
opened by
labAxiaoming
8s
8s
View #31601
View workflow file
Previous
1
2
3
4
5
…
10428
10429
Next
You can’t perform that action at this time.