Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
EvolvingLMMs-Lab
/
lmms-eval
Public
Notifications
You must be signed in to change notification settings
Fork
466
Star
3.5k
Code
Issues
293
Pull requests
2
Discussions
Actions
Projects
0
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights
Development Roadmap (LMMs-Eval 0.4)
#688 ·
Luodian
opened
on May 27, 2025
1
[Common Issue] Common issues you might encounter when using lmms-eval
#186 ·
kcz358
opened
on Aug 9, 2024
New Task Guide
#299 ·
Luodian
opened
on Oct 6, 2024
Issues
Search Issues
is
:
issue
state
:
open
is:issue state:open
Search
Labels
Milestones
New issue
Search results
Open
Closed
Issue: LongVideoBench Video-LLaVA fails with empty frame list (ValueError: need at least one array to stack)
Status: Open.
#964
In EvolvingLMMs-Lab/lmms-eval;
·
omrastogi
opened
on Jan 1, 2026
Qwen2.5-VL Expect all tensors on the same devices
Status: Open.
#952
In EvolvingLMMs-Lab/lmms-eval;
·
yyy195
opened
on Dec 27, 2025
wenetspeech_test_net ���单评测接入
Status: Open.
#951
In EvolvingLMMs-Lab/lmms-eval;
·
wuyongtai7
opened
on Dec 26, 2025
Image features and image tokens do not match
Status: Open.
#948
In EvolvingLMMs-Lab/lmms-eval;
·
Resurgamm
opened
on Dec 23, 2025
MMVet results are low
Status: Open.
#947
In EvolvingLMMs-Lab/lmms-eval;
·
Xnhyacinth
opened
on Dec 23, 2025
The experimental results of Qwen3-VL-8B-Instruct could not be reproduced
Status: Open.
#935
In EvolvingLMMs-Lab/lmms-eval;
·
FrankYang-17
opened
on Dec 15, 2025
The process can't stop after responding to all samples...
Status: Open.
#934
In EvolvingLMMs-Lab/lmms-eval;
·
Maerceci-27
opened
on Dec 14, 2025
It costs a long time to test on the DocVQA
Status: Open.
#933
In EvolvingLMMs-Lab/lmms-eval;
·
01upup10
opened
on Dec 13, 2025
The test results for Qwen3VL show significant discrepancies compared to the official report.
Status: Open.
#932
In EvolvingLMMs-Lab/lmms-eval;
·
Fu-Fu-Fu-Fu
opened
on Dec 13, 2025
MVBench dataset average result
Status: Open.
#925
In EvolvingLMMs-Lab/lmms-eval;
·
ylllllll
opened
on Dec 8, 2025
用vllm和不用vllm结果差的很多
Status: Open.
#921
In EvolvingLMMs-Lab/lmms-eval;
·
Fu-Fu-Fu-Fu
opened
on Dec 3, 2025
I have a question regarding VLLM parameter settings.
Status: Open.
#920
In EvolvingLMMs-Lab/lmms-eval;
·
Fu-Fu-Fu-Fu
opened
on Dec 2, 2025
You can’t perform that action at this time.