GitHub · Where software is built

Development Roadmap (LMMs-Eval 0.4)
#688 · Luodian opened on May 27, 2025
1
[Common Issue] Common issues you might encounter when using lmms-eval
#186 · kcz358 opened on Aug 9, 2024
New Task Guide
#299 · Luodian opened on Oct 6, 2024

Labels Milestones New issue

Issue: LongVideoBench Video-LLaVA fails with empty frame list (ValueError: need at least one array to stack)

#964

· omrastogi opened

on Jan 1, 2026

Qwen2.5-VL Expect all tensors on the same devices

#952

· yyy195 opened

on Dec 27, 2025

wenetspeech_test_net ��单评测接入

#951

· wuyongtai7 opened

on Dec 26, 2025

Image features and image tokens do not match

#948

· Resurgamm opened

on Dec 23, 2025

MMVet results are low

#947

· Xnhyacinth opened

on Dec 23, 2025

The experimental results of Qwen3-VL-8B-Instruct could not be reproduced

#935

· FrankYang-17 opened

on Dec 15, 2025

The process can't stop after responding to all samples...

#934

· Maerceci-27 opened

on Dec 14, 2025

It costs a long time to test on the DocVQA

#933

· 01upup10 opened

on Dec 13, 2025

The test results for Qwen3VL show significant discrepancies compared to the official report.

#932

· Fu-Fu-Fu-Fu opened

on Dec 13, 2025

MVBench dataset average result

#925

· ylllllll opened

on Dec 8, 2025

用vllm和不用vllm结果差的很多

#921

· Fu-Fu-Fu-Fu opened

on Dec 3, 2025

I have a question regarding VLLM parameter settings.

#920

· Fu-Fu-Fu-Fu opened

on Dec 2, 2025