Best model for air-gapped usage? #4081

MyndSpaceAI · 2025-10-29T08:19:38Z

MyndSpaceAI
Oct 29, 2025

Hi everyone,

First of all, thanks for the wonderful repo, great work!

I’m looking for advice on the best model to use with LMDeploy for information extraction (RAG) from a wide variety of multilingual documents in an air-gapped setup (no external network access).

My environment is limited to a single NVIDIA RTX 5090 GPU, so VRAM and compute efficiency also matter.

Does anyone have experience or recommendations for models that perform well under these conditions — e.g. Qwen3, GLM-4.x, or InternVL 3.5 series?

Thanks in advance for any insights or benchmarks you can share!

• Peter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Best model for air-gapped usage? #4081

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Best model for air-gapped usage? #4081

Uh oh!

Uh oh!

MyndSpaceAI Oct 29, 2025

Replies: 0 comments

MyndSpaceAI
Oct 29, 2025