Unsloth AI reposted this
You can now run FP8 reinforcement learning on consumer GPUs! ⚡ DeepSeek-R1 demonstrated the power of FP8 GRPO. Now you can try it at home on just a 5GB GPU with Unsloth AI. • Qwen3-14B FP8 GRPO works on 24GB VRAM. Qwen3-1.7B works on 5GB. • We collabed with PyTorch TorchAO to make Unsloth FP8 RL inference via vLLM ~1.4× faster than FP16 • Unsloth uses 60% less VRAM and enables 12× longer context vs. other implementations • Works on any NVIDIA GeForce RTX 40, 50 series and H100, B200 etc. GPUs ⭐ Blog: https://lnkd.in/gC7-fpx8 Qwen3-8B FP8 GRPO Colab notebook: https://lnkd.in/gn7rpUp6