🚀Introducing DeepSeek-V3.2-Exp — our latest experimental model! ✨Built on V3.1-Terminus, it debuts DeepSeek Sparse Attention(DSA) for faster, more efficient training & inference on long context. 👉Now live on App, Web, and API. 💰API prices cut by 50%+! 🔨Open Source Release 🔗Model: https://lnkd.in/dH8zRB4C 🔗Tech report: https://lnkd.in/djQBKncM 🔗Key GPU kernels in TileLang & CUDA (use TileLang for rapid research prototyping!) ⚡️Efficiency Gains 🤖DSA achieves fine-grained sparse attention with minimal impact on output quality — boosting long-context performance & reducing compute cost. 📈Benchmarks show V3.2-Exp performs on par with V3.1-Terminus.
关于我们
DeepSeek (深度求索), founded in 2023, is a Chinese company dedicated to making AGI a reality. Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism. 🐋
- 网站
-
https://www.deepseek.com/
DeepSeek AI的外部链接
- 所属行业
- 科技、信息和网络
- 规模
- 51-200 人
- 总部
- Hangzhou,CN
- 类型
- 非营利机构
- 创立
- 2023