📣 NVIDIA Blackwell sets a new STAC-AI LANG6 record for LLM inference in quantitative research and algorithmic trading, delivering the highest compute-per-watt and lowest token cost. We tested Llama 3.1 8B and 70B with NVIDIA TensorRT-LLM across multiple NVIDIA platforms. Systems tested: ✅ NVIDIA HGX B200 on Lambda ✅ NVIDIA RTX PRO 6000 Blackwell Server Edition system from Supermicro ✅ NVIDIA Grace Hopper-based server from Hewlett Packard Enterprise See the results 👉 https://nvda.ws/4fFM5ww
great technical blog. u can connect the developed systems to external APIs
Proud to set new records together with NVIDIA! 🏆NVIDIA AI Infrastructure