Currently working on a challenging project , "AI - Judge Assistant" Focusing on Arabic AI reasoning , Whenever a new "buzz" on a release of a new LLM I don't really get dragged by the public benchmarks . I compared DeepSeek R1 , with the new openAI o3-mini on some complex Arabic Judicial Agentic Tasks . Both are equally good on reasoning and analysis , with a slight plus for o3-mini on cultural alignment and more verbose . o3-mini was much faster in inference compared to DeepSeek R1 on nvidia API platform https://lnkd.in/d-9Bn3-s https://lnkd.in/dg9EdzyS
I think that the”buzz” behind deepseek is because introduced a new path for the AI game , before as we were discussing in your lectures , the problem was that chatgpt was trained on almost all of the data in the world so the we thought that the progression might hit a plateau but what i find truly fascinating about deepseek is that they introduced a whole new approach which is offering a high performance model with substantial lower cost, encouraging the increase of commercial use of AI models. I dont think it would have made that buzz if it wasn’t substantially cheaper
Thanks, Dr. Hazem for this very helpful note. Any idea how you were able to create Deepseek R1 key? The service is currently down for me.