I’m currently going deeper into AI, and what I picked up recently completely shifted how I think about building intelligent systems. Here are some key insights I picked up 👇 🔹 Small Language Models (SLMs) vs LLMs Not every problem needs a massive model. I learned that Small Language Models (SLMs) are: 📍More efficient 📍Require less data 📍Cheaper to train and run 📍Better suited for specific, focused problems There’s even a growing trend of running SLMs directly on mobile devices. 🔹 Why RAG Matters One major limitation of LLMs: 👉 They are only as good as their last training data. This is where Retrieval-Augmented Generation (RAG) comes in. RAG allows models to: 📍Pull in real time, up-to-date data 📍Reduce hallucinations (confident but wrong responses) 🔹 MCP (Model Context Protocol) This was a big one for me. MCP is essentially a standard that allows AI models to interact with external tools, like: 📍Local machines 📍Databases 📍Emails, calendars, third party apps It’s what makes AI systems practical and connected to real world workflows. 🔹 Building AI is About Systems, Not Just Models Choosing the right approach means thinking holistically: LLMs SLMs RAG MCP Not one in isolation but how they work together👍🏽. 🔹 Infrastructure Decisions Matter You can: 📍Run models locally 📍Use cloud providers 📍Or build on premises systems Each comes with tradeoffs in cost, scalability, and control. 🔹 Data is Everything Before any model works well, you need to define: 📍Data sources 📍Quantity of data 📍Processing pipelines And most importantly: 👉 Annotation is what turns raw data into ground truth. 🔹 Orchestration: The Real “Intelligence Layer” This part changed how I think about AI systems. Orchestration involves: 📍 Thinking How does the AI approach the problem? 📍 Execution What tools does it use, and in what order? 📍 Review How does it verify its response and avoid hallucinations? This is where systems like MCP and RAG operate behind the scenes. 🔹 Other Concepts I Explored Accuracy scores → Measuring model performance Activation functions → Deciding if a neuron should “fire” AI Agents → Systems that execute tasks autonomously Apache Spark → For large scale, parallel data processing The deeper I go, the clearer it becomes: AI isn’t just about training models. It’s about designing systems that think, retrieve, verify, and act. Still learning. Still building. #ArtificialIntelligence #AIEngineering #MachineLearning #BuildInPublic
Well done Peter 👏
Comrade 🫡