How to Apply Deep Reasoning Agents in AI Solutions

Explore top LinkedIn content from expert professionals.

Summary

Deep reasoning agents in AI solutions are intelligent systems that can think, plan, adapt, and make decisions to achieve specific goals—moving beyond simple chatbot responses to more autonomous problem-solving. These agents use memory, tools, and logic to carry out complex tasks and interact in real-world environments.

  • Structure agent workflow: Break down tasks into planning, reasoning, and execution steps to help agents perform reliably and handle multi-step challenges.
  • Integrate tools and memory: Equip agents with access to external tools and robust memory systems so they can recall past interactions and take meaningful actions.
  • Set safety controls: Build in guardrails and escalation paths to ensure agents act responsibly and can defer to humans when faced with uncertainty or critical decisions.
Summarized by AI based on LinkedIn member posts
  • View profile for Brij Kishore Pandey
    Brij Kishore Pandey Brij Kishore Pandey is an Influencer

    AI Architect & AI Engineer | Building Agentic Systems & Scalable AI Solutions

    727,410 followers

    I created this Agentic AI Learning Roadmap to help developers, architects, and innovators understand how to go from basic LLM usage → fully autonomous multi-agent systems. This roadmap breaks down everything you need to master: 1. What Agentic AI Actually Is Beyond text generation — agents reason, plan, self-evaluate, use tools, and interact with environments. 2. Core Concepts: Reasoning Loops, Memory, Planning, Autonomy Controls The shift from “responding to prompts” → “achieving goals.” 3. Frameworks Powering the Agentic Era LangGraph, CrewAI, Google A2A, Anthropics MCP, OpenAI Agents, AutoGen, FalkorDB, Vertex AI Agents, and more. 4. Full Agentic AI Development Stack LLMs → Tooling Layer → Knowledge Layer → Execution Layer. A true systems-engineering approach, not just prompt engineering. 5. Agent Design Patterns ReAct Agents, Planner–Executor, Self-Reflective Agents, Tool-Use Agents, Social Agents, Environment-Aware Agents. 6–8. How to Build & Scale Agentic Systems From defining goals → enabling reasoning → using APIs → adding autonomy → orchestrating multi-agent workflows. 9. Evaluating Agent Performance Success rates, hallucination control, memory effectiveness, safety layers, cost/latency metrics. 10. Learning Resources I curated the best starting points from OpenAI, Google, MCP docs, LangGraph, NVIDIA, Kaggle, Stanford/MIT, and more. Why I built this: Most people know what agents are. Very few know how to design, test, scale, and productionize real agentic systems. This roadmap gives you a complete mental model — from fundamentals → frameworks → deployment → multi-agent orchestration.

  • View profile for Amit Rawal

    Google Applied AI Director | Former Apple AI/ML Product Leader | Stanford | AI Educator & Keynote Speaker

    60,308 followers

    I build AI agents for a living and after auditing 100+ AI agent systems and studying the latest agent playbooks from OpenAI, Google, and Anthropic... Here’s the simplest, clearest guide I’ve found for building real agents — the kind that think, act, and adapt like a team member, not a chatbot. 🧠 What’s an AI Agent? An agent is a system that: ⨠ Uses an LLM/Reasoning model to understand and reason ⨠ Can take action (via tools/functions/APIs) ⨠ Maintains memory and multi-step context ⨠ Operates within goal-driven logic ⨠ And self-corrects when things go wrong Not just respond. Act. Decide. Adapt. The 5 Components of Any Real Agent (All 3 Playbooks Agree) 🧠 Model (LLM) → Powers reasoning and planning (OpenAI, Claude, Gemini) → Use different models for different steps (cost × latency × complexity) 🔧 Tools (or APIs) → Extend the agent beyond knowledge — into execution → Can be action APIs (send email), retrieval (RAG), or data access (SQL, PDFs) 🧭 Orchestration Layer → Loop that plans > acts > adjusts → Uses frameworks like ReAct, Chain-of-Thought, or Tree-of-Thoughts 🛡️ Guardrails → Input filtering, safety checks, escalation logic → Think: “When do we bring in a human?” 🧠 Memory / State → To handle multi-step workflows, learn over time, and recover from errors 🚀 Want to Build? Start Here: ⨠ Pick 1 task with high cognitive load (not high risk) ⨠ Define the goal, success condition, and edge cases ⨠ Give the agent 1 tool and 1 model ⨠ Add logic: “If [X], do [Y]. Else escalate.” ⨠ Test 10 cases. Break it. Refine. ⚡ Pro Tip: Use This Prompt Stack “You’re an expert AI architect. Design a simple agent that completes [goal] using only 1 model, 1 tool, and clear exit logic.” “Add fallback logic if the agent fails or gets stuck.” “Define 5 test cases to validate it.” “Now output this as a visual workflow + API schema.” We don’t need more copilots. We need real agents — that can reason, act, and learn in real time. This is how you build one. — 📥 Want the full Agent Playbook (Google x Anthropic x OpenAI)? ⨠ Comment “AGENT”, connect with me, and I’ll DM you the full playbook. Because in 2025, knowing how to talk to AI isn’t enough. You need to know how to hire, train, and deploy it. ______________________________________________________________ I’m Amit. I help ambitious thinkers and founders design their lives like systems — using AI to work smarter, live longer, and grow richer with clarity and calm. Missed my last drop? ⨠ How o3 is a game changer https://lnkd.in/dQ3Q8s7C? ♻️ Repost to help someone think better today. ➕ Follow Amit Rawal for AI tools, clarity rituals, and high-agency systems.

  • View profile for Bijit Ghosh

    CTO | CAIO | Leading AI/ML, Data & Digital Transformation

    10,744 followers

    Optimizing Any AI Agent Framework with Context Engineering what’s crippling AI agents today isn’t compute or model architecture, it’s how we architect thought. Teams are jamming context windows with oversized prompts, scattered memory, and irrelevant noise, expecting agents to perform with surgical precision. The result? Skyrocketing costs, inconsistent behavior, brittle chains, and agents that fall apart in production. This isn’t a LangChain quirk or an AutoGen edge case, it’s a systemic failure in context design. As AI agents take on multi-step reasoning, tool orchestration, and role-based collaboration, flat prompts and static memory just don’t cut it. The future demands agents that think modularly, prioritize relevance, adapt their cognitive load in real time, and optimize their own context like a self-tuning system. That’s where context engineering comes in not as a prompt hack, but as a discipline. My blog offers the blueprint. Start with Hierarchical Context Layering, breaking context into modular tiers: meta (identity), operational (task), domain (knowledge), historical (memory), and environmental (system state). Instead of dumping all layers into every interaction, we load them conditionally, scaling cognition to the task at hand. Next, we introduce Semantic Context Compression, distilling long documents into core insights, clustering embeddings to avoid redundancy, and progressively expanding context only when required. Less noise, more signal. I then unlock adaptability. Dynamic context windowing, pruning low-impact tokens, decaying stale information, and forking context paths across reasoning branches keeps agents lean, responsive, and state-aware. Most crucially, we inject meta-cognitive logic: agents don’t just respond, they assess ambiguity, estimate confidence, clarify intent, and defer when uncertain. These are the mechanics of scalable reasoning. I also shares concrete implementation tactics: LangChain memory filters and tool isolation, role-based context sync with contradiction guardrails, task-specific context injectors and validation gates, introduce Context State Machines, Virtual Context References, Attention Steering, and Context Health Checks, future-proof patterns for building agent cognition at scale. The value is clear: higher output fidelity, lower token waste, reduced hallucinations, and agents that evolve across workflows, not break. This isn’t about writing better prompts, it’s engineering thought systems. If you're serious about production-grade agents, it's time to move beyond prompt fiddling and embrace the architecture of intelligent context. This is how we scale minds. https://lnkd.in/eH7wDcsr

  • View profile for Armand Ruiz
    Armand Ruiz Armand Ruiz is an Influencer

    building AI systems @meta

    207,067 followers

    Guide to Building an AI Agent 1️⃣ 𝗖𝗵𝗼𝗼𝘀𝗲 𝘁𝗵𝗲 𝗥𝗶𝗴𝗵𝘁 𝗟𝗟𝗠 Not all LLMs are equal. Pick one that: - Excels in reasoning benchmarks - Supports chain-of-thought (CoT) prompting - Delivers consistent responses 📌 Tip: Experiment with models & fine-tune prompts to enhance reasoning. 2️⃣ 𝗗𝗲𝗳𝗶𝗻𝗲 𝘁𝗵𝗲 𝗔𝗴𝗲𝗻𝘁’𝘀 𝗖𝗼𝗻𝘁𝗿𝗼𝗹 𝗟𝗼𝗴𝗶𝗰 Your agent needs a strategy: - Tool Use: Call tools when needed; otherwise, respond directly. - Basic Reflection: Generate, critique, and refine responses. - ReAct: Plan, execute, observe, and iterate. - Plan-then-Execute: Outline all steps first, then execute. 📌 Choosing the right approach improves reasoning & reliability. 3️⃣ 𝗗𝗲𝗳𝗶𝗻𝗲 𝗖𝗼��𝗲 𝗜𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻𝘀 & 𝗙𝗲𝗮𝘁𝘂𝗿𝗲𝘀 Set operational rules: - How to handle unclear queries? (Ask clarifying questions) - When to use external tools? - Formatting rules? (Markdown, JSON, etc.) - Interaction style? 📌 Clear system prompts shape agent behavior. 4️⃣ 𝗜𝗺𝗽𝗹𝗲𝗺𝗲𝗻𝘁 𝗮 𝗠𝗲𝗺𝗼𝗿𝘆 𝗦𝘁𝗿𝗮𝘁𝗲𝗴𝘆 LLMs forget past interactions. Memory strategies: - Sliding Window: Retain recent turns, discard old ones. - Summarized Memory: Condense key points for recall. - Long-Term Memory: Store user preferences for personalization. 📌 Example: A financial AI recalls risk tolerance from past chats. 5️⃣ 𝗘𝗾𝘂𝗶𝗽 𝘁𝗵𝗲 𝗔𝗴𝗲𝗻𝘁 𝘄𝗶𝘁𝗵 𝗧𝗼𝗼𝗹𝘀 & 𝗔𝗣𝗜𝘀 Extend capabilities with external tools: - Name: Clear, intuitive (e.g., "StockPriceRetriever") - Description: What does it do? - Schemas: Define input/output formats - Error Handling: How to manage failures? 📌 Example: A support AI retrieves order details via CRM API. 6️⃣ 𝗗𝗲𝗳𝗶𝗻𝗲 𝘁𝗵𝗲 𝗔𝗴𝗲𝗻𝘁’𝘀 𝗥𝗼𝗹𝗲 & 𝗞𝗲𝘆 𝗧𝗮𝘀𝗸𝘀 Narrowly defined agents perform better. Clarify: - Mission: (e.g., "I analyze datasets for insights.") - Key Tasks: (Summarizing, visualizing, analyzing) - Limitations: ("I don’t offer legal advice.") 📌 Example: A financial AI focuses on finance, not general knowledge. 7️⃣ 𝗛𝗮𝗻𝗱𝗹𝗶𝗻𝗴 𝗥𝗮𝘄 𝗟𝗟𝗠 𝗢𝘂𝘁𝗽𝘂𝘁𝘀 Post-process responses for structure & accuracy: - Convert AI output to structured formats (JSON, tables) - Validate correctness before user delivery - Ensure correct tool execution 📌 Example: A financial AI converts extracted data into JSON. 8️⃣ 𝗦𝗰𝗮𝗹𝗶𝗻𝗴 𝘁𝗼 𝗠𝘂𝗹𝘁𝗶-𝗔𝗴𝗲𝗻𝘁 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 (𝗔𝗱𝘃𝗮𝗻𝗰𝗲𝗱) For complex workflows: - Info Sharing: What context is passed between agents? - Error Handling: What if one agent fails? - State Management: How to pause/resume tasks? 📌 Example: 1️⃣ One agent fetches data 2️⃣ Another summarizes 3️⃣ A third generates a report Master the fundamentals, experiment, and refine and.. now go build something amazing! Happy agenting! 🤖

  • View profile for Aishwarya Srinivasan
    Aishwarya Srinivasan Aishwarya Srinivasan is an Influencer
    633,656 followers

    If you are building AI agents or learning about them, then you should keep these best practices in mind 👇 Building agentic systems isn’t just about chaining prompts anymore, it’s about designing robust, interpretable, and production-grade systems that interact with tools, humans, and other agents in complex environments. Here are 10 essential design principles you need to know: ➡️ Modular Architectures Separate planning, reasoning, perception, and actuation. This makes your agents more interpretable and easier to debug. Think planner-executor separation in LangGraph or CogAgent-style designs. ➡️ Tool-Use APIs via MCP or Open Function Calling Adopt the Model Context Protocol (MCP) or OpenAI’s Function Calling to interface safely with external tools. These standard interfaces provide strong typing, parameter validation, and consistent execution behavior. ➡️ Long-Term & Working Memory Memory is non-optional for non-trivial agents. Use hybrid memory stacks, vector search tools like MemGPT or Marqo for retrieval, combined with structured memory systems like LlamaIndex agents for factual consistency. ➡️ Reflection & Self-Critique Loops Implement agent self-evaluation using ReAct, Reflexion, or emerging techniques like Voyager-style curriculum refinement. Reflection improves reasoning and helps correct hallucinated chains of thought. ➡️ Planning with Hierarchies Use hierarchical planning: a high-level planner for task decomposition and a low-level executor to interact with tools. This improves reusability and modularity, especially in multi-step or multi-modal workflows. ➡️ Multi-Agent Collaboration Use protocols like AutoGen, A2A, or ChatDev to support agent-to-agent negotiation, subtask allocation, and cooperative planning. This is foundational for open-ended workflows and enterprise-scale orchestration. ➡️ Simulation + Eval Harnesses Always test in simulation. Use benchmarks like ToolBench, SWE-agent, or AgentBoard to validate agent performance before production. This minimizes surprises and surfaces regressions early. ➡️ Safety & Alignment Layers Don’t ship agents without guardrails. Use tools like Llama Guard v4, Prompt Shield, and role-based access controls. Add structured rate-limiting to prevent overuse or sensitive tool invocation. ➡️ Cost-Aware Agent Execution Implement token budgeting, step count tracking, and execution metrics. Especially in multi-agent settings, costs can grow exponentially if unbounded. ➡️ Human-in-the-Loop Orchestration Always have an escalation path. Add override triggers, fallback LLMs, or route to human-in-the-loop for edge cases and critical decision points. This protects quality and trust. PS: If you are interested to learn more about AI Agents and MCP, join the hands-on workshop, I am hosting on 31st May: https://lnkd.in/dWyiN89z If you found this insightful, share this with your network ♻️ Follow me (Aishwarya Srinivasan) for more AI insights and educational content.

  • View profile for Anurag(Anu) Karuparti

    Agentic AI Strategist @Microsoft (30k+) | Applied AI Architect | Author - Generative AI for Cloud Solutions | LinkedIn Learning Instructor | Responsible AI Advisor | Ex-PwC, EY | Marathon Runner

    32,673 followers

    𝐌𝐨𝐬𝐭 𝐀𝐈 𝐚𝐠𝐞𝐧𝐭𝐬 𝐟𝐚𝐢𝐥 𝐢𝐧 𝐏𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧 𝐛𝐞𝐜𝐚𝐮𝐬𝐞 𝐭𝐡𝐞𝐲 𝐜𝐚𝐧 𝐧𝐨𝐭 𝐫𝐞𝐦𝐞𝐦𝐛𝐞𝐫 𝐂𝐨𝐧𝐭𝐞𝐱𝐭.  Here is the 10-step Roadmap to build Agents that actually work. From my experience,  successful deployments follow this exact progression: 1. Scope the Cognitive Contract • Define task domain, decision authority, error tolerance • Specify I/O schemas and action boundaries • Establish non-functional requirements (latency, cost, compliance) 2. Data Ingestion & Governance Layer • Integrate SharePoint, Azure SQL, Blob Storage pipelines • Normalize, chunk, and version content artifacts • Enforce RBAC, PII redaction, policy tagging 3. Semantic Representation Pipeline • Generate embeddings via Azure OpenAI embedding models • Vectorize knowledge segments • Persist in Azure AI Search (vector + semantic index) 4. Retrieval Orchestration • Encode user intent into embedding space • Execute hybrid retrieval (BM25 + ANN search) • Re-rank using similarity scores and metadata constraints 5. Prompt Assembly & Grounding • System instruction + policy constraints + task schema • Inject top-K evidence passages dynamically • Enforce source-bounded generation 6. LLM Reasoning Layer • Invoke GPT (Azure OpenAI) or Claude (Anthropic) • Tune decoding parameters (temperature, top-p, max tokens) • Validate deterministic vs creative response modes 7. Context & State Management • Persist conversational state in Azure Cosmos DB • Apply rolling summarization and relevance pruning • Maintain short-term and long-term memory separation 8. Evaluation & Calibration • Run adversarial, regression, and grounding tests • Measure hallucination rate, retrieval precision, latency • Optimize chunking, ranking heuristics, prompts 9. Productionization & Observability • Deploy via Microsoft Foundry and AKS • Implement distributed tracing, token usage, cost telemetry • Enable human-in-the-loop escalation paths 10. Agentic Capability Expansion • Integrate tool invocation (search, workflow, DB execution) • Add feedback-driven self-correction loops • Implement personalization via behavioral signals The critical steps teams skip: • Step 3 (Semantic Representation): Without proper vectorization, retrieval fails • Step 7 (State Management): Without memory persistence, agents restart every conversation • Step 8 (Evaluation): Without testing, hallucinations go to production My Recommendation: Don't skip steps. Each builds on the previous: • Steps 1-3: Foundation (scope, data, embeddings) • Steps 4-6: Core agent (retrieval, prompts, reasoning) • Steps 7-9: Production readiness (memory, testing, deployment) • Step 10: Advanced capabilities (tools, self-correction) Which step are you currently stuck on? ♻️ Repost this to help your network get started ➕ Follow Anurag(Anu) for more PS: If you found this valuable, join my weekly newsletter where I document the real-world journey of AI transformation. ✉️ Free subscription: https://lnkd.in/exc4upeq

  • View profile for Andreas Sjostrom
    Andreas Sjostrom Andreas Sjostrom is an Influencer

    LinkedIn Top Voice | AI Agents | Robotics I Vice President at Capgemini’s Applied Innovation Exchange | Author | Speaker | San Francisco | Palo Alto

    14,815 followers

    I just finished reading three recent papers that every Agentic AI builder should read. As we push toward truly autonomous, reasoning-capable agents, these papers offer essential insights, not just new techniques, but new assumptions about how agents should think, remember, and improve. 1. MEM1: Learning to Synergize Memory and Reasoning Link: https://bit.ly/4lo35qJ Trains agents to consolidate memory and reasoning into a single learned internal state, updated step-by-step via reinforcement learning. The context doesn’t grow, the model learns to retain only what matters. Constant memory use, faster inference, and superior long-horizon reasoning. MEM1-7B outperforms models twice its size by learning what to forget. 2. ToT-Critic: Not All Thoughts Are Worth Sharing Link: https://bit.ly/3TEgMWC A value function over thoughts. Instead of assuming all intermediate reasoning steps are useful, ToT-Critic scores and filters them, enabling agents to self-prune low-quality or misleading reasoning in real time. Higher accuracy, fewer steps, and compatibility with existing agents (Tree-of-Thoughts, scratchpad, CoT). A direct upgrade path for LLM agent pipelines. 3. PAM: Prompt-Centric Augmented Memory Link: https://bit.ly/3TAOZq3 Stores and retrieves full reasoning traces from past successful tasks. Injects them into new prompts via embedding-based retrieval. No fine-tuning, no growing context, just useful memories reused. Enables reasoning, reuse, and generalization with minimal engineering. Lightweight and compatible with closed models like GPT-4 and Claude. Together, these papers offer a blueprint for the next phase of agent development: - Don’t just chain thoughts; score them. - Don’t just store everything; learn what to remember. - Don’t always reason from scratch; reuse success. If you're building agents today, the shift is clear: move from linear pipelines to adaptive, memory-efficient loops. Introduce a thought-level value filter (like ToT-Critic) into your reasoning agents. Replace naive context accumulation with learned memory state (a la MEM1). Storing and retrieving good trajectories, prompt-first memory (PAM) is easier than it sounds. Agents shouldn’t just think, they should think better over time.

  • View profile for Anthony Alcaraz

    GTM Agentic Engineering Lead @AWS | Author of Agentic Graph RAG (O’Reilly) | Business Angel

    47,046 followers

    We're building AI agents all wrong. ❌ We spend millions on the latest models, hoping more parameters will solve our problems. Yet, our most advanced agents still fail in production because we're focusing on the wrong thing. We blame the model's intelligence when the real flaw is the fragile architecture we build around it. We are building skyscrapers on quicksand to cite my friend Rémi Louf. Here’s how to build on bedrock instead: 👉 Stop hoping for good outputs, start guaranteeing them. An agent's output is a contract with your software. Instead of using parsing logic and prayer , enforce that contract with structured generation. By forcing the model's output to conform to a strict schema like JSON or YAML, you eliminate an entire class of runtime errors, making the agent a reliable and composable component. 🗺️ Give your agent a map, not just a library. Vector search finds similar text; it doesn't understand relationships. A knowledge graph provides the structured map that enables true reasoning. It allows an agent to traverse dependencies and perform the complex, multi-hop logic that separates a simple chatbot from an autonomous system. ⚖️ Enforce the rules of the world and your business world model. An agent's reasoning must be logically sound, not just plausible. Ontologies and type systems act as the laws of physics for your agent's knowledge graph, ensuring its actions are coherent with the rules of its domain. Types, enforces correctness and semantic validity on the agent's structured communications and actions. This operates at two levels of increasing sophistication. First, data modeling libraries like Pydantic ensure that the values within a structured output conform to expected data types, such as ensuring an age is an integer and not a string. This provides a crucial validation step that can even be used to create self-correction loops for the agent. For more advanced agents that generate code, a deeper level of enforcement through type-constrained decoding guarantees that the generated code is not only syntactically correct but also semantically valid according to the rules of the programming language, preventing the vast majority of compilation errors before they occur. Ontologies to ensure logical and conceptual reliability. . It defines a shared, unambiguous vocabulary and a set of rules about how concepts can relate to one another, preventing logical contradictions. This ensures that an agent's reasoning is not just plausible based on learned patterns but is provably coherent with its domain's fundamental principles, a prerequisite for creating collaborative systems where multiple agents and humans can operate on a shared understanding of reality. This prevents semantic failures, like a financial agent generating a report that claims revenue purchased margins. Reliability is an architectural choice, not just a model feature.

  • 𝗧𝗟;𝗗𝗥: Multi-agentic systems are considered the next frontier in GenAI, but most will fail. Why? They lack a crucial step: 𝗮𝗱𝗮𝗽𝘁𝗶𝘃𝗲 𝗽𝗹𝗮𝗻𝗻𝗶𝗻𝗴. Agents can't execute complex enterprise tasks with reasoning alone. They need a deep, structured understanding of your organization—a knowledge graph—to build effective plans and reliably take action. 𝗧𝗵𝗲 𝗣𝗹𝗮𝗻𝗻𝗶𝗻𝗴 𝗕𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸 𝗶𝗻 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗦𝘆𝘀𝘁𝗲𝗺𝘀 We're moving beyond single agent systems to multi-agent systems that promise to tackle complex work. But giving agents a goal and a set of tools isn't enough. When a task requires navigating intricate relationships—like finding which code commits relate to a specific project OKR or which sales deals are at risk due to P1 support tickets—the agents often fail. This is because LLMs, by themselves, struggle with the multi-hop reasoning required for these tasks. They can't reliably connect the dots across siloed enterprise data, leading to brittle agents that can't handle real-world complexity. 𝗧𝗵𝗲 "𝗲𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻" 𝘀𝘁𝗲𝗽 𝗯𝗿𝗲𝗮𝗸𝘀 𝗱𝗼𝘄𝗻 𝗯𝗲𝗰𝗮𝘂𝘀𝗲 𝘁𝗵𝗲 "𝗽𝗹𝗮𝗻𝗻𝗶𝗻𝗴" 𝘀𝘁𝗲𝗽 𝘄𝗮𝘀 𝗳𝗹𝗮𝘄𝗲𝗱 𝗳𝗿𝗼𝗺 𝘁𝗵𝗲 𝘀𝘁𝗮𝗿𝘁. The team at Glean led by Arvind Jain is doing some outstanding work in this area and we can learn from it. Read about their 1/Their agentic engine - https://bit.ly/3InEQv8 and 2/Their knowledge graph - https://bit.ly/4noPLUj 𝗧𝗵𝗲 𝗙𝗼𝘂𝗻𝗱𝗮𝘁𝗶𝗼𝗻: 𝗚𝗿𝗼𝘂𝗻𝗱𝗶𝗻𝗴 𝗔𝗴𝗲𝗻𝘁𝘀 𝗶𝗻 𝗮 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗚𝗿𝗮𝗽𝗵 The most advanced agentic engines solve this with a robust planning phase grounded in an enterprise knowledge graph. This graph acts as a live, permission-aware map of your company's people, projects, processes, and data.   • 𝗔𝗱𝗮𝗽𝘁𝗶𝘃𝗲 𝗣𝗹𝗮𝗻𝗻𝗶𝗻𝗴: Instead of a fixed, linear plan, the agent continuously consults the graph to understand relationships, validate assumptions, and change course as it learns more.  • 𝗖𝗼𝗻𝘁𝗲𝘅𝘁𝘂𝗮𝗹 𝗘𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻: The agent knows precisely which tool to use (e.g., query Salesforce vs. Jira) because the graph provides the context about where authoritative data lives.  • 𝗥𝗲𝗹𝗶𝗮𝗯𝗹𝗲 𝗥𝗲𝘀𝘂𝗹𝘁𝘀: It transforms ambiguous requests ("How is Project X doing?") into precise, executable steps, dramatically improving task success for complex work like debugging, data analysis, and personalized writing. 𝗔𝗰𝘁𝗶𝗼𝗻 𝗳𝗼𝗿 𝗖𝗧𝗢𝘀 & 𝗔𝗜 𝗟𝗲𝗮𝗱𝗲𝗿𝘀 Building a true enterprise knowledge graph is hard. It requires deep integration, real-time updates, and navigating complex permissions. But it's non-negotiable for creating agents that deliver real value. Your focus shouldn't just be on the agent's reasoning loop, but on building its "brain"—the system of context it operates within. An agent is only as smart as its understanding of its world. In the enterprise, that world is the knowledge graph. If you need help with adaptive planning for multi agent systems, our team can help!  

  • View profile for Sohrab Rahimi

    Director, AI/ML Lead @ Google

    23,834 followers

    Most LLM agents today still behave like procedural systems. They follow a linear plan, call predefined tools, and lose their context after each interaction. The approach works for narrow tasks but fails in open environments where the number of possible actions grows exponentially. DeepAgent proposes a very different architecture that merges reasoning, tool discovery, and execution into a single continuous loop. It is not another workflow framework but a shift toward cognitive automation, where the model plans, acts, and learns within the same reasoning space. The core of the design lies in two mechanisms: 1. The first, called autonomous memory folding, creates a structured memory system that stores and compresses reasoning traces into episodic, working, and tool memories. The agent can recall earlier decisions, detect when its logic begins to diverge, and replan without restarting the entire process. It removes the blind spot that limits most current agents, which optimize locally without remembering why a previous path failed. 2. The second mechanism, Tool Policy Optimization or ToolPO, redefines how agents learn to use external tools. It replaces fragile, slow feedback from real APIs with a simulated tool environment and assigns credit to each intermediate decision, not just the final outcome. This allows the model to refine its tool use policy through reinforcement learning that is both faster and more stable. The results are significant. On complex reasoning benchmarks such as GAIA and ALFWorld, DeepAgent delivers 20 to 30 percent higher success rates than prior architectures like ReAct or Plan-and-Solve. It continues to improve as the reasoning chain lengthens and the number of tools increases, rather than collapsing when complexity grows. This scaling behavior is important because it hints at an emerging capability: agents that can generalize across tool ecosystems and adapt to previously unseen APIs. However, the trade-offs are real. DeepAgent is computationally heavy to train, and its autonomous behavior is more difficult to monitor or reproduce. Debugging a system that can rediscover and reprioritize tools mid-reasoning is fundamentally different from tracing a fixed workflow. Still, the architectural direction feels inevitable. Future agents will no longer separate planning, execution, and learning. Memory, reasoning, and action will operate in one continuous loop. For organizations, this means moving from process automation to policy design, defining how much autonomy to grant, how to constrain exploration, and how to measure reliability when reasoning is no longer step by step but self-evolving. DeepAgent is an early view of that future, where agents begin to reason through tools, not around them, and the boundary between cognition and execution starts to disappear.

Explore categories