Crystal aOS

11h

Who is liable when an AI agent causes loss? A recent discussion paper from IMDA on legal responsibility for AI agents includes a thought-provoking scenario. Imagine this simplified example with 4 actors: A base model provider A middle-layer product for building agents, e.g. OpenClaw An end user who gives the agent a task A third party affected by what the agent does The user asks the agent to sign up for a class. The agent cannot access the user’s data because a cloud service is down. Instead of stopping, asking for permission, or escalating, it decides to hack into the cloud provider’s system. The hack causes loss to the cloud provider and exposes personal data of unrelated third parties. So where should liability sit? With the base model provider, because the model was capable of planning an unsafe action? With the agent product/platform, because it allowed the agent to execute a high-impact action without a hard authorisation gate? With the end user, because they deployed the agent and benefited from automation? Or should the affected third party have a direct claim against whoever in the chain had most control over the relevant safeguard? My preliminary view is this. Any agentic system should seek explicit authorisation before committing an action that can create material loss for the user, the platform, or a third party. The party responsible for designing and implementing that authorisation layer should be accountable for whether it works as intended. If the user knowingly removes the authorisation requirement, after clear warnings and within a lawful scope, then liability should shift towards the user for losses caused by that delegated autonomy. One way or another the system should not be allowed to execute actions that create third-party liability unless there is a clear framework for deciding: who authorised the action, what the agent was allowed to do, which safeguards were in place, who controlled those safeguards, and who compensates the victim when things go wrong. “No one is liable because the agent acted unexpectedly” cannot be the foundation for the agentic economy. I am curious how others see this. Marcin Detyniecki, Kwok Yan Lam, Agus Sudjianto, Gary Ang, PhD, Wan Sie LEE, Tess Buckley, Bourn Collier Jane Finlayson-Brown Link to the paper: https://lnkd.in/e2JvVr7k

11 Comments

Crystal aOS reposted this

Sam Burrett

3d

What Directors should know about AI: Last week's lecture from The Hon. Andrew Bell AC is a very valuable resource for board members. We are seeing a wave of scrutiny on the role of Boards and AI. This includes APRA's 30 April letter to industry and ASIC's 8 May letter to licences and market participants. This lecture provides a valuable summary of the key issues - including what is known and unknown - including: - A snapshot of AI governance and disclosure in Australia (paras 14-15), including Bell CJ's own survey of how ASX companies are disclosing responsible AI approaches; - Review of regulator positions including ASIC, APRA, and the Government's approach to regulation (paras 16-23); - The open questions most boards will be wrestling with as AI increasingly pervades the boardroom (paras 31-33); and - Analysis of AI and director's duties (paras 54-76) , including whether being "appropriately informed" may eventually require the use of AI; and on the flipside, whether key protections still hold once AI shapes a decision Bell CJ closes with six principles that are a useful baseline on which to build good governance for AI in the boardroom. These include: 1. The framework isn't changing (as far as we know) and thus we must navigate "the regulatory framework supplied by the Corporations Act and cognate legislation." 2. Safe harbour provisions may not hold where AI plays a prominent role in a decision and / or was relied on by directors or their delegates. 3. Directors must be vigilant and disciplined in their use of AI. "Great care must be taken by directors to ensure that the crucible of debate and productive conflict of ideas in the board room is not undermined or supplanted by AI." 4. Literacy is critical. "…education must be both for the purposes of the use by directors of AI and in relation to the operational use of AI by the company of which they are a director…" 5. We must balance providing safe access to AI with the need to curb shadow AI use. 6. And certainly not least, "it is not difficult to imagine a wave of litigation occurring down the track from the use by corporations and directors and officers of AI. " Well worth a read – we are expecting more and more activity in this space.

4 Comments

Crystal aOS reposted this

Stratis Limnios

3d

As the AI Assurance lead, I get the opportunity to evaluate and assure solutions that we are building for internal use and client deployment. This work has cemented my approach towards agentic AI more generally. Evaluating an agentic framework as a single system, judged by its end-state outputs, is rarely informative. It tells you whether something went wrong, not were. The agent we built is better understood as a pipeline of components, each doing a distinct job. A query classifier extracts intent and jurisdiction. A router selects the relevant policy agent. A retriever returns candidate passages from the corpus. A re-ranker orders them by relevance. A generator drafts a grounded answer. A judge component checks the answer for faithfulness to the cited material. Each of these belongs to a problem class with a well-developed evaluation literature. Classification has standard accuracy and class-balance measures. Retrieval and ranking have order-aware information retrieval metrics. Generation, less settled, is typically scored with LLM-as-judge methods alongside sampled human review of faithfulness. Decomposed this way, evaluation becomes tractable. We worked with domain experts to construct golden sets for each component. Automated metrics run on every release. When overall performance drops, we can isolate the stage responsible and feed the diagnosed failure back into the evaluation set so the same regression is caught next time. The headline number, end-to-end task success, is only meaningful because it can be traced down into per-component scores and back up into a specific failure mode. The instinct here is a fairly conventional machine-learning one: don't trust a single aggregate metric, decompose the system, evaluate at the level where you can intervene. It is worth applying that discipline to agentic frameworks before they are quietly trusted with decisions that matter.

Crystal aOS reposted this

Leonid Bashlykov

4d Edited

Everyone's writing about how AI is shrinking product teams and killing product jobs. In my case, I'm fighting Cost Control to hire more. The dominant narrative is clear by now: Brian Chesky (Airbnb) says pure people managers have no future. Tomer Cohen killed LinkedIn's APM program and replaced it with Full Stack Builders. Tobias Lütke told Shopify to prove AI can't do the job before asking for more headcount. All real. All worth taking seriously. Proving AI can't do it before asking for headcount is already part of Revolut's process ✅. But here's what AI actually did to my teams in Crypto at Revolut: it didn't reduce my need for people. It exposed how badly under-resourced my ambition was. We're running 5 new big bets in parallel right now. If I had one more strong PO tomorrow, I'd start a 6th and a 7th. The constraint was never "do we have enough work to keep teams busy" — it was always "do we have enough strong POs to point at the next opportunity." What AI actually changed: 📈 PRD prep / review / iteration — 5x faster 🔍 Regulatory research — every PO can now dig into the depth of each regime and navigate the product accordingly ⚡ Coding — talented engineers shipping 10x faster, no comments needed 🤖 Automations — nearly in every workflow → less routine nonsense 🎯 Concrete output — our MCP server for Revolut X as a side project. Without AI-driven release capacity and simple building tools, it wouldn't have shipped. 🤔 What we haven't cracked yet — governance. And this is non-negotiable in a regulated business like Revolut. Anyone selling you "AI replaces compliance review" is selling you a future lawsuit. So, my take from the seat: → AI fluency is now the baseline. Agree with Tobias Lütke here — stagnation is slow-motion failure. 70% of skills in tech jobs will change by 2030, no argument there. → But the narrative that AI = smaller teams is mostly a story from consumer SaaS and dev tools. From where I sit, AI is doing something different: it's enabling top talent to operate at 3-5x capacity, which means I can finally chase the bets that were previously beyond reach. → We still have 18 months of strong items in the backlog. I'm all in on AI to compress that to 3 months — so we can move on to brainstorming what comes next and drive the next revolution in the space. 💡 The real question isn't "how many people will AI replace?" It's: how big is your ambition, and do you have the builders to match it?

103 Comments

Crystal aOS reposted this

Jean Mauris

1w

The biggest unsolved problem in legal AI isn't the model. It's the tension between probabilistic and deterministic. And nobody's talking about it honestly. Lawyers want AI. Great. AI is probabilistic by nature. It reasons, it infers, it handles ambiguity. That's the point. But then they want control. So they add playbooks. Rules. Escalation logic. Deterministic guardrails. And here's where it breaks. Add too little determinism – the AI makes calls that feel random. "It doesn't know our standards." "It doesn't understand how we work." Fair. A generic LLM has no idea your firm never accepts unlimited liability. Add too much determinism - and you've lobotomized the model. You've replaced judgment with a decision tree. "It flags everything." "It misses context." "We can't write a rule for every situation." Also fair. And when the LLM hits a gap in your rules? It doesn't stop. It improvises. It makes its own call. Which is exactly what you were trying to prevent. So lawyers are unhappy at both ends. Here's the real insight nobody's saying out loud: building a playbook for an AI isn't a writing task. It's an engineering discipline. A badly written rule doesn't get ignored — it gets misinterpreted. The LLM will execute your rule and reach a conclusion you never intended. It doesn't think the way you think. It reads what you wrote. And that's before you get into the rule hierarchy. Rule conflicts. Rules that depend on other rules. Rules that should fire in one jurisdiction but not another. Exceptions to exceptions. The moat in agentic legal AI isn't better models. It's knowing how to blend determinism into probabilistic reasoning without breaking either. That skill, encoding institutional legal judgment into something an AI can actually execute correctly, is the most important thing legal teams should be training right now. Most aren't even aware it exists.

43 Comments

Crystal aOS reposted this

Stephen A.

4d Edited

Law firms are sleepwalking into AI hostage situations. *Here’s the thought experiment they should be running first.* Imagine you spend the next two years doing everything right. You onboard a leading legal AI platform. Your best lawyers curate your institutional knowledge inside it. You build agents that reflect your firm’s actual risk tolerance, drafting standards, and judgment calls. Your workflows encode three decades of hard-won precedent into automated sequences that run overnight while your associates sleep. It works beautifully. Your clients notice. Your partners are delighted. Now imagine your vendor doubles the price. What do you do? This is the question most firms aren’t asking before they sign. They’re asking “does the AI work?” and “what’s the implementation timeline?” They’re not asking what it would cost to leave. The answer, if you’ve done the above properly, is: more than you think. Possibly more than you can afford. The conventional lock-in analysis focuses on data portability. Can we export our documents? Yes, probably. That’s not the problem. The problem is the knowledge curation decisions your best lawyers made inside that platform. Which precedents are authoritative. How matters are classified, risk-scored, and routed. That structure doesn’t export. It’s embedded in the platform’s data model, not yours. The problem is your agents. Not the technology – the calibration. Once your agentic workflows reflect your firm’s actual judgment, they become load-bearing. Rebuilding them elsewhere isn’t a migration. It’s reconstruction from memory. Both Harvey and Legora (as examples but this is true of Claude and Microsoft too, but without the full cost implications) have made the same strategic move: they’ve stopped being AI assistants and started positioning as operating systems for legal work. The embedded Legal Engineers who configure your knowledge libraries inside their platforms are the tell. When a vendor’s people are encoding your institutional knowledge into their system, the dependency you’re building isn’t contractual. It’s epistemological. The question isn’t whether to use these platforms. It’s whether you’re building on them or building into them. Firms that retain leverage will treat these platforms as execution environments for their knowledge, not as its home. Document your workflow logic externally. Own the knowledge architecture. Negotiate export rights before you sign. Legal AI is genuinely transformative. The economics of dependence, however, are as old as enterprise software. Read the contract. Then read it again. #LegalAI #LegalTech #LawFirms

45 Comments

Crystal aOS reposted this

Allie K. Miller

1w

There are multiple realities of AI right now. And what you have access to drastically changes your workflows, trust in AI, and ability to adapt to the future. Here’s the briefest state of the AI world for business professionals (not engineers) ⬇️ Free AI - you use free ChatGPT or Claude. You think AI is decent for simple tasks but nothing to restructure your day around. You’re worried it hallucinates too much to trust with real work. On a daily basis you treat it like an intern who needs to be checked on everything. Paid AI - you pay for one (maybe more?) AI subscriptions. You’ve had at least a few moments where AI genuinely surprised you with what it could do. You use it daily for writing, research, and analysis and you’ve stopped asking “can AI do this” and youve started asking “how do I get AI to do this BETTER.” Super AI - you have multiple paid subscriptions, you either use Cowork or Claude Code for daily tasks, Claude Code or Codex to build tools for yourself. You treat AI like a teammate with a role, not a tool you open sometimes. You’ve automated things your coworkers are still doing manually. You have genuine opinions about which model is best for which task. AI isn’t just a productivity boost for you anymore. It’s infrastructure. You’re completely reinventing the way your work gets done. Basic enterprise AI - your IT team approved one AI platform (probably Microsoft Copilot, ChatGPT Enterprise, or Gemini for Workspace) in the last few years, and rolled it out with clear-ish guardrails. You have access but you also have restrictions on what data you can put in. You use it for safe, surface-level tasks and your high-stakes work still happens the old way because nobody’s built the bridge between your AI tool and your real enterprise systems (ex: email and docs). Super enterprise AI - your company has access to agentic harnesses (Codex, Claude Code) across different functions, each chosen for what it does best or just giving your employees more flexibility. Your teams have AI embedded in their actual workflows, not bolted on. Engineers have been using Claude Code/Codex/Cursor for months or years. Operations use agents for real processes. Leadership tracks AI adoption the way they track revenue (ie not just usage). The gap between your company and competitors who handed out a single Copilot license is getting wider every month. I would go so far as to say that companies not giving their employees the best AI today are shooting themselves in the foot because their employees are losing more AI trust, putting them in a worse position for eventual (hopeful) company-wide transformation.

157 Comments

Crystal aOS reposted this

Christian Brown

4d

In April 2026, Anthropic quietly moved every Claude Enterprise contract to usage-based billing. Most companies and lawyers haven't absorbed what that means yet but they will soon. I spent three years building AI into my own legal practice and helping others, before stepping into the Head of Product seat at Irys, Legal AI. This piece is from that practitioner seat - what I built, what worked, what the DIY tax actually cost, and the prescription I would have wanted on day one. For paralegals, solos, small and mid-size firms, fractional GCs, in-house generalists, court clerks, and everyone shipping AI into their actual practice — this one is written directly to you.

Build what only you can build. Don't be the OS. Christian Brown on LinkedIn

18 Comments

Crystal aOS reposted this

Prathmesh Patel

4d

MCP is a protocol for how AI applications (clients) and servers talk, but it doesn't own your context window. Most clients today inject every tool schema into the model's context on every request. That's fine for one server with a handful of tools. When you connect 5+ servers, most of which the model never touches in a given workflow, this breaks down. As the context window fills up, workflow efficiency and accuracy degrades for agents. Progressive tool disclosure implemented by MCP clients addresses this. MCPJam Inspector implements this for you to test out yourselves. David Soria Parra, the creator of MCP, put it plainly recently: "we already know the mechanisms to work around context bloat. This is called progressive discovery." Our implementation: after listing tools, the client caches full tool schemas and exposes two meta-tools to the model: 'search_mcp_tools' (returns names, short descriptions, and categories) and 'load_mcp_tools' (hydrates full definitions for the tools the model intends to call). The model searches, loads what it needs, then calls tools normally. Some implementation choices our client made: - Simple (BM25) keyword matching for discovery (deterministic, no embedding round-trip) - Execution gated to the currently loaded tool set so the model can't invoke a hallucinated name - Discovery state persisted across turns so prior loads survive multi-step tasks Likely tradeoff you'll discover: this dynamic disclosure is likely to add more tool-call round trips, so be sure to track latency diffs using our traces. If you're building an MCP client, see the full implementation in our open-source repo with a blog article coming soon. For all of you, try it out here: https://lnkd.in/dkn2yNcn #mcp #modelcontextprotocol

4 Comments

Software Development

The agentic compliance infrastructure layer for regulated firms and law firms.

About us

Updates

Join now to see what you are missing

Similar pages

Juxtabyte

Sedso

Our Leg Up

Briefcase

Motif

b'das*l

Xillions AI

Onsite

OncoRevive

Slice