Building voice agents can come with tradeoffs.
💬 Better convos w/ speech to speech models
Vs
🥪 More reliable harnesses w/ sandwich architecture
How to build a voice research agent with both:
✅ Gemini Live: low-latency, natural-sounding convos
✅ Deep Agents: long-running
At Antimetal, we are building the autonomous system for production engineering.
We build a unified world model of your environments and run a suite of agents to improve your system over time.
If you are interested and want to learn more, do reach out!
LangSmith for Startups Spotlight: @antimetal
Antimetal is an autonomous system to manage your software in production. It builds a world model of your environment -- then its agents continuously monitor to prevent issues, triage alerts, and ship fixes.
They are currently in
LangSmith for Startups Spotlight: @antimetal
Antimetal is an autonomous system to manage your software in production. It builds a world model of your environment -- then its agents continuously monitor to prevent issues, triage alerts, and ship fixes.
They are currently in
Building voice agents can come with tradeoffs.
💬 Better convos w/ speech to speech models
Vs
🥪 More reliable harnesses w/ sandwich architecture
How to build a voice research agent with both:
✅ Gemini Live: low-latency, natural-sounding convos
✅ Deep Agents: long-running
agents that can write code can solve problems more reliably
but you need to make sure you execute that untrusted code in a safe environment
here’s how we enable that w a lightweight code interpreter!
a very cool Harbor x LangSmith flow I love to help you “look at the data”:
1. you do evals or rollouts for RL
2. all reward metrics and traces and rollouts get automatically propulates into Experiments & Tracing projects
3. we send agents (or use Engine) to understand how
At @aiDotEngineer World's Fair?
We're hosting a World Cup Watch Party at The Harlequin today from 11-7 with @baseten and @reductoai
One block from the venue!
we launched code interpreters for deep agents last month. Basic idea is to let agents plan, delegate, and organize context using code instead of chained tool calls
Code interpreters don't need a sandbox, but we still need a way to securely run that code! (and running untrusted
Giving agents the ability to write code makes them dramatically more capable.
It also makes security a lot harder.
At LangChain, we have spent a lot of time this year figuring out how to do both.
Memory has somehow consistently been the most exciting area of agent development over the last 3 years (imo), and it's still a largely unsolved problem!!
Wiki's are the biggest advancement I've seen in a long time, and I'm super excited to push more on research here. Great
.@harborframework can now integrate directly with Deep Agents, LangSmith Sandboxes, and LangSmith Observability.
You need to run agents in a real, reproducible, isolated environment, many times in parallel, with a deterministic check at the end.
Harbor solves this problem.