Voice AI is more than just plugging in an LLM. It's an orchestration challenge involving complex AI coordination across STT, TTS and LLMs, low-latency processing, and context & integration with external systems and tools. Let's start with the basics: ---- Real-time Transcription (STT) Low-latency transcription (<200ms) from providers like Deepgram ensures real-time responsiveness. ---- Voice Activity Detection (VAD) Essential for handling human interruptions smoothly, with tools such as WebRTC VAD or LiveKit Turn Detection ---- Language Model Integration (LLM) Select your reasoning engine carefully—GPT-4 for reliability, Claude for nuanced conversations, or Llama 3 for flexibility and open-source options. ---- Real-Time Text-to-Speech (TTS) Natural-sounding speech from providers like Eleven Labs, Cartesia or Play.ht enhances user experience. ---- Contextual Noise Filtering Implement custom noise-cancellation models to effectively isolate speech from real-world background noise (TV, traffic, family chatter). ---- Infrastructure & Scalability Deploy on infrastructure designed for low-latency, real-time scaling (WebSockets, Kubernetes, cloud infrastructure from AWS/Azure/GCP). ---- Observability & Iterative Improvement Continuous improvement through monitoring tools like Prometheus, Grafana, and OpenTelemetry ensures stable and reliable voice agents. 📍You can assemble this stack yourself or streamline the entire process using integrated API-first platforms like Vapi. Check it out here ➡️https://bit.ly/4bOgYLh What do you think? How will voice AI tech stacks evolve from here?
Software Development
Explore top LinkedIn content from expert professionals.
-
-
The open-source AI ecosystem for agents developers has exploded in the past few months. I've been testing dozens of new libraries, and honestly, it's becoming increasingly difficult to keep track of what actually works and what the state of the art is. So, I built an updated map of the tools that matter, the ones I'd actually reach for when building a new agent. The interesting pattern I'm seeing: we're moving past the "ChatGPT wrapper" phase into genuine infrastructure. The overview includes 40+ open-source packages across: → Agent orchestration frameworks that go beyond basic LLM wrappers: CrewAI for role-playing agents, AutoGPT for autonomous workflows, Langflow for visual agent building. → Tools for computer control and browser automation: Browser Use and Stagehand for LLM-friendly web navigation, Open Interpreter for local machine control, and Cua to control Mac environments. → Voice interaction capabilities beyond basic speech-to-text: Ultravox for real-time voice, Dia for natural TTS, Pipecat for complete voice agent stacks. → Memory systems that enable truly personalized experiences: Mem0 for self-improving memory, Letta for long-term context across sessions, LangMem for shared knowledge bases. → Testing and monitoring solutions for production-grade agents: AgentOps for benchmarking, Langfuse for LLM observability, VoiceLab for voice agent evaluation. Full breakdown with GitHub repos links https://lnkd.in/g3fntJVc
-
What happens when you open-source a tool you built for yourself? 10,000+ developers started using it. I created n8n-MCP - a tool that enables AI Agents such as Claude Desktop or Cursor to actually build working n8n workflows. I simply wanted to solve my own problem. The decision to share it publicly led to something I never expected. 🌍 The numbers tell an interesting story: • Docker images downloaded 10.1k times • npx installations: 4.6k • Repository cloned 2.6k times by 1.7k unique developers • Nearly 40k views, averaging 3k unique visitors daily • 1.7k GitHub stars ⭐ • 4 independent creators discovered the tool and created YT tutorials But beyond metrics, what truly amazes me is how the community adapted the tool for their own needs. From solo developers running local instances to entire teams deploying it remotely, each found their own use case. The tool is valued as "the first one that actually works" and "best on the market" The experience taught me a valuable, yet obvious lesson. When you have deep knowledge in certain domain and solve well particular pain point in this domain, we often solve problems we didn't know others had too. 💡 Open source isn't just about code - it's about creating tools that multiply possibilities across the community. Sometimes the most impactful contributions come from simply scratching your own itch and having the courage to share it. Have you tried it yet?
-
Not a joke, many Data Engineers don’t fully understand the Medallion architecture or their caveats. Here’s a simple, crisp breakdown of the Medallion Architecture and why each layer matters: 🔹 Bronze (Raw Ingestion) - All incoming data lands here-> logs, JSON, CSV, streaming events - Data stays in its original form (think Delta Lake tables) - Use schema-on-read to keep raw JSON/XML (no forced schema yet) - Partition by ingest date/hour for fast file pruning - Add audit columns (ingest_timestamp, source_file, batch_id) for full traceability Why care? Bronze is your “source of truth.” You can recover, reprocess, or track every record. 🔹 Silver (Cleansed & Curated) - Cleaned, standardized view of Bronze data - Enforce data types, drop nulls, fill defaults (schema-on-write) - Use joins and dedupe logic (window functions help remove duplicates) - Add data profiling and constraints (NOT NULL, CHECK) to stop bad data early Why care? Silver gives you reliable, consistent tables for analytics, reports, and ML models. 🔹 Gold (Business Aggregations) - Highly curated, aggregated tables or dimensional models - Pre-compute metrics (daily active users, revenue by region) - Use Slowly Changing Dimension (SCD) for customer data - Partition and Z-order in Delta for super-fast queries Why care? Gold delivers high-performance datasets for BI tools and ML feature stores. Key Benefits Across Layers 1. Modularity & Maintainability – Keep ingestion, cleaning, and aggregation logic separate 2. Data Quality – Catch issues step by step 3. Scalability – Stream and batch workloads scale on their own 4. Governance & Lineage – Track every change with audit columns and Delta logs What else you would like to add here ? 𝗖𝗼𝗻𝗻𝗲𝗰𝘁 𝟭:𝟭 𝗳𝗼𝗿 𝗰𝗮𝗿𝗲𝗲𝗿 𝗴𝘂𝗶𝗱𝗮𝗻𝗰𝗲 → https://lnkd.in/gH4DeYb4 𝗔𝗧𝗦 𝗢𝗽𝘁𝗶𝗺𝗶𝘀𝗲𝗱 𝗿𝗲𝘀𝘂𝗺𝗲 𝘁𝗲𝗺𝗽𝗹𝗮𝘁𝗲 → https://lnkd.in/g-iw7FaQ Gif -> Ilum ♻️ Found this useful? Repost it! ➕ Follow for more daily insights on building robust data solutions.
-
In 2011, the Amazon Appstore failed on launch and Jeff Bezos was furious. It was my fault, and I handled one aspect of recovery so poorly that one of my engineers quit. I still regret it 14 years later. Please learn from my mistake. The main lesson is that when you are leading through a crisis, it can feel like it is all about you. It isn’t. It is about: 1) Solving the problem 2) Guiding your team through it The product issue was that there were some pretty simple bugs, and we solved those problem well enough that I was eventually promoted. Where I failed was in guiding my team through the crisis. My leadership miss was that I neglected to encourage and support the engineer who had written the bad code. He did a great job stepping up and supporting the effort to fix the problem, but shortly afterward, he resigned. During the crisis, I failed to make clear to him that we did not blame him for the launch failure despite the bugs. I imagine that left room for him to think we blamed him or that he didn’t belong. It is also possible that others did blame him directly and that I was too caught up in the crisis to realize it. Both instances were my responsibility as the leader of the team. His resignation taught me a valuable lesson about leading through a crisis: No matter how bad the situation is, your team must be your first priority. If you make them feel safe, they will move heaven and earth to fix the problem. If you don’t, they may still fix the problem, but the team itself will never be the same. As a leader, here is how you can give them what they need: 1) Take the blame and do not allow others to be blamed. In some bug cases after this we did not release the name of the engineer outside the team in order to protect them from judgment or blame. 2) Separate fixing the problem from figuring out why it happened. Once the problem is fixed, you can focus on root-causing. This lowers the risk of searching for answers getting confused with searching for someone to blame. 3) Realize that anyone involved in the problem already feels bad. High performers know when they have fallen short and let their team down. As a leader you have to show them the path to growth and success after the crisis. They do not need to be beaten up on- they have taken care of that themselves. 4) See crises and problems as growth opportunities, not personal flaws. Your team comes with you in a crisis whether you like it or not, so you might as well come out stronger on the other side. As a leader, the responsibility for a crisis is yours in two ways: The problem itself and the effect it has on the future of the team. Don’t get too caught up in the first to think about the second. Readers- Has your team survived a crisis? How did you handle it?
-
Copy a project from GitHub or YouTube to land a job… That’s not how it works anymore. “What project should I add to my resume?” That’s one of the most common questions I get from juniors. And honestly, I’ve asked the same thing in the past. Back then, the default answer was simple: “Build an e-commerce website.” “Make a full-stack MERN app.” “Do something with ML “ So we did. Copied projects from GitHub. Followed 3-hour YouTube tutorials. Changed a few colors. Added our name to the footer. Done. It worked… kind of. But here’s what I’ve realised now, especially with how fast AI is evolving: Your resume doesn’t need another “portfolio project.” It needs a “problem solver.” These days, anyone can use Cursor or bolt to build a full-stack app in a few hours. What makes your project different? Not the tech. The problem it solves. Let me give you an example: You want to send 100+ personalized emails to students. Mail Merge? Already exists. But your emails are landing in spam. So you build a small Python script that sends emails like a human would. Slowly. With better delivery. Tiny project. Few lines of code. But real value. That’s what recruiters remember. Not how many lines of code you wrote. But how many problems you solved. You can use AI. You should use AI. But don’t be just a copy-paster. Understand what you’re building. Learn the stack. Because your project should say one thing.. “I don’t just code. I solve.”
-
Open source isn’t free. Someone else just paid the price. Behind every great library is often a tired, unpaid maintainer. Let’s talk about the tools we all rely on: ➡️ Java apps run on Spring Boot, Maven, Log4j. ➡️ Python apps lean on NumPy, Pandas, TensorFlow. ➡️ Web apps, APIs, analytics, and AI models depend on them. Most of these are maintained by developers working late nights, weekends, and holidays — without compensation. A handful of contributors are carrying the weight of global infrastructure. Big companies use these tools at massive scale… but rarely give back. ➡️ One bug in Log4j created a global security crisis. ➡️ A NumPy issue could break critical scientific apps. Meanwhile, maintainers handle security patches, bug reports, support questions, and community drama — often alone. Why does this matter? Because your code, models, and servers wouldn’t run without them. ⚠️ Burnout is real. ⚠️ When maintainers leave, critical software becomes vulnerable. How to support open source: ✅ Contribute — fix bugs, improve docs, add tests. ✅ Sponsor — via GitHub Sponsors, OpenCollective, or direct donations. ✅ Promote — highlight their work in talks, blogs, and posts. ✅ Be thoughtful — don’t treat open source like free labor. ✅ Say thanks — a kind message or tweet can mean the world. Next time you import or build, remember: There’s a human behind that code. Let’s support them. #OpenSource #DevCommunity #SoftwareEngineering #SupportMaintainers #TechResponsibility #BurnoutIsReal
-
Dear Software Engineers, If your app serves 10 users → a single server and REST API will do If you’re handling 10M requests a day → start thinking load balancers, autoscaling, and rate limits /— If one developer is building features → skip the ceremony, ship and test manually If 10 devs are pushing daily → invest in CI/CD, testing layers, and feature flags /— If your downtime just breaks one page → add a banner and move on If your downtime kills a business flow → redundancy, health checks, and graceful fallbacks are non-negotiable /— If you're just consuming APIs → learn how to handle 400s and 500s If you're building APIs for others → version them, document them, test them, and monitor them /— If your product can tolerate 3s of lag → pick clarity over performance If users are waiting on each click → profiling, caching, and edge delivery are part of your job /— If your data fits in RAM → store it in memory, use simple maps If your data spans terabytes → indexing, partitioning, and disk I/O patterns start to matter /— If you're solo coding → naming things poorly is just annoying If you're on a growing team → naming things poorly is a ticking time bomb /— If you're fixing bugs once a week → logs and console prints might do If you're running production → you need structured logs, tracing, alerts, and dashboards /— If your deadlines are tight → write the simplest code that works If your code is expected to last → design for readability, testability, and change /— If you work alone → "it works on my machine" might be fine If you're in a real team → reproducible builds and shared dev setups are your baseline /— If your app is new → move fast, clean up later If your app is in maintenance hell → you now pay interest on every rushed decision People think software engineering is just about building things. It’s really about: – Knowing when not to build – Being okay with deleting good code – Balancing tradeoffs without always having all the data The best engineers don’t just ship fast. They build systems that are safe to move fast on top of.
-
There’s an underrated superpower in tech (and life): knowing who’s worth listening to. Andrej Karpathy is one of those people. Ex-Director of AI at Tesla. Founding team at OpenAI. PhD under Fei-Fei Li. If these creds don't impress you, he also coined the term 'vibe-coding'. When he took the stage at YC AI Startup School in San Francisco this week, I paid attention. Here’s what I took away: 1️⃣ Software 3.0: English as Code. He reframes software’s evolution in three eras: Software 1.0: Hand-coded logic. Software 2.0: Trained models; neural net weights are the program. Software 3.0: You program in English. Prompts are the code. Everyone who can write a clear sentence is, in theory, a coder now. 2️⃣ LLMs aren’t Utilities - they’re Operating Systems. Karpathy’s most powerful framework: we’re in the ✨ mainframe era of AI ✨ In the 1960s OS world, there was ▪️Expensive, centralized compute. Few owned mainframes, many shared them. ▪️Time-sharing. Jobs batched, users were thin clients. ▪️Command-line interfaces. No GUI, just terminals. ▪️Remote access. The computer lived in a data center, users dialed in. In LLMs today? Same story. ▪️Massive, costly, cloud-native. Nobody runs GPT-4 locally. ▪️Thin clients. We pipe requests via browser or API. ▪️No AI GUI yet. We’re typing into terminals (ChatGPT). We’re pre-personal computer. Someone still has to build the AI equivalent of the desktop, the mouse, the spreadsheet. 3️⃣ Partial Autonomy + The Autonomy Slider. Karpathy’s Tesla experience taught him what happens between flashy demos and reliable autonomy: a decade of boring, hard work. In 2013, he rode in a Waymo car that handled 30 minutes of Palo Alto driving perfectly. The demo worked. It’s 2025. We’re still debugging self-driving at scale. The same is true for AI agents. The opportunity is augmenting people with AI “Iron Man suits,” not replacing them with Iron Man robots. Cursor, Perplexity are early examples of where this is going. ▪️They package context, orchestrate multiple LLM calls, and give users GUIs to audit AI output. ▪️They offer an autonomy slider - letting humans choose how much control to give up. The future is co-pilot software - where humans steer, AI assists, and the feedback loop is fast. 4️⃣ Docs and infra need to meet AI halfway. Today’s software is built for humans and APIs. Tomorrow’s needs to be legible to agents: ▪️Ditch “click here.” Use curl. ▪️Replace PDFs with agent-friendly Markdown. ▪️Build tooling that packages context so LLMs don’t fumble their way through HTML and menus. We need to design for a new consumer: not just people, not just code, but people-like machines. We’re in AI’s mainframe era. The personal computing revolution will come. The job now is to build what comes between. And in the meantime, I guess we’ll keep typing into our terminals and hoping the prompt does what we meant.
-
I pitched my software idea to 47 investors across Africa in 2022. Every single one said "NO." But here's what happened next that changed everything about how I view leadership in African tech... The brutal reality nobody talks about: Starting a tech company in Zimbabwe without seed capital isn't just hard—it's like trying to build a skyscraper with your bare hands while standing in quicksand. My co-founder and I had a revolutionary AI solution for recruitment and talent matching. The problem? We were operating from a 2-bedroom flat in Harare, coding on 5-year-old laptops that overheated every 3 hours. But here's the plot twist... Those rejections forced us to pivot multiple times and become the most resourceful leaders we could ever be. The breakthrough came when we stopped pitching investors and started talking to ONE user. We couldn't afford AWS, so we optimized our code to run on potato servers. We couldn't hire senior developers, so we mentored junior talent into world-class engineers. We couldn't afford marketing, so we built solutions so good that users became our evangelists. The real turning point was when I stopped trying to be Silicon Valley 2.0 Instead, I embraced what I call "Ubuntu Leadership"—leveraging our collective strength, community networks, and solving uniquely African problems with African ingenuity. We focused on solving the recruitment problem for one company perfectly. Then we scaled that solution to similar businesses. We created new features based on real user feedback, not investor demands. The uncomfortable truth about African tech leadership: You don't need venture capital to build something meaningful. You need vision, resilience, and the courage to solve real problems for real people first. Every "disadvantage" we faced became our competitive edge. Every "no" taught us to build something undeniably valuable. Every pivot forced us to innovate beyond what well-funded competitors could imagine. To every tech leader reading this from Harare, Lagos, Nairobi, or Cape Town: Your constraints are not your ceiling—they're your creativity catalyst. The next African software giant won't be built by copying Silicon Valley playbooks. It'll be built by leaders who understand that our greatest strength isn't in mimicking others, but in solving problems that only we truly understand. What's the biggest "disadvantage" in your market that you could turn into your competitive edge? Drop your thoughts below—let's rewrite the narrative about what's possible in African tech. 🚀 #AfricanTech #Leadership #Zimbabwe #TechLeadership #AI #SoftwareDevelopment #Entrepreneurship