Search is invisible right up until it is part of everything. THE SCENE -> GitHub search is not a side feature. It is how work is found, traced, and resumed. That means search outages are workflow outages. THE LESSON -> High availability work often looks boring on paper: rebuild paths, isolation, failover, recovery. But that boring work is what turns "search is down" from a company-wide stop into a degraded inconvenience. THE RULE -> Treat any system as critical if teams use it to navigate daily work. If it breaks discovery, it breaks delivery. YOUR TURN -> What internal system in your org still gets "nice to have" treatment even though everyone depends on it? 📲 Full discussion on Telegram: https://t.me/md_sdet 💬 Want to discuss? Connect with me: https://lnkd.in/eUypsdP5 #TechLessons #SoftwareFails #Engineering #PostMortem
Stanislav Ivanov’s Post
More Relevant Posts
-
**Turn every major OSS launch into a same-day opinionated take – here's how.** Most developers treat open‑source releases like breaking news: → See a tweet → upvote → move on. The best ones treat them like **market signals**. They watch the repo, get the release notes the instant they’re tagged, and form a **same‑day opinion**. That’s what *release watching* on GitHub gives you. Instead of reading a one‑line “v5.2 is out” synopsis on Monday morning, you get to write (or tweet) on Friday: > “Just scanned the React 19 RC. The new compiler is nice, but the real win is server component interop 🧵” Why this matters: ▪️ **First‑mover authority** – you’re quoted before the hot takes become stale ▪️ **Deeper framing** – you can contrast the release with alternatives (e.g. Svelte 5 vs Qwik) ▪️ **Networking** – authors notice who shows up with real feedback within hours How to set it up: 1. Go to any repo → **Watch** → select **Releases only** 2. Connect GitHub → Slack/Discord/Telegram via a bot like `octokit` 3. When a new tag hits, you get a ping. Read the changelog + diff in <15 min. 4. Post your take the same day. The difference between “announcing” a release and **opinionating** a release is the difference between re‑sharing a link and building a personal brand. Are you watching the repos that matter?👇 #OpenSource #GitHub #DevTools #ThoughtLeadership #ReleaseManagement #MentalHealth #Telehealth #TherapyMatters
To view or add a comment, sign in
-
From a debugging rabbit hole to my first Open Source contribution 🚀 Last week, while developing a new feature for a staging environment, I hit a wall. A large data response was causing unexpected behavior, and what started as a routine feature build turned into a marathon debugging session. The issue was rooted in how we handle Server-Sent Events (SSE). For those unfamiliar, SSE is a standard that allows servers to push real-time updates to web pages or apps over a single HTTP connection. It’s essential for things like live news feeds, stock tickers, or real-time data streaming. After hours of peeling back layers of our internal code, I realized the bottleneck wasn't in our logic—it was deep within the library we use to manage these streams: EventSource. Specifically, the library was struggling to parse and process large chunks of data incoming from the server. I’ll be honest: my first instinct was to walk away. I spent some time researching other libraries, thinking it would be faster to just replace the whole dependency rather than deal with a bug that wasn't "mine." I hesitated to dive into the library's source code—it felt intimidating to try and "fix" a foundational tool. But then I realized: if I just switched libraries, the bug would still be there for every other developer using it. I decided to stop looking for an exit and started looking for a solution. I encouraged myself to dive into the repository, found the root cause in the stream parsing logic, and submitted a Pull Request. And now, that PR is merged! It might be a small fix in the grand scheme of things, but choosing to "fix" rather than "replace" taught me so much about the value of the open-source ecosystem. Knowing that this contribution will help others handle large SSE payloads without the same headache makes the hustle completely worth it. Grateful for the experience and looking forward to more! Check out the issue here: https://lnkd.in/g22bmHgq #OpenSource #SoftwareEngineering #Swift #Debugging #ServerSentEvents #RealTimeData #TechCommunity #ProblemSolving
To view or add a comment, sign in
-
-
Two hours. Gone. Not building. Not shipping. Comparing packages. You need to make an HTTP request. Simple. But now you're in a rabbit hole: http vs dio, GitHub stars, last commit date, pub points, issue count, that one Stack Overflow answer from 2021 confidently saying one is terrible and the other is mandatory. You're not building anymore. You're auditing the internet. Here's the thing: the decision was never that deep. "http" is fine for simple requests. "dio" earns its place when you need things like: - interceptors - token refresh handling - request cancellation - structured error handling The complexity of your use case should drive the decision. Not GitHub stars. But early on, nobody teaches you a decision framework. So you research until the anxiety of choosing wrong goes away. It never fully does. You just eventually get tired and pick something. The better question is: “What does my feature actually need right now and what will it need in 3 months?” If the answer is simple: use the lightest solution possible. Sometimes even building it yourself is better. Owning your own code means: - fewer hidden abstractions - fewer surprise breaking changes - fewer dependencies you barely understand sitting in your "pubspec.yaml" But if the problem is genuinely complex: edge cases, platform inconsistencies, battle-tested infrastructure, weeks of engineering work... Then use the package. A good engineer isn't anti-package. They're anti-unnecessary complexity. Two hours of research is usually a tax you pay for not having a decision framework. The package decision should take 15 minutes. The build vs adopt decision shouldn’t take more than 5 minutes. Stop researching. Start building. What’s the longest you’ve spent choosing a tool instead of actually building? #SoftwareEngineering #Flutter #MobileDevelopment
To view or add a comment, sign in
-
Most first-time open source contributors don’t quit because the code is hard. They quit because they can’t get the repo to build locally. A few weeks ago I shared 8 skills I built for first-time OSS contributors. Since then, I’ve added 7 more based on the problems people kept running into. The biggest improvements came from: - a setup skill that reads the repo’s CI config instead of the README (READMEs go stale, CI pipelines don’t) - a skill for writing tests as a first contribution (lower risk than bug fixes, faster way to learn the codebase) - a post-merge skill that helps contributors keep momentum after their first PR lands (this is where most people disappear) Everything now works with Cursor, GitHub Copilot, Windsurf, and Gemini CLI too. Not just Claude Code. The skills are plain markdown, so they’re portable across tools and easy to customize. If you’re starting with open source or running community programs, this might save some time. Link in the first comment! #OpenSource #GitHub #OSS #DeveloperTools #AIEngineering #ClaudeCode #CursorAI #GitHubCopilot #DevTools #SoftwareEngineering #BuildInPublic
To view or add a comment, sign in
-
🚀 I just shipped my open source project — Statsy! Statsy is a free, self-hosted alternative to Statuspage.io. Deploy a beautiful status page for your services in under 60 seconds with a single Docker command — no subscriptions, no vendor lock-in, your data on your server. I built this because small teams and indie developers shouldn't have to pay $99/month just to tell their users "everything is working fine." ✨ What it does: → HTTP, TCP & ICMP uptime monitoring → 90-day GitHub-style uptime history bars → Incident management & scheduled maintenance → Discord, Slack & email alerts → Multi-region probing Would love a ⭐ on GitHub if this is useful to you — and contributions are very welcome! 👉 https://lnkd.in/echg7bGm #opensource #buildinpublic #devtools #selfhosted
To view or add a comment, sign in
-
-
I used to treat tools like GitHub and deployment infrastructure as if they belonged to “real developers.” Then I actually used them. This piece is about what happens when intimidating systems become ordinary enough to disappear into the work itself. From Distance to Surface: https://lnkd.in/gtjWMY5W
To view or add a comment, sign in
-
It is hard to believe that we can code from anywhere now. I just pushed 8 pull requests to GitHub using #Claude on my mobile and it worked over a few hours in the background while I cleaned the house and got the kids ready for bed. A little under the hood. After the code is created, a robust set of automated tests, GitHub native capabilities (e.g. full GHAS suite), and proprietary systems we built at Infracodebase kick off to drive up quality and drive down security vulnerabilities. This is not the end of the story. Each of these gets a complete human + distilled agent review, but the situations where significant further changes, given up-front investments in great specs, is getting lower and lower. We still hold all of the accountability, but the right system makes working with agents end-to-end so much more possible. For example, we have focused on instrumentation that agents can easily work with and understand as a critical piece of underlying infrastructure to support real-time feedback. We also have blue-green deploys with a single button click roll back and container images for every release that has ever been made for the platform. What all of this means is that by the time anything makes it to production, it has already been though so many filters, positive feedback loops, and human-guided review with a focus on right systems architecture that shipping is trivial, especially when the cost of getting it wrong and routing traffic to a healthy, still-running instance of the platform at any time. The important thing to take away from this is not that agents can write #code. It is that building the right system around agents to help you ship fast is one of the greatest moats and developer experience hacks you can have.
To view or add a comment, sign in
-
-
GitHub's April 2026 incidents signal a critical inflection point for engineering teams. They started a 10x capacity plan in October 2025. By February 2026, they needed to design for 30x scale. Cause: agentic development acceleration since December 2025. Two incidents in one week: - April 23: Merge queue regression — 658 repos, 2,092 PRs in an incorrect state - April 27: Elasticsearch overload — PR, issue, and project search went dark This is the Agentic Infrastructure Tax. 6 rules before you scale coding agents: 1. Agent quotas (concurrent tasks + CI budget) 2. Sandbox branches (isolate agent experiments) 3. Reviewer protocol (summaries, test evidence, risk flags) 4. Tune CI for loops (path-based tests, smoke checks) 5. Queue health dashboard (retry rate, merge lag, PR age) 6. Rollback by design (feature flags, canary, clean reverts) https://lnkd.in/dvxnkNTt #AgentCoding #GitHub #DevOps #EngineeringLeadership #AIAgents
To view or add a comment, sign in
-
Two things about Claude Code skills I did not know: I thought --add-dir only added files, not config. Turns out skills are the exception, they get pulled in silently, no warning. Your agent can load skills you've never seen. Skills can run shell commands (prefixed with !) and inject the output into context before the model sees it, and the skill can pre-approve all bash too. Datadog's article was a real eye-opener for me (more great work from them!): They used a real example of a OpenClaw skill that curls your GitHub token to an external endpoint. If you've installed Claude Code skills from outside your own repo, audit them today. https://lnkd.in/gbDCJr5E #AISecurity #ClaudeCode #DevSecOps
To view or add a comment, sign in
-
GitHub published their April 2026 availability report yesterday. 10 incidents in one month. Code Search: 100% failure rate for 2 hours. 8h to recover. Pages: 17.5 million HTTP 500 errors in 39 minutes. Copilot Coding Agent: 97.5% peak error rate. Same month, Mitchell Hashimoto pulled Ghostty (50K stars) off GitHub. 18 years of daily use. Gone. Those two facts are connected. GitHub isn't a code host anymore. Git, Actions, Packages, Search, Dependabot, CodeQL, Releases, Issues, Pages, OAuth, 4 Copilot surfaces. A dozen services under one domain. April 23: one DNS config change in one datacenter knocked out six services simultaneously. Not because those services are related. Because they share infrastructure never designed to carry all of them. Nesbitt found 91% of PyPI packages reference Actions by mutable tag. OIDC trusted publishing routes through Actions for PyPI, npm, RubyGems, crates.io. Trust concentrated on a platform with 10 incidents/month. Stack Overflow 2025: GitHub at 81% as a collaboration tool. That's not market share. That's infrastructure dependency. Full migration is rarely the right call. Partial diversification is. How many of your critical paths run through GitHub right now? #DevOps #PlatformEngineering #GitHub
To view or add a comment, sign in