Sign in to view Reynold’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Reynold’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
San Francisco, California, United States
Sign in to view Reynold’s full profile
Reynold can introduce you to 10+ people at Databricks
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
38K followers
500+ connections
Sign in to view Reynold’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Reynold
Reynold can introduce you to 10+ people at Databricks
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Reynold
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Reynold’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Activity
38K followers
-
Reynold Xin shared thisThe future of databases is being built directly on top of object stores. We call this the Lakebase architecture. For a long time, the industry treated data lakes strictly as analytical or offline storage. But the Lakebase architecture is changing that by enabling true operational databases directly on top of the lake. I believe this is the future of data infrastructure. It is how every database, whether it's an OLTP system or a vector database, should be built moving forward. Of course, delivering the stringent performance requirements for operational databases on top of object stores require some creative engineering. Really excited to see more real-world examples of this architecture emerging. The team at Zilliz just shared a piece on why they rebuilt their vector database using this exact approach, and it perfectly captures where the industry is heading. Check it out here: https://lnkd.in/gKxY3bHXWhy We Built Vector Lakebase: Rethinking Unstructured Data Architecture for AI - Zilliz blogWhy We Built Vector Lakebase: Rethinking Unstructured Data Architecture for AI - Zilliz blog
-
Reynold Xin shared thisOracle has spent the last two weeks writing articles comparing Oracle (and PDB) to Lakebase, and it highlights a massive philosophical divide in how we view databases in the agentic era. They are trying to retrofit heavy, traditional architectures for AI. We believe Lakebase are the future because agents need something entirely different: ⚡️ Super simple APIs: so agents don't have to read a giant manual and hallucinate a query. ⚡️ Sub-second provisioning & auto-scaling: so you aren't paying legacy-level prices for idle time. ⚡️ Branching: Git-style branching to create isolated, safe environments for agents on the fly. ⚡️ Automatic backup & restore: so you don't sweat it when an autonomous agent inevitably drops a table. The numbers speak for themselves. Lakebase is our fastest growing product. In the last few months alone, we've seen database start rate 30X, and now we are starting tens of millions of databases EVERY DAY. Some of these databases have 500 level deep branches and lifetime of just seconds due to how fast agents move. Go try it yourself in a few seconds on neon.com! The team has been cooking hard to push this gap even further. Come to Data and AI Summit next month to hear about some major new breakthrough capabilities. 🚀 (Links in comments so you can read their take)
-
Reynold Xin reposted thisReynold Xin reposted thisExciting news! �� Lovable now integrates with Databricks, providing a natural language interface that allows anyone, regardless of technical skills, to build live data apps can read and write data stored in Databricks. Bridge the gap between complex data engineering and beautiful, functional front-ends. I tested it with SEC filing data. You can go from search to analyzing detailed financial statements in minutes. Try it yourself! #databricks #lovable
-
Reynold Xin reposted thisReynold Xin reposted thisExcited to announce GPT-5.5, our smartest and most intuitive model yet! 5.5 now understands what you mean, not just what you type – perfect for handling ambiguity, planning multi-step work, and actually helping complete tasks. Earlier models already did this well, but 5.5 moves further toward understanding what you’re trying to accomplish, especially at work where things aren’t perfectly packaged. From my conversations with hundreds of enterprises, this is the type of model capability they look for as they make AI a core intelligence layer of their infrastructure at companies like NVIDIA, Lowe's Companies, Inc., Cisco, BNY, and more. To celebrate today’s launch, Patrick Wendell at Databricks and I had a chance to sit down and talk about GPT-5.5’s impact on joint customers, and how builders and non-technical teams can harness the power of this latest model directly in Databricks’ platform. Congrats to all the research and deployment teams for this amazing milestone!
-
Reynold Xin shared thisMatei just received the ACM Prize in Computing, one of the most prestigious awards in computer science. It's hard to think of anyone who deserves it more. Few people have had as much impact on how the world works with data and AI, and Matei has done it all with a focus on building open source tools that are accessible to researchers, nonprofits and enterprises across all industries. Watching his body of work compound over the years is a privilege. Congrats, Matei!Reynold Xin shared thisWe're incredibly proud to congratulate our co-founder and CTO, Matei Zaharia, on receiving the ACM Prize in Computing for his development of distributed data systems that have enabled large-scale machine learning, analytics, and AI. Matei's open-source contributions have fundamentally changed how organizations work with data and AI — including Apache Spark™, Delta Lake, and MLflow. Researchers, nonprofits, startups, and enterprises across every industry have built on the foundation he helped create. Now he's pushing the frontier further, focusing on building and scaling reliable AI agents through open-source research like DSPy and GEPA. Matei, this recognition is so well deserved. We're honored to build alongside you every day. https://lnkd.in/gZTw65kW
-
Reynold Xin reposted thisReynold Xin reposted thisWe're incredibly proud to congratulate our co-founder and CTO, Matei Zaharia, on receiving the ACM Prize in Computing for his development of distributed data systems that have enabled large-scale machine learning, analytics, and AI. Matei's open-source contributions have fundamentally changed how organizations work with data and AI — including Apache Spark™, Delta Lake, and MLflow. Researchers, nonprofits, startups, and enterprises across every industry have built on the foundation he helped create. Now he's pushing the frontier further, focusing on building and scaling reliable AI agents through open-source research like DSPy and GEPA. Matei, this recognition is so well deserved. We're honored to build alongside you every day. https://lnkd.in/gZTw65kW
-
Reynold Xin reposted thisReynold Xin reposted thisThe Databricks Lakebase engineering team just shipped a real step toward invisible maintenance. For Lakebase Postgres, a new compute node is brought up ahead of a scheduled update and prewarmed using the primary’s cache footprint and WAL stream. When it is ready, it takes over. No cold cache, no throughput drop, no disruption to the workload. This is fundamental systems work. Stateless compute, shared storage, and being precise about what to warm and when. It removes a failure mode most databases still expose during routine patching. Well done Hans Norheim and the entire Lakebase eng team who contributed to this milestone. 🧱 🔥 Check out how we did it: https://lnkd.in/e7KPYyWR
-
Reynold Xin shared thisThe dynamics of cybersecurity defense are changing rapidly. Attackers are using AI agents that don't sleep, can connect the dots faster than humans, and can exploit almost any surface area. Now, even someone without a technical background can deploy a team of agents to mount an attack. If defenders want to level the playing field, we have to change our approach: we need to collect and analyze ALL data rather than relying on selective filtering. We need an open ecosystem. And most importantly, we need to fight agents with agents. Today, we are taking a massive step in that direction by announcing Lakewatch, the open agentic SIEM. Unlike traditional SIEMs, Lakewatch empowers organizations to store and analyze ALL of their data at scale and at a low cost, running advanced detections and analysis with AI agents. Building this wouldn't have been possible without our amazing design partners. We’ve been working with National Australia Bank (NAB) since the very beginning. In our first collaboration meeting, I remember the NAB team bringing up the exact threats of the AI era and the necessity of an open lakehouse architecture. In fact, Patrick Wright, Sandro Bucchianeri, and Rob Smith practically pitched our own vision to us before we even told them what we were building! A huge thank you to these visionaries. We are so lucky to have you on the Lakewatch journey. Check out the blog to see how we are building the future of security together: https://lnkd.in/gZiZPb3EBuilding the future of security with NAB with LakewatchBuilding the future of security with NAB with Lakewatch
-
Reynold Xin reposted thisReynold Xin reposted thisDatabricks CEO and co-founder Ali Ghodsi is taking the stage at #RSAC2026 to address the most critical challenge in security: the shift from manual operations to machine-scale AI automation. Hear why today's security stack — built for yesterday's data volumes and human-speed threats — is no longer enough. Ali will break down what's coming next and why an open, agent-first architecture is the path forward for CISOs. https://lnkd.in/gC4E2wm7
-
Reynold Xin liked thisReynold Xin liked thisI'm excited to share that I've been elected as a Committer for Apache Spark. Spark has impressed me for as long as I can remember, both in its technical innovation and the community behind it. It's been a privilege to contribute back to a project that has had such a profound impact on the data ecosystem, and I hope I can help it continue to succeed for years to come. I'm especially grateful for the support, feedback, and collaboration from the Spark community and my teammates at Apple and then Databricks. A special thank you to everyone who reviewed my contributions, provided guidance, and helped me learn along the way! #ApacheSpark #OpenSource #ApacheSoftwareFoundation #BigData #DataEngineering
-
Reynold Xin liked thisReynold Xin liked thisSome exciting news from the mathematics chapter of my life! Our paper, “Guts in sutured decompositions and the Thurston norm”, has been accepted for publication in Geometry & Topology, one of the most prestigious journals in geometry and topology. I spent two years at UC Berkeley earning a PhD in mathematics, studying 3-manifolds, geometric topology, and hyperbolic geometry. This paper, written with my advisor Ian Agol, is one of the few papers that came out of that period. Research often moves on a very different timescale than startups. It’s been years since this work was done, but seeing it finally reach publication makes the journey especially meaningful. I’m deeply grateful to Ian for his support, mentorship, and countless mathematical conversations over the years. It’s nice to be reminded that even in the age of AI, beautiful mathematics remains timeless.
-
Reynold Xin liked thisReynold Xin liked thisIt is fantastic to see #uoft alumni Reynold Xin and Mike Murchison being featured during #TorontoTechWeek, via University of Toronto Entrepreneurship's Desjardins Speaker Series, to discuss their journeys building successful tech startups. What an incredible opportunity to hear from these brilliant founders about lessons learned and seizing the AI moment.
-
Reynold Xin liked thisReynold Xin liked thisEarlier today Eduardo Gomes, Gaurang Joshi, Karl Pullicino, and I had the privilege of sitting down with Reynold Xin, Co-founder and Chief Architect at Databricks. What stood out wasn't just the depth of technical insight, but how openly Reynold shared his perspective on where the industry is heading, and how thoughtfully he engaged with us on our priorities and use cases. Nearly five years ago, a similar conversation gave us the conviction to embrace the Lakehouse paradigm and begin our Data Modernization journey with Databricks. Today's discussion left us with the same clarity and confidence about what comes next, a clear sense of how the Databricks platform is constantly evolving, and the reassurance that we've chosen the right partner to get us there. A special shoutout also goes to Kasthuri Thambipillai and Mifrah K., not just for making conversations like this possible, but for their ongoing support with our initiatives. Looking forward to the upcoming Data + AI Summit, and the exciting announcements that come with it. 🚀 #AI #BallysIntralot #Data #Databricks #DAIS #Innovation
-
Reynold Xin liked thisReynold Xin liked thisThrilled to share I've started at Stanford University's Department of Pathology in addition to Arc Institute. Looking forward to a shorter commute after 5 years at University of California, Berkeley's Department of Bioengineering and embarking on daring new projects. We're recruiting multiple postdocs and technical staffwho share our vision of programming biology and building tangible products that impact everyday lives in the real world! - Lab website: https://lnkd.in/ggTqJEgh - Machine learning for biology postdoc: https://lnkd.in/g2vuPUzv - Biological design postdoc: https://lnkd.in/gW7UEqZ2 - Molecular technology development scientist: https://lnkd.in/gfabFssU - Biochemistry scientist: https://lnkd.in/gM5znDUi We are a hybrid lab of experimental and computational scientists working across bioengineering and machine learning. Recent work has included genome foundation models, metagenomic mining, DNA recombinases, AI-designed molecular systems, ML-guided directed evolution, and perturbation prediction Lately, we have been thinking about the following research frontiers (your innovative ideas could be related, or broadly fit into our thematic interests): - AI agents and virtual cell models for scientific research and drug discovery - Discovery and engineering of modulators/probes/drug-like molecules for physiological control and human enhancement (e.g. sleep, appetite, energy, etc) - Immune rejuvenation and skin/microbe/barrier tissue rejuvenation for aging-related phenotypes - New programmable molecular tools for synthetic biology and AI-guided perturbations of cellular behavior
Experience & Education
-
Databricks
*********
-
********** ** *********** ********
****** ** ********** ******* ******** ******* undefined
-
-
********** ** *******
**** *********** *******
View Reynold’s full experience
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
View Reynold’s full profile
-
See who you know in common
-
Get introduced
-
Contact Reynold directly
Other similar profiles
Explore more posts
-
Bunty Shah
MSCI Inc. • 4K followers
[AI paper] GRPO has a hidden flaw in Multi-Reward settings. It’s time to decouple your normalization. 📉 As AI Architects, we are moving from single-objective RL (just "get the answer right") to multi-objective RL (accuracy + format + length + safety). We typically sum these rewards and throw them into GRPO. A new paper from NVIDIA, "GDPO: Group reward-Decoupled Normalization Policy Optimization," demonstrates that this naive summation causes Reward Signal Collapse. The Architectural Failure Mode: If you sum disparate rewards (e.g., a binary Format reward and a scalar Accuracy reward) and then normalize, different raw reward combinations can map to identical advantage values. Example: A rollout with rewards (0, 2) might yield the same normalized advantage as (0, 1) due to group statistics, effectively deleting the gradient signal for the second objective . The Solution: GDPO The fix is architectural: Decoupled Normalization. Instead of normalizing the sum, GDPO normalizes each reward objective independently within the group before aggregation. Impact: This restores the resolution of the training signal. On Tool Calling and Math Reasoning tasks (DeepSeek-R1/Qwen), GDPO converges where GRPO fails or plateaus. If you are training agents that must balance strict formatting (JSON) with reasoning quality, this is the correct objective function to use. 👇 Link to the paper in the comments. #AIArchitecture #RLHF #NVIDIA #GRPO #DeepLearning #LLM #Alignment #Research
21
1 Comment -
Jiaqi Xu
Meta • 529 followers
KDD 2026 paper +3 🚀 Excited to share that our team had 3 papers accepted to the 2026 KDD! These works represent recent years of our efforts on large-scale recommendation model training, spanning model co-design, GPU kernel optimization, and compiler optimization across the full training stack 🚀 Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design RecCompl: Efficient Model Compilation for Industrial Scale Recommendation Models with PyTorch 2 Optimus: A Generic Operator-Level PyTorch Model Transformation Framework The first work introduces Kunlun, our next-generation scaling-up foundation model architecture for GEM (Generative Ads Model). We recently uploaded the paper to arXiv (https://lnkd.in/gKzFSzcF), which also includes our newly open-sourced GDPA kernel work (https://lnkd.in/gpJ879HG) The second and third works focus on our optimizations on PyTorch 2 compiler stack for large-scale recommendation systems. RecCompl shares how we achieved full-model compilation for production recommendation models with PT2. Optimus is a continuation of our operator-level graph optimization work on PT2. We previously shared some of the core ideas in our PyTorch blog. (https://lnkd.in/gBi3fUqJ) Over the past three years, thanks to the amazing effort from the team, we’ve solved many real-world production challenges and successfully launched these optimizations across most Ads models in production. Huge congrats to the entire team — I’m incredibly proud of what we’ve built together. A lot of hard work, late nights, and iteration went into making these systems work in real production environments 💪 Excited to continue pushing large-scale recommendation systems forward and looking forward to sharing more details soon 🔥
139
2 Comments -
Rivindu Perera
Shōgun Ventures • 11K followers
For decades, companies chased cheaper labour. Now they’re eliminating it. Oracle’s AI pivot signals the end of traditional labour arbitrage. They moved software engineering to lower-income countries to save money. The logic was simple: cut costs and improve the P&L by shifting roles to cheaper markets. Cheap labour was never the real advantage in tech. It often led to lower-quality software. What matters is great engineers, regardless of location. If AI is writing the code, the value shifts. Not to more engineers, but to better ones. The future is smaller teams of exceptional engineers, amplified by AI. #AI #layoffs #employment #jobs https://lnkd.in/eXgdBj-W
20
1 Comment -
Shirly Ozer
Team8 • 2K followers
Solid (AI for Data) is out of stealth! I am excited to share the news about our portfolio company's public launch with $20M in seed funding! ✨ Solid is tackling the biggest bottleneck in Enterprise AI: the lack of context. They are building the AI enablement layer, that moves enterprises’ AI from pilot to real-world production. Huge congratulations to ProFounders Yoni Leitersdorf and Tal Segalov and the entire Solid team. I enjoyed being part of your early ideation and company-building journey! Now let's scale it big time 🚀 This is a generational company at the foundation of the enterprise AI stack, reflecting our belief that the AI era is defined by reliability and results. #SuccessbyDesign Read more: https://lnkd.in/dX6JH6J5 Aviad Harell, Amir Zilberstein, Nathaniel Tavisal, Omri Sela, Noa Bar-Yosef, Robert Wiseman, Nick Aharoni, Noa Hen, Tomer Tirosh, Alon Melcer, Jonathan Bergerbest, Omer Biran, Tal Levi, Aviv Turecki, Liran Grinberg, Ori Barzilay, Ori Yankelev, Asaf Azulay, Tal Levi, Aviv Turecki
89
3 Comments -
Liran Hason
Coralogix • 13K followers
Less than 6 months after joining the Coralogix family, the Aporia team is back with a bang: Olly. Olly is an AI-native Observability Agent that turns observability data into actionable insights. * 3 months → integrated Aporia into the Coralogix platform (aka AI Center). * <6 months → a brand-new offering that redefines Observability. We’re just getting started. Watch the demo, share your thoughts, and feel free to guess what's next on our roadmap. 😉
363
23 Comments -
Nikhil Kassetty
Gen AI Global • 5K followers
𝗛𝗼𝘄 𝗠𝗖𝗣 𝗦𝗲𝗿𝘃𝗲𝗿𝘀 𝗠𝗮𝗻𝗮𝗴𝗲 𝗦𝗲𝘀𝘀𝗶𝗼𝗻𝘀 𝘄𝗶𝘁𝗵 𝗟𝗟𝗠𝘀 𝗮𝗻𝗱 𝗖𝗼𝗻𝘁𝗲𝘅𝘁 𝗪𝗶𝗻𝗱𝗼𝘄𝘀 Managing sessions in LLMs isn’t just about keeping track of prompts. It is about orchestrating memory, context, and tools seamlessly across multiple agents. That’s where MCP servers come in. Here’s the flow I captured visually → Context Window as Working Memory: Holds system prompt, user inputs, assistant responses, tool calls, and rolling context → MCP Servers: Each server maintains its own session state, ensuring consistency across interactions → System Prompt + Tools + Code Loops: Work together to power agentic behavior while grounding outputs in real tasks → LLM Integration: The context is constantly summarized and updated before being sent to the model This structure ensures scalable, multi-session orchestration, which is critical for enterprise AI where multiple tasks and tools interact simultaneously.
148
14 Comments -
Tim (Xin) Zhou
Micas Networks • 1K followers
The most interesting portion of the paper to me is chapter 4.2, a significant optimization for inference workload on a small GPU cluster affordable to midsize enterprise on-prem deployment. This design ensures a balanced computational load across devices, substantially boosting overall throughput, particularly during the inference phase. A key enabler for this advancement is Huawei's unique full-stack ownership of the NPU hardware (specifically the Ascend NPU platform), kernel, operating system and the pangu LLM. This vertical integration empowers Huawei to develop highly specialized optimizations, including the creation of innovative fused operators like MulAttention and SwiftGMM, which are custom-engineered to dramatically accelerate model inference specifically on the Ascend NPU platform. https://lnkd.in/gKiNYpat
7
1 Comment -
Venkatesh Peddi
Chiratae Ventures • 6K followers
CtrlB – changing orchestration of telemetry data. Enterprises generate terabytes of logs every day, but traditional data architectures make it impossible to store and search this data comprehensively. CtrlB’s disruptive architecture is questioning these long-held industry assumptions and have built a platform that delivers 10x faster search at 90% lower cost. Excited to announce our investment in CtrlB’s $2.5M seed round and thrilled to join Adarsh Srivastava & team CtrlB in their journey! More details of why we invested in CtrlB - https://lnkd.in/gZNEVzER (by Naimil Shah) Aman Raj Chiratae Ventures
154
4 Comments -
Christopher McMahon
GoodLabs Studio • 444 followers
A More Principled Metric for RAG Evaluation Evaluating retrieval-augmented generation systems has long been entangled with downstream performance metrics. Existing approaches often assess the quality of the final output, which makes it difficult to isolate the contribution of retrieval. Others rely on retrieval-specific metrics like NDCG or recall\@k, which are agnostic to how the retrieved context is actually used by the language model. SePer (Semantic Perplexity Reduction) proposes a more direct and model-aware alternative. It quantifies how much the retrieved documents reduce the semantic perplexity of the model on the input prompt. In effect, it asks the model: given this prompt, do these documents reduce your uncertainty? This is done without relying on gold answers or generated completions. The model computes a semantic probability distribution over possible answers before and after conditioning on the retrieved documents. The reduction in perplexity is treated as a measure of how useful the context was for informing the model’s internal representation of the query. Empirical results show that SePer correlates more strongly with human preferences than standard retrieval metrics. On five open-domain QA datasets, SePer achieved a Kendall tau correlation of 0.43 with human rankings. By comparison, NDCG\@20 reached only 0.20. Other common metrics, such as recall\@k, precision\@k, and reranking scores, performed similarly poorly. Notably, SePer maintains high correlation with human judgment even when varying only the retriever while holding the generator fixed. This suggests it can serve as a standalone retriever evaluation metric, decoupled from full-pipeline performance. Because SePer is generator-aware but does not require final output generation, it enables more targeted, efficient, and interpretable evaluation of retrieval in RAG systems. Rather than inferring retrieval quality through surrogate signals, it leverages the model’s own internal distribution as a source of supervision. Find the paper here arxiv.org/abs/2503.01478 #RetrievalAugmentedGeneration #LLMEvaluation #RAGMetrics
1
-
Shubham Baldava
Datazip • 5K followers
Nvidia GTC presentation about structured & unstructured data being foundation for AI/LLMs used for business needs. We at OLake strongly believe the same and developing features around Iceberg. Apache Iceberg as per Feb 2026 release will soon be supporting unstructured data along side with structured and semi. https://lnkd.in/geYYF--6
22
-
Mark Kerzner
Novus • 11K followers
Session date: May 01, 2026, noon CST Math challenge: Can you explain Compton’s ring experiment? New prompting with Andrew Ng (repo) How to prepare your data for the future life with Snowflake or FreeEed (repo) AI News Beyond Vibe: more spec-driven development (repo) How to build with Anthropic Claude Console (course)
2
1 Comment -
Pete Wilkins
HPA • 7K followers
Join the Databricks Data and AI Experts: Steve Sobel --Global Leader, Startup & Venture Programs; Kevin Rasmussen -- Field CTO, Early Stage Programs, and Sriharsha Tikkireddy -- Technical Lead, Early Stage Programs, for the next Omaxn event: Workshop: #VibeCoding Unlocked: Effortless App Building with Databricks Details" TechNexus Venture Collaborative on Aug 26 What you'll learn: #FasterDevelopment and #Iteration: Databrick's "#Vibecoding" allows developers to use natural language prompts to generate code and build applications, cutting the time from idea to deployment from weeks to minutes or hours. Engineers can test, refine, and redeploy applications in a highly efficient, conversational way—enabling tight feedback loops. #FullCustomization and #CodeOwnership: Engineers get complete control over the generated code instead of being limited by drag-and-drop low-code templates. You can modify, extend, and host the application anywhere, ensuring flexibility for enterprise needs and long-term maintainability. #PatternExtraction and #Maintainable Architecture: The AI focuses on extracting repetitive code patterns and parameterizing them, producing clean, documented configurations. This ensures that applications are not only quick to build, but also easy for future engineers to understand and maintain. #Integration with #DatabricksInfrastructure: Applications inherit benefits from Databricks’ robust governance and security: access controls, audit trails, RBAC, and deployment on the trusted Lakehouse platform. #TurningIdeas into #Apps: Fast: Vibe coding enables business stakeholders, analysts, and subject matter experts to describe their needs in plain English (or any natural language). The system translates these ideas into working applications, so non-techies can rapidly bring their insights to life—no computer science degree required.
10
1 Comment -
Chris Li
Yotta Labs • 2K followers
AWS Trainium receives insufficient attention. While most of the industry has been laser-focused on NVIDIA GPU availability, Trainium has been quietly offering a compelling cost and performance profile — with very little of the open-source tooling needed to use it effectively for LLM inference. Our team just took a real step toward fixing that with Mini-SGLang-Neuron — a lightweight framework that brings SGLang to AWS Trainium and Inferentia silicons, with features like radix attention caching, chunked prefill, and tensor parallelism across Neuron cores. Benchmarked against vLLM-Neuron and available to run on Trainium1 instances through the Yotta console today. The broader point: the teams building hardware-agnostic inference infrastructure right now are positioning themselves well for a world where compute is abundant but fragmented. Trainium is one piece of that puzzle worth paying attention to. Open source — repo linked in comments.
2
1 Comment -
Andrew Ng
DeepLearning.AI • 3M followers
New short course: DSPy: Build and Optimize Agentic Apps DSPy is a powerful open-source framework for automatically tuning prompts for GenAI applications. In this course, you'll learn to use DSPy, together with MLflow. This is built in partnership with Databricks and taught by Chen Qian, co-lead of the DSPy framework. Many AI builders spend hours hand-tuning prompts. When given a set of evals, DSPy automates this process. It’s especially useful for optimizing prompts, including few-shot prompts, in complex agentic AI workflows. Further, if you switch an application to a newer LLM, performance can degrade if your prompts were optimized to the previous model. DSPy automatically optimizes the entire system for the new LLM as well, using just a few evaluation examples. This course teaches DSPy works, and best practices for using it. You’ll write programs using DSPy’s signature-based programming model, debug them with MLflow tracing -- to gain visibility into how different parts of a pipeline, as well as how the overall system, are performing -- and automatically improve their accuracy with DSPy Optimizer. Please sign up here: https://lnkd.in/gdjae8AX
4,672
190 Comments -
Jonathan Vickery
Activant Capital • 1K followers
"Taken together, these aren’t just high-profile database deals but proof that the AI infrastructure race is moving down-stack. While sophisticated models may still grab headlines, the battle is now increasingly about who can serve AI-ready data, fast, resiliently and at scale." An article from Forbes on how 3 recent database acquisitions are starting to prove a prediction that we made in March: Data is the undervalued element of AI. 👉 https://lnkd.in/dSWMak-J
12
2 Comments -
Niall Murphy
6K followers
YellowDog.ai just set a 10x benchmark uplift in scale computing, delivering 40,000 tasks per second (TPS) and managing 100,000 compute nodes in the cloud. What's even more interesting, that's 2x IBM Symphony and opens an intriguing pathway for these until-know closed/captive systems.
11
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content